Unknown

Dataset Information

0

Error bounds on the SCISSORS approximation method.


ABSTRACT: The SCISSORS method for approximating chemical similarities has shown excellent empirical performance on a number of real-world chemical data sets but lacks theoretically proven bounds on its worst-case error performance. This paper first proves reductions showing SCISSORS to be equivalent to two previous kernel methods: kernel principal components analysis and the rank-k Nystro?m approximation of a Gram matrix. These reductions allow the use of generalization bounds on these techniques to show that the expected error in SCISSORS approximations of molecular similarity kernels is bounded in expected pairwise inner product error, in matrix 2-norm and Frobenius norm for full kernel matrix approximations and in root-mean-square deviation for approximated matrices. Finally, we show that the actual performance of SCISSORS is significantly better than these worst-case bounds, indicating that chemical space is well-structured for chemical sampling algorithms.

SUBMITTER: Haque IS 

PROVIDER: S-EPMC3183166 | biostudies-literature | 2011 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Error bounds on the SCISSORS approximation method.

Haque Imran S IS   Pande Vijay S VS  

Journal of chemical information and modeling 20110908 9


The SCISSORS method for approximating chemical similarities has shown excellent empirical performance on a number of real-world chemical data sets but lacks theoretically proven bounds on its worst-case error performance. This paper first proves reductions showing SCISSORS to be equivalent to two previous kernel methods: kernel principal components analysis and the rank-k Nyström approximation of a Gram matrix. These reductions allow the use of generalization bounds on these techniques to show t  ...[more]

Similar Datasets

| S-EPMC8336423 | biostudies-literature
| S-EPMC6690833 | biostudies-literature
| S-EPMC4588586 | biostudies-other
| S-EPMC4391030 | biostudies-literature
| S-EPMC6292514 | biostudies-literature
| S-EPMC7182476 | biostudies-literature
| S-EPMC8607505 | biostudies-literature
| S-EPMC4207653 | biostudies-literature
| S-EPMC7826264 | biostudies-literature
| S-EPMC10062410 | biostudies-literature