Unknown

Dataset Information

0

A structural hierarchy matching approach for molecular similarity/substructure searching.


ABSTRACT: An approach for molecular similarity/substructure searching based on structural hierarchy matching is proposed. In this approach, small molecules are divided into two categories, acyclic and cyclic forms. The latter are further divided into three structural hierarchies, namely, framework, complicated-, and mono-rings. During searching, the similarity coefficients of a structural query and each retrieved molecule are calculated using the hierarchy of the query as the reference. A total of 13,911 chemicals were involved in this work, from which the minimal cyclic and acyclic substructures are extracted, and further processed into fuzzy structural fingerprints. Subsequently, the fingerprints are used as the searching indices for molecular similarity or substructure searching. The tests show that this approach can give user options to choose between one-substructure and multi-substructure searching with sorted results. Moreover, this algorithm has the potential to be developed for molecular similarity searching and substructure analysis.

SUBMITTER: Ji SS 

PROVIDER: S-EPMC6272706 | biostudies-literature | 2015 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

A structural hierarchy matching approach for molecular similarity/substructure searching.

Ji Shu-Shen SS   Dong Hong-Ju HJ   Zhou Xin-Xin XX   Liu Ya-Min YM   Zhang Feng-Xue FX   Wang Qi Q   Huang Xin-An XA  

Molecules (Basel, Switzerland) 20150515 5


An approach for molecular similarity/substructure searching based on structural hierarchy matching is proposed. In this approach, small molecules are divided into two categories, acyclic and cyclic forms. The latter are further divided into three structural hierarchies, namely, framework, complicated-, and mono-rings. During searching, the similarity coefficients of a structural query and each retrieved molecule are calculated using the hierarchy of the query as the reference. A total of 13,911  ...[more]

Similar Datasets

| S-EPMC2739202 | biostudies-literature
| S-EPMC2996407 | biostudies-literature
| S-EPMC2908588 | biostudies-literature
| S-EPMC2944279 | biostudies-literature
| S-EPMC4830209 | biostudies-literature
| S-EPMC3125773 | biostudies-literature
| S-EPMC2718661 | biostudies-literature
| S-EPMC6934298 | biostudies-literature