Unknown

Dataset Information

0

QM7-X, a comprehensive dataset of quantum-mechanical properties spanning the chemical space of small organic molecules.


ABSTRACT: We introduce QM7-X, a comprehensive dataset of 42 physicochemical properties for ?4.2 million equilibrium and non-equilibrium structures of small organic molecules with up to seven non-hydrogen (C, N, O, S, Cl) atoms. To span this fundamentally important region of chemical compound space (CCS), QM7-X includes an exhaustive sampling of (meta-)stable equilibrium structures-comprised of constitutional/structural isomers and stereoisomers, e.g., enantiomers and diastereomers (including cis-/trans- and conformational isomers)-as well as 100 non-equilibrium structural variations thereof to reach a total of ?4.2 million molecular structures. Computed at the tightly converged quantum-mechanical PBE0+MBD level of theory, QM7-X contains global (molecular) and local (atom-in-a-molecule) properties ranging from ground state quantities (such as atomization energies and dipole moments) to response quantities (such as polarizability tensors and dispersion coefficients). By providing a systematic, extensive, and tightly-converged dataset of quantum-mechanically computed physicochemical properties, we expect that QM7-X will play a critical role in the development of next-generation machine-learning based models for exploring greater swaths of CCS and performing in silico design of molecules with targeted properties.

SUBMITTER: Hoja J 

PROVIDER: S-EPMC7854709 | biostudies-literature | 2021 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

QM7-X, a comprehensive dataset of quantum-mechanical properties spanning the chemical space of small organic molecules.

Hoja Johannes J   Medrano Sandonas Leonardo L   Ernst Brian G BG   Vazquez-Mayagoitia Alvaro A   DiStasio Robert A RA   Tkatchenko Alexandre A  

Scientific data 20210202 1


We introduce QM7-X, a comprehensive dataset of 42 physicochemical properties for ≈4.2 million equilibrium and non-equilibrium structures of small organic molecules with up to seven non-hydrogen (C, N, O, S, Cl) atoms. To span this fundamentally important region of chemical compound space (CCS), QM7-X includes an exhaustive sampling of (meta-)stable equilibrium structures-comprised of constitutional/structural isomers and stereoisomers, e.g., enantiomers and diastereomers (including cis-/trans- a  ...[more]

Similar Datasets

| S-EPMC9174255 | biostudies-literature
| S-EPMC6713865 | biostudies-literature
| S-EPMC6321265 | biostudies-literature
| S-EPMC11362161 | biostudies-literature
| S-EPMC10442335 | biostudies-literature
| S-EPMC5603848 | biostudies-literature
| S-EPMC8179589 | biostudies-literature
| S-EPMC3104521 | biostudies-literature
| S-EPMC7374734 | biostudies-literature
| S-EPMC6478689 | biostudies-literature