Unknown

Dataset Information

0

Incorporating Domain Knowledge and Structure-Based Descriptors for Machine Learning: A Case Study of Pd-Catalyzed Sonogashira Reactions.


ABSTRACT: Machine learning has revolutionized information processing for large datasets across various fields. However, its limited interpretability poses a significant challenge when applied to chemistry. In this study, we developed a set of simple molecular representations to capture the structural information of ligands in palladium-catalyzed Sonogashira coupling reactions of aryl bromides. Drawing inspiration from human understanding of catalytic cycles, we used a graph neural network to extract structural details of the phosphine ligand, a major contributor to the overall activation energy. We combined these simple molecular representations with an electronic descriptor of aryl bromide as inputs for a fully connected neural network unit. The results allowed us to predict rate constants and gain mechanistic insights into the rate-limiting oxidative addition process using a relatively small dataset. This study highlights the importance of incorporating domain knowledge in machine learning and presents an alternative approach to data analysis.

SUBMITTER: Chan K 

PROVIDER: S-EPMC10302643 | biostudies-literature | 2023 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Incorporating Domain Knowledge and Structure-Based Descriptors for Machine Learning: A Case Study of Pd-Catalyzed Sonogashira Reactions.

Chan Kalok K   Ta Long Thanh LT   Huang Yong Y   Su Haibin H   Lin Zhenyang Z  

Molecules (Basel, Switzerland) 20230613 12


Machine learning has revolutionized information processing for large datasets across various fields. However, its limited interpretability poses a significant challenge when applied to chemistry. In this study, we developed a set of simple molecular representations to capture the structural information of ligands in palladium-catalyzed Sonogashira coupling reactions of aryl bromides. Drawing inspiration from human understanding of catalytic cycles, we used a graph neural network to extract struc  ...[more]

Similar Datasets

| S-EPMC9354472 | biostudies-literature
| S-EPMC5472585 | biostudies-literature
| S-EPMC6902853 | biostudies-literature
| S-EPMC9178954 | biostudies-literature
| S-EPMC10401178 | biostudies-literature
| S-EPMC9061866 | biostudies-literature
| S-EPMC6438147 | biostudies-literature
| S-EPMC10458182 | biostudies-literature
| S-EPMC10686539 | biostudies-literature
| S-EPMC3164313 | biostudies-literature