Dataset Information

Deep Retrosynthetic Reaction Prediction using Local Reactivity and Global Attention.

ABSTRACT: As a fundamental problem in chemistry, retrosynthesis aims at designing reaction pathways and intermediates for a target compound. The goal of artificial intelligence (AI)-aided retrosynthesis is to automate this process by learning from the previous chemical reactions to make new predictions. Although several models have demonstrated their potentials for automated retrosynthesis, there is still a significant need to further enhance the prediction accuracy to a more practical level. Here we propose a local retrosynthesis framework called LocalRetro, motivated by the chemical intuition that the molecular changes occur mostly locally during the chemical reactions. This differs from nearly all existing retrosynthesis methods that suggest reactants based on the global structures of the molecules, often containing fine details not directly relevant to the reactions. This local concept yields local reaction templates involving the atom and bond edits. Because the remote functional groups can also affect the overall reaction path as a secondary aspect, the proposed locally encoded retrosynthesis model is then further refined to account for the nonlocal effects of chemical reaction through a global attention mechanism. Our model shows a promising 89.5 and 99.2% round-trip accuracy at top-1 and top-5 predictions for the USPTO-50K dataset containing 50 016 reactions. We further demonstrate the validity of LocalRetro on a large dataset containing 479 035 reactions (UTPTO-MIT) with comparable round-trip top-1 and top-5 accuracy of 87.0 and 97.4%, respectively. The practical application of the model is also demonstrated by correctly predicting the synthesis pathways of five drug candidate molecules from various literature.

SUBMITTER: Chen S

PROVIDER: S-EPMC8549044 | biostudies-literature | 2021 Oct

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Deep Retrosynthetic Reaction Prediction using Local Reactivity and Global Attention.

Chen Shuan S Jung Yousung Y

JACS Au 20210805 10

As a fundamental problem in chemistry, retrosynthesis aims at designing reaction pathways and intermediates for a target compound. The goal of artificial intelligence (AI)-aided retrosynthesis is to automate this process by learning from the previous chemical reactions to make new predictions. Although several models have demonstrated their potentials for automated retrosynthesis, there is still a significant need to further enhance the prediction accuracy to a more practical level. Here we prop ...[more]

PMID: 34723264

Similar Datasets

Project description:The prediction of energy consumption in households is essential due to the reliance on electrical appliances for daily activities. Accurate assessment of energy demand is crucial for effective energy generation, preventing overloads and optimizing energy storage. Traditional techniques have limitations in accuracy and error rates, necessitating advancements in prediction techniques. To enhance prediction accuracy, a proposed smart city system utilizes the Household Energy Consumption dataset, employing deep learning algorithms. In the beginning, data pre-processing addresses missing values and performs feature scaling for normalizing independent variables. Followed by that, Modified Deep CNN-Bi-LSTM (Convolutional Neural Network and Bi-directional Long Short Term Memory) with attention mechanism is utilized for regression which extracts temporal and spatial complex features. Deep CNN extracts features impacting energy consumption whereas Bi-LSTM with attention layer finds suitability for regression as it is capable of modelling irregular trends in the time-series components, where the attention mechanism is implemented to enhance the decoder's ability to selectively focus on the most relevant segments of the input sequence. This is achieved through a weighted integration of all encoded input trajectories, allowing the model to dynamically emphasize the vectors that carry the highest significance for accurate predictions. Based on regression outcomes from analysis taken in hourly, daily and monthly time intervals, enhanced prediction accuracy is estimated through evaluation metrics such as MSE (Mean Square Error), MAPE (Mean Absolute Percentage Error) and RMSE (Root Mean Square Error) which determines the efficacy of the system, where Specifically, the proposed model achieves MSE of 0.123, MAE of 0.22, and MAPE of 324.12. Furthermore, this model demonstrates a training time of 692.12 s and a prediction time of just 1.87 s. Therefore, present research highlights the critical need for accurate energy consumption prediction in households, driven by the increasing reliance on electrical appliances in daily life.

Dataset Information

Deep Retrosynthetic Reaction Prediction using Local Reactivity and Global Attention.

Publications

Deep Retrosynthetic Reaction Prediction using Local Reactivity and Global Attention.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets