Unknown

Dataset Information

0

A Machine Learning Approach to Zeolite Synthesis Enabled by Automatic Literature Data Extraction.


ABSTRACT: Zeolites are porous, aluminosilicate materials with many industrial and "green" applications. Despite their industrial relevance, many aspects of zeolite synthesis remain poorly understood requiring costly trial and error synthesis. In this paper, we create natural language processing techniques and text markup parsing tools to automatically extract synthesis information and trends from zeolite journal articles. We further engineer a data set of germanium-containing zeolites to test the accuracy of the extracted data and to discover potential opportunities for zeolites containing germanium. We also create a regression model for a zeolite's framework density from the synthesis conditions. This model has a cross-validated root mean squared error of 0.98 T/1000 Å3, and many of the model decision boundaries correspond to known synthesis heuristics in germanium-containing zeolites. We propose that this automatic data extraction can be applied to many different problems in zeolite synthesis and enable novel zeolite morphologies.

SUBMITTER: Jensen Z 

PROVIDER: S-EPMC6535764 | biostudies-literature | 2019 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Machine Learning Approach to Zeolite Synthesis Enabled by Automatic Literature Data Extraction.

Jensen Zach Z   Kim Edward E   Kwon Soonhyoung S   Gani Terry Z H TZH   Román-Leshkov Yuriy Y   Moliner Manuel M   Corma Avelino A   Olivetti Elsa E  

ACS central science 20190419 5


Zeolites are porous, aluminosilicate materials with many industrial and "green" applications. Despite their industrial relevance, many aspects of zeolite synthesis remain poorly understood requiring costly trial and error synthesis. In this paper, we create natural language processing techniques and text markup parsing tools to automatically extract synthesis information and trends from zeolite journal articles. We further engineer a data set of germanium-containing zeolites to test the accuracy  ...[more]

Similar Datasets

| S-EPMC9310626 | biostudies-literature
| S-EPMC7731361 | biostudies-literature
| S-EPMC6397530 | biostudies-literature
| S-EPMC6061985 | biostudies-literature
| S-EPMC8725656 | biostudies-literature
2024-05-17 | GSE267438 | GEO
2022-08-14 | GSE184943 | GEO
| S-EPMC3704944 | biostudies-literature