TY - JOUR
T1 - Group Contribution-Based Graph Convolution Network
T2 - Pure Property Estimation Model
AU - Hwang, Sun Yoo
AU - Kang, Jeong Won
N1 - Funding Information:
This work was supported by National Research Foundation of Korea (NRF) grants funded by the Korean government (MSIT) [grant numbers NRF- 2021R1A5A6002853 and NRF-2019M3E6A1064876]
Publisher Copyright:
© 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
PY - 2022/9
Y1 - 2022/9
N2 - Properties data for chemical compounds are essential information for the design and operation of chemical processes. Experimental values are reported in the literature, but that are too scarce compared with exploding demand for data. When the data are not available, various estimation methods are employed. The group contribution method is one of the standards and simple techniques used today. However, these methods have inherent inaccuracy due to the simplified representation of the molecular structure. More advanced methods are emerging, including improved molecular representations and handling experimental data. However, such processes also suffer from a lack of valid data for adjusting many parameters. We suggest a compromise between a complex machine learning algorithm and a linear group contribution method in this contribution. Instead of representing a molecule using a graph of atoms, we employed bulkier blocks—a graph of functional groups. The new approach dramatically reduced the number of adjustable parameters for machine learning. The result shows higher accuracy than the conventional methods. The whole process was also examined in various aspects—incorporating uncertainties in the data, the robustness of the fitting process, and detecting outlier data. Graphical Abstract: [Figure not available: see fulltext.].
AB - Properties data for chemical compounds are essential information for the design and operation of chemical processes. Experimental values are reported in the literature, but that are too scarce compared with exploding demand for data. When the data are not available, various estimation methods are employed. The group contribution method is one of the standards and simple techniques used today. However, these methods have inherent inaccuracy due to the simplified representation of the molecular structure. More advanced methods are emerging, including improved molecular representations and handling experimental data. However, such processes also suffer from a lack of valid data for adjusting many parameters. We suggest a compromise between a complex machine learning algorithm and a linear group contribution method in this contribution. Instead of representing a molecule using a graph of atoms, we employed bulkier blocks—a graph of functional groups. The new approach dramatically reduced the number of adjustable parameters for machine learning. The result shows higher accuracy than the conventional methods. The whole process was also examined in various aspects—incorporating uncertainties in the data, the robustness of the fitting process, and detecting outlier data. Graphical Abstract: [Figure not available: see fulltext.].
KW - Artificial neural network
KW - Graph convolution networks
KW - Group contribution method
KW - Machine learning
KW - Thermodynamic property estimation
UR - http://www.scopus.com/inward/record.url?scp=85134247294&partnerID=8YFLogxK
U2 - 10.1007/s10765-022-03060-7
DO - 10.1007/s10765-022-03060-7
M3 - Article
AN - SCOPUS:85134247294
SN - 0195-928X
VL - 43
JO - International Journal of Thermophysics
JF - International Journal of Thermophysics
IS - 9
M1 - 136
ER -