Show simple item record

dc.contributor.authorLeenhouts, Roel J.
dc.contributor.authorMorgan, Nathan
dc.contributor.authorAl Ibrahim, Emad
dc.contributor.authorGreen, William H.
dc.contributor.authorVermeire, Florence H.
dc.date.accessioned2025-04-18T19:22:40Z
dc.date.available2025-04-18T19:22:40Z
dc.date.issued2025-04-13
dc.identifier.issn1385-8947
dc.identifier.urihttps://hdl.handle.net/1721.1/159176
dc.description.abstractSolvation free energy is an important design parameter in reaction kinetics and separation processes, making it a critical property to predict during process development. In previous research, directed message passing neural networks (D-MPNN) have successfully been used to predict solvation free energies and enthalpies in organic solvents. However, solvent mixtures provide greater flexibility for optimizing solvent interactions than monosolvents. This work aims to extend our previous models to mixtures. To handle mixtures in a permutation invariant manner we propose a pooling function; MolPool. With this pooling function, the machine learning models can learn and predict solvation energy and enthalpy for an arbitrary number of molecules in the mixed solvent. The novel SolProp-mix software that applies MolPool to D-MPNN was compared to state-of-the-art architectures for predicting mixture properties and validated with our new database of COSMOtherm calculations; BinarySolv-QM. To improve predictions towards experimental accuracy, the network was then fine-tuned on experimental data in monosolvents. To demonstrate the benefit of this transfer learning methodology, experimental datasets of solvation free energies in binary (BinarySolv-Exp) and ternary (TernarySolv-Exp) solvent mixtures were compiled from data on vapor–liquid equilibria and activity coefficients. The neural network performed comparable in accuracy to the benchmark of COSMOtherm calculations with an MAE of 0.29 kcal/mol and an RMSE of 0.45 kcal/mol for binary mixed solvents. Additionally, the ability to capture trends for a varying mixture composition was validated successfully. Our model’s ability to accurately predict mixture properties from the combination of in silico data and pure component experimental data is promising given the scarcity of experimental data for mixtures in many fields.en_US
dc.description.sponsorshipMachine Learning for Pharmaceutical Discovery and Synthesis Consortiumen_US
dc.publisherElsevier BVen_US
dc.relation.isversionof10.1016/j.cej.2025.162232en_US
dc.rightsCreative Commons Attribution-Noncommercial-ShareAlikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceAuthoren_US
dc.titlePooling solvent mixtures for solvation free energy predictionsen_US
dc.typeArticleen_US
dc.identifier.citationLeenhouts, Roel J., Morgan, Nathan, Al Ibrahim, Emad, Green, William H. and Vermeire, Florence H. 2025. "Pooling solvent mixtures for solvation free energy predictions." Chemical Engineering Journal.
dc.contributor.departmentMassachusetts Institute of Technology. Department of Chemical Engineeringen_US
dc.relation.journalChemical Engineering Journalen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.identifier.doi10.1016/j.cej.2025.162232
dspace.date.submission2025-04-16T18:47:27Z
mit.licenseOPEN_ACCESS_POLICY
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record