Skip to main content
Dryad

Data from: Predictive design of crystallographic chiral separation

Data files

Aug 27, 2025 version files 39.04 GB

Select up to 11 GB of files for download

Abstract

The efficient separation of chiral molecules is a fundamental challenge in the manufacture of pharmaceuticals and light-polarising materials. We developed an approach that combines machine learning with a physics-based representation to predict resolving agents for chiral molecules, using a transformer-based neural network. On historical data, our approach is 4-6 times more accurate than current practice. We further validate the model in a prospective experiment, where we use the model to design a resolution screen for six unseen racemates. We successfully resolved three of the six mixtures in a single round of experiments and obtained an overall 8-to-1 true positive to false negative ratio. Together with this study, we release a previously proprietary dataset of over 6,000 resolution experiments, the largest diastereomeric salt crystallisation dataset to date. More broadly, our approach and open crystallization data lay the foundation for accelerating and reducing the costs of chiral resolutions.