Skip to main content
Dryad

Training and test data for: Not getting in too deep: A practical deep learning approach to routine crystallisation image classification

Cite this dataset

Milne, Jamie et al. (2023). Training and test data for: Not getting in too deep: A practical deep learning approach to routine crystallisation image classification [Dataset]. Dryad. https://doi.org/10.5061/dryad.0k6djhb45

Abstract

These data were used to classify crystallisation experiments in Milne et al., (https://doi.org/10.1101/2022.09.28.509868). Here, four of the most widely-used convolutional deep-learning network architectures that can be implemented without the need for extensive computational resources were compared. It was shown that the classifiers have different strengths that can be combined to provide an ensemble classifier achieving a classification accuracy comparable to that obtained by a large consortium initiative (Bruno et al. PLOS one, 13(6), 2018). Eight classes were used to rank the experimental outcomes, thereby providing detailed information that can be used with routine crystallography experiments to automatically identify crystal formation for drug discovery and pave the way for further exploration of the relationship between crystal formation and crystallisation conditions.

Methods

The images in this dataset were collected at AstraZeneca UK using a Rock imager (Formulatrix) and cropped from the original 1028x960 pixels to 800x800 pixels.

Usage notes

The data files in this submission are in PNG format and are compressed as .zip files. The images for three independent test sets are compressed separately (Test1.zip, Test2.zip, Test3.zip) whilst the images used as the training set are compressed as separate classes (e.g.TrainingClear.zip contains training set images from the class 'Clear').

Funding

Engineering and Physical Sciences Research Council, Award: EP/V519807/1