Training and test data for: Not getting in too deep: A practical deep learning approach to routine crystallisation image classification
Cite this dataset
Milne, Jamie et al. (2023). Training and test data for: Not getting in too deep: A practical deep learning approach to routine crystallisation image classification [Dataset]. Dryad. https://doi.org/10.5061/dryad.0k6djhb45
Abstract
These data were used to classify crystallisation experiments in Milne et al., (https://doi.org/10.1101/2022.09.28.509868). Here, four of the most widely-used convolutional deep-learning network architectures that can be implemented without the need for extensive computational resources were compared. It was shown that the classifiers have different strengths that can be combined to provide an ensemble classifier achieving a classification accuracy comparable to that obtained by a large consortium initiative (Bruno et al. PLOS one, 13(6), 2018). Eight classes were used to rank the experimental outcomes, thereby providing detailed information that can be used with routine crystallography experiments to automatically identify crystal formation for drug discovery and pave the way for further exploration of the relationship between crystal formation and crystallisation conditions.
Methods
The images in this dataset were collected at AstraZeneca UK using a Rock imager (Formulatrix) and cropped from the original 1028x960 pixels to 800x800 pixels.
Usage notes
The data files in this submission are in PNG format and are compressed as .zip files. The images for three independent test sets are compressed separately (Test1.zip, Test2.zip, Test3.zip) whilst the images used as the training set are compressed as separate classes (e.g.TrainingClear.zip contains training set images from the class 'Clear').
Funding
Engineering and Physical Sciences Research Council, Award: EP/V519807/1