Prediction of mefenamic acid crystal shape by random forest classification

Nakapraves, Siya and Warzecha, Monika and Mustoe, Chantal L. and Srirambhatla, Vijay (2022) Prediction of mefenamic acid crystal shape by random forest classification. Pharmaceutical Research, 39 (12). pp. 3099-3111. ISSN 0724-8741 (https://doi.org/10.1007/s11095-022-03450-4)

[thumbnail of Nakapraves-etal-PR-2022-Prediction-of-mefenamic-acid-crystal-shape]
Preview
Text. Filename: Nakapraves_etal_PR_2022_Prediction_of_mefenamic_acid_crystal_shape.pdf
Final Published Version
License: Creative Commons Attribution 4.0 logo

Download (1MB)| Preview

Abstract

Objective Particle shape can have a significant impact on the bulk properties of materials. This study describes the development and application of machine-learning models to predict the crystal shape of mefenamic acid recrystallized from organic solvents. Methods Crystals were grown in 30 different solvents to establish a dataset comprising solvent molecular descriptors, process conditions and crystal shape. Random forest classification models were trained on this data and assessed for prediction accuracy. Results The highest prediction accuracy of crystal shape was 93.5% assessed by fourfold cross-validation. When solvents were sequentially excluded from the training data, 32 out of 84 models predicted the shape of mefenamic acid crystals for the excluded solvent with 100% accuracy and a further 21 models had prediction accuracies from 50-100%. Reducing the feature set to only solvent physical property descriptors and supersaturations resulted in higher overall prediction accuracies than the models trained using all available or another selected subset of molecular descriptors. For the 8 solvents on which the models performed poorly (ConclusionsRandom forest classification models using solvent physical property descriptors can reliably predict crystal morphologies for mefenamic acid crystals grown in 20 out of the 28 solvents included in this work. Poor prediction accuracies for the remaining 8 solvents indicate that further factors will be required in the feature set to provide a more generalized predictive morphology model.

ORCID iDs

Nakapraves, Siya, Warzecha, Monika ORCID logoORCID: https://orcid.org/0000-0001-6166-1089, Mustoe, Chantal L. and Srirambhatla, Vijay ORCID logoORCID: https://orcid.org/0000-0002-4492-7567; Florence, Alastair J.