Please use this identifier to cite or link to this item:
http://hdl.handle.net/11375/32199
Title: | B3clf: A Resampling-Integrated Machine Learning Framework to Predict Blood-Brain Barrier Permeability |
Authors: | Meng, Fanwang Chen, Jitian Collins-Ramirez, Juan Samuel Ayers, Paul W. |
Department: | Chemistry and Chemical Biology |
Keywords: | blood-brain barrier permeability;central nervous system (CNS);class imbalance;drug discovery;open-source;machine learning |
Publication Date: | 15-Aug-2025 |
Publisher: | ChemRxiv |
Abstract: | Developing accurate, computationally efficient, and reliable predictive models for small molecules' blood-brain barrier (BBB) permeability is challenging due to the class imbalance often found in collections of reference data. We use resampling techniques to address class imbalance and build 24 types of machine learning models, which we developed using comprehensive hyperparameter optimizations. We evaluated our model against those from previous studies, which provides insight into optimal classification models and resampling techniques that are relevant beyond BBB permeability. In addition to classifying unknown compounds on the basis of BBB permeability, the predicted probabilities are provided to facilitate further improvements and comparative benchmarking, and to report the models' confidence in their predictions. To disseminate our findings, we developed B3clf, a highly efficient, user-friendly tool that facilitates BBB permeability prediction, which can be accessed as open-source software https://github.com/theochem/B3clf or as a web app https://huggingface.co/spaces/QCDevs/b3clf. The newly curated external dataset for BBB is hosted at https://github.com/theochem/B3DB. |
URI: | http://hdl.handle.net/11375/32199 |
Appears in Collections: | Student Publications (Not Graduate Theses) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
BBB_Predictions_Manuscript_preprint_2025Aug15.pdf | main text | 4.11 MB | Adobe PDF | View/Open |
BBB_Predictions_Manuscript_preprint_2025Aug15 SI.pdf | Supporting Information | 5.77 MB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License