Please use this identifier to cite or link to this item:
http://hdl.handle.net/11375/32199
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Meng, Fanwang | - |
dc.contributor.author | Chen, Jitian | - |
dc.contributor.author | Collins-Ramirez, Juan Samuel | - |
dc.contributor.author | Ayers, Paul W. | - |
dc.date.accessioned | 2025-08-21T20:34:15Z | - |
dc.date.available | 2025-08-21T20:34:15Z | - |
dc.date.issued | 2025-08-15 | - |
dc.identifier.uri | http://hdl.handle.net/11375/32199 | - |
dc.description.abstract | Developing accurate, computationally efficient, and reliable predictive models for small molecules' blood-brain barrier (BBB) permeability is challenging due to the class imbalance often found in collections of reference data. We use resampling techniques to address class imbalance and build 24 types of machine learning models, which we developed using comprehensive hyperparameter optimizations. We evaluated our model against those from previous studies, which provides insight into optimal classification models and resampling techniques that are relevant beyond BBB permeability. In addition to classifying unknown compounds on the basis of BBB permeability, the predicted probabilities are provided to facilitate further improvements and comparative benchmarking, and to report the models' confidence in their predictions. To disseminate our findings, we developed B3clf, a highly efficient, user-friendly tool that facilitates BBB permeability prediction, which can be accessed as open-source software https://github.com/theochem/B3clf or as a web app https://huggingface.co/spaces/QCDevs/b3clf. The newly curated external dataset for BBB is hosted at https://github.com/theochem/B3DB. | en_US |
dc.description.sponsorship | P.W.A. acknowledges Natural Sciences and Engineering Research Council of Canada (NSERC), the Canada Research Chairs, and the Digital Research Alliance of Canada for financial and computational support. F.M. acknowledges the Banting Postdoctoral Fellowship administered by Canada’s three research granting agencies: the Canadian Institutes of Health Research (CIHR), the Natural Sciences and Engineering Research Council of Canada (NSERC), and the Social Sciences and Humanities Research Council (SSHRC). F.M. also thanks the support of the Alliance’s DRI (Digital Research Infrastructure) EDIA (Equity, Diversity, Inclusion and Accessibility) Champions Pilot Program, Digital Research Alliance of Canada, for financial and computational support. F.M. also thanks the support of the Center for Advanced Computing (CAC) at Queen's University. J.C. acknowledges the Canada Graduate Scholarships Master’s (CGS-M) program for financial support. | en_US |
dc.language.iso | en | en_US |
dc.publisher | ChemRxiv | en_US |
dc.subject | blood-brain barrier permeability | en_US |
dc.subject | central nervous system (CNS) | en_US |
dc.subject | class imbalance | en_US |
dc.subject | drug discovery | en_US |
dc.subject | open-source | en_US |
dc.subject | machine learning | en_US |
dc.title | B3clf: A Resampling-Integrated Machine Learning Framework to Predict Blood-Brain Barrier Permeability | en_US |
dc.type | Preprint | en_US |
dc.contributor.department | Chemistry and Chemical Biology | en_US |
Appears in Collections: | Student Publications (Not Graduate Theses) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
BBB_Predictions_Manuscript_preprint_2025Aug15.pdf | main text | 4.11 MB | Adobe PDF | View/Open |
BBB_Predictions_Manuscript_preprint_2025Aug15 SI.pdf | Supporting Information | 5.77 MB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License