Welcome to the upgraded MacSphere! We're putting the finishing touches on it; if you notice anything amiss, email macsphere@mcmaster.ca

B3clf: A Resampling-Integrated Machine Learning Framework to Predict Blood-Brain Barrier Permeability

dc.contributor.authorMeng, Fanwang
dc.contributor.authorChen, Jitian
dc.contributor.authorCollins-Ramirez, Juan Samuel
dc.contributor.authorAyers, Paul W.
dc.contributor.departmentChemistry and Chemical Biologyen_US
dc.date.accessioned2025-08-21T20:34:15Z
dc.date.available2025-08-21T20:34:15Z
dc.date.issued2025-08-15
dc.description.abstractDeveloping accurate, computationally efficient, and reliable predictive models for small molecules' blood-brain barrier (BBB) permeability is challenging due to the class imbalance often found in collections of reference data. We use resampling techniques to address class imbalance and build 24 types of machine learning models, which we developed using comprehensive hyperparameter optimizations. We evaluated our model against those from previous studies, which provides insight into optimal classification models and resampling techniques that are relevant beyond BBB permeability. In addition to classifying unknown compounds on the basis of BBB permeability, the predicted probabilities are provided to facilitate further improvements and comparative benchmarking, and to report the models' confidence in their predictions. To disseminate our findings, we developed B3clf, a highly efficient, user-friendly tool that facilitates BBB permeability prediction, which can be accessed as open-source software https://github.com/theochem/B3clf or as a web app https://huggingface.co/spaces/QCDevs/b3clf. The newly curated external dataset for BBB is hosted at https://github.com/theochem/B3DB.en_US
dc.description.sponsorshipP.W.A. acknowledges Natural Sciences and Engineering Research Council of Canada (NSERC), the Canada Research Chairs, and the Digital Research Alliance of Canada for financial and computational support. F.M. acknowledges the Banting Postdoctoral Fellowship administered by Canada’s three research granting agencies: the Canadian Institutes of Health Research (CIHR), the Natural Sciences and Engineering Research Council of Canada (NSERC), and the Social Sciences and Humanities Research Council (SSHRC). F.M. also thanks the support of the Alliance’s DRI (Digital Research Infrastructure) EDIA (Equity, Diversity, Inclusion and Accessibility) Champions Pilot Program, Digital Research Alliance of Canada, for financial and computational support. F.M. also thanks the support of the Center for Advanced Computing (CAC) at Queen's University. J.C. acknowledges the Canada Graduate Scholarships Master’s (CGS-M) program for financial support.en_US
dc.identifier.urihttp://hdl.handle.net/11375/32199
dc.language.isoenen_US
dc.publisherChemRxiven_US
dc.subjectblood-brain barrier permeabilityen_US
dc.subjectcentral nervous system (CNS)en_US
dc.subjectclass imbalanceen_US
dc.subjectdrug discoveryen_US
dc.subjectopen-sourceen_US
dc.subjectmachine learningen_US
dc.titleB3clf: A Resampling-Integrated Machine Learning Framework to Predict Blood-Brain Barrier Permeabilityen_US
dc.typePreprinten_US

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
BBB_Predictions_Manuscript_preprint_2025Aug15.pdf
Size:
4.02 MB
Format:
Adobe Portable Document Format
Description:
main text
Loading...
Thumbnail Image
Name:
BBB_Predictions_Manuscript_preprint_2025Aug15 SI.pdf
Size:
5.63 MB
Format:
Adobe Portable Document Format
Description:
Supporting Information

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: