What is the IMPROVE project?
IMPROVE aims to establish methodology to systematically and rigorously compare supervised learning models. To this end, IMPROVE has four components:
IMPROVE Python Package (improvelib): Python tools and protocols for standardizing model code structure, facilitating modular code allowing to design complex workflows
IMPROVE Workflows Evaluation Schemes: distributed and modular workflows for large-scale model evaluation and downstream model comparison
Benchmark Datasets: standardized datasets (for each scientific application), enabling consistent and rigorous model benchmarking
Community Curated Models: A diverse collection of models from the scientific community (a collocation per application), curated within the IMPROVE framework and organized by application, providing a baseline for comparison
IMPROVE can be applied to a variety of supervised learning models. We have begun with the Drug Response Prediction problem, standardizing pan-cancer, pan-drug community models that predict monotherapy drug response with a standardized benchmark dataset.
For more information
Access the code base https://github.com/JDACS4C-IMPROVE/
References
1. A. Partin et al. “Deep learning methods for drug response prediction in cancer: Predominant and emerging trends”, Frontiers in Medicine, Section Prediction Oncology, 2023
JC. Overbeek and A. Partin et al. “Assessing reusability of deep learning-based monotherapy drug response prediction models trained with omics data”, arXiv, 2024