PerturbBench

Welcome to the PerturbBench project of the BioHackathon Europe 2024!

Overview

The Perturb-Bench Project focuses on assessing the robustness of single-cell perturbation modelling tools by developing a unified benchmarking pipeline that enables fair comparison. Many publications report only on metrics that favour their own tools. The Perturb-Bench project is a collective effort to provide an objective and systematic comparison of these tools across a comprehensive set of metrics, offering a balanced overview of each tool’s capabilities.

To begin, we aim to:

Assess Extrapolation Accuracy: Evaluate how closely the predictions of generative AI (GAI) tools designed to extrapolate unseen events align with ground truth data.
Evaluate Digital Knockout Performance: Investigate the performance of Gene Regulatory Network inference (GRNs) tools conducting digital knockouts by comparing their results to experimental data, such as CRISPR screening outcomes.

Objectives

Benchmarking Robustness: Establish standardized benchmarks to measure methods robustness across diverse metrics, system distributions and datasets.
Tool Development: Create a Nextflow framework to facilitate the testing and evaluation systematically.
Community Collaboration: Engage with the ELIXIR research community to combine multidisciplinary, share findings, methodologies, and best practices.

Schemas

These schemas formalize the methods flows through which Perturb-Bench will effectively compare GAIs and GRNs, ensuring consistent formats for data and results. For instance, common file formats like AnnData objects for sc-expression data/metadata will be required to be fed into metric functions across different scenarios.

Schema 1: GAI (Perturbation Task: Extrapolation to Unseen Events)

Workflow:

Load the dataset.
Pre-process the data (clarify specific pre-processing steps).
Train and test the model:
- Activate the model instance.
- Perform hyperparameter tuning.
- Train the model.
Generate predictions for control and stimulated scenarios (output as AnnData objects).

Outputs:

R² Score: Measures the closeness of predictions to stimulation data distributions.
Distance Metrics: Includes Euclidean distance, E distance, Maximum Mean Discrepancy, etc.

Schema 2: GRN (Perturbation Task: Digital Knockout)

Workflow:

Load the dataset.
Pre-process the data (clarify specific pre-processing steps).
Reconstruct the Gene Regulatory Network (GRN).
Define the target for simulation.
Optionally, specify the cell type for simulation.

Outputs:

KO-Responsive Genes: A list of genes responsive to knockouts.
Validation Metric:
- Compare results against iLINCS ground truth using Jaccard Similarity.
- Optionally, validate with other experimental perturbation datasets.

If you are a participant, please check our guide and do not hesitate to get in contact with us if needed: https://docs.google.com/document/d/1Kp7-LJOEpZaBdUOM_cBLXfu7nC7mcRi0XrMTsDgN6Go/edit?usp=sharin

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
.github		.github
biological_validation		biological_validation
metrics		metrics
nextflow_pipeline		nextflow_pipeline
tools		tools
LICENSE		LICENSE
Loading_datasets_Perturb_Bench.ipynb		Loading_datasets_Perturb_Bench.ipynb
README.md		README.md
TEST_PUSH.md		TEST_PUSH.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PerturbBench

Welcome to the PerturbBench project of the BioHackathon Europe 2024!

Overview

Objectives

Schemas

Schema 1: GAI (Perturbation Task: Extrapolation to Unseen Events)

Schema 2: GRN (Perturbation Task: Digital Knockout)

If you are a participant, please check our guide and do not hesitate to get in contact with us if needed: https://docs.google.com/document/d/1Kp7-LJOEpZaBdUOM_cBLXfu7nC7mcRi0XrMTsDgN6Go/edit?usp=sharin

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PerturbBench

Welcome to the PerturbBench project of the BioHackathon Europe 2024!

Overview

Objectives

Schemas

Schema 1: GAI (Perturbation Task: Extrapolation to Unseen Events)

Schema 2: GRN (Perturbation Task: Digital Knockout)

If you are a participant, please check our guide and do not hesitate to get in contact with us if needed: https://docs.google.com/document/d/1Kp7-LJOEpZaBdUOM_cBLXfu7nC7mcRi0XrMTsDgN6Go/edit?usp=sharin

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages