Skip to content

miladtorabi65/Bioinformatics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

31 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿงฌ Putative Disease Gene Identification & Drug Repurposing for Steatohepatitis


๐Ÿ‘ฅ Authors: Milad Torabi, Possenti Francesca
๐Ÿ”น Sapienza University, Bioinformatics and Network Medicine
๐Ÿ“… January 16, 2025


๐Ÿ“ Overview:
Non-alcoholic steatohepatitis (NASH) is a complex liver disorder characterized by fat accumulation, inflammation, and hepatocellular injury. As the prevalence of NASH rises globally, there is an urgent need to identify putative disease genes and explore potential drug repurposing strategies. This project utilizes network science methods and bioinformatics algorithms to analyze NASH-related genes and repurpose existing drugs as potential therapeutic candidates.


๐Ÿš€ Our goal: Identify putative disease genes & explore drug repurposing using network-based algorithms.

๐ŸŽฏ Project Objectives
๐Ÿ”ฌ Disease Gene Identification:

  • Identify putative disease-associated genes using PPI network analysis.
  • Utilize network-based algorithms (DIAMOnD, DIAble, Diffusion) to identify key genes related to NASH.
  • Integrate multi-omics data (proteomics and genomics) to explore disease mechanisms.
  • Implement drug repurposing strategies by screening approved drugs against the identified genes.
  • Provide insights into novel therapeutic avenues for NASH treatment.

๐Ÿ’Š Drug Repurposing Approach:

  • Screen FDA-approved drugs
  • Identify potential therapeutic candidates
  • Accelerate the drug discovery process

๐Ÿ“š Project Structure

๐Ÿ“‚ Project Root
โ”œโ”€โ”€ ๐Ÿ“˜ Report.pdf
โ”œโ”€โ”€ ๐Ÿ“„ README.md (This File)
โ”œโ”€โ”€ ๐Ÿ“’ Results Folder
โ””โ”€โ”€ ๐Ÿ“’ Main - Project.ipynb (Jupyter Notebook)


๐Ÿ›  Methodology & Workflow:

๐Ÿ“ฅ Step 1: Data Collection

  • Protein-Protein Interaction (PPI) Network: Retrieved from BioGRID database.
  • Gene-Disease Associations (GDAs): Extracted from DisGeNET (Disease Code: C2711227).

๐Ÿ•ธ Step 2: Network Construction & Analysis

  • Built PPI network from BioGRID interactome.
  • Filtered data to include only human physical interactions.
  • Identified the largest connected component (LCC) within the network.

๐Ÿงช Step 3: Disease Gene Identification

  • Applied DIAMOnD, DIAble, and Diffusion algorithms to identify putative disease genes.
  • Prioritized genes based on their connectivity and biological relevance.

๐Ÿ’Š Step 4: Drug Repurposing

  • Mapped identified genes to known drug-target interactions.
  • Screened FDA-approved drugs for potential efficacy against NASH-related pathways.

๐Ÿ— Tech Stack & Tools ๐Ÿ–ฅ Programming Language: Python
๐Ÿ“š Libraries: pandas, numpy, networkx, matplotlib, seaborn
๐Ÿ—„ Databases: BioGRID and DisGeNET databases for gene and interaction data
๐Ÿ“Š Algorithms: DIAMOnD, DIAble, Diffusion(Graph-based algorithms for network analysis)

๐Ÿ“Š Results & Insights
โœ” Identified key genes potentially involved in NASH pathogenesis.
โœ” Mapped existing drugs for repurposing
โœ” Contributed to advancing bioinformatics-driven approaches for liver disease research

๐Ÿ” Potential Impact:
๐Ÿš€ Accelerates drug discovery
๐Ÿฉบ Provides therapeutic targets
๐Ÿ”ฌ Advances precision medicine for NASH

๐Ÿ”ฎ Future Work

  • Expand datasets with multi-omics data
  • Validate findings via experimental studies
  • Incorporate deep learning for better gene-disease association predictions

About

Bioinformatics and Network Medicine Course at Sapienza University

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors