Skip to content
View Parul-Kudtarkar's full-sized avatar
:octocat:
:octocat:

Organizations

@PanKbase

Block or report Parul-Kudtarkar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Parul-Kudtarkar/README.md

PARUL KUDTARKAR

Bioinformatics & AI/ML Lead


Portfolio | Blog | Scholar

What I Do

I design and deliver production-scale genomic data programs, building the infrastructure that turns raw biological data into clinical insight at consortium scale.

I own the full stack: pipeline architecture, model selection, optimization, evaluation and biological interpretation. Not just the code, the system that makes the science reproducible, scalable, and useful to partners who discover the medicine.


Flagship Programs

Common Metabolic Disease Genome Atlas · cmdga.org

$57M FNIH AMP CMD Initiative · Amgen · Eli Lilly · Novo Nordisk · Pfizer

Single-threaded technical owner and principal architect. Built end-to-end data delivery, partner onboarding and enablement infrastructure for T1D and T2D genomic research; executed single-cell ATAC-seq analyses identifying disease-associated chromatin accessibility patterns at consortium scale.

PanKbase · data.pankbase.org

$10M NIH-Funded Diabetes Research Ecosystem

Architected a comprehensive genomic + phenotypic data platform with standardized pipelines, automated QC frameworks and documentation enabling cohort-scale analysis for research partners worldwide.

PerseusAI

Agentic Research Platform for Multi-Omics Discovery

Integrates knowledge graphs, vector embeddings, RAG pipelines and multi-agent LLM frameworks to synthesize multi-omics data for biomarker discovery and therapeutic target identification.


Currently Shipping

Project What Status
alphagenome-explorer Coverage tool for 714 human + 179 mouse biosamples before variant analysis 🟢 Live
tert-alphageome TERT promoter mutation analysis — chromatin, TF binding & transcription from sequence alone 🟢 Live
PanKbase Data Library Python backend and Next.js front-end for consortium data access 🟢 Active


Technical Stack

Bioinformatics & Genomics

NGS Pipelines · Single-Cell Multi-Omics (Seurat, Signac) · Spatial Omics (Scanpy, Squidpy) · ATAC-seq · RNA-seq · Variant Calling · Chromatin Accessibility Profiling · Foundation Models: AlphaGenome · Evo2 · Regulatory Network Analysis · Disease Biomarker Discovery · Target Identification

AI & Machine Learning

TensorFlow · RAG (Retrieval-Augmented Generation) · LangChain · Multi-Agent LLM Systems · Knowledge Graphs · Prompt Engineering

Cloud & DevOps

Python · AWS · Nextflow · Docker · Elasticsearch · ReactJS · TypeScript · Next.js · API Development · CI/CD


Publications

Full list on Google Scholar · 15+ publications


Talks & Service

  • FNIH AMP-CMD Leadership Meetings — Invited speaker, 2021–2025
  • Intel ISEF and Chen Institute Symposium for AI Accelerated Science — Conference Jury
  • Editorial BoardDatabase: The Journal of Biological Databases and Curation
  • Peer Reviewer — AJHG · PLOS Computational Biology · Diabetes · BMC Bioinformatics · Bioinformatics · J. Endocrinology

GitHub Stats Top Languages

Pinned Loading

  1. T2DREAM/t2dream-portal T2DREAM/t2dream-portal Public

    The Common Metabolic Disease Genome Atlas (CMDGA) resource is a component of AMP-CMD being developed at the University of California San Diego as part of a larger consortium of academic, industry a…

    Python 4 8

  2. Gaulton-Lab/HPAP-snATAC-seq Gaulton-Lab/HPAP-snATAC-seq Public

    Jupyter Notebook 1 1

  3. PanKbase/igvfd PanKbase/igvfd Public

    Metadata database for PanKbase project

    Python

  4. PanKbase/igvf-ui PanKbase/igvf-ui Public

    The Next.js-based UI portion of PanKbase that works with igvfd

    JavaScript

  5. tert-alphageome tert-alphageome Public

    Jupyter Notebook 1

  6. PanKbase/PanKbase-data-library-scripts PanKbase/PanKbase-data-library-scripts Public

    Jupyter Notebook