Skip to content

ishaanntyagi/Medical-Image-Captioning

Repository files navigation

Medical Image Caption Generation using Generative Transformers

A full-stack AI-powered system for automated brain tumor prediction and caption generation from MRI scans. This project integrates Vision Transformers (ViT), Knowledge-Aware Networks (KAN), Grad-CAM, and a modern Flask + React interface to make deep learning results interpretable and interactive.


📸 Visual Preview

Step 1 Step 2 Step 3 Step 4 Step 5


🚀 Features

  • 🧠 Tumor prediction from MRI using ViT + KAN
  • 📝 Automatic generation of domain-specific medical captions
  • 🔍 Visual explainability via Grad-CAM
  • 🖼️ Real-time web interface (React + Tailwind CSS)
  • 🔗 Fully integrated backend using Flask and PyTorch

🛠️ Tech Stack

Layer Tools
Model PyTorch, Vision Transformer (ViT), KAN
Explainability Grad-CAM
Backend Flask, Torchvision, Pillow, Joblib
Frontend React, TypeScript, Tailwind CSS, Dropzone
Storage LocalStorage (frontend), .pth, .pkl

📂 Project Structure

Medical-Image-Captioning/
├── frontend/           # React app for UI
├── backend/            # Flask app with model + API
├── outputs/            # Screenshots (used in README)
├── model.pth           # Trained model
├── label_encoder.pkl   # Class mapping
├── app2.py             # Flask API script
├── README.md

🧪 To Use

npm install
npm run dev
python app2.py

👥 Project Team & Acknowledgments

Mentor
Dr. Kamakshi Rautela

📬 Contact

For questions, collaborations, or feedback, feel free to connect:


About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors