Medical-Insurance-Claim-Analysis

This project predicts medical insurance charges using machine learning. It involves a complete workflow starting from data cleaning and exploratory analysis to model training and explainability. Multiple regression models are compared, including Linear Regression, Ridge, Lasso, Decision Tree, and Random Forest.

Project Workflow

Data Loading & Cleaning
- Import dataset
- Handle missing values
- Outlier detection & treatment
Exploratory Data Analysis (EDA)
- Summary statistics
- Correlation analysis
- Distribution plots
Feature Engineering
- Encoding categorical variables
- Feature scaling

@ Test Linear regression Assumptions

Modeling The following regression models were implemented and compared:
- Linear Regression → baseline model
- Ridge Regression → handles multicollinearity with L2 regularization
- Lasso Regression → performs feature selection with L1 regularization
- Decision Tree → captures non-linear relationships
- Random Forest → ensemble method for better generalization
Model Evaluation
- Metrics: R², Adjusted R², RMSE, MAE
- Performance comparison across all models
Model Explainability
- Feature Importance (from tree-based models)
- SHAP values for global and local interpretability

Results

-Linear Regression: Provided a good baseline.

-Ridge & Lasso: Reduced overfitting, Lasso also performed feature selection.

-Decision Tree: Captured non-linear patterns but prone to overfitting.

-Random Forest: Best generalization among models.

-SHAP & Feature Importance: Helped explain which features (e.g., smoking status, BMI, age) contribute most to insurance charges.

📈 Future Work

Apply hyperparameter tuning for each model (GridSearchCV, RandomizedSearchCV).
Try advanced models like XGBoost, LightGBM, CatBoost.

*Deploy the model as a Flask/Streamlit web app for predictions.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
R2 VS AD R2.png		R2 VS AD R2.png
README.md		README.md
RMSE VS MAE.png		RMSE VS MAE.png
SHAP.png		SHAP.png
insurance.csv		insurance.csv
my_project (1).ipynb		my_project (1).ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Medical-Insurance-Claim-Analysis

Project Workflow

Results

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Medical-Insurance-Claim-Analysis

Project Workflow

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages