Skip to content

sachin02-hub/ExploratoryDataAnalysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“Š Advanced Exploratory Data Analysis (EDA) on Synthetic Datasets

Welcome to the repository for Advanced Exploratory Data Analysis (EDA) on synthetic datasets. This project dives deep into data visualization, distribution analysis, correlation checks, and outlier handling techniques, providing a comprehensive walkthrough of modern EDA techniques.

πŸ” Project Overview

This project uses a synthetic dataset to demonstrate:

Univariate and multivariate analysis

Visualization techniques using libraries like Matplotlib and Seaborn

Correlation and distribution exploration

Handling of missing data and outliers

Insights derivation for machine learning model readiness

The analysis is fully documented in a Jupyter Notebook for easy readability and reproducibility.

πŸ“ Files

Data.ipynb: Jupyter notebook containing all EDA steps, visualizations, and insights.

πŸ“– Blog Post

For a detailed walkthrough and insights behind the decisions made in this analysis, check out the blog post here: https://medium.com/@sachinsmanoj02/advanced-exploratory-data-analysis-eda-on-synthetic-datasets-d7c82bf78a14

πŸ‘‰ Advanced Exploratory Data Analysis (EDA) on Synthetic Datasets

πŸ›  Tools & Libraries Used Python

Pandas

NumPy

Seaborn

Matplotlib

Scikit-learn (for preprocessing)

About

Advanced EDA on Dataset to determine viability and various forms of Data wrangling to improve quality

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors