Skip to content

Kirisaki00/DataAnalysis

Repository files navigation

📊 Data Analysis Repository

A collection of data analysis projects and practice notebooks using Python, Pandas, and Scikit-Learn.
This repository focuses on exploring datasets, handling missing values, and building a strong foundation in data preprocessing and analysis.


🧠 What This Repository Covers

  • Data cleaning and preprocessing
  • Handling missing values
  • Exploratory Data Analysis (EDA)
  • Working with real-world datasets
  • Basic machine learning using Scikit-Learn
  • Pandas operations and data manipulation

📂 Datasets Used

  • Airbnb NYC Dataset (AB_NYC_2019.csv)
  • Google Play Store Dataset
  • Custom datasets for practice

🛠️ Tools & Technologies

  • Python 🐍
  • Pandas
  • NumPy
  • Scikit-Learn
  • Jupyter Notebook

📂 Files Included

  • 📓 DataSetWalkthrough.ipynb – Dataset exploration
  • 📓 NullValueFileHandeling.ipynb – Handling missing values
  • 📓 Pandas.ipynb – Pandas operations
  • 📓 sklearn.ipynb – Basic ML with Scikit-Learn
  • 🐍 demo.py – Python script example
  • 📊 Datasets – Airbnb NYC, Google Play Store, and custom data

🎯 Purpose

This repository is created to:

  • Practice data analysis concepts
  • Build a strong foundation in data science
  • Work with real-world datasets
  • Improve problem-solving skills using Python

🚀 Future Improvements

  • Add data visualizations (Matplotlib, Seaborn)
  • Include advanced EDA projects
  • Build end-to-end ML pipelines
  • Add project-based case studies

✨ Author

Anupam Singh
Aspiring Data Analyst & Developer

About

A collection of data analysis notebooks exploring data cleaning, preprocessing, and machine learning using Python, Pandas, and Scikit-Learn.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors