Skip to content
View sachin02-hub's full-sized avatar

Block or report sachin02-hub

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sachin02-hub/README.md

πŸ‘‹ Hi, I'm Sachin S M

Welcome to my GitHub! I’m an Entry-Level Data Scientist with industry experience at Rolls-Royce Power Systems, passionate about turning data into actionable insights through machine learning, statistical analysis, and scalable data pipelines.

My work spans exploratory data analysis, predictive modeling, time-series analytics, and applied AI systems deployed in real-world environments.


πŸ” Data Science Focus

  • πŸ“Š Exploratory Data Analysis & Data Wrangling β€” cleaning, feature engineering, insight discovery
  • πŸ“ˆ Predictive Modeling β€” classification, regression, ensemble methods
  • ⏱️ Time-Series Analysis β€” segmentation, anomaly detection, forecasting
  • πŸ“„ Unstructured Data Analytics β€” Retrieval-Augmented Generation (RAG), document intelligence

🏭 Industry Experience

πŸš€ Rolls-Royce Power Systems β€” Data Science Intern

  • Built and deployed machine learning pipelines for time-series classification and thermal prediction on real-world sensor data.
  • Optimized deep learning models for production, reducing model size from ~29 GB to <1 GB with minimal accuracy loss.
  • Accelerated end-to-end inference pipelines using PyTorch GPU acceleration, reducing processing time by ~95%.
  • Developed physics-informed ML models to estimate engine node temperatures with Β±5–6Β°C accuracy, reducing reliance on costly simulations.
  • Collaborated with cross-functional engineering teams and stakeholders to validate model outputs against domain constraints.

πŸ“Œ Featured Data Science Projects

πŸ“‰ Customer Churn Prediction

  • Built and evaluated ensemble ML models achieving 81% accuracy and 87% F1-score.
  • Performed feature engineering and model comparison to identify key churn drivers and derive actionable insights.

πŸ“Š Advanced Exploratory Data Analysis

  • Conducted EDA including data imputation, outlier detection, correlation analysis, and feature distribution analysis.
  • Delivered clean, ML-ready datasets for downstream modeling.

πŸ“„ Retrieval-Augmented Generation (RAG) System

  • Implemented a RAG pipeline using FAISS and transformer models for semantic retrieval and context-aware answer generation over large PDF documents.

🌦️ Weather Prediction (Model Comparison)

  • Compared KNN, Logistic Regression, and Decision Tree models to analyze performance trade-offs in weather prediction tasks.

πŸ› οΈ Tech Stack

Languages & Data:
Python Β· SQL Β· PostgreSQL Β· Excel

Data Science & ML:
Pandas Β· NumPy Β· Scikit-learn Β· Matplotlib Β· PyTorch Β· TensorFlow

Tools & Platforms:
Git Β· Jupyter Β· AWS (Learning)


πŸ“š Research & Intellectual Property

  • Indian Patent (Published): AI-Driven Multi-Factor Authentication Using Hardware Fingerprints
  • IEEE Access: Robust Authentication Using Hardware Fingerprints and AI
  • Springer: Residual Network Depth vs Accuracy for Crop Disease Classification
  • Elsevier (Under Review): Autoencoder-Based Mixed Pixel Correction in Thermal Images

πŸ“« Connect With Me

Thanks for visiting! Always open to discussions on data, machine learning, and real-world problem solving πŸš€

Pinned Loading

  1. ExploratoryDataAnalysis ExploratoryDataAnalysis Public

    Advanced EDA on Dataset to determine viability and various forms of Data wrangling to improve quality

    Jupyter Notebook 1

  2. A-Retrieval-Augmented-Generation-RAG-System-for-Document-Insights A-Retrieval-Augmented-Generation-RAG-System-for-Document-Insights Public

    Jupyter Notebook 1

  3. Customer-Churn-Prediction Customer-Churn-Prediction Public

    Predict customer churn using machine learning models. This project includes data preprocessing, model training, evaluation, and comparison to identify customers likely to leave, helping businesses …

    Jupyter Notebook 1

  4. Weather Weather Public

    Comparison of three distinct algorithms in Machine Learning for Weather Prediction

    Jupyter Notebook 1

  5. CropDiseaseDetection CropDiseaseDetection Public

    Resnet Based Classification on Plant Village Dataset

    Jupyter Notebook 1

  6. Biometric-Device-Classification Biometric-Device-Classification Public

    Jupyter Notebook