SMS Spam Detection App

A machine learning-powered web application that classifies SMS messages as spam or not using NLP techniques and the Multinomial Naive Bayes algorithm. This project includes full model training, evaluation, and a user-friendly Streamlit interface.

📂 Dataset

Source: Kaggle - UCI SMS Spam Collection Dataset
Description: A set of SMS labeled messages as spam or not.

⚙️ Features

Data cleaning and preprocessing
Exploratory Data Analysis (EDA)
Text tokenization using NLTK
Vectorization using TF-IDF
Model comparison using multiple classifiers
Final model: Multinomial Naive Bayes
Evaluation metrics: Accuracy, Precision, Confusion Matrix
Streamlit web app for user interaction

Preview of the app can be accessed from here

📁 Project Structure

📦 buzz-blocker/
├── app.py                  # Streamlit app
├── model.pkl               # Trained Naive Bayes model
├── vectorizer.pkl          # TF-IDF vectorizer
├── spam.csv                # Original dataset
├── spam_utf8.csv           # UTF-8 converted dataset
├── spam-detection.ipynb    # Training and EDA notebook
├── requirements.txt        # Python dependencies
├── LICENSE                 # MIT open-source license
└── README.md               # Contains basic info about the project

🧠 Model Insights

The dataset was vectorized using TF-IDF to capture term importance.
Multiple classifiers were tested (e.g. Logistic Regression, SVM).
Multinomial Naive Bayes gave the best results on precision and accuracy.
The model was saved as model.pkl and used directly in the app.

🛠 Tech Stack

Python, Pandas, Scikit-learn, NLTK
TF-IDF Vectorizer
Streamlit (for frontend)

📄 License

This project is licensed under the MIT License.

📝 Author

Kreesh Modi | IIT Kharagpur Mechanical Engineering

Email: [kreeshmodi2018@gmail.com]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SMS Spam Detection App

📂 Dataset

⚙️ Features

📁 Project Structure

🧠 Model Insights

🛠 Tech Stack

📄 License

📝 Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.devcontainer		.devcontainer
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
model.pkl		model.pkl
requirements.txt		requirements.txt
spam-detection.ipynb		spam-detection.ipynb
spam.csv		spam.csv
spam_utf8.csv		spam_utf8.csv
vectorizer.pkl		vectorizer.pkl

Folders and files

Latest commit

History

Repository files navigation

SMS Spam Detection App

📂 Dataset

⚙️ Features

📁 Project Structure

🧠 Model Insights

🛠 Tech Stack

📄 License

📝 Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages