Anime-Recommendations-Project

Introduction

I was an avid anime fan and this lead me to pursue this project of creating a recommendation system for animes a user may enjoy based on their enjoyment of similar animes and similar people. This made me try to create both a content based filtering system and a collaborative filtering system to recommend animes to the user. What I would like to do is try combine both systems into one more effective system which eventually could be deployed as a website. The repo containing the deployment website files can be found here.

What I learnt:

How to replace missing or sub-optimal values in a dataset
Creating pairplot using seaborn to compare numerical data and distributions
Use pandas and collections' counter to analyse frequency of values of multiple elements in strings
Create content based recommendation system using the genres of a show and a user's ratings of watched shows
Create a collaborative filtering recommendation system using scikit surprise and SVD

Included Files:

Notebooks

Data Cleaning.ipynb (Notebook for cleaning the anime and ratings datasets that will be used)
Exploratory Data Analysis.ipynb (Exploring the data and trying to extract insights and patterns before building the recommendation system)
Content Based Filtering.ipynb (Notebook containing the code for the content based recommender)
Collaborative Filtering.ipynb (Notebook containing the code for the collaborative filtering recommender)

Datasets

anime.csv (Original anime dataset)
rating.csv (Original user and rating data)
cleaned_anime.csv (The cleaned and processed anime dataset after data cleaning)
cleaned_rating.csv (The cleaned and processed rating dataset after data cleaning)

Resources and Datasets Used

Python Version: 3.8

Packages:

pandas 1.13
numpy 1.19.2
matplotlib 3.3.2
seaborn 0.11.0
scikit surprise 1.1.1

Anime and Rating Datasets

For this project, I originally used an anime dataset by CooperUnion for data cleaning, data exploration, and model building. However, as I was developing the deployment website, I though maybe I should use a more up to data dataset. This led me to use Marlesson's data which will be use for deployment purposes.

Original dataset: https://www.kaggle.com/CooperUnion/anime-recommendations-database.
Deployment dataset: https://www.kaggle.com/marlesson/myanimelist-dataset-animes-profiles-reviews

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.streamlit		.streamlit
deployment data		deployment data
utilities		utilities
.gitignore		.gitignore
Collaborative Filtering.ipynb		Collaborative Filtering.ipynb
Content Based Filtering.ipynb		Content Based Filtering.ipynb
Data Cleaning.ipynb		Data Cleaning.ipynb
Exploratory Data Analysis.ipynb		Exploratory Data Analysis.ipynb
LICENSE		LICENSE
README.md		README.md
anime.jpg		anime.jpg
blur_image.jpeg		blur_image.jpeg
blur_img.jpg		blur_img.jpg
gui.py		gui.py
hot_encode.jpeg		hot_encode.jpeg
kaggle.json		kaggle.json
logo_img.png		logo_img.png
random_user.txt		random_user.txt
requirements.txt		requirements.txt
tempCodeRunnerFile.py		tempCodeRunnerFile.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Anime-Recommendations-Project

Introduction

What I learnt:

Included Files:

Notebooks

Datasets

Resources and Datasets Used

Anime and Rating Datasets

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Anime-Recommendations-Project

Introduction

What I learnt:

Included Files:

Notebooks

Datasets

Resources and Datasets Used

Anime and Rating Datasets

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages