This code shows several machine learning algorithms in-depth for a random regression dataset. I made this because I wanted to see each algorithms sensitivity against over/underfitting. The main library that I'm using is SK-learn which is for python. Take also into account that I built this in collab from Google. The structures is as follows:
- libraries installation, imports, etc.
- Definition of a random dataset
- Drive mounting
- Random dataset generation
- Linear regression algorithm
- KNN
- SVR
- SVR - kernel sensitivity
- Decision tree model
- Decision tree model - Sample split sensitivity
- Decision tree model - Leaf's sensitivity
- Random forest model
- Gradient boosting
- Download the code
- Open it in google colab. I didn't try with Jupyter but I guess that should be more or less the same. As far as I had just checked, this guy made it: https://www.youtube.com/watch?v=L80dn0PiN_U&ab_channel=TechTips
There are several point to tweek. You can see at the beggining of each section how it is made.
Please let us know by writing it down and your thoughts about what caused that bug. Try to explain as much as you can. It will be very appreciate it
I'm thinking about adding ensembled learning techniques actually.