Build a Decision Tree classifier from scratch to identify profitable Lending loans.
- Download LendingClub loan data on Kaggle
- Call the Python command to install the required libraries: pip3 install -r py_libraries.txt
- Run the Jupyter notebook data_preprocessing.ipynb
- Open the terminal and run the data sampling program using the command: python stratified_sampling.py
- Call the main program and follow the instruction on the screen using the command: python main.py
- Data Sampling
- Command: python stratified_sampling.py
- Call this program to perform stratified sampling on the pre-processed data.
- DT.py - this module contains:
- Two classes: Node and DecisionTree.
- Utility functions for making predictions and evaluating the model.
- Main