-
Notifications
You must be signed in to change notification settings - Fork 1
Build Classifier
Parth Patel edited this page Feb 24, 2018
·
3 revisions
This wiki provides details about using buildClassifier.py script.
Python 2.x and following python modules:
pandasNumpyScipyMatplotlib-
scikit-learn- This python package provides variety of machine learning algorithms. Downlaod - http://scikit-learn.org/stable/install.html#
n_trees- The number of trees in the forest (Default - 100)
n_features- The number of features to consider when looking for the best split (Default - 5)
n_classes- Binary class classification (Default - 2, Do not change)
n_folds- Number of k-fold cross-validation (Default - 5)
n_jobs - The number of jobs to run in parallel (Default - 10)
Best_k_features - Number of features to select using Information gain (Default - 250)
InputFile- Feature file generated from Generate_Features.py (e.g. 'Features.csv')
python buildClassifier.py
-
Results Folder
- Performance_Report.txt - Performance measures such as accuracy, sensitivity, specificity, precision, confusion matrix, and information gain based ranked
Best_k_features - ROC_Detailed.png
- Performance_Report.txt - Performance measures such as accuracy, sensitivity, specificity, precision, confusion matrix, and information gain based ranked