Skip to content

Decision Tree in Python : Metaprotein

Rahul Mondal edited this page Apr 21, 2021 · 16 revisions

Decision Tree

We have implemented the following criteria to split nodes in our Decision Trees for comparison of accuracy

  • Gini Index
  • Information Gain (Entropy)

We have taken (2/3)rd of our Metaprotein Dataset to be used as Training Dataset & (1/3)rd to be used as Testing Dataset


Gini Index

Full Tree_gini

Confusion Matrix: Prediction on Test Dataset

Full Tree__conf_matrix_gini

Accuracy = 72/97 = 79.16 %


Information Gain (Entropy)

Full Tree_entropy

Confusion Matrix: Prediction on Test Dataset

Full Tree_conf_entropy

Accuracy = 72/97 = 70.83 %


Clone this wiki locally