top of page

I will use confusion matrix to evaluate the decision tree model that I created for this income dataset.

evaluation: Text
2ED59FF2-5193-44F9-A6BA-CCDC6D57F416.jpeg
evaluation: Image

Display the report on how decision tree classifier works on this income dataset including displaying recall, f1-score values.

evaluation: Text
54C79699-6732-41E2-B08D-5E4AE3469770_4_5005_c.jpeg
evaluation: Image

I will use confusion matrix again to evaluate the pipelined SVC model.

evaluation: Text
9E3A933B-A363-4DAD-90EF-F3C5E234698E.jpeg
evaluation: Image

Again, display the report of how pipelined SVC model works on this income dataset including displaying the recall, f1-score values.

evaluation: Text
4D67EBC4-FFF7-4EBD-AE69-7AFABCD854AA_4_5005_c.jpeg
evaluation: Image

I used confusion matrix to do evaluation because it gives direct comparisons of values including True Positive, False Positive, True Negative and False Negative. By looking at the visualization created by confusion matrix function, we are able to have a quick prediction about the "unseen" data. 

I am confident to include that both two classification algorithm, decision tree and pipelined SVC, perform well in the Income dataset based on the test accuracy score, recall, precision and f1-score values. In fact, both these two of classifiers give high score of test accuracy. Notably, the test accuracy score of decision tree classifier and pipelined SVC classifier are the same, which is 0.84. Not only the test accuracy score, the other values of these two classifiers are also surprisingly similar. However, there are also some slight differences about the precision and recall values between these two classifiers. Concretely, when decision tree classifier makes a prediction about if a person make less than $50K a year, it is correct 85% of the time, meanwhile the pipelined SVC classifiers corrects 86% of the time. With only 1% difference in the precision value does not tell anything about whether pipelined SVC classifier works better than decision tree on this Income dataset. In short, both decision tree and pipelined SVC perform well on this Income dataset.


Even though it seems like these two classifiers do the same good job on classifying/prediction on this Income dataset. It does not mean that decision tree and SVC will always perform the same and return the same result since each classifier has its own functions and features. 

evaluation: Text

Click the button below to download my Jupyter notebook to read more about my evaluation step.

evaluation: Files

7044972914

  • Facebook
  • Twitter
  • LinkedIn
  • LinkedIn

©2022 by ℅ D. Proudly created with Wix.com

bottom of page