Date: January 30, 2024

Topic: Intro to Decision Trees

Recall

Decision trees conceive nodes that split into either left or right depending on the splitval of that node.

This split is learned from the data, where factors with the highest gini coefficient is used to split the tree first.

Decision trees in Python can be defined by matrices, with each row being the node used to split the tree further down.

This can be done with a numpy array

Notes

Decision Trees

In the context of animals

Example of a Decision Tree (of wine data)

Untitled

In-sample and out-of-sample testing

For Python

Querying and Learning


Determine the best feature to split on

Get the median value based on that feature for the splitval

Randomly select a feature to split on

Randomly select 2 rows and use the mean from that feature for splitval




<aside> 📌 SUMMARY:

</aside>