Date: May 18, 2024

Topic: Learning in Decision Trees

Recall

When creating splits for decision trees, we want to find the best attribute at each node to split the data evenly based on the eventual outcomes

Notes

20 Questions

When playing 20 questions (e.g., if the answer is Michael Jackson):

Applying to Decision Trees

We can follow the same principles when trying to build a decision tree

  1. Pick the best attribute (e.g., best may split the data into half)
  2. Ask a question
  3. Follow the answer path
  4. Repeat the above 3 steps until we get an answer

Making the best split

Untitled


Decision trees can also be used to express logic functions like AND, OR, XOR

Expressiveness in Decision Trees

Boolean AND (A AND B)

Decision Trees can be used to represent AND function (e.g., A AND B)

Untitled

Boolean OR (A OR B)

Untitled

Boolean XOR (A XOR B)

Untitled


ANY is an example of a simple problem, where we have a linear relationship between the attributes and nodes



<aside> 📌 SUMMARY: The best splits in decision trees should separate the data cleanly based on their eventual outcomes. Decision trees can also grow very deep, so we need a good method to search such trees

</aside>