Date: May 19, 2024

Topic: ID3 Algorithm

Recall

Notes

ID3 Algorithm

LOOP:

select_best_attribute_A()
assign_A_as_decision_attribute_for_node()

for each_value_of_A:
	create_descendent_node()
	
sort_training_examples_to_leaves()

if examples_perfectly_created:
	break
else:
	iterate_over_leaves()

Entropy measures randomness in a dataset

The best attribute for each node maximizes the Entropy Gain

Selecting the best attribute (Information Gain)

Formula

Information Gain:

$\text{Gain}(S,A) = \text{Entropy}(S)-\sum_V\frac{|S_v|}{|S|}\cdot\text{Entropy}(S_V)$

Entropy

<aside> 💡 In order to choose the best attribute, we want one to maximize the Information Gain

</aside>


For a decision tree to perform well, it has to consider inductive biases in its algorithm

Inductive Bias

Restriction Bias ($H$)

Preference Bias ($n\in H$)

Applying inductive bias to decision trees


<aside> 📌 SUMMARY: When building a decision tree, we want to obtain the best splits for the attributes and this can be done by maximizing entropy gain

</aside>