Date: January 30, 2024

Topic: Ensemble Learners

Recall

Ensemble learners take predictions from different machine learning models to perform a combined prediction.

Notes

Ensemble Learners

Untitled

Building an ensemble

  1. Train several parameterized polynomials of differing degrees
  2. Train several KNN models using different subsets
  3. Combine the models above into an ensemble

<aside> 📌 SUMMARY: Take several different models (can be the same or different kinds) and weigh the predictions to get a single prediction

</aside>


Date: January 30, 2024

Topic: Bootstrap aggregating - Bagging (Ensemble KNN)

Recall

Create several subsets of data (bags) to train a several KNN models. Combine them together to create an ensemble learner.

Notes

Bagging

Untitled


With bagging, we can get an ensemble model that mitigates overfitting.

Example of Ensemble Models (KNN)

Untitled


<aside> 📌 SUMMARY: Using bagging, we can create an ensemble of models from the different datasets. This leads to a model that can mitigate overfitting

</aside>


Date: January 30, 2024

Topic: Boosting (Ada Boost)

Recall

Ada Boost tries to single out badly performing data points so the model is more exposed to them for subsequent trainings

Notes

Ada Boost (Adaptive Boosting)

Repeat the above for n bags

Repeat the above for n bags

<aside> 💡 However, as we increase the number of bags, certain data points will always be picked. This will lead to overfitting if many bags are used

</aside>


<aside> 📌 SUMMARY: Bagging and boosting are just wrappers for existing methods. They help to reduce error and overfitting

</aside>