Date: September 12, 2025

Topic: Data Cleaning

Recall

Notes

Data Cleaning

Understanding Missing Data

Survey Example

We want to know how affected the population is by depression


We can choose to remove data but may lose valuable information, or impute missing data by choosing from some statistical method

Handling Missing Data

Remove Missing Data

image.png

Impute Missing Data

image.png


Transform the data based on methods suited for the data type

Transforming Data

Highly dependent on input format

Image

Text


Proper data pre-processing allows for faster convergence

Pre-processing Data

image.png


Case Study: Depth Perception

image.png

Filling in Depth (Black Pixels)

image.png

Data Transformation

image.png

Pre-processing

image.png


<aside> 📌 SUMMARY: By doing proper data cleaning, transform and pre-processing, we get much better results during model training

</aside>


Date: September 12, 2025

Topic: Managing Bias


<aside> 📌 SUMMARY: We should maintain fairness

</aside>