Date: August 29, 2025

Topic: Foundations in Statistics

Recall

Random variables hold probabilities for what their actual value is. When querying the random variable with a particular value, we get the probability of that particular value appearing.

Notes

Random Variables

Querying Random Variables


We can have complex queries like in the case of OR, where we combine multiple variables together to get their probability.

Complex Queries

Can use logical operators like OR and AND to modify queries

Combining Multiple Variables

Using OR

image.png


From the Full Joint Distribution table we can derive many probabilities as it shows the probabilities from all worlds.

This allows us to compute the probability of any combination of values for a set of random variables.

Full Joint Distribution

Using the Table







<aside> 📌 SUMMARY: Random vars have probability distributions over their possible values; with multiple variables, the full joint distribution lists probabilities for all “worlds” where we get marginals and OR/AND queries from by summing the appropriate entries. Bayes Rule allows us to invert the conditional probabilities, which is useful as usually we want $P(\text{CAUSE}\mid\text{EFFECT})$ but we have data of $P(\text{EFFECT} \mid \text{CAUSE})$. Using Naive Bayes, we can assume that the effect variables are always independent, allowing for simple calculation.

</aside>