Through fine-tuning, we introduce examples that bias the model towards the fine-tuned dataset.


Through fine-tuning, we are able to change the distributions of the predicted probabilities. For example, instead of the word “company”, the word “elves” is now more likely.

E.g., input “In that time the” most likely outputs “company” due to training on the large corpus, but the fantasy text has the output “elves” instead

The model prefers standard words like “company” and “world”, with “elves” being ranked relatively low
However after fine-tuning, we may get a distribution like this instead, where “elves” is ranked much higher

<aside> 📌 SUMMARY: Fine-tuning is useful when we have a small dataset that we want the model to bias to, but not large enough to train a model from scratch. This helps drive the distribution of outputs towards that dataset. If we have very large models, chances are such specific datasets are already included in the model’s distribution and through careful prompting, we can drive it towards those outputs instead.
</aside>
Instruction tuning is a form of fine-tuning that aligns the model towards a question-answer format, with humans correcting the output so the model is less likely to treat the prompt as a text continuation problem.
<aside> 📌 SUMMARY: Instruction tuning helps align the model towards human conversation structure instead of being solely text generating.
</aside>