Alpaca

Stanford · March 2023

activeOpen Sourcedecoder onlytext
Parameters7B

Why It Matters

Proved that fine-tuning an open model on synthetic instruction data could replicate much of ChatGPT's conversational ability for under $600, sparking a wave of community-driven model development.

Description

Stanford's instruction-tuned version of LLaMA that demonstrated you could create a ChatGPT-like model for under $600. Fine-tuned on 52K instruction-following examples generated by GPT-3.5, it proved that high-quality instruction tuning data matters more than model scale for conversational ability.

Key Innovations

Instruction Tuning
Instruction TuningFine-tuning a model on instruction-response pairs so it follows user commands more reliably.
Distillation
DistillationTraining a smaller 'student' model to mimic a larger 'teacher' model, preserving capability at lower cost.

Family Tree

Built On

Lineage

LLaMAAlpaca