meta-llama · pchng · Apr 24, 2024
diff --git a/MODEL_CARD.md b/MODEL_CARD.md
@@ -4,13 +4,13 @@ Meta developed and released the Meta Llama 3 family of large language models (LL
 
 **Model developers** Meta
 
-**Variations** Llama 3 comes in two sizes — 8B and 70B parameters — in pre-trained and instruction tuned variants.
+**Llama 3 family of models** Llama 3 comes in two sizes — 8B and 70B parameters — in pre-trained and instruction tuned variants.
 
 **Input** Models input text only.
 
 **Output** Models generate text and code only.
 
-**Model Architecture** Llama 3 is an auto-regressive language model that uses an optimized transformer architecture. The tuned versions use supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for helpfulness and safety.
+**Model Architecture** Llama 3 is an auto-regressive language model that uses an optimized transformer architecture. Llama 3 uses a tokenizer with a vocabulary of 128K tokens, and was trained on on sequences of 8,192 tokens. Grouped-Query Attention (GQA) is used for all models to improve inference efficiency. The tuned versions use supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for helpfulness and safety.
 
 
 <table>
@@ -59,7 +59,7 @@ Meta developed and released the Meta Llama 3 family of large language models (LL
 </table>
 
 
-**Llama 3 family of models**. Token counts refer to pretraining data only. Both the 8 and 70B versions use Grouped-Query Attention (GQA) for improved inference scalability.
+Note: Token counts refer to pretraining data only.
 
 **Model Release Date** April 18, 2024.