XiaodongChen
/

Llama-2-4.7B

Model card Files Files and versions

XiaodongChen commited on Feb 26

Commit

7138538

·

verified ·

1 Parent(s): 3fc8989

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -5,6 +5,7 @@ base_model:
 ---
 The model is derived from Llama-2-7b-hf through pruning using LLM-Streamline **(Streamlining Redundant Layers to Compress Large Language Models, ICLR 2025 Spotlight)**. The entire training process required only 0.06B tokens.
 Below are the results of the evaluation using lm-eval:
 |              | arc_c | arc_e | boolq | hellaswag | openbookqa | rte  | winogrande | Avg  |
 |--------------|-------|-------|-------|-----------|------------|------|------------|------|

 ---
 The model is derived from Llama-2-7b-hf through pruning using LLM-Streamline **(Streamlining Redundant Layers to Compress Large Language Models, ICLR 2025 Spotlight)**. The entire training process required only 0.06B tokens.
 Below are the results of the evaluation using lm-eval:
 |              | arc_c | arc_e | boolq | hellaswag | openbookqa | rte  | winogrande | Avg  |
 |--------------|-------|-------|-------|-----------|------------|------|------------|------|