metadata
language:
- en
license: cc-by-nc-4.0
library_name: transformers
tags:
- llama
- knowledge
- recall
- fine-tuned
- fine-tuning
pipeline_tag: text-generation
model-index:
- name: Llama-3.1-8B-knowledge-recall
results:
- task:
type: text-generation
name: Text Generation
dataset:
name: tulu3_mixture_knowledge_recall
type: custom
metrics:
- name: Training Loss
type: loss
value: 1.05
base_model: meta-llama/Llama-3.1-8B
Llama-3.1-8B Knowledge Recall Model
This is a fine-tuned Llama-3.1-8B model specialized for knowledge recall tasks. This checkpoint was released alongside https://arxiv.org/abs/2509.11167.
Model Details
- Base model: Llama-3.1-8B
- Training dataset: tulu3_mixture_knowledge_recall
- Learning rate: 5e-06
- Effective batch size: 128
Export Files
This repository includes export files for state averaging and other advanced techniques.