Collection related to the paper, "Training a Generally Curious Agent" (Project page: https://paprika-llm.github.io/)
-
ftajwar/paprika_Meta-Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 60 • 2 -
ftajwar/paprika_SFT_dataset
Viewer • Updated • 17.2k • 30 • 3 -
ftajwar/paprika_preference_dataset
Viewer • Updated • 5.26k • 20 • 1 -
ftajwar/paprika_Meta-Llama-3.1-8B-Instruct_SFT_only
Text Generation • 8B • Updated • 14