opendatalab
/

meta-rater-1b-reasoning

Text Generation

Model card Files Files and versions

renma commited on Jun 11

Commit

b225c65

·

verified ·

1 Parent(s): d1d3585

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -10,6 +10,10 @@ pipeline_tag: text-generation
 # PRRC-Reasoning Language Model (1.3B Parameters, 30B Tokens)
 ## Model Description
 This is a 1.3B parameter transformer-based decoder-only language model trained from scratch on 30B tokens selected from SlimPajama dataset using the **Reasoning** dimension of the PRRC framework. The training data was curated by selecting text with high reasoning complexity, focusing on content that requires multi-step logical analysis and critical thinking.

 # PRRC-Reasoning Language Model (1.3B Parameters, 30B Tokens)
+This repository contains the model described in the paper [Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models](https://huggingface.co/papers/2504.14194).
+Code: https://github.com/opendatalab/Meta-rater
 ## Model Description
 This is a 1.3B parameter transformer-based decoder-only language model trained from scratch on 30B tokens selected from SlimPajama dataset using the **Reasoning** dimension of the PRRC framework. The training data was curated by selecting text with high reasoning complexity, focusing on content that requires multi-step logical analysis and critical thinking.