Improve model card: Update pipeline tag, add library name, and link paper
#1
by
nielsr
HF Staff
- opened
This PR enhances the model card for RLinf/RLinf-math-7B by:
- Updating the
pipeline_tagtotext-generationto accurately reflect the model's primary inference capability as a causal language model for mathematical reasoning, as indicated byconfig.jsonandconfiguration.json. - Adding
reinforcement-learningto thetagslist to preserve information about the model's training methodology. - Adding
library_name: transformersto the metadata, as demonstrated by the existingHow to Usesection. This will enable the automatic code snippet on the Hugging Face Hub. - Prominently linking the paper RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training at the top of the model card.
These changes will improve the model's discoverability and provide more complete information for users.
zoeyuchao
changed pull request status to
merged