Improve model card: Update pipeline tag, add library name, and link paper

by nielsr HF Staff - opened Oct 10

←

nielsr

Oct 10

This PR enhances the model card for RLinf/RLinf-math-7B by:

Updating the pipeline_tag to text-generation to accurately reflect the model's primary inference capability as a causal language model for mathematical reasoning, as indicated by config.json and configuration.json.
Adding reinforcement-learning to the tags list to preserve information about the model's training methodology.
Adding library_name: transformers to the metadata, as demonstrated by the existing How to Use section. This will enable the automatic code snippet on the Hugging Face Hub.
Prominently linking the paper RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training at the top of the model card.

These changes will improve the model's discoverability and provide more complete information for users.

zoeyuchao changed pull request status to merged Oct 10

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment