Gensyn
/

Qwen2.5-0.5B-Instruct

Text Generation

text-generation-inference

Model card Files Files and versions

benfielding commited on Mar 31

Commit

317b7eb

·

verified ·

1 Parent(s): e40f103

Re-order text

Files changed (1) hide show

README.md +5 -6

README.md CHANGED Viewed

@@ -15,6 +15,11 @@ library_name: transformers
 # Qwen2.5-0.5B-Instruct
 ## Introduction
 This repo contains an **unmodified version** of the instruction-tuned 0.5B Qwen2.5 model, which has the following features:
 - Type: Causal Language Models
@@ -26,12 +31,6 @@ This repo contains an **unmodified version** of the instruction-tuned 0.5B Qwen2
 - Number of Attention Heads (GQA): 14 for Q and 2 for KV
 - Context Length: Full 32,768 tokens and generation 8192 tokens
-This model is intended for use in the [Gensyn RL Swarm](https://www.gensyn.ai/articles/rl-swarm), to finetune locally using peer-to-peer reinforcement learning post-training.
-Once finetuned, the model can be used as normal in any workflow, for details on how to do this please refer to the [original model documentation](https://qwen.readthedocs.io/en/latest/).
-For more details on the original model, please refer to the original repository [here](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct).
 ## Requirements
 This model is intended for use in the [Gensyn RL Swarm](https://www.gensyn.ai/articles/rl-swarm) system, for details on model requirements when using outside of a swarm, refer to the original Qwen repo [here](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct).

 # Qwen2.5-0.5B-Instruct
 ## Introduction
+This model is intended for use in the [Gensyn RL Swarm](https://www.gensyn.ai/articles/rl-swarm), to finetune locally using peer-to-peer reinforcement learning post-training.
+Once finetuned, the model can be used as normal in any workflow, for details on how to do this please refer to the [original model documentation](https://qwen.readthedocs.io/en/latest/).
+For more details on the original model, please refer to the original repository [here](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct).
 This repo contains an **unmodified version** of the instruction-tuned 0.5B Qwen2.5 model, which has the following features:
 - Type: Causal Language Models
 - Number of Attention Heads (GQA): 14 for Q and 2 for KV
 - Context Length: Full 32,768 tokens and generation 8192 tokens
 ## Requirements
 This model is intended for use in the [Gensyn RL Swarm](https://www.gensyn.ai/articles/rl-swarm) system, for details on model requirements when using outside of a swarm, refer to the original Qwen repo [here](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct).