Add instructions on how to run the model with transformers (#31)

Files changed (1) hide show

README.md CHANGED Viewed

@@ -104,6 +104,27 @@ num1, num2):
     # return the sum
 ```
 ## Limitations
 The Codestral-22B-v0.1 does not have any moderation mechanisms. We're looking forward to engaging with the community on ways to

     # return the sum
 ```
+## Usage with transformers library
+This model is also compatible with `transformers` library, first run `pip install -U transformers` then use the snippet below to quickly get started:
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+model_id = "mistralai/Codestral-22B-v0.1"
+tokenizer = AutoTokenizer.from_pretrained(model_id)
+model = AutoModelForCausalLM.from_pretrained(model_id)
+text = "Hello my name is"
+inputs = tokenizer(text, return_tensors="pt")
+outputs = model.generate(**inputs, max_new_tokens=20)
+print(tokenizer.decode(outputs[0], skip_special_tokens=True))
+```
+By default, transformers will load the model in full precision. Therefore you might be interested to further reduce down the memory requirements to run the model through the optimizations we offer in HF ecosystem.
 ## Limitations
 The Codestral-22B-v0.1 does not have any moderation mechanisms. We're looking forward to engaging with the community on ways to