Writer
/

palmyra-mini

Text Generation

text-generation-inference

Model card Files Files and versions

tperes commited on Sep 11

Commit

2030b09

·

verified ·

1 Parent(s): 3605879

Update README.md

Files changed (1) hide show

README.md +21 -0

README.md CHANGED Viewed

@@ -119,6 +119,27 @@ output_text = tokenizer.decode(output_id[0][input_ids.shape[1] :])
 print(output_text)
 ```
 ## Ethical Considerations
 As with any language model, there is a potential for generating biased or inaccurate information. Users should be aware of these limitations and use the model responsibly.

 print(output_text)
 ```
+## Running with vLLM
+```py
+vllm serve Writer/palmyra-mini-thinking-b
+```
+```py
+curl -X POST http://localhost:8000/v1/chat/completions \
+  -H "Content-Type: application/json" \
+  -d '{
+    "model": "Writer/palmyra-mini-thinking-b",
+    "messages": [
+      {
+        "role": "user",
+        "content": "You have a 3-liter jug and a 5-liter jug. How can you measure exactly 4 liters of water?"
+      }
+    ],
+    "max_tokens": 8000,
+    "temperature": 0.2
+  }'
+```
 ## Ethical Considerations
 As with any language model, there is a potential for generating biased or inaccurate information. Users should be aware of these limitations and use the model responsibly.