tperes commited on
Commit
2030b09
·
verified ·
1 Parent(s): 3605879

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -0
README.md CHANGED
@@ -119,6 +119,27 @@ output_text = tokenizer.decode(output_id[0][input_ids.shape[1] :])
119
  print(output_text)
120
  ```
121
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
122
  ## Ethical Considerations
123
 
124
  As with any language model, there is a potential for generating biased or inaccurate information. Users should be aware of these limitations and use the model responsibly.
 
119
  print(output_text)
120
  ```
121
 
122
+ ## Running with vLLM
123
+ ```py
124
+ vllm serve Writer/palmyra-mini-thinking-b
125
+ ```
126
+ ```py
127
+ curl -X POST http://localhost:8000/v1/chat/completions \
128
+ -H "Content-Type: application/json" \
129
+ -d '{
130
+ "model": "Writer/palmyra-mini-thinking-b",
131
+ "messages": [
132
+ {
133
+ "role": "user",
134
+ "content": "You have a 3-liter jug and a 5-liter jug. How can you measure exactly 4 liters of water?"
135
+ }
136
+ ],
137
+ "max_tokens": 8000,
138
+ "temperature": 0.2
139
+ }'
140
+ ```
141
+
142
+
143
  ## Ethical Considerations
144
 
145
  As with any language model, there is a potential for generating biased or inaccurate information. Users should be aware of these limitations and use the model responsibly.