codelion
/

optillm-modernbert-large

Model card Files Files and versions

codelion commited on Feb 16

Commit

12bd9b6

·

verified ·

1 Parent(s): 4ea34db

Update README.md

Files changed (1) hide show

README.md +14 -0

README.md CHANGED Viewed

@@ -123,4 +123,18 @@ input_ids, attention_mask = preprocess_input(tokenizer, system_prompt, initial_q
 predicted_approach, _ = predict_approach(router_model, input_ids, attention_mask, device)
 print(f"Router predicted approach: {predicted_approach}")
 ```

 predicted_approach, _ = predict_approach(router_model, input_ids, attention_mask, device)
 print(f"Router predicted approach: {predicted_approach}")
+```
+## Citation
+If you use this in your work, please cite:
+```bibtex
+@software{optillm,
+  title = {Optillm: Optimizing inference proxy for LLMs},
+  author = {Asankhaya Sharma},
+  year = {2024},
+  publisher = {GitHub},
+  url = {https://github.com/codelion/optillm}
+}
 ```