nvidia
/

gpt-oss-120b-Eagle3

Text Generation

Model Optimizer

Model card Files Files and versions

omrialmog commited on Oct 7

Commit

6511991

·

1 Parent(s): 74219e3

Updated Readme.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -17,10 +17,13 @@ tags:
 # Model Overview
 ## Description:
-The NVIDIA gpt-oss-120b Eagle model is the Eagle head of the OpenAI’s gpt-oss-120b model, which is an auto-regressive language model that uses a mixture-of-experts (MoE) architecture with 32 billion activated parameters and 1 trillion total parameters. For more information, please check [here](https://huggingface.co/openai/gpt-oss-120b). The NVIDIA gpt-oss-120b Eagle3 model incorporates Eagle speculative decoding with [TensorRT Model Optimizer](https://github.com/NVIDIA/TensorRT-Model-Optimizer).
 This model is ready for commercial/non-commercial use.  <br>
 ### License/Terms of Use:
 [nvidia-open-model-license](https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/)

 # Model Overview
 ## Description:
+The NVIDIA gpt-oss-120b Eagle model is the Eagle head of the OpenAI’s gpt-oss-120b model, which is an auto-regressive language model that uses a mixture-of-experts (MoE) architecture with 5 billion activated parameters and 120 billion total parameters. For more information, please check [here](https://huggingface.co/openai/gpt-oss-120b). The NVIDIA gpt-oss-120b Eagle3 model incorporates Eagle speculative decoding with [TensorRT Model Optimizer](https://github.com/NVIDIA/TensorRT-Model-Optimizer).
 This model is ready for commercial/non-commercial use.  <br>
+### Note
+For use cases of less than 8k context length - please consider using [gpt-oss-120b-Eagle3-v2](https://huggingface.co/nvidia/gpt-oss-120b-Eagle3-v2)
 ### License/Terms of Use:
 [nvidia-open-model-license](https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/)