Dhee-NxtGen-Qwen3-v2
Collection
Dhee-NxtGen-Qwen3-1.7B-v2 is a multilingual LLM series by DheeYantra and NxtGen Cloud Technologies, based on Qwen3-1.7B and built for Indian languages
•
8 items
•
Updated
Dhee-NxtGen-Qwen3-Hindi-v2 is a large language model developed by DheeYantra in collaboration with NxtGen Cloud Technologies Pvt. Ltd.
It is based on the Qwen3 architecture and fine-tuned for assistant-style, function-calling, and reasoning-based conversational tasks in Hindi.
This model generates fluent and contextually accurate Hindi responses, suitable for building chatbots, reasoning systems, and multilingual AI assistants.
from transformers import AutoTokenizer, AutoModelForCausalLM
model_name = "dheeyantra/dhee-nxtgen-qwen3-hindi-v2"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name, trust_remote_code=True)
# Example prompt
prompt = """<|im_start|>system
You are a helpful assistant.<|im_end|>
<|im_start|>user
क्या आप मेरे लिए एक अपॉइंटमेंट शेड्यूल कर सकते हैं?nd|<|im_end|>
<|im_start|>assistant
"""
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=150)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
For high-throughput serving with vLLM, ensure the following environment:
Install dependencies:
pip install torch transformers vllm sentencepiece
Run vLLM server:
vllm serve --model dheeyantra/dhee-nxtgen-qwen3-hindi-v2 --host 0.0.0.0 --port 8000
Released under the Apache 2.0 License.
Developed by DheeYantra in collaboration with NxtGen Cloud Technologies Pvt. Ltd.