--- base_model: - deepseek-ai/DeepSeek-V3-0324 pipeline_tag: text-generation --- For inference with `sglang` and `kt-kernel`: https://lmsys.org/blog/2025-10-22-KTransformers/ This version is packed specifically for NUMA tensor parallel = 4