Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs Paper • 2310.16355 • Published Oct 25, 2023
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena Paper • 2306.05685 • Published Jun 9, 2023 • 37
Toward Inference-optimal Mixture-of-Expert Large Language Models Paper • 2404.02852 • Published Apr 3, 2024
LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch Paper • 2501.07124 • Published Jan 13
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper • 2506.14965 • Published Jun 17 • 49
Efficient Long-context Language Model Training by Core Attention Disaggregation Paper • 2510.18121 • Published about 1 month ago • 118
Efficient Long-context Language Model Training by Core Attention Disaggregation Paper • 2510.18121 • Published about 1 month ago • 118
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset Paper • 2309.11998 • Published Sep 21, 2023 • 25