Collections

Discover the best community collections!

Collections including paper arXiv:2506.20512
VisionLM
Collection by 3 days ago
OctoThinker-Llama-8B Family
What makes a base language model suitable for RL? Through controlled experiments, we identify key factors then leverage them to scale up mid-training.
reasoning llm
Collection by Oct 9
Psychology
Collection by Oct 3
VisionLM
Collection by 3 days ago
reasoning llm
Collection by Oct 9
OctoThinker-Llama-8B Family
What makes a base language model suitable for RL? Through controlled experiments, we identify key factors then leverage them to scale up mid-training.
Psychology
Collection by Oct 3