Glyph: Scaling Context Windows via Visual-Text Compression Paper • 2510.17800 • Published Oct 20 • 66
AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework Paper • 2510.04206 • Published Oct 5 • 2
ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents Paper • 2508.14040 • Published Aug 19 • 3
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models Paper • 2412.11605 • Published Dec 16, 2024 • 18
AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents Paper • 2410.24024 • Published Oct 31, 2024 • 49
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning Paper • 2411.02337 • Published Nov 4, 2024 • 36
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents Paper • 2408.06327 • Published Aug 12, 2024 • 17
WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences Paper • 2306.07906 • Published Jun 13, 2023 • 13