TON Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models. kolerk/TON-3B-AITZ Image-Text-to-Text • 4B • Updated Jul 14 • 7 kolerk/TON-3B-CLEVR Image-Text-to-Text • 4B • Updated Jul 14 • 6 kolerk/TON-3B-Math Image-Text-to-Text • 4B • Updated Jul 14 • 6 kolerk/TON-7B-Math Image-Text-to-Text • 8B • Updated Jul 14 • 3
TON Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models. kolerk/TON-3B-AITZ Image-Text-to-Text • 4B • Updated Jul 14 • 7 kolerk/TON-3B-CLEVR Image-Text-to-Text • 4B • Updated Jul 14 • 6 kolerk/TON-3B-Math Image-Text-to-Text • 4B • Updated Jul 14 • 6 kolerk/TON-7B-Math Image-Text-to-Text • 8B • Updated Jul 14 • 3