Submitted by Tony Zhao 3 VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs Om AI Lab 2