@doladoo both version 1 and 2 @merve from my testing, it didn't lose its multi lingual skills, in fact it made them much better. I tested both Qwen2.5-VL and OlmOCR and the latter is the best on arabic text.
If only this came last week! I spent the last week learning about about and benchmarking all these plus extra models, and I wanna point out a correction. OlmOCR isn't an English language only model, in fact, it produced the best results across all VLM and none VLM frameworks on my Arabic language corpus.