Show HN: Qwen-2.5-32B is now the best open source OCR model

Last week was big for open source LLMs. We got:

- Qwen 2.5 VL (72b and 32b)

- Gemma-3 (27b)

- DeepSeek-v3-0324

And a couple weeks ago we got the new mistral-ocr model. We updated our OCR benchmark to include the new models.

We evaluated 1,000 documents for JSON extraction accuracy. Major takeaways:

- Qwen 2.5 VL (72b and 32b) are by far the most impressive. Both landed right around 75% accuracy (equivalent to GPT-4o’s performance). Qwen 72b was only 0.4% above 32b. Within the margin of error.

- Both Qwen models passed mistral-ocr (72.2%), which is specifically trained for OCR.

- Gemma-3 (27B) only scored 42.9%. Particularly surprising given that it's architecture is based on Gemini 2.0 which still tops the accuracy chart.

The data set and benchmark runner is fully open source. You can check out the code and reproduction steps here:

- https://getomni.ai/blog/benchmarking-open-source-models-for-...

- https://github.com/getomni-ai/benchmark

- https://huggingface.co/datasets/getomni-ai/ocr-benchmark

Comments URL: https://news.ycombinator.com/item?id=43549072

Points: 61

# Comments: 13

https://github.com/getomni-ai/benchmark/blob/main/README.md

Created 25d | Apr 1, 2025, 9:40:16 PM

Other posts in this group

Fifteen new giant radio galaxies discovered with ASKAP

Article URL: https://phys.org/news/2025-04-fifteen-giant-radio-galaxies-askap.html

Comments URL:

Apr 26, 2025, 5:30:02 PM | Hacker news

Catastrophic fires and soil degradation: possible link with Neolithic revolution

Article URL: https://link.springer.com/article/10.1007/s11368-025-04021-x

Comments URL:

Apr 26, 2025, 3:10:19 PM | Hacker news

ICE Deports 3 U.S. Citizen Children Held Incommunicado Prior to the Deportation

Article URL: https://www.aclu.org/press-releases/ice-deports-3-u-s-citi

Apr 26, 2025, 3:10:19 PM | Hacker news

Will the Humanities Survive Artificial Intelligence?

Article URL: https://www.newyorker.com/culture/the-weekend-essay/will-the-humaniti

Apr 26, 2025, 3:10:17 PM | Hacker news

The NNCPNET Email Network

Article URL: https://changelog.complete.org/archives/10768-announcing-the-nncpnet-email-network

Apr 26, 2025, 3:10:14 PM | Hacker news

Show HN: My self-written hobby OS is finally running on my vintage IBM ThinkPad

Finally got my hobby OS up and running on real hardware. I love the old IBM thinkpads, so thought it was the perfect machine to get it working on. Been working on it for quite some time now, but t

Apr 26, 2025, 3:10:12 PM | Hacker news

Watching o3 guess a photo's location is surreal, dystopian and entertaining

Article URL: https://simonwillison.net/2025/Apr/26/o3-photo-locations/

Comments URL:

Apr 26, 2025, 3:10:12 PM | Hacker news

Techie