Character Encoding and UTF-8



Jelentkezéshez jelentkezzen be

EGYÉB POSTS Ebben a csoportban

Show HN: OCR pipeline for ML training (tables, diagrams, math, multilingual)

Hi HN,

I’ve been working on an OCR pipeline specifically optimized for machine learning dataset preparation. It’s designed to process complex academic materials — including math formulas, tables

2025. ápr. 5. 6:50:06 | Hacker news