Hey HN,
I'm Adithya, a 20-year-old dev from India. I have been working with GenAI for the past year, and I've found it really painful to deal with the many different forms of data out there and get the best representation of it for my AI applications.
That's why I built OmniParse—an open-source platform designed to handle any unstructured data and transform it into optimized, structured representations.
Key Features: - Completely local processing—no external APIs - Supports ~20 file types - Converts documents, multimedia, and web pages to high-quality structured markdown - Table extraction, image extraction/captioning, audio/video transcription, web page crawling - Fits in a T4 GPU - Easily deployable with Docker and Skypilot - Colab friendly with an interactive UI powered by Gradio
Why OmniParse? I wanted a platform that could take any kind of data—documents, images, videos, audio files, web pages, and more—and make it clean and structured, ready for AI applications.
Check it out on GitHub: https://git.new/omniparse
Comments URL: https://news.ycombinator.com/item?id=40854733
Points: 25
# Comments: 4
Autentifică-te pentru a adăuga comentarii
Alte posturi din acest grup
Article URL: https://github.com/Livinglist/Hacki
Comments URL: https://news.ycombin
Hello everyone, I would like to see if there is any interest in this little project that I have been working on for the past few years.
Could be relevant, seeing the direction in which the mains
Article URL: https://pypi.org/project/aegypti
Comments URL: https://news.ycombinator.c
Article URL: https://www.phoronix.com/fo
Article URL: https://www.redhat.com/en/