Hey HN — We're Jacky and Mac from Trellis (https://runtrellis.com/). We’re building AI-powered ETL for unstructured data. Trellis transforms phone calls, PDFs, and chats into structured SQL format based on any schema you define in natural language. This helps data and ops teams automate manual data entry and run SQL queries on messy data.
There’s a demo video at " rel="nofollow">
Why we built this: At the Stanford AI lab where we met, we collaborated with many F500 data teams (including Amazon, Meta, and Standard Chartered), and repeatedly saw the same problem: 80% of enterprise data is unstructured, and traditional platforms can’t handle it. For example, a major commercial bank I work with couldn’t improve credit risk models because critical data was stuck in PDFs and emails.
We realized that our research from the AI lab could be turned into a solution with an abstraction layer that works as well for financial underwriting as it does for analysis of call center transcripts: an AI-powered ETL that takes in any unstructured data source and turns it into a schematically correct table.
Some interesting technical challenges we had to tackle along the way: (1) Supporting complex documents out of the box: We use LLM-based map-reduce to handle long documents and vision models for table and layout extraction. (2) Model Routing: We select the best model for each transformation to optimize cost and speed. For instance, in data extraction tasks, we could leverage simpler fine-tuned models that are specialized in returning structured JSONs of financial tables. (3) Data Validation and Schema Guarantees: We ensure accuracy with reference links and anomaly detection.
After launching Trellis, we’ve seen diverse use cases, especially in legacy industries where PDFs are treated as APIs. For example, financial services companies need to process complex documents like bonds and credit ratings into a structured format, and need to speed up underwriting and enable pass-through loan processing. Customer support and back-office operations need to accelerate onboarding by mapping documents across different schema and ERP systems, and ensure support agents follow SOPs (security questions, compliance disclosures, etc.). And many companies today want data preprocessing in ETL pipelines and data ingestion for RAG.
We’d love your feedback! Try it out at https://demo.runtrellis.com/. To save and track your large data transformations, you can visit our dashboard and create an account at https://dashboard.runtrellis.com/. If you’re interested in integrating with our APIs, our quick start docs are here: https://docs.runtrellis.com/docs/getting-started. If you have any specific use cases in mind, we’d be happy to do a custom integration and onboarding—anything for HN. :)
Excited to hear about your experience wrangling with unstructured data in the past, workflows you want to automate, and what data integration you would like to see.
Comments URL: https://news.ycombinator.com/item?id=41236273
Points: 43
# Comments: 17
Login to add comment
Other posts in this group
![Migraine is more than a headache – a rethink offers hope](https://www.cdn5.niftycent.com/a/1/V/5/b/v/j/migraine-is-more-than-a-headache-a-rethink-offers-hope.webp)
Article URL: https://www.nature.com/articles/d41586-025-00456-x
![File Pilot: A file explorer built for speed with a modern, robust interface](https://www.cdn5.niftycent.com/a/D/m/8/0/p/b/file-pilot-a-file-explorer-built-for-speed-with-a-modern-robust-interface.webp)
Article URL: https://filepilot.tech/
Comments URL: https://news.ycombinator.com/item?id=4309146
![One year after switching from Java to Go](https://www.cdn5.niftycent.com/a/D/Z/3/l/7/3/one-year-after-switching-from-java-to-go.webp)
Article URL: https://glasskube.dev/blog/from-java-to-go/
Comments URL: http
![South Korean regulator accuses DeepSeek of sharing user data with ByteDance](https://www.cdn5.niftycent.com/a/k/z/7/Y/w/6/south-korean-regulator-accuses-deepseek-of-sharing-user-data-with-bytedance.webp)
Article URL: https://www.bbc.com/news/articles/c4gex0x87g4o
Article URL: https://broot.ca/kafka-at-the-low-end.html
Comments URL: https: