Hi HN!
My latest side project is knowledge graph that maps the French culinary network using data extracted from restaurant reviews from LeFooding.com. The project uses LLMs to extract structured information from unstructured text.
Some technical aspects you may be interested in:
- Used structured generation to reliably parse unstructured text into a consistent schema
- Tested multiple models (Mistral-7B-v0.3, Llama3.2-3B, gpt4o-mini) for information extraction
- Created an interactive visualization using gephi-lite and Retina (WebGL)
- Built (with Claude) a simple Flask web app to clean and deduplicate the data
- Total cost for inferencing 2000 reviews with gpt4o-mini: less than 1€!
You can explore the visualization here: [Interactive Culinary Network](https://ouestware.gitlab.io/retina/1.0.0-beta.4/#/graph/?url...)
The code for the project is available on GitHub: - Main project: https://github.com/theophilec/foudinge - Data cleaning tool: https://github.com/theophilec/foudinge-scrub
Happy to get feedback!
Comments URL: https://news.ycombinator.com/item?id=43242818
Points: 49
# Comments: 19
Login to add comment
Other posts in this group

Article URL: https://thenewstack.io/nvidia-finally-adds-native-python-support-to-cuda/
Comments URL:
For popular threads with a lot of comments, collapsing a top comment leads to high CPU usage and can cause browsers (I tried in Chrome and Vivaldi) to freeze up.
For example, this very popular t

Article URL: https://lyra.horse/css-clicker/
Comments URL: https://news.ycombinator.com

