Show HN: I created a PoC for live descriptions of the surroundings for the blind

The difference in cost between products that are developed as accessibility tools compared to consumer products is huge. One example is camera glasses where the accessibility product costs ~$3000 (Envision Glasses), and the consumer product costs ~$300 (Ray-Ban Meta).

In this case the Ray-Ban Meta is getting accessibility features. The functionality is promising according to reviews, but requires the user to say "Hey meta what am I looking at" every time a scene is to be described. The battery life seem underwhelming as well.

It would be nice to have an cheap and open source alternative to the currently available products, where the user gets fed information rather than continuously requesting it. This is where I got interested to see if I could create a solution using an ESP32 WiFi camera, and learn some arduino development in the process.

I managed to create a solution where the camera connects to the phone "personal hotspot", and publishes an image every 7 seconds to an online server, which then uses the gpt-4o-mini model to describe the image and update a web page, that is read back to the user using voice synthesis. The latency for this is less than 2 seconds, and is generally faster.

I am happy with the result and learnt a lot, but I think I will pause this project for now. At least until some shiny new tech emerges (cheaper open source camera glasses).


Comments URL: https://news.ycombinator.com/item?id=42593919

Points: 14

# Comments: 2

https://github.com/o40/seesay

Created 3mo | Jan 5, 2025, 1:20:06 AM


Login to add comment

Other posts in this group

Recommendation for non-DOS/Unix open source OS outside x86/X64

I know some interesting non-DOS/UNIX-based open source OS like HelenOS, MenuetOS, or Kolibri. They are X86/X64 only, though.

Are there similar things say for ARM/RISC/etc ?


Comments

Mar 26, 2025, 5:40:10 AM | Hacker news
Show HN: A website for sharing the "Good, Bad, and Why"s of urban spaces

Hello HN! We're a small team in Kyoto building a website called dédédé (https://dedede.de/en) that invites people to share the various positives,

Mar 26, 2025, 5:40:07 AM | Hacker news