Show HN: I made a website to semantically search ArXiv papers

As a grad student (and an ADHDer), I had trouble doing literature review systematically. To combat this, I made a website that finds similar papers using the meaning of the thing I am looking for.

I used MixedBread's [^1] embedding model to generate vectors from the abstracts. I store and search similar vectors using Milvus [^2] and finally use Gradio [^3] to serve the frontend. I update the vector database weekly by pulling the metadata dataset from Kaggle [^4].

To speed up the search process on my free oracle instance, I binarise the embeddings and use Hamming distance as a metric.

I would love your feedback on the site :) Happy Holidays!

[1]: https://www.mixedbread.ai/docs/embeddings/mxbai-embed-large-... [2]: https://milvus.io/ [3]: https://www.gradio.app/ [4]: https://www.kaggle.com/datasets/Cornell-University/arxiv

Comments URL: https://news.ycombinator.com/item?id=42507116

Points: 14

# Comments: 0

https://papermatch.mitanshu.tech/

Created 4mo | Dec 25, 2024, 10:10:08 AM

Login to add comment

Other posts in this group

Mobygratis – Free Moby music to empower your creative projects

Mobygratis – Free Moby music to empower your creative projects

Article URL: https://mobygratis.com/

Comments URL: https://news.ycombinator.com/item?id=4380015

Apr 26, 2025, 10:30:07 AM | Hacker news

An end to all this prostate trouble?

An end to all this prostate trouble?

Article URL: https://yarchive.net/blog/prostate/

Comments URL: https://news.ycombin

Apr 26, 2025, 10:30:05 AM | Hacker news

Amazon Japan ordered to pay 35M. yen for allowing listing of fakes

Amazon Japan ordered to pay 35M. yen for allowing listing of fakes

Article URL: https://mainichi.jp/english/articles/20250425/p2g/00m/0bu/047000c

Comments URL:

Apr 26, 2025, 8:10:14 AM | Hacker news

Cloth

Cloth

Article URL: https://www.cloudofoz.com/verlet-test/

Comments URL: https://news.y

Apr 26, 2025, 8:10:13 AM | Hacker news

Australian who ordered radioactive materials walks away from court

Australian who ordered radioactive materials walks away from court

Article URL: https://www.chemistryworld.com/news/

Apr 26, 2025, 8:10:11 AM | Hacker news

Apparently Bluesky has one centralized service, the "relay"

Apparently Bluesky has one centralized service, the "relay"

Article URL: https://mastodon.online/@mastodonmigration/114399534536933573

Comments URL:

Apr 26, 2025, 8:10:11 AM | Hacker news

MobileBoost (YC S21) Is Hiring a Founding Back End/Platform Engineer (Remote)

MobileBoost (YC S21) Is Hiring a Founding Back End/Platform Engineer (Remote)

Article URL: https://www.ycombinator.com/companies/mobileboost/jobs/v6gPgiZ-found

Apr 26, 2025, 8:10:08 AM | Hacker news

Techie