Show HN: Tune LLaMa3.1 on Google Cloud TPUs

Hey HN, we wanted to share our repo where we fine-tuned Llama 3.1 on Google TPUs. We’re building AI infra to fine-tune and serve LLMs on non-NVIDIA GPUs (TPUs, Trainium, AMD GPUs).

The problem: Right now, 90% of LLM workloads run on NVIDIA GPUs, but there are equally powerful and more cost-effective alternatives out there. For example, training and serving Llama 3.1 on Google TPUs is about 30% cheaper than NVIDIA GPUs.

But developer tooling for non-NVIDIA chipsets is lacking. We felt this pain ourselves. We initially tried using PyTorch XLA to train Llama 3.1 on TPUs, but it was rough: xla integration with pytorch is clunky, missing libraries (bitsandbytes didn't work), and cryptic HuggingFace errors.

We then took a different route and translated Llama 3.1 from PyTorch to JAX. Now, it’s running smoothly on TPUs! We still have challenges ahead, there is no good LoRA library in JAX, but this feels like the right path forward.

Here's a demo (https://dub.sh/felafax-demo) of our managed solution.

Would love your thoughts on our repo and vision as we keep chugging along!

Comments URL: https://news.ycombinator.com/item?id=41512142

Points: 34

# Comments: 3

https://github.com/felafax/felafax

Établi 6mo | 11 sept. 2024, 19:50:06

Connectez-vous pour ajouter un commentaire

Autres messages de ce groupe

The "Memory Cartography" of Azerbaijani Emigrants

Article URL: https://jam-news.net/memory-cartography-of-azerbaijani-emigrants/

Comments URL:

19 mars 2025, 11:30:16 | Hacker news

Designing Electronics That Work

Article URL: https://www.hscott.net/designing-electronics-that-work/

Comments URL:

19 mars 2025, 11:30:11 | Hacker news

Show HN: I made a tool to port tweets to Bluesky mantaining their original date

Bluesky allows to backdate their posts with their API, so I made this tool to copy your twitter (X) profile to Bluesky keeping the backdated dates of your tweets, showing as if they were posted ba

19 mars 2025, 11:30:09 | Hacker news

The Molecule of the Month

Article URL: https://www.chm.bris.ac.uk/motm/motm.htm

Comments URL: https://ne

19 mars 2025, 11:30:09 | Hacker news

Chrome disabling uBlock Origin is a serious security threat

Article URL: https://nuage.quimerch.com/-/ewen/articles/chrome-disabling-ublock-ori

19 mars 2025, 11:30:08 | Hacker news