A Qualcomm expert breaks down some of the tools and techniques they use to fit GenAI models on a smartphone.
The post Fitting AI models in your pocket with quantization appeared first on Stack Overflow Blog.
https://stackoverflow.blog/2023/08/23/fitting-ai-models-in-your-pocket-with-quantization/
Connectez-vous pour ajouter un commentaire
Autres messages de ce groupe
![Why build your own vector DB? To process 25,000 images per second](https://www.cdn5.niftycent.com/a/D/y/Y/4/l/x/why-build-your-own-vector-db-to-process-25-000-images-per-second.webp)
Ben and Ryan chat with Babak Behzad, senior engineering manager at Verkada, about running a pipeline that vectorizes 25,000 images per second into a custom-built vector database. They discuss whether
![Investing in the Stack Exchange Network and the future of Stack Overflow](https://www.cdn5.niftycent.com/a/1/B/q/G/b/g/investing-in-the-stack-exchange-network-and-the-future-of-stack-overflow.webp)
Mark your calendars to learn more about Stack’s Future—Feb 26th. https://stackoverflow.blog/2025/02/06/investing-in-the-stack-exchange-network-and-the-future-of-stack-overflow/
![Will the web ever be the primary delivery system for 3D games?](https://www.cdn5.niftycent.com/a/k/M/r/7/0/r/will-the-web-ever-be-the-primary-delivery-system-for-3d-games.webp)
Jaime Torrealba, a frontend developer currently at Push Security, joins Ryan to talk about 3D graphics and web development. Their conversation ranges from the evolution of technologies like WebGL and
![Community Products Roadmap Update, January 2025](https://www.cdn5.niftycent.com/a/D/y/Y/Z/p/q/community-products-roadmap-update-january-2025.webp)
An update on recent launches and the upcoming roadmap. https://stackoverflow.blog/2025/02/03/community-products-roadmap-update-january-2025/
![Feature flags: Theory meets reality](https://www.cdn5.niftycent.com/a/k/A/r/0/6/2/feature-flags-theory-meets-reality.webp)
Ryan is joined by Fynn Glover (CEO) and Ben Papillon (CTO), cofounders of Schematic, for a conversation about managing feature flags in software development. They explore theoretical and practical app
![New year, new features: Level up your Stack Overflow for Teams in 2025](https://www.cdn5.niftycent.com/a/e/a/a/y/q/M/new-year-new-features-level-up-your-stack-overflow-for-teams-in-2025.webp)
The first release of the year is packed with features to make your knowledge-sharing community better. https://stackoverflow.blog/2025/01/29/new-year-new-features-level-up-your-stack-overflow-for-tea
![How engineering teams can thrive in 2025](https://www.cdn5.niftycent.com/a/e/7/v/G/4/v/how-engineering-teams-can-thrive-in-2025.webp)
New year, new approach. https://stackoverflow.blog/2025/01/28/how-engineering-teams-can-thrive-in-2025/