Imagine crafting high-quality sound effects or audio samples on your smartphone—no internet, no delays, just pure creativity at your fingertips. Thanks to a groundbreaking partnership between Stability AI and Arm, this futuristic vision is now a reality. Together, they’ve unlocked the power of generative AI for mobile devices, enabling offline audio generation that’s faster and more accessible than ever before.
This isn’t just a step forward—it’s a leap into the future of content creation. And it’s all happening on the devices you already own.
The Power of Offline Generative Audio
Generative AI has been transforming industries, from music production to video editing, but until now, it’s been tethered to the cloud. Stability AI and Arm have shattered that limitation by enabling Stable Audio Open, their cutting-edge text-to-audio model, to run entirely on Arm CPUs—without needing an internet connection.
This means creators can generate sound effects, audio samples, and production elements in seconds, directly on their smartphones. Whether you’re a filmmaker on location or a musician sketching ideas on the go, this breakthrough ensures your creative flow isn’t interrupted by connectivity issues.
30x Faster: The Tech Behind the Magic
Optimizing generative AI for mobile devices was no small feat. Initially, generating a single audio clip on an Arm CPU took a sluggish 240 seconds. But by leveraging Arm’s KleidiAI libraries and advanced software stack, Stability AI slashed that time to under 8 seconds for an 11-second clip—a staggering 30x improvement.
The secret sauce? Arm’s int8 matmul kernels and ExecuTorch via XNNPack, which streamline the model’s performance on Armv9 CPUs. This optimization not only speeds up generation but also makes the technology accessible to anyone with a compatible device, eliminating the need for expensive, high-end hardware.
What’s Next? A Future of On-Device Creativity
Audio is just the beginning. Stability AI and Arm are already working to bring their generative models for images, video, and 3D to mobile devices. This partnership is a pivotal step toward a future where high-quality media generation happens directly on your phone, transforming how creators work across all visual and auditory mediums.
The possibilities are endless: imagine editing a video with AI-generated effects, designing 3D models, or composing a soundtrack—all on your smartphone, all offline.
See It in Action at MWC Barcelona
Curious to see this tech in action? Head to MWC Barcelona on March 3rd, 2025, where Stability AI and Arm will showcase real-world applications of generative media at the edge. From rapid audio generation to seamless integration into creative workflows, this demo promises to be a glimpse into the future of mobile creativity.