Meta Newsroom:
Takeaways
- SAM Audio is the first unified AI model that can segment sound from complex audio mixtures using text, visual, and time span prompts.
- This technology has the potential to transform audio and video editing and drive innovation in areas like music, podcasting, television, film, scientific research, accessibility, and more.
- You can download SAM Audio or explore its capabilities in the Segment Anything Playground today.
This intuitive approach mirrors how people naturally engage with sound, making professional-grade audio separation more accessible and easier than ever before. SAM Audio has the potential to transform audio and video editing and drive innovation in areas like music, podcasting, television, film, scientific research, accessibility, and more.
Until now, audio segmentation and editing has been a fragmented space, with a variety of tools designed for single-purpose use cases. As a unified model, SAM Audio is the first to support use cases that match how people naturally think about audio, and achieves cutting-edge performance across diverse, real-world scenarios. SAM Audio supports three kinds of prompts:
- Text prompting: Type “dog barking” or “singing voice” to extract specific sounds.
- Visual prompting: Click on the person or object in the video that’s making a sound to isolate their audio.
- Span prompting: An industry first, this method lets you mark time segments where target audio occurs.
You can try SAM Audio in the Segment Anything Playground, our new platform that enables anyone to try our latest models. Starting today, people can select from our collection of audio and video assets or upload their own to explore the capabilities of SAM Audio. The model is also available for download.
We’re excited to bring audio to the Segment Anything collection of models and we believe SAM Audio is the all-around best audio separation model available. Learn more about SAM Audio and try it on the Segment Anything Playground today.
Source:
Our New SAM Audio Model Transforms Audio Editing
We're introducing SAM Audio, a state-of-the-art AI model that enables you to segment sound.









