
Open-source generative audio tools by Stability AI
Harmonai is a Stability AI lab releasing open-source generative audio tools to make music production more accessible. Its flagship model, Dance Diffusion, uses diffusion technology to generate novel audio samples, enable style transfer, and interpolate between sounds. All outputs under the base models are released under CC0 license for unrestricted use.
Generative audio model that creates novel sound samples using diffusion technology, producing 1-3 second audio clips
Transform existing audio recordings by applying the style and characteristics of trained models
Blend and morph between two audio files to create unique transitional sounds
Describe the type of sound you want using text prompts with CLIP-like text encoding
Train your own models on specific audio datasets for personalized sound generation
Run models directly in Google Colab notebooks without local GPU requirements
Developer API for building custom applications and integrations on top of Harmonai models
Generate unique, never-before-heard samples and textures for electronic music, film scores, and sound libraries
Quickly explore sonic possibilities by interpolating between sounds or applying style transfer to existing recordings
Build custom royalty-free sample packs using CC0 licensed outputs for commercial distribution or personal use
Study and experiment with generative audio models in an open-source environment for academic or commercial R&D
All sounds generated with base models are released under Creative Commons Zero for unrestricted commercial use

Open-source, AI-first business automation