MusicGen
5
About
A model designed for conditional music generation using a single-stage transformer that produces high-quality audio. It accepts text or melodic guides, offering flexible controls. Advanced techniques such as melody-guided generation and diffusion-based enhancements improve coherence and authenticity in stereophonic formats while efficiently managing long-term structures.
Percs
High-Quality Music
Text-to-Music
Melody-Guided
Stereo Support
Settings
BPM- Beats Per Minute. Set the tempo of a generation
Track duration- The duration of a generation
Model version- Different models make different sound.
Normalization Strategy- Strategy for normalizing audio
Temperature- The temperature of the model. Higher values make the model more creative and lower values make it more focused.
Top K- The number of top-ranked items to select from the output. Higher values result in more diverse outputs by considering more candidates.
Top P- Reduces sampling to tokens with cumulative probability of p. When set to `0` (default), top_k sampling is used.
Classifier Free Guidance- Increases the influence of inputs on the output. Higher values produce lower-varience outputs that adhere more closely to inputs.