Stable Audio 2.5 – Audio to Audio
Audio-to-Audio model by Stability AI — UK.
Audio-to-audio transformation model — restyle a source clip with a target-sound prompt and a strength slider to balance source preservation vs. prompt freedom.
Practical specs for planning a generation in LORY. These details come from the model contract we use when routing a request.
Try Stable Audio 2.5 – Audio to Audio on LORY
Start with free welcome credits — no subscription, and we never ask for payment info during your trial. Pay only when you decide to top up.
Stable Audio 2.5
Text-to-audio generations for full-length music and SFX (up to ~3 minutes).
MiniMax Music 2.6
Full-track music generation with optional structured lyrics, vocal or instrumental output, and configurable audio settings.
ACE-Step – Text to Audio
Text-to-audio generations with optional structured lyrics and genre tags.
ElevenLabs Music
ElevenLabs Eleven Music (music_v1) — full-track music generation with vocals or instrumental, multilingual singing, and 44.1 kHz studio-quality output.
Eleven Multilingual v2 – Text to Speech
High-quality multilingual text-to-speech by ElevenLabs with 21 preset voices, style control, and speed adjustment.
Chatterbox – Speech to Speech
Voice conversion from a source clip with an optional target voice reference.