Audio Models

Audio AI models on LORY

9 models available — compare each audio model's strengths, including ElevenLabs Music and MiniMax Music 2.6, then try the best fit in your own story project.

Try LORY free — no subscription

Start with free welcome credits — we never ask for payment info during your trial. Pay only when you decide to top up after your credits run out.

ACE Studio1 model
AudioText-to-Audio

ACE-Step – Text to Audio

Text-to-audio generations with optional structured lyrics and genre tags.

ACE Studio · ChinaView details
ElevenLabs3 models
AudioText-to-Speech

Eleven v3 – Text to Speech

ElevenLabs' most expressive TTS model — cinematic delivery, emotional range, and dramatic pacing. Ideal for trailers, narration, and character dialogue.

ElevenLabs · USView details
AudioText-to-Audio

ElevenLabs Music

ElevenLabs Eleven Music (music_v1) — full-track music generation with vocals or instrumental, multilingual singing, and 44.1 kHz studio-quality output.

ElevenLabs · USView details
AudioText-to-Speech

Eleven Multilingual v2 – Text to Speech

High-quality multilingual text-to-speech by ElevenLabs with 21 preset voices, style control, and speed adjustment.

ElevenLabs · USView details
MiniMax1 model
AudioText-to-Audio

MiniMax Music 2.6

Full-track music generation with optional structured lyrics, vocal or instrumental output, and configurable audio settings.

MiniMax · ChinaView details
Resemble AI2 models
AudioSpeech-to-Speech

Chatterbox – Speech to Speech

Voice conversion from a source clip with an optional target voice reference.

Resemble AI · CanadaView details
AudioText-to-Speech

Chatterbox Turbo – Text to Speech

Turbo text-to-speech with preset voices and optional 5-10s voice cloning.

Resemble AI · CanadaView details
Stability AI2 models
AudioText-to-Audio

Stable Audio 2.5

Text-to-audio generations for full-length music and SFX (up to ~3 minutes).

Stability AI · UKView details
AudioAudio-to-Audio

Stable Audio 2.5 – Audio to Audio

Audio-to-audio transformation with prompt-driven restyling and a strength control to preserve or replace the source.

Stability AI · UKView details