Audio Models

Audio AI models on LORY

9 models available — compare each audio model's strengths, including ElevenLabs Music and MiniMax Music 2.6, then try the best fit in your own story project.

Try LORY free — no subscription

Start with free welcome credits — we never ask for payment info during your trial. Pay only when you decide to top up after your credits run out.

Try LORY free How it works

ACE Studio1 model

AudioText-to-Audio

ACE-Step – Text to Audio

Text-to-audio generations with optional structured lyrics and genre tags.

ACE Studio · ChinaView details

ElevenLabs3 models

AudioText-to-Speech

Eleven v3 – Text to Speech

ElevenLabs' most expressive TTS model — cinematic delivery, emotional range, and dramatic pacing. Ideal for trailers, narration, and character dialogue.

ElevenLabs · USView details

AudioText-to-Audio

ElevenLabs Music

ElevenLabs Eleven Music (music_v1) — full-track music generation with vocals or instrumental, multilingual singing, and 44.1 kHz studio-quality output.

ElevenLabs · USView details

AudioText-to-Speech