Audio Models

Audio AI models on LORY

10 audio files available — explore each model's strengths, then try it in your own story project.

Ace1 model
AudioText-to-Audio

ACE-Step (text-to-audio)

Song-focused text-to-audio model that works well for structured style tags and lyric guidance.

ChinaView details
Chatterbox2 models
AudioSpeech-to-Speech

Chatterbox Speech-to-Speech

Voice conversion model for transforming source speech while preserving delivery rhythm.

CanadaView details
AudioText-to-Speech

Chatterbox Turbo (TTS)

Fast, lightweight TTS model for rapid voiceover drafts and iterative narration passes.

CanadaView details
ElevenLabs3 models
AudioText-to-Speech

ElevenLabs Multilingual v2 (TTS)

High-quality multilingual TTS suited for polished narration and character voice lines.

USView details
AudioText-to-Audio

ElevenLabs Music

ElevenLabs Eleven Music (music_v1) — full-track music generation direct from ElevenLabs with vocal or instrumental output, multilingual singing, and 44.1…

USView details
AudioText-to-Speech

ElevenLabs v3 (TTS)

ElevenLabs v3 direct — the most expressive TTS model for cinematic trailers, dramatic narration, and emotional dialogue.

USView details
Minimax1 model
AudioText-to-Audio

Minimax Music 2.6

Full-track music generator with structured lyrics support — vocal or instrumental, configurable sample rate/format/bitrate, and lyric structure tags…

ChinaView details
Sonauto1 model
AudioText-to-Music

Sonauto v2 (Text-to-Music)

Music generation model optimized for prompt-driven song ideas with strong style control.

South KoreaView details
Stable2 models
AudioText-to-Audio

Stable Audio 2.5

General-purpose text-to-audio model for longer-form ambient, score, and sound-design outputs.

UKView details
AudioAudio-to-Audio

Stable Audio 2.5 (Audio-to-Audio)

Audio-to-audio transformation model — restyle a source clip with a target-sound prompt and a strength slider to balance source preservation vs. prompt…

UKView details