Model Browser

AI generation models on LORY

Browse 62 models across image, video, audio, and 3D — from OpenAI, Google, ElevenLabs, Bytedance, Alibaba and more, including Voice Text Overlay and Image Animation. Pick one to learn what it does best, then try it in your own story project.

Try LORY free — no subscription

Start with free welcome credits — we never ask for payment info during your trial. Pay only when you decide to top up after your credits run out.

Try LORY free How it works

Image models

27 models available

View all image models

ImageComposite

Seedream 5.0 Lite – Composite

ByteDance Seedream 5.0 Lite multi-reference composition — blends up to 10 references with prompt-led restyling.

ByteDance · ChinaView details

ImageEdit

Seedream 5.0 Lite – Edit

ByteDance Seedream 5.0 Lite single-image editing — prompt-led transformation with preserved composition.

ByteDance · ChinaView details

ImageComposite

GPT Image 2 – Composite

Blend multiple reference images into a single cohesive scene with GPT Image 2. Inputs are processed at high fidelity.

OpenAI · USView details

ImageEdit

GPT Image 2 – Edit

Edit an image with GPT Image 2. Input images are processed at high fidelity for detailed, identity-aware revisions.

OpenAI · USView details

ImageText-to-Image

GPT Image 2 – Generate

GPT Image 2 is a high-quality text-to-image model with strong prompt adherence, improved text rendering, and high-fidelity detail.

OpenAI · USView details

ImageInpaint

GPT Image 2 – Inpaint

Mask-based inpainting with GPT Image 2 that preserves surrounding context and lighting.

OpenAI · USView details

Video models

23 models available

View all video models

VideoVoice Text Overlay

Voice Text Overlay

Turn a voice clip's transcript into a transparent text overlay timed to the spoken words. Useful for subtitles, lyric-style captions, and on-screen dialogue treatments you want to place over other footage.

LORY · NetherlandsView details

VideoImage Animation

Image Animation

Turn a still image into a moving shot with a chosen camera move. Useful for storyboard frames, concept stills, and reference images when you want simple motion from one image.

LORY · NetherlandsView details

VideoImage-to-Video

Grok Imagine – Reference to Video

Build a single video shot from several reference images with Grok Imagine. It is a strong choice when you want to combine subject, setting, and style references in one clip, with synced audio included.

xAI · USView details

VideoReference-to-Video

HappyHorse-1.0 – Reference to Video

Shape a single video shot from several reference images with Happy Horse 1.0. It works well when you want tighter control over characters, styling, or scene details while still generating one finished clip with synced audio.

Alibaba · ChinaView details

VideoFirst/Last Frame to Video

Veo 3.1 Lite – First/Last Frame

Animate between a first and last frame with Veo 3.1 Lite and optional native audio.

Google · USView details

VideoImage-to-Video

Veo 3.1 Lite – Image to Video

Animate a single image into a short Veo 3.1 Lite clip with optional native audio.

Google · USView details

Audio models

9 models available

View all audio models

AudioText-to-Audio

ElevenLabs Music

ElevenLabs Eleven Music (music_v1) — full-track music generation with vocals or instrumental, multilingual singing, and 44.1 kHz studio-quality output.

ElevenLabs · USView details

AudioText-to-Audio

MiniMax Music 2.6

Full-track music generation with optional structured lyrics, vocal or instrumental output, and configurable audio settings.

MiniMax · ChinaView details

AudioAudio-to-Audio

Stable Audio 2.5 – Audio to Audio

Audio-to-audio transformation with prompt-driven restyling and a strength control to preserve or replace the source.

Stability AI · UKView details

AudioText-to-Speech

Eleven v3 – Text to Speech

ElevenLabs' most expressive TTS model — cinematic delivery, emotional range, and dramatic pacing. Ideal for trailers, narration, and character dialogue.

ElevenLabs · USView details

AudioText-to-Speech

Eleven Multilingual v2 – Text to Speech

High-quality multilingual text-to-speech by ElevenLabs with 21 preset voices, style control, and speed adjustment.

ElevenLabs · USView details

AudioText-to-Audio

3D ModelImage-to-3D

Tripo v2.5 – Image to 3D

Generate a high-quality 3D model from a single image

Tripo AI · ChinaView details

3D ModelMulti-view to 3D

Tripo v2.5 – Multiview to 3D

Generate high-quality 3D models from multi-view images (front, left, back, right)

Tripo AI · ChinaView details

AI generation models on LORY

Try LORY free — no subscription

Image models

Seedream 5.0 Lite – Composite

Seedream 5.0 Lite – Edit

GPT Image 2 – Composite

GPT Image 2 – Edit

GPT Image 2 – Generate

GPT Image 2 – Inpaint

Video models

Voice Text Overlay

Image Animation

Grok Imagine – Reference to Video

HappyHorse-1.0 – Reference to Video

Veo 3.1 Lite – First/Last Frame

Veo 3.1 Lite – Image to Video

Audio models

ElevenLabs Music

MiniMax Music 2.6

Stable Audio 2.5 – Audio to Audio

Eleven v3 – Text to Speech

Eleven Multilingual v2 – Text to Speech

ACE-Step – Text to Audio

3D Model models

Meshy v6 – Image to 3D

Tripo v2.5 – Image to 3D

Tripo v2.5 – Multiview to 3D