Model Browser

AI generation models on LORY

Browse 62 models across image, video, audio, and 3D — from OpenAI, Google, ElevenLabs, Bytedance, Alibaba and more, including Voice Text Overlay and Image Animation. Pick one to learn what it does best, then try it in your own story project.

Try LORY free — no subscription

Start with free welcome credits — we never ask for payment info during your trial. Pay only when you decide to top up after your credits run out.

ImageComposite

Seedream 5.0 Lite – Composite

ByteDance Seedream 5.0 Lite multi-reference composition — blends up to 10 references with prompt-led restyling.

ByteDance · ChinaView details
ImageEdit

Seedream 5.0 Lite – Edit

ByteDance Seedream 5.0 Lite single-image editing — prompt-led transformation with preserved composition.

ByteDance · ChinaView details
ImageComposite

GPT Image 2 – Composite

Blend multiple reference images into a single cohesive scene with GPT Image 2. Inputs are processed at high fidelity.

OpenAI · USView details
ImageEdit

GPT Image 2 – Edit

Edit an image with GPT Image 2. Input images are processed at high fidelity for detailed, identity-aware revisions.

OpenAI · USView details
ImageText-to-Image

GPT Image 2 – Generate

GPT Image 2 is a high-quality text-to-image model with strong prompt adherence, improved text rendering, and high-fidelity detail.

OpenAI · USView details
ImageInpaint

GPT Image 2 – Inpaint

Mask-based inpainting with GPT Image 2 that preserves surrounding context and lighting.

OpenAI · USView details
VideoVoice Text Overlay

Voice Text Overlay

Turn a voice clip's transcript into a transparent text overlay timed to the spoken words. Useful for subtitles, lyric-style captions, and on-screen dialogue treatments you want to place over other footage.

LORY · NetherlandsView details
VideoImage Animation

Image Animation

Turn a still image into a moving shot with a chosen camera move. Useful for storyboard frames, concept stills, and reference images when you want simple motion from one image.

LORY · NetherlandsView details
VideoImage-to-Video

Grok Imagine – Reference to Video

Build a single video shot from several reference images with Grok Imagine. It is a strong choice when you want to combine subject, setting, and style references in one clip, with synced audio included.

xAI · USView details
VideoReference-to-Video

HappyHorse-1.0 – Reference to Video

Shape a single video shot from several reference images with Happy Horse 1.0. It works well when you want tighter control over characters, styling, or scene details while still generating one finished clip with synced audio.

Alibaba · ChinaView details
VideoFirst/Last Frame to Video

Veo 3.1 Lite – First/Last Frame

Animate between a first and last frame with Veo 3.1 Lite and optional native audio.

Google · USView details
VideoImage-to-Video

Veo 3.1 Lite – Image to Video

Animate a single image into a short Veo 3.1 Lite clip with optional native audio.

Google · USView details
AudioText-to-Audio

ElevenLabs Music

ElevenLabs Eleven Music (music_v1) — full-track music generation with vocals or instrumental, multilingual singing, and 44.1 kHz studio-quality output.

ElevenLabs · USView details
AudioText-to-Audio

MiniMax Music 2.6

Full-track music generation with optional structured lyrics, vocal or instrumental output, and configurable audio settings.

MiniMax · ChinaView details
AudioAudio-to-Audio

Stable Audio 2.5 – Audio to Audio

Audio-to-audio transformation with prompt-driven restyling and a strength control to preserve or replace the source.

Stability AI · UKView details
AudioText-to-Speech

Eleven v3 – Text to Speech

ElevenLabs' most expressive TTS model — cinematic delivery, emotional range, and dramatic pacing. Ideal for trailers, narration, and character dialogue.

ElevenLabs · USView details
AudioText-to-Speech

Eleven Multilingual v2 – Text to Speech

High-quality multilingual text-to-speech by ElevenLabs with 21 preset voices, style control, and speed adjustment.

ElevenLabs · USView details
AudioText-to-Audio

ACE-Step – Text to Audio

Text-to-audio generations with optional structured lyrics and genre tags.

ACE Studio · ChinaView details

3D Model models

3 models available

3D ModelImage-to-3D

Meshy v6 – Image to 3D

Generate a 3D model from a single image using Meshy v6. Supports PBR, quad/triangle topology, and optional texture prompts.

Meshy · USView details
3D ModelImage-to-3D

Tripo v2.5 – Image to 3D

Generate a high-quality 3D model from a single image

Tripo AI · ChinaView details
3D ModelMulti-view to 3D

Tripo v2.5 – Multiview to 3D

Generate high-quality 3D models from multi-view images (front, left, back, right)

Tripo AI · ChinaView details