Video Models

Video AI models on LORY

23 models available — compare each video model's strengths, including Voice Text Overlay and Image Animation, then try the best fit in your own story project.

Try LORY free — no subscription

Start with free welcome credits — we never ask for payment info during your trial. Pay only when you decide to top up after your credits run out.

Alibaba3 models
VideoReference-to-Video

HappyHorse-1.0 – Reference to Video

Shape a single video shot from several reference images with Happy Horse 1.0. It works well when you want tighter control over characters, styling, or scene details while still generating one finished clip with synced audio.

Alibaba · ChinaView details
VideoImage-to-Video

Wan 2.5 Preview – Image to Video

Image-to-video with 5s/10s outputs up to 1080p. Includes audio support.

Alibaba · ChinaView details
VideoImage-to-Video

Wan Pro – Image to Video

Wan 2.1 Pro image-to-video: ~6s 1080p @30fps from a source image.

Alibaba · ChinaView details
ByteDance1 model
VideoImage-to-Video

Seedance 1.5 Pro – Image to Video

Seedance image-to-video with start/end frame conditioning, camera control, and native audio.

ByteDance · ChinaView details
Google4 models
VideoReference-to-Video

Veo 3.1 – Reference to Video

Animate up to 3 reference images with Veo 3.1; audio on by default.

Google · USView details
VideoFirst/Last Frame to Video

Veo 3.1 Fast – First/Last Frame

Animate between first and last frame with Veo 3.1 Fast; audio on by default.

Google · USView details
VideoFirst/Last Frame to Video

Veo 3.1 Lite – First/Last Frame

Animate between a first and last frame with Veo 3.1 Lite and optional native audio.

Google · USView details
VideoImage-to-Video

Veo 3.1 Lite – Image to Video

Animate a single image into a short Veo 3.1 Lite clip with optional native audio.

Google · USView details
Kuaishou6 models
VideoImage-to-Video

Kling 3.0 Standard – Image to Video

Image-to-video with 3-15s durations, optional native audio, and multi-element support.

Kuaishou · ChinaView details
VideoReference-to-Video

Kling O1 – Reference Video to Video

Reference video to video that preserves camera language, motion, and optional source audio.

Kuaishou · ChinaView details
VideoReference-to-Video

Kling O3 – Reference to Video

Reference-to-video with multi-image elements and optional native audio.

Kuaishou · ChinaView details
VideoImage-to-Video

Kling 2.6 Pro – Image to Video + Audio

Higher fidelity image-to-video with optional native audio.

Kuaishou · ChinaView details
VideoReference-to-Video

Kling O1 – Reference to Video

Reference-to-video that keeps characters consistent across multiple guide images

Kuaishou · ChinaView details
VideoImage-to-Video

Kling 2.5 Turbo Pro – Image to Video

Fast 5–10s image-to-video with optional tail guidance

Kuaishou · ChinaView details
Lightricks3 models
VideoImage-to-Video

LTX-2.3 Pro – Image to Video

LTX-2.3 Pro image-to-video: higher fidelity with better motion stability and visual detail. Up to 10s, up to 4K, native audio. Best for final renders.

Lightricks · IsraelView details
VideoImage-to-Video

LTX-2.3 Fast – Image to Video

LTX-2.3 Fast image-to-video with native audio, up to 20s, and up to 4K resolution.

Lightricks · IsraelView details
VideoImage-to-Video

LTX-2 19B Distilled – Image to Video

Image-to-video with optional native audio; exposes frames/FPS/resolution controls.

Lightricks · IsraelView details
LORY2 models
VideoImage Animation

Image Animation

Turn a still image into a moving shot with a chosen camera move. Useful for storyboard frames, concept stills, and reference images when you want simple motion from one image.

LORY · NetherlandsView details
VideoVoice Text Overlay

Voice Text Overlay

Turn a voice clip's transcript into a transparent text overlay timed to the spoken words. Useful for subtitles, lyric-style captions, and on-screen dialogue treatments you want to place over other footage.

LORY · NetherlandsView details
MeiGen-AI1 model
VideoImage-to-Video

InfiniteTalk – Image + Audio

Talking avatar from an image and audio. Lip-syncs to supplied speech with facial animation.

MeiGen-AI · ChinaView details
ShengShu Technology1 model
VideoImage-to-Video

Vidu Q3 – Image to Video

Vidu Q3 image-to-video with optional end-frame transitions and native audio.

ShengShu Technology · ChinaView details
xAI2 models
VideoImage-to-Video

Grok Imagine – Image to Video

Image-to-video via xAI's Grok Imagine with native audio (1-15s, up to 720p).

xAI · USView details
VideoImage-to-Video

Grok Imagine – Reference to Video

Build a single video shot from several reference images with Grok Imagine. It is a strong choice when you want to combine subject, setting, and style references in one clip, with synced audio included.

xAI · USView details