Video Models
Video AI models on LORY
23 models available — compare each video model's strengths, including Voice Text Overlay and Image Animation, then try the best fit in your own story project.
Try LORY free — no subscription
Start with free welcome credits — we never ask for payment info during your trial. Pay only when you decide to top up after your credits run out.
HappyHorse-1.0 – Reference to Video
Shape a single video shot from several reference images with Happy Horse 1.0. It works well when you want tighter control over characters, styling, or scene details while still generating one finished clip with synced audio.
Wan 2.5 Preview – Image to Video
Image-to-video with 5s/10s outputs up to 1080p. Includes audio support.
Wan Pro – Image to Video
Wan 2.1 Pro image-to-video: ~6s 1080p @30fps from a source image.
Seedance 1.5 Pro – Image to Video
Seedance image-to-video with start/end frame conditioning, camera control, and native audio.
Veo 3.1 – Reference to Video
Animate up to 3 reference images with Veo 3.1; audio on by default.
Veo 3.1 Fast – First/Last Frame
Animate between first and last frame with Veo 3.1 Fast; audio on by default.
Veo 3.1 Lite – First/Last Frame
Animate between a first and last frame with Veo 3.1 Lite and optional native audio.
Veo 3.1 Lite – Image to Video
Animate a single image into a short Veo 3.1 Lite clip with optional native audio.
Kling 3.0 Standard – Image to Video
Image-to-video with 3-15s durations, optional native audio, and multi-element support.
Kling O1 – Reference Video to Video
Reference video to video that preserves camera language, motion, and optional source audio.
Kling O3 – Reference to Video
Reference-to-video with multi-image elements and optional native audio.
Kling 2.6 Pro – Image to Video + Audio
Higher fidelity image-to-video with optional native audio.
Kling O1 – Reference to Video
Reference-to-video that keeps characters consistent across multiple guide images
Kling 2.5 Turbo Pro – Image to Video
Fast 5–10s image-to-video with optional tail guidance
LTX-2.3 Pro – Image to Video
LTX-2.3 Pro image-to-video: higher fidelity with better motion stability and visual detail. Up to 10s, up to 4K, native audio. Best for final renders.
LTX-2.3 Fast – Image to Video
LTX-2.3 Fast image-to-video with native audio, up to 20s, and up to 4K resolution.
LTX-2 19B Distilled – Image to Video
Image-to-video with optional native audio; exposes frames/FPS/resolution controls.
Image Animation
Turn a still image into a moving shot with a chosen camera move. Useful for storyboard frames, concept stills, and reference images when you want simple motion from one image.
Voice Text Overlay
Turn a voice clip's transcript into a transparent text overlay timed to the spoken words. Useful for subtitles, lyric-style captions, and on-screen dialogue treatments you want to place over other footage.
InfiniteTalk – Image + Audio
Talking avatar from an image and audio. Lip-syncs to supplied speech with facial animation.
Vidu Q3 – Image to Video
Vidu Q3 image-to-video with optional end-frame transitions and native audio.
Grok Imagine – Image to Video
Image-to-video via xAI's Grok Imagine with native audio (1-15s, up to 720p).
Grok Imagine – Reference to Video
Build a single video shot from several reference images with Grok Imagine. It is a strong choice when you want to combine subject, setting, and style references in one clip, with synced audio included.