Voice Text Overlay
Voice Text Overlay model by LORY — Netherlands.
Turn a voice clip into a transparent subtitle overlay you can place over your video. The text stays timed to the spoken words, making it useful for captions, lyric-style text, and other on-screen dialogue treatments.
This is a LORY in-house overlay renderer. It uses speech timing from the voice clip to create a transparent video layer, so you can combine readable timed text with generated footage without baking captions directly into the base video.
Practical specs for planning a generation in LORY. These details come from the model contract we use when routing a request.
Try Voice Text Overlay on LORY
Start with free welcome credits — no subscription, and we never ask for payment info during your trial. Pay only when you decide to top up.
Image Animation
Turn a still image into a moving shot with a chosen camera move. Useful for storyboard frames, concept stills, and reference images when you want simple motion from one image.
Grok Imagine – Reference to Video
Build a single video shot from several reference images with Grok Imagine. It is a strong choice when you want to combine subject, setting, and style references in one clip, with synced audio included.
HappyHorse-1.0 – Reference to Video
Shape a single video shot from several reference images with Happy Horse 1.0. It works well when you want tighter control over characters, styling, or scene details while still generating one finished clip with synced audio.
Veo 3.1 Lite – First/Last Frame
Animate between a first and last frame with Veo 3.1 Lite and optional native audio.
Veo 3.1 Lite – Image to Video
Animate a single image into a short Veo 3.1 Lite clip with optional native audio.
LTX-2.3 Fast – Image to Video
LTX-2.3 Fast image-to-video with native audio, up to 20s, and up to 4K resolution.