VideoVoice Text Overlay

Voice Text Overlay

Voice Text Overlay model by LORY — Netherlands.

5/5 LORY rating
About this model

Turn a voice clip into a transparent subtitle overlay you can place over your video. The text stays timed to the spoken words, making it useful for captions, lyric-style text, and other on-screen dialogue treatments.

This is a LORY in-house overlay renderer. It uses speech timing from the voice clip to create a transparent video layer, so you can combine readable timed text with generated footage without baking captions directly into the base video.

CapabilityVoice Text Overlay
ProviderLORY
OriginNetherlands
OutputVideo
ModesGenerate
Model details

Practical specs for planning a generation in LORY. These details come from the model contract we use when routing a request.

OutputWEBM, WEBM · Transparent alpha output
Resolution720P, 1080P
Aspect ratios16:9, 9:16, 16:9 (Landscape), 9:16 (Portrait)
DurationUp to 10 minutes

Try Voice Text Overlay on LORY

Start with free welcome credits — no subscription, and we never ask for payment info during your trial. Pay only when you decide to top up.

Similar modelsMore video models you might like
VideoImage Animation

Image Animation

Turn a still image into a moving shot with a chosen camera move. Useful for storyboard frames, concept stills, and reference images when you want simple motion from one image.

LORY · NetherlandsView details
VideoImage-to-Video

Grok Imagine – Reference to Video

Build a single video shot from several reference images with Grok Imagine. It is a strong choice when you want to combine subject, setting, and style references in one clip, with synced audio included.

xAI · USView details
VideoReference-to-Video

HappyHorse-1.0 – Reference to Video

Shape a single video shot from several reference images with Happy Horse 1.0. It works well when you want tighter control over characters, styling, or scene details while still generating one finished clip with synced audio.

Alibaba · ChinaView details
VideoFirst/Last Frame to Video

Veo 3.1 Lite – First/Last Frame

Animate between a first and last frame with Veo 3.1 Lite and optional native audio.

Google · USView details
VideoImage-to-Video

Veo 3.1 Lite – Image to Video

Animate a single image into a short Veo 3.1 Lite clip with optional native audio.

Google · USView details
VideoImage-to-Video

LTX-2.3 Fast – Image to Video

LTX-2.3 Fast image-to-video with native audio, up to 20s, and up to 4K resolution.

Lightricks · IsraelView details
Try on LORY