VideoImage-to-Video

Infinitalk (Image + Audio)

Image-to-Video model by Infinitalk — US.

About this model

Talking-head video model that syncs speech to facial performance for presenter-style output.

Talking avatar from an image and audio. Lip-syncs to supplied speech with facial animation.

CapabilityImage-to-Video
ProviderInfinitalk
OriginUS
OutputVideo
ModesGenerate

Try Infinitalk (Image + Audio) on LORY

Sign in to start generating with this model in your own story project.

Similar modelsMore video models you might like
VideoImage-to-Video

Veo 3.1 Lite (Image to Video)

Veo 3.1 Lite image-to-video profile for short 4-8s clips with prompt-led motion and optional native audio.

USView details
VideoImage-to-Video

LTX-2.3 Fast (Image to Video)

LTX-2.3 Fast i2v with native audio, up to 20s duration, 4K resolution, and start+end frame transitions.

IsraelView details
VideoImage-to-Video

LTX-2.3 Pro (Image to Video)

LTX-2.3 Pro i2v: higher fidelity with better motion stability and visual detail. Up to 10s, 4K, native audio. Best for final renders.

IsraelView details
VideoImage-to-Video

Vidu Q3 (Image to Video)

Vidu Q3 image-to-video profile for broad stylistic range and rapid experimentation.

ChinaView details
VideoImage-to-Video

Seedance 1.5 Pro (Image to Video)

Seedance pro model for dynamic image-to-video shots with clean movement transitions.

ChinaView details
VideoImage-to-Video

Kling Video V3 (Standard I2V)

Kling V3 image-to-video profile with better temporal consistency and prompt adherence.

ChinaView details
Try on LORY