Skip to content

Conversation

dg845
Copy link
Contributor

@dg845 dg845 commented Sep 5, 2025

What does this PR do?

This PR implements a pipeline for the InfiniteTalk audio-driven video generation model (paper, code, weights). The InfiniteTalk model is designed to handle infinite-length video and demonstrates SOTA performance for video dubbing. It is ultimately based on the Wan 2.1-I2V-14B image-to-video model (with extra audio components).

Fixes #12239.

Before submitting

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@yiyixuxu
@DN6
@supermeng

@dg845 dg845 mentioned this pull request Sep 5, 2025
2 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Support for InfiniteTalk
1 participant