| Audio driven virtual human generation |
| Project Description |
|
We are preparing a product lanuch and need 3 virtual avatars to host the launch. We have the scripts first and then generate audio files with TTS technology. Then we generate videos with audios and images. To make long videos temporal consistent, we use the last frame of model output as input of next video.
|
| Video demo |