Yahoo Search Búsqueda en la Web

Resultado de búsqueda

  1. Hace 3 horas · The proposed system is end-to-end trainable and the learning is performed with six losses. This method can generate photo realistic, expressive and audio-synced talking faces while preserving the identity of the target person. This work proposes a discriminator network to impose audio-visual synchrony in the generated video.