Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Mingwang Xu^1* Hui Li^1* Qingkun Su^1* Hanlin Shang¹ Liwei Zhang¹ Ce Liu³

Jingdong Wang² Yao Yao⁴ Siyu Zhu¹

¹Fudan University ²Baidu Inc ³ETH Zurich ⁴Nanjing University

Social Risks and Mitigations

The development of portrait image animation technologies driven by audio inputs poses social risks, such as the ethical implications of creating realistic portraits that could be misused for deepfakes. To mitigate these risks, it is crucial to establish ethical guidelines and responsible use practices. Privacy and consent concerns also arise from using individuals' images and voices. Addressing these involves transparent data usage policies, informed consent, and safeguarding privacy rights. By addressing these risks and implementing mitigations, the research aims to ensure the responsible and ethical development of this technology.

Downloads last month: -

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 1 Ask for provider support

Spaces using fudan-generative-ai/hallo 75

Paper for fudan-generative-ai/hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Paper • 2406.08801 • Published Jun 13, 2024