Stars
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
Memory-Guided Diffusion for Expressive Talking Video Generation
HunyuanVideo: A Systematic Framework For Large Video Generation Model
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Instant voice cloning by MIT and MyShell.
Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko,…
[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
Pytorch implementation of paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing"
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)
[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers
The source code and data of the paper "HIST: A Graph-based Framework for Stock Trend Forecasting via Mining Concept-Oriented Shared Information".
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Open-Sora: Democratizing Efficient Video Production for All
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
A simple, high-quality voice conversion tool focused on ease of use and performance.
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)