Stars
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
[CVPR'24] DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation
chinese speech pretrained models
We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference image and a se…
Official implementations for paper: Anydoor: zero-shot object-level image customization
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Official PyTorch implementation of "Accurate 3D Hand Pose Estimation for Whole-Body 3D Human Mesh Estimation", CVPRW 2022 (Oral.)
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
HaMeR: Reconstructing Hands in 3D with Transformers
[CVPR 2024] Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework.
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Official repository of NeuMan: Neural Human Radiance Field from a Single Video (ECCV 2022)
InstantAvatar: Learning Avatars from Monocular Video in 60 Seconds (CVPR 2023)
SparseNeuS: Fast Generalizable Neural Surface Reconstruction from Sparse views
This system provides the facility to convert roughly hand-drawn humam face sketch image on canvas into a realistic face image by using image generative AI in a real-time.
TriplaneGaussian: A new hybrid representation for single-view 3D reconstruction.
yinyunie / BlenderProc-3DFront
Forked from DLR-RM/BlenderProcSupport BlenderProc2 with multi-GPU batch rendering and 3D visualization for 3D-Front
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Source code for the SIGGRAPH 2024 paper "X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention"
[ICLR 2024] Generalizable and Precise Head Avatar from Image(s)
Portrait4D: Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data (CVPR 24); Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer (ECCV 2024)