Stars
[CVPR 2024] Adaptive Fusion of Single-View and Multi-View Depth for Autonomous Driving
Argoverse 2: Next generation datasets for self-driving perception and forecasting.
ML Dataset Governance Policy for Autonomous Vehicle Datasets
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guid…
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction…
Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"
From Big to Small: Multi-Scale Local Planar Guidance for Monocular Depth Estimation
Official implementation for HybridDepth Model (WACV 2025, ISMAR 2024)
Tensorflow implementation of Semi-Supervised Monocular Depth Estimation with Left-Right Consistency Using Deep Neural Network.
[NeurIPS 2023] Official implementation of the paper "Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset"
The Replica Dataset v1 as published in https://arxiv.org/abs/1906.05797 .
Structure-Guided Ranking Loss for Single Image Depth Prediction
Code for "HDNet: Human Depth Estimation for Multi-Person Camera-Space Localization"
Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding
Monocular Depth Estimation Using Laplacian Pyramid-Based Depth Residuals
The official repository for Mobile AR Depth Estimation: Challenges & Prospects(HotMobile 2024)
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
A curated list of recent monocular depth estimation papers
🏠 PyTorch implementation of our ICCV2021 paper: StructDepth: Leveraging the structural regularities for self-supervised indoor depth estimation
BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Networks
A Scalable Pipeline for Making Steerable Multi-Task Mid-Level Vision Datasets from 3D Scans [ICCV 2021]