Stars
[NeurIPS 2024] Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement
A collection of papers on diffusion models for 3D generation.
Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).
Point-SAM: This is the official repository of "Point-SAM: Promptable 3D Segmentation Model for Point Clouds". We provide codes for running our demo and links to download checkpoints.
[ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"
OpenXRLab foundational library for XR-related algorithms
OpenXRLab Structure-from-Motion Toolbox and Benchmark
OpenXRLab Visual Localization Toolbox and Server
OpenXRLab Multi-view Motion Capture Toolbox and Benchmark
OpenXRLab Visual-inertial SLAM Toolbox and Benchmark
OpenXRLab Neural Radiance Field (NeRF) Toolbox and Benchmark
Release for Improved Denoising Diffusion Probabilistic Models
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
(TPAMI 2024) A Survey on Open Vocabulary Learning
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
OpenXRLab XRAPI is an open-source implementation of the Google ARCore and Apple ARKit
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD, TMLR 2024)
[CVPR 2024 Highlight] PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
OpenXRLab Multi-Modal Motion Generation Toolbox and Benchmark
Fine-Grained Open Domain Image Animation with Motion Guidance
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Painter & SegGPT Series: Vision Foundation Models from BAAI
[ICCV 2019] Monocular depth estimation from a single image