-
University of Michigan
- Ann Arbor
- jinlinyi.github.io
Highlights
- Pro
Stars
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
[ArXiv] Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono Fail
Official Code for Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)
Efficiently Composable Data Augmentation on the GPU with Jax
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation
HInt dataset from HaMeR: Reconstructing Hands in 3D with Transformers
Lightplane implements a highly memory-efficient differentiable radiance field renderer, and a module for unprojecting features from images to 3D grids.
[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
[CVPR 2024] Probing the 3D Awareness of Visual Foundation Models
Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"
[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation
[CVPR 2024 - Highlight] FAR: Flexible, Accurate and Robust 6DoF Relative Camera Pose Estimation
The official PyTorch implementation of Google's Gemma models
(ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Life
[CVPR 2024] Code for SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes
Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)
HaMeR: Reconstructing Hands in 3D with Transformers