Lists (3)
Sort Name ascending (A-Z)
Stars
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
A curated list of awesome Deep Stereo Matching resources
ViT Prisma is a mechanistic interpretability library for Vision Transformers (ViTs).
[IPCAI'2024 (IJCARS special issue)] Surgical-DINO: Adapter Learning of Foundation Models for Depth Estimation in Endoscopic Surgery
Code for "ATISS: Autoregressive Transformers for Indoor Scene Synthesis", NeurIPS 2021
[CVPR 2024] Official implement of <Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation>
GEM: Boost Simple Network for Glass Surface Segmentation via Vision Foundation Models
Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"
A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.
Official source code of Fast Point Transformer, CVPR 2022
[RSS 2022, Best System Paper Finalist] DextAIRity: Deformable Manipulation Can be a Breeze
Infinite Photorealistic Worlds using Procedural Generation
[ICRA 2023] LODE: Locally Conditioned Eikonal Implicit Scene Completion from Sparse LiDAR
Official implementation of the benchmarked 2D, 3D classficiation, and 3D semantic segmentation models on PeRFception.
Official implementation of the paper: Behind the Scenes: Density Fields for Single View Reconstruction (CVPR 2023)
OpenShot Video Editor is an award-winning free and open-source video editor for Linux, Mac, and Windows, and is dedicated to delivering high quality video editing and animation solutions to the world.
A playbook for systematically maximizing the performance of deep learning models.
materials/notes/links for discussions/collaborations
[ICRA 2023 & IROS 2023] Code release for Keypoint-GraspNet (KGN) and Keypoint-GraspNet-V2 (KGNv2)
3D VQ-VAE-2 for high-resolution CT scan synthesis
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
A colleciton of MuJoCo models that I have downloaded / made small changes to for my own research purposes.
[TPAMI 2024 & CVPR 2022] Attention Concatenation Volume for Accurate and Efficient Stereo Matching
[CVPR'22] ICON: Implicit Clothed humans Obtained from Normals
Match Selection and Refinement for Accurate Structure from Motion