Skip to content
View michaelyuancb's full-sized avatar

Highlights

  • Pro

Block or report michaelyuancb

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

HoloLens 2 Data Acquisition. Access HoloLens 2 Research Mode, Front Camera, Microphone, Head, Eye, Hand, and External USB-C A/V sensor data.

C++ 14 2 Updated Jan 3, 2025

A system to easily extract ground truth training data for different machine learning tasks from GTAV

C++ 97 11 Updated Oct 20, 2022

Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"

387 8 Updated Dec 8, 2024

Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"

Python 330 15 Updated Nov 17, 2024

Official Repository of "UniHOI: Learning Fast, Dense and Generalizable 4D Reconstruction for Egocentric Hand Object Interaction Videos"

1 Updated Nov 18, 2024

MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 646 34 Updated Dec 8, 2024

Web-based 3D visualization + Python

Python 918 55 Updated Jan 5, 2025

Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"

Python 912 51 Updated Dec 26, 2024

Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具

Python 1,377 144 Updated Dec 16, 2024

The scanner app acquires RGB-D scans using iPhone LiDAR sensor and ARKit API, stores color, depth and IMU data on local memory and then uploads to PC for processing.

Swift 25 2 Updated Dec 18, 2024

A paper list of my history reading. Robotics, Learning, Vision.

296 13 Updated Dec 30, 2024

Official PyTorch implementation of the “A Unified Transformer Framework for Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection”. (TMM2023)

Python 295 49 Updated Apr 2, 2023

TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compa…

Python 3,879 393 Updated Jan 3, 2025

A model combined 100DoH, Semantic-SAM and EgoHOS for hand-object state classification, detection, segmentation.

Python 3 Updated Jul 26, 2024

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 21,309 3,131 Updated Jan 4, 2025

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 5,670 474 Updated Dec 31, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,980 2,307 Updated Aug 12, 2024

Pytorch❤️ Keras 😋😋

Jupyter Notebook 1,841 240 Updated Oct 28, 2024

Ace interviews with AI practice. Our agent role-plays personalized interview tailored to your background, listening and replying like a real interviewer. Train across personas for any situation.

Python 102 18 Updated Jun 9, 2024

Code for RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion [Arxiv 2024]

209 7 Updated Apr 10, 2024

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective…

Python 18,704 2,431 Updated Sep 19, 2024

Large Action Model framework to develop AI Web Agents

Python 5,786 528 Updated Nov 17, 2024

Code for RoboFlamingo

Python 333 28 Updated May 8, 2024

武汉大学2019级本科毕业论文Latex模板

TeX 2 Updated May 25, 2023

Building a quick conversation-based search demo with Lepton AI.

TypeScript 7,920 1,011 Updated Jan 4, 2025

Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"

Python 43 Updated Dec 20, 2024

my blog

HTML 6 Updated Jan 4, 2025

Project and dataset webpage:

Python 246 68 Updated Oct 12, 2023
Next