michaelyuancb

Michael Yuan michaelyuancb

Now Intern@MoonshotAI; Graduate@IIIS, Tsinghua University; EmbodiedAI & Agent; Simple+Elegant leads to AGI

17 followers · 6 following

Tsinghua University
Beijing, China
michaelyuancb.github.io

Achievements

Highlights

Lists (1)

Sort

excellent_tool

1 repository

Stars

jdibenes / hl2da

HoloLens 2 Data Acquisition. Access HoloLens 2 Research Mode, Front Camera, Microphone, Head, Eye, Hand, and External USB-C A/V sensor data.

C++ 14 2 Updated Jan 3, 2025

David0tt / DeepGTAV

A system to easily extract ground truth training data for different machine learning tasks from GTAV

C++ 97 11 Updated Oct 20, 2022

mega-sam / mega-sam

Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"

387 8 Updated Dec 8, 2024

hustvl / EVF-SAM

Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"

Python 330 15 Updated Nov 17, 2024

michaelyuancb / unihoi

Official Repository of "UniHOI: Learning Fast, Dense and Generalizable 4D Reconstruction for Egocentric Hand Object Interaction Videos"

1 Updated Nov 18, 2024

microsoft / MoGe

MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 646 34 Updated Dec 8, 2024

nerfstudio-project / viser

Web-based 3D visualization + Python

Python 918 55 Updated Jan 5, 2025

Junyi42 / monst3r

Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"

Python 912 51 Updated Dec 26, 2024

yatengLG / ISAT_with_segment_anything

Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具

Python 1,377 144 Updated Dec 16, 2024

xiongyiheng / ARKit-Scanner

The scanner app acquires RGB-D scans using iPhone LiDAR sensor and ARKit API, stores color, depth and IMU data on local memory and then uploads to PC for processing.

Swift 25 2 Updated Dec 18, 2024

YanjieZe / Paper-List

A paper list of my history reading. Robotics, Learning, Vision.

296 13 Updated Dec 30, 2024

suyukun666 / UFO

Official PyTorch implementation of the “A Unified Transformer Framework for Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection”. (TMM2023)

Python 295 49 Updated Apr 2, 2023

TEN-framework / TEN-Agent

TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compa…

Python 3,879 393 Updated Jan 3, 2025

michaelyuancb / ego_hoi_model

A model combined 100DoH, Semantic-SAM and EgoHOS for hand-object state classification, detection, segmentation.

Python 3 Updated Jul 26, 2024

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 21,309 3,131 Updated Jan 4, 2025

THUDM / GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 5,670 474 Updated Dec 31, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,980 2,307 Updated Aug 12, 2024

lyhue1991 / torchkeras

Pytorch❤️ Keras 😋😋

Jupyter Notebook 1,841 240 Updated Oct 28, 2024

tejpshah / interview-pilot-ai

Ace interviews with AI practice. Our agent role-plays personalized interview tailored to your background, listening and replying like a real interviewer. Train across personas for any situation.

Python 102 18 Updated Jun 9, 2024

jaidevshriram / realmdreamer

Code for RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion [Arxiv 2024]

209 7 Updated Apr 10, 2024

stitionai / devika

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective…

Python 18,704 2,431 Updated Sep 19, 2024