Skip to content
View deeptimhe's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Microsoft Research Asia
  • Beijing, China

Block or report deeptimhe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence

Python 280 5 Updated Jan 5, 2025

World's First Large-scale High-quality Robotic Manipulation Benchmark

Python 985 70 Updated Jan 3, 2025

Friends of OLMo and their links.

228 14 Updated Dec 15, 2024

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".

Python 6,005 380 Updated Dec 27, 2024

a family of versatile and state-of-the-art video tokenizers.

Python 311 19 Updated Jan 4, 2025

A deep learning library for video understanding research.

Python 3,355 414 Updated Nov 26, 2024

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 7,062 533 Updated Jan 2, 2025

Official repository of In-Context LoRA for Diffusion Transformers

1,436 74 Updated Dec 20, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,972 514 Updated Jan 3, 2025

Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch

Python 46 1 Updated Nov 25, 2024

Official repository for LTX-Video

Python 2,383 181 Updated Jan 3, 2025

SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer

Python 187 5 Updated Dec 29, 2024

The Fastest Deep Reinforcement Learning Library

C++ 703 26 Updated Dec 20, 2024

Apache ECharts is a powerful, interactive charting and data visualization library for browser

TypeScript 61,528 19,670 Updated Jan 2, 2025

Official inference repo for FLUX.1 models

Python 19,124 1,351 Updated Dec 31, 2024

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,671 114 Updated Dec 6, 2024

Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"

Jupyter Notebook 404 19 Updated Dec 12, 2024

A suite of image and video neural tokenizers

Python 1,056 28 Updated Dec 23, 2024

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 1,803 93 Updated Jan 5, 2025

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

18,601 2,570 Updated Jan 2, 2025

Inference script for Oasis 500M

Python 1,682 145 Updated Nov 8, 2024

Efficient vision foundation models for high-resolution generation and perception.

Python 2,524 201 Updated Dec 24, 2024

[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Jupyter Notebook 253 9 Updated Dec 4, 2024

Implementation of CamTrol: Training-free Camera Control for Video Generation

Python 6 1 Updated Sep 13, 2024

The best OSS video generation models

Python 2,631 266 Updated Dec 18, 2024

Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"

Python 912 51 Updated Dec 26, 2024
Python 18 Updated Sep 25, 2024

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 4,043 268 Updated Dec 19, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,364 226 Updated Dec 31, 2024
Next