Skip to content
View limbo0000's full-sized avatar
🛫
Working from home
🛫
Working from home

Organizations

@decisionforce

Block or report limbo0000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 3 Updated Nov 18, 2024

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 5,733 400 Updated Dec 13, 2024

Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

Python 2,207 180 Updated Dec 12, 2024

AlphaFold 3 inference pipeline.

Python 5,554 652 Updated Dec 13, 2024

A suite of image and video neural tokenizers

Python 966 23 Updated Nov 13, 2024

Code for "Diffusion Model Alignment Using Direct Preference Optimization"

Python 278 25 Updated Dec 28, 2023

Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics

Python 57 5 Updated Oct 11, 2024

Scaling Diffusion Transformers with Mixture of Experts

Python 222 9 Updated Sep 9, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,304 223 Updated Dec 12, 2024

Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"

Python 877 45 Updated Dec 11, 2024

A native PyTorch Library for large model training

Python 2,754 222 Updated Dec 14, 2024

Long context evaluation for large language models

Python 192 15 Updated Dec 9, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 3,572 222 Updated Dec 4, 2024

A continuously updated collection of DCTLs (DaVinci Color Transform Language) designed to enhance and educate on workflows using ARRI LogC3, Gen5 and Cineon in DaVinci Resolve. This collection offe…

C 27 1 Updated Dec 1, 2024

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 933 86 Updated Dec 10, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,751 169 Updated Sep 25, 2024

[ECCV 2024] 3DPE: Real-time 3D-aware Portrait Editing from a Single Image

20 1 Updated Jul 19, 2024

Grounding Image Matching in 3D with MASt3R

Python 1,408 111 Updated Oct 12, 2024

[TMLR] Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"

Jupyter Notebook 122 4 Updated Nov 14, 2024

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 696 36 Updated Aug 5, 2024

Kolors Team

Python 3,999 289 Updated Nov 13, 2024

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 770 30 Updated Dec 4, 2024

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 591 29 Updated Nov 20, 2024

AuraSR: GAN-based Super-Resolution for real-world

Python 414 34 Updated Nov 13, 2024

Official code for NeurIPS 2024 paper LRM-Zero: Training Large Reconstruction Models with Synthesized Data

Python 134 7 Updated Oct 7, 2024

[arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization

Python 99 Updated Jun 12, 2024

[NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising

Python 167 8 Updated Sep 27, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,396 58 Updated Aug 15, 2024

ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors

Python 199 6 Updated Dec 3, 2024

Transformer-Mamba Diffusion Models

Python 93 6 Updated Jun 30, 2024
Next