Skip to content
View ifzhang's full-sized avatar
🐶
Focusing
🐶
Focusing

Organizations

@hustvl

Block or report ifzhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,587 98 Updated Jan 23, 2025

[arXiv'25] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 215 4 Updated Jan 16, 2025

ReNeg: Learning Negative Embedding with Reward Guidance

Python 27 Updated Jan 2, 2025

Liquid: Language Models are Scalable Multi-modal Generators

61 Updated Dec 12, 2024

Code for "StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models", Arxiv 2024

78 2 Updated Dec 31, 2024

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)

Python 819 40 Updated Jan 28, 2025

The official implementation of "[MASK] is All You Need"

104 4 Updated Dec 10, 2024

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 1,866 79 Updated Jan 2, 2025

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 902 35 Updated Jan 21, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 7,961 641 Updated Jan 24, 2025

[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting

Python 972 59 Updated Dec 31, 2024

Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving

Python 414 20 Updated Jan 20, 2025

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Python 688 47 Updated Oct 1, 2024

Bridging Large Vision-Language Models and End-to-End Autonomous Driving

Python 280 13 Updated Dec 26, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,472 982 Updated Jan 22, 2025

Multimodal Models in Real World

Jupyter Notebook 431 19 Updated Oct 28, 2024

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 272 7 Updated Jul 9, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,523 66 Updated Aug 15, 2024

[NeurIPS 2024] A Generalizable World Model for Autonomous Driving

Python 645 49 Updated Dec 12, 2024

[AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention

Python 107 1 Updated Jun 17, 2024

DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention

Python 118 3 Updated Nov 26, 2024

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,480 424 Updated Jan 12, 2025

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,189 212 Updated Nov 22, 2024

[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models

Jupyter Notebook 694 58 Updated Jul 7, 2024

A method that can match the 3D point cloud sub-map generated by the robot during the SLAM process with the 2D map.

Python 15 3 Updated Oct 4, 2022
HTML 7 1 Updated Aug 26, 2023

[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"

Python 210 13 Updated Aug 15, 2024

[ICRA'2024] Rethinking Imitation-based Planner for Autonomous Driving

Python 231 17 Updated Jul 11, 2024

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Jupyter Notebook 2,412 199 Updated Oct 27, 2024
Next