ifzhang

🐶

Focusing

Yifu Zhang ifzhang

🐶

Focusing

787 followers · 115 following

Achievements

Organizations

Stars

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,587 98 Updated Jan 23, 2025

hustvl / LightningDiT

[arXiv'25] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 215 4 Updated Jan 16, 2025

LemonTwoL / ReNeg

ReNeg: Learning Negative Embedding with Reward Guidance

Python 27 Updated Jan 2, 2025

FoundationVision / Liquid

Liquid: Language Models are Scalable Multi-modal Generators

61 Updated Dec 12, 2024

zju3dv / street_crafter

Code for "StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models", Arxiv 2024

78 2 Updated Dec 31, 2024

sihyun-yu / REPA

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)

Python 819 40 Updated Jan 28, 2025

CompVis / mask

The official implementation of "[MASK] is All You Need"

104 4 Updated Dec 10, 2024

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 1,866 79 Updated Jan 2, 2025

FoundationVision / Infinity

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 902 35 Updated Jan 21, 2025

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 7,961 641 Updated Jan 24, 2025

zju3dv / street_gaussians

[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting

Python 972 59 Updated Dec 31, 2024

hustvl / DiffusionDrive

Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving

Python 414 20 Updated Jan 20, 2025

NVlabs / DoRA

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Python 688 47 Updated Oct 1, 2024

hustvl / Senna

Bridging Large Vision-Language Models and End-to-End Autonomous Driving

Python 280 13 Updated Dec 26, 2024

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,472 982 Updated Jan 22, 2025

AILab-CVC / SEED-X

Multimodal Models in Real World

Jupyter Notebook 431 19 Updated Oct 28, 2024

FoundationVision / OmniTokenizer

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 272 7 Updated Jul 9, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,523 66 Updated Aug 15, 2024

zympsyche / BevWorld

104 5 Updated Jul 9, 2024

OpenDriveLab / Vista

[NeurIPS 2024] A Generalizable World Model for Autonomous Driving

Python 645 49 Updated Dec 12, 2024

hustvl / ViG

[AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention

Python 107 1 Updated Jun 17, 2024

hustvl / DiG

DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention

Python 118 3 Updated Nov 26, 2024

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,480 424 Updated Jan 12, 2025

hustvl / Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,189 212 Updated Nov 22, 2024

opendilab / LMDrive

[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models

Jupyter Notebook 694 58 Updated Jul 7, 2024

fengdelin / FloorplanNet

A method that can match the 3D point cloud sub-map generated by the robot during the SLAM process with the 2D map.

Python 15 3 Updated Oct 4, 2022

STAR-Center / osmAG

HTML 7 1 Updated Aug 26, 2023

wenyuqing / panacea

[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"

Python 210 13 Updated Aug 15, 2024

jchengai / planTF

[ICRA'2024] Rethinking Imitation-based Planner for Autonomous Driving

Python 231 17 Updated Jul 11, 2024

hustvl / 4DGaussians

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Jupyter Notebook 2,412 199 Updated Oct 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yifu Zhang ifzhang

Achievements

Achievements

Organizations

Block or report ifzhang

Stars

OpenGVLab / InternVideo

hustvl / LightningDiT

LemonTwoL / ReNeg

FoundationVision / Liquid

zju3dv / street_crafter

sihyun-yu / REPA

CompVis / mask

facebookresearch / flow_matching

FoundationVision / Infinity

Tencent / HunyuanVideo

zju3dv / street_gaussians

hustvl / DiffusionDrive

NVlabs / DoRA

hustvl / Senna

THUDM / CogVideo

AILab-CVC / SEED-X

FoundationVision / OmniTokenizer

FoundationVision / LlamaGen

zympsyche / BevWorld

OpenDriveLab / Vista

hustvl / ViG

hustvl / DiG

FoundationVision / VAR

hustvl / Vim

opendilab / LMDrive

fengdelin / FloorplanNet

STAR-Center / osmAG

wenyuqing / panacea

jchengai / planTF

hustvl / 4DGaussians