iFighting

🎯

Focusing

Yi Jiang iFighting

🎯

Focusing

Large Language Model & Generative Models

463 followers · 254 following

WFH
HangZhou
https://enjoyyi.github.io/
@Enjoy_Yi

Achievements

Stars

tianweiy / DMD2

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 604 31 Updated Sep 27, 2024

microsoft / VidTok

a family of versatile and state-of-the-art video tokenizers.

Python 311 19 Updated Jan 4, 2025

Everlyn-Labs / Wasserstein-VQ

Python 221 41 Updated Oct 11, 2024

baaivision / NOVA

NOVA: Autoregressive Video Generation without Vector Quantization

Python 288 8 Updated Jan 3, 2025

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 1,688 65 Updated Jan 2, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 21,684 1,724 Updated Jan 6, 2025

FoundationVision / Liquid

Liquid: Language Models are Scalable Multi-modal Generators

55 Updated Dec 12, 2024

zju3dv / street_gaussians

[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting

Python 945 58 Updated Dec 31, 2024

ByteFlow-AI / TokenFlow

🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python 211 1 Updated Dec 28, 2024

FoundationVision / Infinity

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 723 20 Updated Dec 30, 2024

yandex-research / switti

The code and models for the paper: Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis

Jupyter Notebook 147 13 Updated Dec 29, 2024

lxa9867 / ImageFolder

XQ-GAN🚀: An Open-source Image Tokenization Framework for Autoregressive Generation

Python 168 Updated Dec 10, 2024

czg1225 / CoDe

CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient

Python 75 1 Updated Nov 28, 2024

LargeWorldModel / ElasticTok

ElasticTok: Adaptive Tokenization for Image and Video

Python 42 Updated Nov 4, 2024

ChaofanTao / Autoregressive-Models-in-Vision-Survey

The paper collections for the autoregressive models in vision.

343 12 Updated Dec 27, 2024

eloialonso / diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,671 114 Updated Dec 6, 2024

facebookresearch / MovieGenBench

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

357 21 Updated Dec 18, 2024

mit-han-lab / hart

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Python 398 19 Updated Oct 16, 2024

YangLing0818 / buffer-of-thought-llm

[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Python 572 54 Updated Jan 3, 2025

CrossmodalGroup / DynamicVectorQuantization

Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization"

Python 163 6 Updated Jul 23, 2023

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 19,122 1,351 Updated Dec 31, 2024

lxa9867 / ControlVAR

This is the official implementation for ControlVAR.

Python 79 3 Updated Dec 10, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 37,298 4,598 Updated Jan 4, 2025

bytedance / tarsier

Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.

Python 171 10 Updated Dec 25, 2024

daixiangzi / VAR-CLIP

Implements VAR+CLIP for text-to-image (T2I) generation

Python 106 2 Updated Dec 30, 2024

FoundationVision / vaex

🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook

Python 69 5 Updated Jun 23, 2024

FoundationVision / OmniTokenizer

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 279 7 Updated Jul 9, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,438 57 Updated Aug 15, 2024

mira-space / MiraData

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Python 377 10 Updated Sep 2, 2024

FoundationVision / Groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 591 61 Updated Jun 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yi Jiang iFighting

Achievements

Achievements

Block or report iFighting

Stars

tianweiy / DMD2

microsoft / VidTok

Everlyn-Labs / Wasserstein-VQ

baaivision / NOVA

facebookresearch / flow_matching

Genesis-Embodied-AI / Genesis

FoundationVision / Liquid

zju3dv / street_gaussians

ByteFlow-AI / TokenFlow

FoundationVision / Infinity

yandex-research / switti

lxa9867 / ImageFolder

czg1225 / CoDe

LargeWorldModel / ElasticTok

ChaofanTao / Autoregressive-Models-in-Vision-Survey

eloialonso / diamond

facebookresearch / MovieGenBench

mit-han-lab / hart

YangLing0818 / buffer-of-thought-llm

CrossmodalGroup / DynamicVectorQuantization

black-forest-labs / flux

lxa9867 / ControlVAR

hiyouga / LLaMA-Factory

bytedance / tarsier

daixiangzi / VAR-CLIP

FoundationVision / vaex

FoundationVision / OmniTokenizer

FoundationVision / LlamaGen

mira-space / MiraData

FoundationVision / Groma