encounter1997

🎯

Focusing

Wen Wang encounter1997

🎯

Focusing

164 followers · 552 following

China
@encounter19972

Achievements

Highlights

Organizations

Lists (5)

Sort

Starred repositories

KwaiVGI / GameFactory

[ARXIV'25] GameFactory: Creating New Games with Generative Interactive Videos

Python 170 2 Updated Jan 15, 2025

yunlong10 / Awesome-AI4Animation

🔥🔥🔥 This repository includes latest papers, projects and datasets on GenAI for Cel-Animation.

63 2 Updated Jan 21, 2025

MiniMax-AI / MiniMax-01

Python 1,781 114 Updated Jan 16, 2025

ali-vilab / MangaNinjia

Official implementation of "MangaNinja: Line Art Colorization with Precise Reference Following"

Python 358 27 Updated Jan 20, 2025

NovaSky-AI / SkyThought

Sky-T1: Train your own O1 preview model within $450

Python 1,983 209 Updated Jan 20, 2025

hao-ai-lab / FastVideo

FastVideo is a lightweight framework for accelerating large video diffusion models.

Python 892 50 Updated Jan 20, 2025

PKU-YuanGroup / LLaVA-CoT

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 1,754 64 Updated Jan 8, 2025

NVIDIA / Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,127 431 Updated Jan 9, 2025

magic-research / Sa2VA

🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Python 689 48 Updated Jan 20, 2025

OpenMOSS / MOSS

An open-source tool-augmented conversational language model from Fudan University

Python 12,019 1,147 Updated Jul 13, 2024

DAMO-NLP-SG / multimodal_textbook

The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"

Python 122 13 Updated Jan 18, 2025

pixeli99 / TrackDiffusion

Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)

Python 75 4 Updated Jun 26, 2024

hustvl / LightningDiT

[arXiv'25] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 210 3 Updated Jan 16, 2025

stanford-oval / storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 20,591 1,809 Updated Jan 15, 2025

berkeley-hipie / segllm

Code release for "SegLLM: Multi-round Reasoning Segmentation"

Python 56 4 Updated Jan 10, 2025

ant-research / edicho

[Arxiv 2024] Edicho: Consistent Image Editing in the Wild

95 1 Updated Jan 14, 2025

Yuanshi9815 / OminiControl

A minimal and universal controller for FLUX.1.

Python 1,109 71 Updated Jan 17, 2025

deepseek-ai / DeepSeek-V3

Python 20,139 1,654 Updated Jan 7, 2025

a-r-r-o-w / finetrainers

Memory-optimized training scripts for video models based on Diffusers

Python 756 80 Updated Jan 17, 2025

yzhang2016 / video-generation-survey

A reading list of video generation

480 34 Updated Jan 20, 2025

ant-research / DepthLab

Official implementation of "DepthLab: From Partial to Complete"

Python 411 25 Updated Jan 16, 2025

LMM101 / Awesome-Multimodal-Next-Token-Prediction

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

308 8 Updated Jan 17, 2025

kijai / ComfyUI-FramerWrapper

Python 91 1 Updated Dec 20, 2024

ant-research / LeviTor

Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

Python 113 4 Updated Dec 20, 2024

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 23,086 1,908 Updated Jan 20, 2025

sihyun-yu / REPA

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 796 39 Updated Dec 17, 2024

ant-research / AniDoc

Official Implementations for Paper - AniDoc: Animation Creation Made Easier

Python 454 29 Updated Dec 31, 2024

baaivision / NOVA

NOVA: Autoregressive Video Generation without Vector Quantization

Python 318 8 Updated Jan 20, 2025

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 8,289 860 Updated Jan 20, 2025

ant-research / lumos

Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text

Wen Wang encounter1997

Highlights

Organizations

Lists (5)

DE-DETRs

Detection

DETR-baselines

FP-DETR

SFA

Starred repositories

low-light-image

deep-image-prior

gcn

cvpr2019