Skip to content
View encounter1997's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@aim-uofa @baaivision @ant-research

Block or report encounter1997

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[ARXIV'25] GameFactory: Creating New Games with Generative Interactive Videos

Python 170 2 Updated Jan 15, 2025

🔥🔥🔥 This repository includes latest papers, projects and datasets on GenAI for Cel-Animation.

63 2 Updated Jan 21, 2025
Python 1,781 114 Updated Jan 16, 2025

Official implementation of "MangaNinja: Line Art Colorization with Precise Reference Following"

Python 358 27 Updated Jan 20, 2025

Sky-T1: Train your own O1 preview model within $450

Python 1,983 209 Updated Jan 20, 2025

FastVideo is a lightweight framework for accelerating large video diffusion models.

Python 892 50 Updated Jan 20, 2025

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 1,754 64 Updated Jan 8, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,127 431 Updated Jan 9, 2025

🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Python 689 48 Updated Jan 20, 2025

An open-source tool-augmented conversational language model from Fudan University

Python 12,019 1,147 Updated Jul 13, 2024

The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"

Python 122 13 Updated Jan 18, 2025

Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)

Python 75 4 Updated Jun 26, 2024

[arXiv'25] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 210 3 Updated Jan 16, 2025

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 20,591 1,809 Updated Jan 15, 2025

Code release for "SegLLM: Multi-round Reasoning Segmentation"

Python 56 4 Updated Jan 10, 2025

[Arxiv 2024] Edicho: Consistent Image Editing in the Wild

95 1 Updated Jan 14, 2025

A minimal and universal controller for FLUX.1.

Python 1,109 71 Updated Jan 17, 2025

Memory-optimized training scripts for video models based on Diffusers

Python 756 80 Updated Jan 17, 2025

A reading list of video generation

480 34 Updated Jan 20, 2025

Official implementation of "DepthLab: From Partial to Complete"

Python 411 25 Updated Jan 16, 2025

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

308 8 Updated Jan 17, 2025
Python 91 1 Updated Dec 20, 2024

Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

Python 113 4 Updated Dec 20, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 23,086 1,908 Updated Jan 20, 2025

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 796 39 Updated Dec 17, 2024

Official Implementations for Paper - AniDoc: Animation Creation Made Easier

Python 454 29 Updated Dec 31, 2024

NOVA: Autoregressive Video Generation without Vector Quantization

Python 318 8 Updated Jan 20, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 8,289 860 Updated Jan 20, 2025

Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text

Python 27 Updated Jan 9, 2025
Next