Stars
Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".
Official implementation of "FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on"
Official repository of In-Context LoRA for Diffusion Transformers
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Deezer source separation library including pretrained models.
③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.
ControlNet++: All-in-one ControlNet for image generations and editing!
Official PyTorch implementation of ECCV 2024 Paper: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Fast and complete guided filter implementation for OpenCV
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
The collection of awesome papers on alignment of diffusion models.
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
制作懂人情世故的大语言模型 | 涵盖提示词工程、RAG、Agent、LLM微调教程
Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"
DynamicPose, a simple and robust framework for animating human images.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Official inference repo for FLUX.1 models
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA