Skip to content
View johnsoong's full-sized avatar

Block or report johnsoong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".

Python 132 4 Updated Nov 8, 2024

The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".

493 20 Updated Mar 21, 2024

Official implementation of "FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on"

Python 324 19 Updated Jan 16, 2025

🤖 AI Gateway | AI Native API Gateway

Go 3,760 548 Updated Jan 19, 2025

Official repository of In-Context LoRA for Diffusion Transformers

1,492 76 Updated Dec 20, 2024

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,710 309 Updated Jan 8, 2025

Deezer source separation library including pretrained models.

Python 26,232 2,875 Updated Oct 29, 2024

③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.

Python 337 24 Updated Aug 12, 2024

ControlNet++: All-in-one ControlNet for image generations and editing!

Python 1,839 46 Updated Sep 30, 2024

Official PyTorch implementation of ECCV 2024 Paper: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.

Python 467 20 Updated Jan 12, 2025

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,120 468 Updated Nov 6, 2024

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,725 219 Updated Sep 8, 2024

Fast and complete guided filter implementation for OpenCV

C++ 358 113 Updated Jun 1, 2020

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,832 90 Updated Jan 15, 2025

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Python 682 46 Updated Oct 1, 2024

Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"

Python 1,113 40 Updated Nov 6, 2024

Controlnet inpaint for flux.1

Python 18 3 Updated Sep 7, 2024

Bring portraits to life!

Python 13,680 1,461 Updated Jan 1, 2025

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Python 834 40 Updated Jun 27, 2024

The collection of awesome papers on alignment of diffusion models.

74 1 Updated Jan 20, 2025

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 14,337 1,504 Updated Nov 20, 2024

制作懂人情世故的大语言模型 | 涵盖提示词工程、RAG、Agent、LLM微调教程

Python 1,082 78 Updated Jan 18, 2025

Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"

Python 593 23 Updated May 27, 2024

DynamicPose, a simple and robust framework for animating human images.

Python 62 5 Updated Sep 11, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,361 964 Updated Jan 20, 2025

Official inference repo for FLUX.1 models

Python 19,605 1,374 Updated Jan 9, 2025
Python 18 Updated Jan 6, 2025

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

Python 1,490 75 Updated Sep 25, 2024
Next