Stars
an empirical study on few-shot counting using segment anything (SAM)
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything
OpenMMLab Detection Toolbox and Benchmark
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Workflow-to-APP、ScreenShare&FloatingVideo、GPT & 3D、SpeechRecognition&TTS
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Open-source and strong foundation image recognition models.
Auto detecting, masking and inpainting with detection model.
I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models
[SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
[SIGGRAPH ASIA 2024 TCS] AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data
State-of-the-art 2D and 3D Face Analysis Project
Official implementation of the paper "Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirection Guidance" (AAAI 2025 Oral)
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
Face Pose: Estimate pose (Yaw, Roll, Pitch) of a face using two extremely simple, efficient and accurate methods.
[AAAI 25] SegFace: Face Segmentation of Long-tail classes
A minimal and universal controller for FLUX.1.
Code for ''MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation''