CV
[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Ultra Fast Deep Lane Detection With Hybrid Anchor Driven Ordinal Classification (TPAMI 2022)
YOLOPv2: Better, Faster, Stronger for Panoptic driving Perception
Ultra Fast Structure-aware Deep Lane Detection (ECCV 2020)
A curated list of awesome neural radiance fields papers
A paper list of object detection using deep learning.
SOTA Re-identification Methods and Toolbox
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also …
Real-Time Object Detection, Tracking, Blurring and Counting using YOLOv8
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
This is a collection of our NAS and Vision Transformer work.
StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.
Official Implementation for "Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation" (CVPR 2021) presenting the pixel2style2pixel (pSp) framework
CoTracker is a model for tracking any point (pixel) on a video.
[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
Official tensorflow implementation for CVPR2020 paper “Learning to Cartoonize Using White-box Cartoon Representations”
Code of ICCV 2023 paper titled General Image-to-Image Translation with One-Shot Image Guidance
UVCGAN v2: An Improved Cycle-Consistent GAN for Unpaired Image-to-Image Translation
[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
[ICCV 2023] StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型