Stars
A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.
A comprehensive video analysis tool that combines computer vision, audio transcription, and natural language processing to generate detailed descriptions of video content. This tool extracts key fr…
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"
Official implementation of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM"
A collection list of AIGC detection related papers.
[ECCV2022] PASS: Part-Aware Self-Supervised Pre-Training for Person Re-Identification
[CVPR2024]Day-Night Cross-domain Vehicle Re-identification
The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"
[ICCV-2021] TransReID: Transformer-based Object Re-Identification
Deep Learning for Person Re-identification: A Survey and Outlook
Unsupervised Pre-training for Person Re-identification (LUPerson)
Official implementation for "CLIP-ReID: Exploiting Vision-Language Model for Image Re-identification without Concrete Text Labels" (AAAI 2023)
Collection of public available person re-identification datasets
A General-purpose Person Re-identification Task with Instructions
⛹️ Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial
A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximu…
[ECCV 2024] Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models
One-paper-one-short-contribution-summary of all latest image/burst/video Denoising papers with code & citation published in top conference and journal.
Collection of popular and reproducible video denoising works.
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
DaVinci toolkit aims at high-quality multimedia content creation which plays an important role in modern work and life. The targeted features can include both low-level image and video enhancement …