Starred repositories
Visualizing the attention of vision-language models
ζΊε¨ε¦δΉ οΌε¨εΏεοΌPPTθ―Ύδ»Ά
[ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking Multimodal Large Language Models''
ICCV2023 - Parallax-Tolerant Unsupervised Deep Image Stitching (UDIS++)
Seam-guided local alignment and stitching for large parallax images
Official implementation of AAAI 2025 paper: Object-level Geometric Structure Preserving for Natural Image Stitching
[CVPR2024]: RecDiffusion: Rectangling for Image Stitching with Diffusion Models
Parallax-tolerant Image Stitching via Segmentation-guided Multi-homography Warping
This is an open-source implementation of paper: Real-time Incremental UAV Image Mosaicing based on Monocular SLAM.
A command line toolkit to generate maps, point clouds, 3D models and DEMs from drone, balloon or kite images. π·
An algorithm that handles large-scale aerial photo co-registration, based on SURF, RANSAC and PyTorch autograd.
A python program to automate stitching of ariel images with overlapping areas captured by UAV
"Structure-Aware Sparse-View X-ray 3D Reconstruction" (CVPR 2024)
SpecGaussian with latent features: A high-quality modeling of the view-dependent appearance for 3D Gaussian Splatting
BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models
[AAAI2024]Hybrid-SORT: Weak Cues Matter for Online Multi-Object Tracking
Resources for Multiple Object Tracking (MOT)
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. π₯ π₯ π₯
YOLOv5 π in PyTorch > ONNX > CoreML > TFLite
πΆ A curated list of Tiny Object Detection papers and related resources.
Language-Driven Semantic Segmentation
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
An easy-to-use Python framework to generate adversarial jailbreak prompts.