Stars
Code for "MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training", Arxiv 2025.
[CVPR 2024] RoMa: Robust Dense Feature Matching; RoMa is the robust dense feature matcher capable of estimating pixel-dense warps and reliable certainties for almost any image pair.
PyTorch code and models for the DINOv2 self-supervised learning method.
A curated list of radar datasets, detection, tracking and fusion
A global resource download orchestration system, build your home download center.
[ECCV 2024] A Simple and Effective 3D DETR in Point Clouds
OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
Point-to-Voxel Knowledge Distillation for LiDAR Semantic Segmentation (CVPR 2022)
Curb Detection Framework Based on LiDAR Point Cloud Segmentation
a simple project to beat boss in Blackmyth Wukong, using yolo8 to detect boss movement and a script to react to certain detections
We write your reusable computer vision tools. 💜
Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
✯ 可直连访问的电视/广播图标库与相关工具项目 ✯ 🔕 永久免费 直连访问 完整开源 不断完善的台标 支持IPv4/IPv6双栈访问 🔕
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
The devkit of the nuScenes dataset.
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
Offical implementation of ICCV2023 paper 3DMOTFormer: Graph Transformer for Online 3D Multi-Object Tracking.
✨ Local and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows
国内首个占据栅格网络全栈课程《从BEV到Occupancy Network,算法原理与工程实践》,包含端侧部署。Surrounding Semantic Occupancy Perception Course for Autonomous Driving (docs, ppt and source code) 在线课程主页:http://111.229.117.200:8100/ (作者独立搭建)
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
(IROS 2020, ECCVW 2020) Official Python Implementation for "3D Multi-Object Tracking: A Baseline and New Evaluation Metrics"
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
A Pythonic framework to simplify AI service building
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥