Stars
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Inpaint anything using Segment Anything and inpainting models.
Official implementations for paper: Anydoor: zero-shot object-level image customization
High-Resolution Image Synthesis with Latent Diffusion Models
Official PyTorch implementation of FB-BEV & FB-OCC - Forward-backward view transformation for vision-centric autonomous driving perception
This code is used to get images from google maps given a GPS region or a center GPS point and a Zoom level.
Implementation of https://srush.github.io/annotated-s4
Official implementation of our TIV'23 paper: Occupancy-MAE: Self-supervised Pre-training Large-scale LiDAR Point Clouds with Masked Occupancy Autoencoders
Writing AI Conference Papers: A Handbook for Beginners
CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!
[NeurIPS 2024] Official code of ”LION: Linear Group RNN for 3D Object Detection in Point Clouds“
Qt SerialPort-BLE-UDP-TCP-WebSocket-Modbus-CAN Assistant.
real time face swap and one-click video deepfake with only a single image
[CVPR24] COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
This repository shares the documentation and development kit of the View of Delft automotive dataset.
[CVPR 2023 Highlight] LaserMix for Semi-Supervised LiDAR Semantic Segmentation
2023年,最新音视频学习资料整理,项目(调试可用),ffmpeg命令手册,文章,编解码论文,视频讲解,面试题全套资料
Fast-BEV: A Fast and Strong Bird’s-Eye View Perception Baseline
Utilities intended for use with Llama models.
Linux kernel module for Thrustmaster T300RS, T248 and (experimental) TX, T128, T-GT II and TS-XW wheels
Google谷歌、Wikipedia维基百科、谷歌学术镜像2024最新 新增各种镜像站
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
[CVPR 2024] A world model for autonomous driving.
3D LiDAR Mapping in Dynamic Environments using a 4D Implicit Neural Representation (CVPR 2024)