Stars
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero
[TPAMI] RoboBEV: Towards Robust Bird's Eye View Perception under Common Corruption and Domain Shift
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
training a vision transformer based model to detect violence in real life videos
3D ResNets for Action Recognition (CVPR 2018)
Video classification tools using 3D ResNet
[ICCV'23] Official implementation of CRN: Camera Radar Net for Accurate, Robust, Efficient 3D Perception
《파이토치 트랜스포머를 활용한 자연어 처리와 컴퓨터비전 심층학습》 예제 코드