Starred repositories
Famous Vision Language Models and Their Architectures
[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
An open source implementation of CLIP.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Awesome Incremental Learning
Probing the representations of Vision Transformers.
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
✨✨Latest Advances on Multimodal Large Language Models
This repository contains demos I made with the Transformers library by HuggingFace.
PyTorch implementation of SwAV https//arxiv.org/abs/2006.09882
PyTorch implementation of MoCo v3 https//arxiv.org/abs/2104.02057
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
A curated (most recent) list of resources for Learning with Noisy Labels
🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
A hyperspherical face recognition library based on PyTorch
[CVPR 2022] Official implementation of the paper "Uformer: A General U-Shaped Transformer for Image Restoration".
External Prior Guided Internal Prior Learning for Real-World Noisy Image Denoising. IEEE Transactions on Image Processing, 2018.
A Collection of Papers and Codes in CVPR2023/2022 about low level vision