Stars
[CVPR 2024 Highlight] Official GraCo: Granularity-Controllable Interactive Segmentation.
[ICCV 2023] Implementation of the paper “Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation”
The paper collections for the autoregressive models in vision.
Official Pytorch Code of Our Paper: Rethinking Multiple Instance Learning for Whole Slide Image Classification: A Good Instance Classifier is All You Need
Official repository for EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image Segmentation (MICCAI 2024)
The summary of code and paper for few-shot learning in fine-grained recognition
Codes and models for Medical Image Analysis (MIA) 2023 paper. Segment Anything Model for Medical Images?.
LibFewShot: A Comprehensive Library for Few-shot Learning. TPAMI 2023.
[CVPR2024 Highlight] Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
[MICCAI 2023] This repository includes the official implementation our paper "SwinMM: Masked Multi-view with Swin Transformers for 3D Medical Image Segmentation"
This is a repository for the ICLR2023 accepted paper -- Medical Image Understanding with Pretrained Vision Language Models: A Comprehensive Study.
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Takagi and Nishimoto, CVPR 2023
Iterative Interaction Training for Segmentation Editing Networks
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guid…
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
[MedIA2022]WORD: A large scale dataset, benchmark and clinical applicable study for abdominal organ segmentation from CT image
Reviving Iterative Training with Mask Guidance for Interactive Segmentation
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
🧙 A web app to generate template code for machine learning
Personal social distancing detector using Python, a Tensorflow model and OpenCV
Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.