Stars
Code for the paper "Improved Techniques for Training GANs"
Inception Score for GANs in Pytorch
ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Large World Model -- Modeling Text and Video with Millions Context
Accepted in CVPR 2023
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
📄 Awesome CV is LaTeX template for your outstanding job application
ChatGPT 中文指南🔥,ChatGPT 中文调教指南,指令指南,应用开发指南,精选资源清单,更好的使用 chatGPT 让你的生产力 up up up! 🚀
🆓免费的 ChatGPT 镜像网站列表,持续更新。List of free ChatGPT mirror sites, continuously updated.
This repo includes ChatGPT prompt curation to use ChatGPT better.
An opinionated list of awesome Python frameworks, libraries, software and resources.
PMC-VQA is a large-scale medical visual question-answering dataset, which contains 227k VQA pairs of 149k images that cover various modalities or diseases.
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
[ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
Data set for the IEEE TGRS paper "Mutual Attention Inception Network for Remote Sensing Visual Question Answering"
This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation", accepted by CVPR 2024.
The official code for "Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D&3D Medical Data".
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
Radiology Objects in COntext (ROCO): A Multimodal Image Dataset
This is the official repository for the LENS (Large Language Models Enhanced to See) system.
SAM-Med3D: An Efficient General-purpose Promptable Segmentation Model for 3D Volumetric Medical Image
Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training