Skip to content
View kalabiqlx's full-sized avatar

Highlights

  • Pro

Block or report kalabiqlx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repo for MM-REACT

Python 939 69 Updated Jan 31, 2024

Tracking and collecting papers/projects/others related to Segment Anything.

1,554 133 Updated Aug 16, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,862 4,167 Updated Dec 14, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,180 392 Updated Aug 7, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,179 420 Updated May 29, 2024

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

9,556 2,084 Updated Dec 11, 2024

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,364 177 Updated Nov 27, 2024

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

Python 366 29 Updated Dec 10, 2024

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Python 1,420 208 Updated Apr 3, 2024

[Nature Communications] The official codes for "Towards Building Multilingual Language Model for Medicine"

Python 217 11 Updated Nov 15, 2024

The official code to build up dataset PMC-OA

Python 31 7 Updated Jul 16, 2024

[NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine

Python 65 1 Updated Sep 26, 2024

The official codes for "PMC-CLIP: Contrastive Language-Image Pre-training using Biomedical Documents"

Python 207 12 Updated Aug 30, 2024

PMC-VQA is a large-scale medical visual question-answering dataset, which contains 227k VQA pairs of 149k images that cover various modalities or diseases.

Python 180 11 Updated Dec 6, 2024

Mixture-of-Experts for Large Vision-Language Models

Python 2,022 129 Updated Dec 3, 2024

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, B…

Python 475 36 Updated Apr 21, 2024

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

499 14 Updated Dec 7, 2024

Multi-Aspect Vision Language Pretraining - CVPR2024

Python 67 1 Updated Aug 20, 2024

Official implementation of SAM-Med2D

Jupyter Notebook 900 85 Updated Jun 18, 2024

Medical Multimodal LLMs

Python 260 22 Updated Sep 12, 2024

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high …

Python 370 156 Updated Dec 13, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 169,123 44,544 Updated Dec 13, 2024

The official codes for "PMC-LLaMA: Towards Building Open-source Language Models for Medicine"

Python 615 55 Updated Jul 8, 2024

M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models

Python 215 13 Updated Sep 30, 2024

[CVPR 2024] FairCLIP: Harnessing Fairness in Vision-Language Learning

Jupyter Notebook 54 7 Updated Aug 2, 2024

The official code for "Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D&3D Medical Data".

Python 354 33 Updated Nov 10, 2024

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Python 1,620 204 Updated Aug 13, 2024

The official respository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'

Python 17 Updated Nov 5, 2024

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 1,811 203 Updated May 20, 2024
Next