-
University of Amsterdam
- Amsterdam
- https://github.com/thongnt99
- @thongnt99
Highlights
- Pro
Stars
This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR25]
An Open-sourced Knowledgable Large Language Model Framework.
Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
A method to increase the speed and lower the memory footprint of existing vision transformers.
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
Scalable training for dense retrieval models.
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
🦜🔗 Build context-aware reasoning applications
LLM training code for Databricks foundation models
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
Hackable and optimized Transformers building blocks, supporting a composable construction.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
An open-source implementation for training LLaVA-NeXT.
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning
Residual Quantization with Implicit Neural Codebooks
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
State-of-the-Art Text Embeddings
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)