4IK1d

4IK1d

Starred repositories

HillZhang1999 / MuCGEC

MuCGEC中文纠错数据集及文本纠错SOTA模型开源；Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction"

Python 520 65 Updated Jun 9, 2023

zjunlp / OmniThink

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

Python 404 51 Updated Feb 24, 2025

MiniMax-AI / MiniMax-01

Python 2,275 161 Updated Feb 24, 2025

danilop / multimodal-chat

A multimodal chat interface with many tools.

Python 117 16 Updated Feb 28, 2025

AriaUI / Aria-UI

Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents

Python 328 32 Updated Feb 8, 2025

illuin-tech / colpali

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 1,552 130 Updated Feb 27, 2025

illuin-tech / vidore-benchmark

Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.

Python 179 20 Updated Feb 26, 2025

mlfoundations / MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

799 20 Updated Jul 31, 2024

voyage-ai / voyage-multimodal-3

16 Updated Nov 8, 2024

Fly2flies / Cross-modal-retrieval

媒体计算实践作业：图像——文本跨模态搜索

Python 38 11 Updated Dec 4, 2020

RhapsodyAILab / Awesome-MiniCPMV-Projects

11 1 Updated Aug 19, 2024

tablegpt / tablegpt-agent

A pre-built agent for TableGPT2.

Python 510 44 Updated Feb 24, 2025

Dao-AILab / fast-hadamard-transform

Fast Hadamard transform in CUDA, with a PyTorch interface

C 150 21 Updated May 24, 2024

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 1,871 224 Updated Mar 4, 2025

cvdfoundation / open-images-dataset

Open Images is a dataset of ~9 million images that have been annotated with image-level labels and bounding boxes spanning thousands of classes.

1,022 158 Updated May 4, 2020

vinchu / xiaohongshu-3

Forked from hkcityu/xiaohongshu

小红书API，获取小红书帖子内容，评论内容等信息

12 1 Updated Mar 29, 2019

liulu1550 / MoreAPI

MoreAPI是抖音/lemon8/小红书/快手等各视频平台非官方的RESTful API平台。抖音视频解析、小红书解析、快手解析、youtube解析、B站解析

Python 86 4 Updated Feb 22, 2025

mushan0x0 / AI0x0.com

一个多模态多模型通用型的全局全能 AI 查询生成桌面悬浮助手应用

3,849 411 Updated Feb 6, 2025

Kiteflyingee / academic_prompts

学术常用的prompts

38 4 Updated Apr 12, 2023

ZJU-M3 / TableGPT-techreport

The report of a fine-tuned GPT model unifying tables, natural language, and commands.

108 7 Updated Nov 26, 2023

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 5,030 431 Updated Nov 21, 2024

huggingface / smollm

Everything about the SmolLM2 and SmolVLM family of models

Python 1,971 111 Updated Feb 20, 2025

RWKV / rwkv.cpp

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

C++ 1,478 101 Updated Feb 20, 2025

facebookresearch / SpinQuant

Code repo for the paper "SpinQuant LLM quantization with learned rotations"

Python 219 29 Updated Feb 14, 2025

openvinotoolkit / nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

Python 979 247 Updated Mar 3, 2025

Cuberick-Orion / Bi-Blip4CIR

The official implementation for BLIP4CIR with bi-directional training | Bi-directional Training for Composed Image Retrieval via Text Prompt Learning (WACV 2024)

Python 30 3 Updated Feb 7, 2024

davidnvq / grit

GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)

Python 188 30 Updated May 9, 2023

jakespringer / echo-embeddings

Python 141 7 Updated Apr 17, 2024

silicx / LoRS_Distill

Code for our ICML'24 on multimodal dataset distillation

Python 35 2 Updated Oct 11, 2024

rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,925 347 Updated Aug 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly