[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
The official repository of "Video assistant towards large language model makes everything easy"
🏦 银行笔试面试经验分享及资料分享(help you pass the bank interview, and get a amazing bank offer!)
(unofficial) Googletrans: Free and Unlimited Google translate API for Python. Translates totally free of charge.
hzcxq / funNLP
Forked from fighting41love/funNLP中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
Massively Parallel Deep Reinforcement Learning. 🔥
CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包
Implementation of "Distribution Alignment: A Unified Framework for Long-tail Visual Recognition"(CVPR 2021)
This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 2020
Google Research
Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)
All-in-one Toolbox for Computer Vision Research.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Google AI 2018 BERT pytorch implementation
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Dense Unsupervised Learning for Video Segmentation (NeurIPS*2021)
Code for Off The Beaten Sidewalk paper (
Cross-Modal Unsupervised Domain Adaptationfor 3D Semantic Segmentation