Stars
Command-line program to download videos from YouTube.com and other video sites
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
A natural language interface for computers
Scrapy, a fast high-level web crawling & scraping framework for Python.
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
PyTorch Tutorial for Deep Learning Researchers
OpenMMLab Detection Toolbox and Benchmark
⚡ A Fast, Extensible Progress Bar for Python and CLI
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Datasets, Transforms and Models specific to Computer Vision
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
pix2code: Generating Code from a Graphical User Interface Screenshot
Hackable and optimized Transformers building blocks, supporting a composable construction.
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
tensorboard for pytorch (and chainer, mxnet, numpy, ...)
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Accessible large language models via k-bit quantization for PyTorch.
Official repo for consistency models.
A Unified Toolkit for Deep Learning Based Document Image Analysis