Lists (1)
Sort Name ascending (A-Z)
Stars
Run Mixtral-8x7B models in Colab or consumer desktops
Extracts the compiled portion of the DeepSolo model's code
Lightweight version of Detectron2's config package, stripped of all superfluous requirements
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
agoryuno / DeepSolo
Forked from ViTAE-Transformer/DeepSoloThe official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Text …
An ONNX exporter fot the DeepSolo scene text recognition model
🔊 Text-Prompted Generative Audio Model
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and mo…
E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multi…
[AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
Fork of tess-two rewritten from scratch to support latest version of Tesseract OCR.
We write your reusable computer vision tools. 💜
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
Fast and memory-efficient exact attention
TorchCFM: a Conditional Flow Matching library
Vector (and Scalar) Quantization, in Pytorch
Firefox in a docker container with a control API
A pytorch implementation of the vector quantized variational autoencoder (https://arxiv.org/abs/1711.00937)
A Flask service to allow API access to ChatGPT in a browser
A multi-voice TTS system trained with an emphasis on quality
The landscape of biomedical research
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.