OCR software, free and offline
Open source no-code system for text annotation and building of text
CLIP, Predict the most relevant text snippet given an image
Spark-TTS Inference Code
A simple, high-quality voice conversion tool focused on ease of use
Code for running inference and finetuning with SAM 3 model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Official inference repo for FLUX.1 models
PersonaPlex code
SOTA Open Source TTS
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Edit PDF files with Nano Banana
A Powerful Native Multimodal Model for Image Generation
Official inference repo for FLUX.2 models
An Open Source text-to-speech system built by inverting Whisper
Audiocraft is a library for audio processing and generation
Check code for common misspellings
Offline inference engine for art, real-time voice conversations
Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
NLP Cloud serves high performance pre-trained or custom models for NER
Statusline plugin for vim with prompts for several other applications
Sample code and notebooks for Generative AI on Google Cloud
Code for the paper "Evaluating Large Language Models Trained on Code"
High-Resolution Image Synthesis with Latent Diffusion Models
Offline Text To Speech synthesis for python