Open source no-code system for text annotation and building of text
OpenCompass is an LLM evaluation platform
Foundational model for human-like, expressive TTS
A generative speech model for daily dialogue
Train a 26M-parameter GPT from scratch in just 2h
Generate audiobooks from e-books
HY-Motion model for 3D character animation generation
AI-powered video clipping and highlight generation
Multilingual speech recognition and audio understanding model
A feature rich discord Modmail bot
Qwen2.5-VL is the multimodal large language model series
A Pragmatic VLA Foundation Model
A python tool that uses GPT-4, FFmpeg, and OpenCV
The best ChatGPT that $100 can buy
SOTA discrete acoustic codec models with 40/75 tokens per second
Implementation of the Surya Foundation Model for Heliophysics
Large Multimodal Models for Video Understanding and Editing
Implementation of 'lightweight' GAN, proposed in ICLR 2021
Singing voice change based on whisper, lora for singing voice clone
Distributed training framework for TensorFlow, Keras, PyTorch, etc.
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Get notified when your training ends