Starred repositories
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Fast audio super resolution from 16khz to 48khz.
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
A collection of projects designed to help developers quickly get started with building deployable applications using the Claude API
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
(Pattern Recognition) Pytorch implementation of “HTR-VT: Handwritten Text Recognition with Vision Transformer”
Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Pytorch implementation of "Grandmaster-Level Chess Without Search" and "Stop Regressing: Training Value Functions via Classification for Scalable Deep RL"
Official PyTorch Implementation of CleanUNet (ICASSP 2022)
Audio Denoiser System based on NVIDIA CleanUNet
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Multilingual Voice Understanding Model
Hands-on tutorial and Automation stack for an operations-ready DigitalOcean Kubernetes (DOKS) cluster.
Example DigitalOcean Kubernetes workload with service exposed through a DO load-balancer.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
A mcp server to allow LLMS gain context about shadcn ui component structure,usage and installation,compaitable with react,svelte 5,vue & React Native

