-
fastembed
generating vector embeddings, reranking locally
-
qts_cli
Command-line tools for Qwen3 TTS synthesis and WAV output
-
large
Rust LLM inference implementation
-
async-dashscope
client for DashScope API
-
qts
Qwen3 TTS inference (GGUF + GGML); Rust API for host apps and gdext
-
qwen3-asr-rs
Pure Rust implementation of Qwen3 ASR (Automatic Speech Recognition) with libtorch and MLX backends
-
candle-pipelines
intuitive pipelines for local LLM inference in Rust, powered by Candle. Inspired by Python's Transformers library.
-
cuttle
A large language model inference engine in Rust
-
car-inference
Local model inference for CAR — Candle backend with Qwen3 models