-
Trip
- Shanghai
-
11:06
(UTC +08:00) - [email protected]
- @shylockasr
- https://www.meta-speech.com
Lists (3)
Sort Name ascending (A-Z)
Stars
Talk to any LLM with hands-free voice interaction, voice interruption, Live2D taking face, and long-term memory running locally across platforms
Conversion between Traditional and Simplified Chinese
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.
The codebase of our paper "Improving the Training of Rectified Flows"
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Train a 1B LLM with 1T tokens from scratch by personal
来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
https://hrl.boyuai.com/
Code for 'Textless Speech-to-Speech Translation With Limited Parallel Data'
CMMLU: Measuring massive multitask language understanding in Chinese
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
A trainable PyTorch reproduction of AlphaFold 3.
🚀 Next Generation AI One-Stop Internationalization Solution. 🚀 下一代 AI 一站式 B/C 端解决方案,支持 OpenAI,Midjourney,Claude,讯飞星火,Stable Diffusion,DALL·E,ChatGLM,通义千问,腾讯混元,360 智脑,百川 AI,火山方舟,新必应,Gemini,Moonshot …
Speech, Language, Audio, Music Processing with Large Language Model