-
Inner Mongolia University, China
- Hohhot
- ttslr.github.io
- @RuiLiu60711141
Starred repositories
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端
AudioLDM: Generate speech, sound effects, music and beyond, with text.
🌈Beamer风格的幻灯片模板集。包含了PowerPoint和Keynote两套格式。
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Audio generation using diffusion models, in PyTorch.
A timeline of the latest AI models for audio generation, starting in 2023!
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
⚡️ A practical visualization library for tabular analysis.
Rembg is a tool to remove images background
A collection of datasets for the purpose of emotion recognition/detection in speech.
PyTorch implementation of some attentions for Deep Learning Researchers.
🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。
⛷ Lightweight Markdown app to help you write great sentences. ⛷ 轻灵的 Markdown 笔记本伴你写出妙言
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
Deep Performer: Score-to-audio music performance synthesis
A curated list of the latest breakthroughs in AI (in 2021) by release date with a clear video explanation, link to a more in-depth article, and code.
Rich Prosody Diversity Modelling with Phone-level Mixture Density Network
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.
An implementation of Additive Attention
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Deep Speaker: an End-to-End Neural Speaker Embedding System.
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.