ttslr

💭

I may be slow to respond.

RuiLiu ttslr

💭

I may be slow to respond.

微信公众号：IMU语音理解与生成实验室（IMU-S2Lab）

163 followers · 334 following

Inner Mongolia University, China
Hohhot
ttslr.github.io
@RuiLiu60711141

Achievements

Starred repositories

atcbosselut / comet-commonsense

Code for ACL 2019 Paper: "COMET: Commonsense Transformers for Automatic Knowledge Graph Construction" https://arxiv.org/abs/1906.05317

Python 672 124 Updated Nov 17, 2022

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 1,713 113 Updated Dec 13, 2024

cyanbx / Prompt-Singer

Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).

Python 88 12 Updated Oct 18, 2024

interactiveaudiolab / emphases

Crowdsourced and Automatic Speech Prominence Estimation

Python 14 2 Updated Apr 12, 2024

facebookresearch / facestar

Facestar dataset. High quality audio-visual recordings of human conversational speech.

Python 104 7 Updated Mar 29, 2022

fusiming3 / MARS

Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

84 2 Updated Jul 16, 2024

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 7,973 1,006 Updated Dec 14, 2024

b04901014 / FT-w2v2-ser

Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition

Python 144 33 Updated Oct 26, 2021

datawhalechina / leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

Jupyter Notebook 14,033 2,926 Updated Dec 13, 2024

Camb-ai / MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

Jupyter Notebook 2,550 209 Updated Aug 1, 2024

run-llama / voice-chat-pdf

Use OpenAI's realtime API for a chatting with your documents

JavaScript 283 55 Updated Oct 6, 2024

IAHispano / Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

Python 1,869 300 Updated Dec 13, 2024

ZebangCheng / Emotion-LLaMA

Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning

Python 142 15 Updated Dec 11, 2024

NKU-HLT / KNN-CTC

[ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels

C++ 28 3 Updated Mar 20, 2024

kaixindelele / ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,570 1,940 Updated Apr 4, 2024

nishiwen1214 / ChatReviewer

ChatReviewer: 使用ChatGPT分析论文优缺点，提出改进建议

Python 1,307 115 Updated Nov 22, 2024

BenoitWang / Speech_Emotion_Diarization

Python 60 7 Updated Sep 13, 2024

Haoqiu-Yan / PerceptiveAgent

Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))

Python 37 1 Updated Aug 6, 2024

NEU-DataMining / awesome-affective-computing

A comprehensive overview of affective computing research in the era of large language models (LLMs).

16 2 Updated Aug 7, 2024

0nutation / SpeechGPT

SpeechGPT Series: Speech Large Language Models

Python 1,317 87 Updated Jul 22, 2024

Sreyan88 / LipGER

Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition

Python 14 1 Updated Jul 16, 2024

zhouzhouyang520 / IAMM

Python 7 Updated Nov 12, 2024

doubibobo / SKEAFN

The source code for the paper titled "Sentiment Knowledge Enhanced Attention Fusion Network (SKEAFN)".

Python 23 3 Updated Aug 17, 2023

JoeYing1019 / SDIF-DA

[ICASSP2024] Code for paper "SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent Detection"

Python 8 2 Updated Jul 6, 2024

ictnlp / SiLLM

SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a traditional SiMT model for policy-decision to achieve SiMT th…

Python 15 2 Updated Feb 22, 2024

tarepan / vocos-official

Forked from gemelo-ai/vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Jupyter Notebook 3 2 Updated Aug 21, 2023

RuiLiu ttslr

Starred repositories

audio-deepfake-detection

wavernn

perceptual-losses