Skip to content
View ttslr's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report ttslr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Code for ACL 2019 Paper: "COMET: Commonsense Transformers for Automatic Knowledge Graph Construction" https://arxiv.org/abs/1906.05317

Python 672 124 Updated Nov 17, 2022

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 1,713 113 Updated Dec 13, 2024

Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).

Python 88 12 Updated Oct 18, 2024

Crowdsourced and Automatic Speech Prominence Estimation

Python 14 2 Updated Apr 12, 2024

Facestar dataset. High quality audio-visual recordings of human conversational speech.

Python 104 7 Updated Mar 29, 2022

Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

84 2 Updated Jul 16, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 7,973 1,006 Updated Dec 14, 2024

Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition

Python 144 33 Updated Oct 26, 2021

《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

Jupyter Notebook 14,033 2,926 Updated Dec 13, 2024

MARS5 speech model (TTS) from CAMB.AI

Jupyter Notebook 2,550 209 Updated Aug 1, 2024

Use OpenAI's realtime API for a chatting with your documents

JavaScript 283 55 Updated Oct 6, 2024

A simple, high-quality voice conversion tool focused on ease of use and performance.

Python 1,869 300 Updated Dec 13, 2024

Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning

Python 142 15 Updated Dec 11, 2024

[ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels

C++ 28 3 Updated Mar 20, 2024

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,570 1,940 Updated Apr 4, 2024

ChatReviewer: 使用ChatGPT分析论文优缺点,提出改进建议

Python 1,307 115 Updated Nov 22, 2024

Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))

Python 37 1 Updated Aug 6, 2024

A comprehensive overview of affective computing research in the era of large language models (LLMs).

16 2 Updated Aug 7, 2024

SpeechGPT Series: Speech Large Language Models

Python 1,317 87 Updated Jul 22, 2024

Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition

Python 14 1 Updated Jul 16, 2024
Python 7 Updated Nov 12, 2024

The source code for the paper titled "Sentiment Knowledge Enhanced Attention Fusion Network (SKEAFN)".

Python 23 3 Updated Aug 17, 2023

[ICASSP2024] Code for paper "SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent Detection"

Python 8 2 Updated Jul 6, 2024

SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a traditional SiMT model for policy-decision to achieve SiMT th…

Python 15 2 Updated Feb 22, 2024

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Jupyter Notebook 3 2 Updated Aug 21, 2023

Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset

Python 25 1 Updated Sep 1, 2024

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,569 461 Updated Nov 21, 2024

Reliable Conflictive Multi-view Learning

Python 71 6 Updated Mar 24, 2024

Source code for the paper 'Audio Captioning Transformer'

Jupyter Notebook 51 3 Updated Jan 18, 2022
Next