Skip to content
View huyanxin's full-sized avatar

Highlights

  • Pro

Block or report huyanxin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Python 182 11 Updated Aug 25, 2024

Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.

Python 32 8 Updated Mar 22, 2021
Python 7,165 561 Updated Jan 14, 2025

Target Speaker Extraction Toolkit

Python 139 16 Updated Nov 6, 2024

[Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement

Python 35 1 Updated Dec 2, 2024

Paderwasn is a collection of methods for acoustic signal processing in wireless acoustic sensor networks (WASNs).

Python 17 7 Updated Aug 13, 2024

A lightweight library for portable low-level GPU computation using WebGPU.

C++ 3,791 178 Updated Dec 29, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,324 192 Updated Aug 11, 2024

10W首中文歌词数据库

458 76 Updated Jun 13, 2021

Generate synthetic wind noise signals based on a wind speed profile.

Python 23 5 Updated Apr 23, 2024

LLM training in simple, raw C/CUDA

Cuda 25,070 2,861 Updated Oct 2, 2024

Stable Diffusion web UI

Python 146,033 27,378 Updated Dec 28, 2024

Synthesizes a room impulse response using a ray tracing simulation engine.

C 12 3 Updated Mar 22, 2017

Graph Neural Networks for Sound Source Localization

Jupyter Notebook 14 5 Updated Oct 31, 2023

Pitch detection and pitch tracking, voicing unvoicing detection (VAD),基音检测

MATLAB 93 21 Updated Apr 21, 2022

A python algorithm to change the pitch of the voice in real time

Python 13 1 Updated Dec 13, 2020

Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (arXiv:2401.01498)

Python 59 4 Updated Apr 4, 2024
Cuda 108 30 Updated Apr 11, 2024

An optimized neural network operator library for chips base on Xuantie CPU.

C 88 38 Updated Jun 26, 2024

" Music Style Transfer with Time-Varying Inversion of Diffusion Models"

Jupyter Notebook 37 4 Updated Jul 23, 2024

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,255 120 Updated Jul 11, 2024

Official implementation of Self-Remixing

Python 13 Updated Feb 3, 2024

多个SVC/TTS的C++推理库

C 1,030 123 Updated Oct 19, 2024

中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。

4,052 1,003 Updated Mar 27, 2024

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

Python 776 118 Updated Dec 19, 2024

Code and dataset for photorealistic Codec Avatars driven from audio

Python 2,740 263 Updated Sep 15, 2024

The official implementation of GTCRN, an ultra-lite speech enhancement model.

Python 249 44 Updated Jan 1, 2025

Fast Independent Vector Extraction: Code and data to reproduce the results from the paper.

Python 22 12 Updated May 7, 2020
Next