This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the …

Python 889 131 Updated Nov 19, 2024

yeyupiaoling / VoiceprintRecognition-PaddlePaddle

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型，同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

Python 250 47 Updated Nov 19, 2024

Music-and-Culture-Technology-Lab / omnizart

Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.

Python 1,661 103 Updated Jul 11, 2024

KinWaiCheuk / Jointist

Official Implementation of Jointist

Python 32 2 Updated Jul 26, 2023

bytedance / piano_transcription

Python 1,703 205 Updated Aug 18, 2023

Anjok07 / ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 19,319 1,428 Updated Dec 9, 2024

ZFTurbo / MVSEP-MDX23-music-separation-model

Model for MDX23 music separation contest

Python 687 97 Updated Jun 24, 2024

yeyupiaoling / Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…

C 945 155 Updated Jan 22, 2025

MTG / violin-transcription

High-Resolution Violin Transcription using Weak Labels

Jupyter Notebook 24 1 Updated Oct 29, 2023

ggerganov / whisper.cpp

Port of OpenAI's Whisper model in C/C++

C++ 37,493 3,871 Updated Feb 6, 2025

magenta / midi-ddsp

Synthesis of MIDI with DDSP (https://midi-ddsp.github.io/)

Python 305 20 Updated Nov 30, 2022

kunato / mt3-pytorch

Python 37 5 Updated Jun 13, 2023

stemrollerapp / stemroller

Isolate vocals, drums, bass, and other instrumental stems from any song

Svelte 2,700 116 Updated Jan 27, 2025

ilya16 / ScorePerformer

ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)

Python 35 1 Updated Nov 4, 2023

sony / hFT-Transformer

Pytorch implementation of automatic music transcription method that uses a two-level hierarchical frequency-time Transformer architecture (hFT-Transformer).

Python 98 11 Updated Jul 11, 2023

Stability-AI / stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Python 39,982 5,133 Updated Oct 10, 2024

kssteven418 / Squeezeformer

[NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

Python 248 19 Updated Feb 12, 2023

facebookresearch / demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 8,613 1,116 Updated Apr 24, 2024

WenzheLiu-Speech / awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

1,086 224 Updated Nov 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yellow122

Block or report yellow122

Stars

Rainbow-Dreamer / musicpy

josephding23 / MusicCritique

zhvng / open-musiclm

RetroCirce / MusicLDM

magenta / music-spectrogram-diffusion

mimbres / YourMT3

keunwoochoi / DrummerNet

MZehren / ADTOF

skittree / DrummerScore

yoyolicoris / music-spectrogram-diffusion-pytorch

FunAudioLLM / SenseVoice

yeyupiaoling / VoiceprintRecognition-Pytorch