-
-
RTranslator Public
Forked from niedev/RTranslatorRTranslator is the world's first open source real-time translation app.
C++ Apache License 2.0 UpdatedJun 19, 2024 -
nnAudio Public
Forked from KinWaiCheuk/nnAudioAudio processing by using pytorch 1D convolution network
Python MIT License UpdatedFeb 13, 2024 -
fish-speech Public
Forked from fishaudio/fish-speechBrand new TTS solution
Python BSD 3-Clause "New" or "Revised" License UpdatedJan 19, 2024 -
aimoneyhunter Public
Forked from bleedline/aimoneyhunterai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。
UpdatedDec 15, 2023 -
INTERSPEECH-2023-Papers Public
Forked from DmitryRyumin/INTERSPEECH-2023-24-PapersINTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code …
MIT License UpdatedDec 5, 2023 -
3D-TransUNet Public
Forked from Beckschen/3D-TransUNetThis is the official repository for the paper "3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers"
Python Apache License 2.0 UpdatedOct 17, 2023 -
-
DecryptPrompt Public
Forked from DSXiangLi/DecryptPrompt总结Prompt&LLM论文,开源数据&模型,AIGC应用
UpdatedAug 31, 2023 -
-
-
-
FCH-TTS Public
Forked from atomicoo/FCH-TTSA fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型,适用于英语、普通话/中文、日语、韩语、俄语和藏语(当前已测试)。
Python MIT License UpdatedMar 25, 2023 -
SiamTrackers Public
Forked from HonglinChu/SiamTrackers(2020-2022)The PyTorch version of SiamFC,SiamRPN,DaSiamRPN, UpdateNet , SiamDW, SiamRPN++, SiamMask, SiamFC++, SiamCAR, SiamBAN, Ocean, LightTrack , TrTr, NanoTrack; Visual object tracking based on…
Python Apache License 2.0 UpdatedMar 6, 2023 -
asv-subtools Public
Forked from Snowdar/asv-subtoolsAn Open Source Tools for Speaker Recognition
Python Apache License 2.0 UpdatedDec 30, 2022 -
Attention_Backend_for_ASV Public
Forked from nii-yamagishilab/Attention_Backend_for_ASVAttention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances
Python BSD 3-Clause "New" or "Revised" License UpdatedOct 27, 2022 -
ASRT_SpeechRecognition Public
Forked from nl8590687/ASRT_SpeechRecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Python GNU General Public License v3.0 UpdatedSep 26, 2022 -
pykaldi Public
Forked from pykaldi/pykaldiA Python wrapper for Kaldi
Python Apache License 2.0 UpdatedSep 18, 2022 -
qtrader Public
Forked from josephchenhk/qtraderA Light Event-Driven Algorithmic Trading Engine
Python UpdatedSep 7, 2022 -
audiomentations Public
Forked from iver56/audiomentationsA Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Python MIT License UpdatedAug 28, 2022 -
voicefilter Public
Forked from maum-ai/voicefilterUnofficial PyTorch implementation of Google AI's VoiceFilter system
Python UpdatedAug 24, 2022 -
-
-
VoiceprintRecognition-Tensorflow Public
Forked from yeyupiaoling/VoiceprintRecognition-Tensorflow使用Tensorflow实现声纹识别
Python Apache License 2.0 UpdatedJun 22, 2022 -
DeepFilterNet Public
Forked from Rikorose/DeepFilterNetNoise supression using deep filtering
Python Other UpdatedJun 20, 2022 -
awesome-keyword-spotting Public
Forked from zycv/awesome-keyword-spottingThis repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
MIT License UpdatedMay 23, 2022 -
w2v2-speaker Public
Forked from nikvaessen/w2v2-speakerResearch code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053
Python MIT License UpdatedMay 10, 2022 -
-
-
SpeechAlgorithms Public
Forked from Ryuk17/SpeechAlgorithmsSpeech Algorithms Collections
C Apache License 2.0 UpdatedMar 21, 2022