-
whisperX Public
Forked from m-bain/whisperXWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Python BSD 2-Clause "Simplified" License UpdatedJan 30, 2025 -
-
accelerate Public
Forked from huggingface/accelerate🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Python Apache License 2.0 UpdatedOct 22, 2024 -
omegaconf Public
Forked from omry/omegaconfFlexible Python configuration system. The last one you will ever need.
Python BSD 3-Clause "New" or "Revised" License UpdatedAug 14, 2024 -
audiotools Public
Forked from descriptinc/audiotoolsObject-oriented handling of audio data, with GPU-powered augmentations, and more.
Python MIT License UpdatedAug 13, 2024 -
pyannote-audio Public
Forked from pyannote/pyannote-audioNeural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Jupyter Notebook MIT License UpdatedJun 20, 2024 -
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedJun 20, 2024 -
demucs Public
Forked from facebookresearch/demucsCode for the paper Hybrid Spectrogram and Waveform Source Separation
Python MIT License UpdatedJun 14, 2024 -
CLAP Public
Forked from LAION-AI/CLAPContrastive Language-Audio Pretraining
Python Creative Commons Zero v1.0 Universal UpdatedApr 30, 2024 -
fadtk Public
Forked from microsoft/fadtkA simple library for Fréchet Audio Distance (FAD) calculation
Python MIT License UpdatedApr 13, 2024 -
-
frechet-audio-distance Public
Forked from gudgud96/frechet-audio-distanceA lightweight library for Frechet Audio Distance calculation.
Python MIT License UpdatedJan 23, 2024 -
keyfinder-py Public
Forked from evanpurkhiser/keyfinder-pyBasic python 3 bindings for libkeyfinder
C++ GNU General Public License v3.0 UpdatedJan 15, 2024 -
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 14, 2023 -
causal-conv1d Public
Forked from Dao-AILab/causal-conv1dCausal depthwise conv1d in CUDA, with a PyTorch interface
Cuda BSD 3-Clause "New" or "Revised" License UpdatedDec 7, 2023 -
sota-music-tagging-models Public
Forked from minzwon/sota-music-tagging-modelsPython MIT License UpdatedNov 11, 2023 -
descript-audio-codec Public
Forked from descriptinc/descript-audio-codecState-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Python MIT License UpdatedSep 13, 2023 -
google-research-python3-fad Public
Forked from google-research/google-researchGoogle Research (modified to support fad calculation from AudioCraft training code)
Jupyter Notebook Apache License 2.0 UpdatedAug 29, 2023