Skip to content
View sivannavis's full-sized avatar

Block or report sivannavis

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Latte: Cross-framework Python Package for Evaluation of Latent-based Generative Models

Python 36 3 Updated Feb 23, 2023

A simple and elegant Jekyll theme for an academic personal homepage

CSS 706 579 Updated Dec 18, 2024

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 11,737 11,435 Updated Jan 3, 2025

[ICLR'24] Learning to Compose: Improving Object Centric Learning by Injecting Compositionality

Python 4 Updated Aug 21, 2024

Multi-view-AE: An extensive collection of multi-modal autoencoders implemented in a modular, scikit-learn style framework.

Python 46 5 Updated Aug 1, 2024

Multi-VAE: Learning Disentangled View-common and View-peculiar Visual Representations for Multi-view Clustering

Python 46 8 Updated Mar 26, 2023

Variational auto-encoders for audio

Python 115 20 Updated May 20, 2020

Audiogen Codec

Python 129 11 Updated Jul 9, 2024

Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound

Python 120 9 Updated Nov 9, 2024

This toolbox aims to unify audio generation model evaluation for easier comparison.

Python 311 31 Updated Sep 29, 2024

Python packaging and dependency management made easy

Python 32,140 2,294 Updated Jan 5, 2025

This repo implements a Stable Diffusion model in PyTorch with all the essential components.

Python 161 35 Updated Nov 24, 2024

This repo implements Denoising Diffusion Probabilistic Models (DDPM) in Pytorch

Python 90 16 Updated Nov 25, 2024

Vector (and Scalar) Quantization, in Pytorch

Python 2,783 228 Updated Jan 3, 2025

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,297 103 Updated Sep 24, 2023

AudioLDM training, finetuning, evaluation and inference.

Python 224 44 Updated Dec 13, 2024

Official repository supporting the L3DAS23 IEEE ICASSP Grand Challenge

Python 16 4 Updated Feb 10, 2023

[OBSOLETE] Plugin that adds OAuth2 login support to yt-dlp's YouTube extractors

Python 256 37 Updated Oct 29, 2024

A feature-rich command-line audio/video downloader

Python 95,449 7,488 Updated Dec 26, 2024

Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models

Python 170 19 Updated May 29, 2024

Implementation of SpatialCodec.

Python 55 3 Updated Sep 23, 2023

Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

Python 244 23 Updated Mar 20, 2024

Stereo, Binaural, Surround -- The more the better

Python 1 Updated Dec 1, 2023

Learning audio concepts from natural language supervision

Python 515 39 Updated Sep 18, 2024

Contrastive Language-Audio Pretraining

Python 1,483 148 Updated Nov 21, 2024

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Python 784 96 Updated Sep 30, 2021
Jupyter Notebook 32 4 Updated Aug 11, 2024
Python 178 9 Updated Feb 14, 2024
Next