Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Reset Tasks
Multimodal
Visual Question Answering
Video-Text-to-Text
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Table to Text
Multiple Choice
Text Retrieval
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Tabular to Text
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Datasets
81
Full-text search
Edit filters
Sort: Trending
Active filters:
audio-to-audio
Clear all
omegalabsinc/omega-multimodal
Preview
•
Updated
1 minute ago
•
63.1k
•
31
litagin/ehehe-corpus
Viewer
•
Updated
Mar 31
•
16.4k
•
85
•
12
Matthijs/cmu-arctic-xvectors
Viewer
•
Updated
Feb 7, 2023
•
7.93k
•
19.2k
•
39
foldl/midi
Viewer
•
Updated
Jul 13, 2023
•
20.5k
•
110
•
6
PetraAI/PetraAI
Updated
Sep 14, 2023
•
306
•
20
Cnam-LMSSC/vibravox
Viewer
•
Updated
Nov 6
•
26.7k
•
40.5k
•
17
jhu-clsp/seamless-align
Preview
•
Updated
Jun 2
•
235
•
10
davidchan/anim400k
Updated
Jun 21
•
79
•
32
jhu-clsp/seamless-align-expressive
Updated
Feb 22
•
68
•
4
MomoyamaSawa/Voice-KusanagiNene
Viewer
•
Updated
Jan 29
•
3.89k
•
244
•
11
jaCappella/jaCappella
Updated
Feb 8
•
43
•
3
projecte-aina/commonvoice_benchmark_catalan_accents
Updated
May 16
•
222
•
3
herwoww/arabic_xvector_embeddings
Viewer
•
Updated
May 13
•
305
•
239
•
5
jspr/symbolic-jazz-standards
Viewer
•
Updated
Mar 4
•
709
•
49
•
3
IVLLab/MultiDialog
Updated
Aug 29
•
1.08k
•
12
espnet/ace-opencpop-segments
Viewer
•
Updated
Jul 16
•
106k
•
381
•
3
espnet/ace-kising-segments
Viewer
•
Updated
Sep 9
•
29k
•
210
•
4
projecte-aina/annotated_catalan_common_voice_v17
Updated
Nov 7
•
97
•
1
Kit-Lemonfoot/LemonfootVoiceDatasets
Updated
May 2
•
57
•
3
MushanW/GLOBE
Viewer
•
Updated
19 days ago
•
582k
•
718
•
27
MushanW/ESLTTS
Viewer
•
Updated
Jun 22
•
41.8k
•
154
•
1
espnet/mms_ulab_v2
Viewer
•
Updated
Jul 2
•
201k
•
1.27k
•
15
CanCLID/zoengjyutgaai_saamgwokjinji
Viewer
•
Updated
7 days ago
•
39.2k
•
4.43k
•
8
benjamin-paine/freesound-laion-640k
Viewer
•
Updated
Sep 7
•
506k
•
879
•
6
benjamin-paine/free-music-archive-medium
Viewer
•
Updated
Sep 7
•
24.8k
•
163
•
4
benjamin-paine/free-music-archive-large
Viewer
•
Updated
Sep 7
•
105k
•
211
•
6
benjamin-paine/free-music-archive-full
Viewer
•
Updated
Sep 15
•
107k
•
447
•
5
WaveGenAI/youtube-cc-by-music
Viewer
•
Updated
Oct 29
•
316k
•
85
•
8
JacobLinCool/VoiceBank-DEMAND-16k
Viewer
•
Updated
Oct 26
•
12.4k
•
419
•
1
longmaodata/Cantonese-ASR
Preview
•
Updated
22 days ago
•
40
•
1
Previous
1
2
3
Next