Datasets related to FSD

Datasets created within the Freesound Annotator platform can be downloaded from the Freesound Datasets Zenodo community page.

Freesound Dataset (FSD) Main dataset that we are collecting. It is currently under development and we hope to make the first release in mid 2019. FSD will be a general-purpose dataset consisting of tens of thousands of audio clips from Freesound organised using the AudioSet Ontology.
FSDKaggle2018 Dataset containing 11k audio clips and 18 hours of training data unequally distributed in 41 classes of the AudioSet Ontology. It was collected for the DCASE Challenge 2018 Task 2, which was run as the Kaggle competition Freesound General-Purpose Audio Tagging Challenge. Described in our DCASE 2018 paper.
FSDnoisy18k Dataset collected with the aim of fostering the investigation of label noise in sound event classification. It contains 42.5 hours of audio across 20 sound classes, including a small amount of manually-labeled data and a larger quantity of real-world noisy data. Described in its companion site and in our ICASSP 2019 paper.