MACS (Multi-Annotator Captioned Soundscapes)

Introduced by Martin-Morato et al. in Diversity and bias in audio captioning datasets

This is a dataset containing audio captions and corresponding audio tags for a number of 3930 audio files of the TAU Urban Acoustic Scenes 2019 development dataset (airport, public square, and park). The files were annotated using a web-based tool. Each file is annotated by multiple annotators that provided tags and a one-sentence description of the audio content.

Papers


Paper Code Results Date Stars

Dataset Loaders


Tasks


Similar Datasets


License


  • Other (Non commercial)

Modalities


Languages