Some of my colleagues have just released a huge dataset of sounds! More than 500 sound classes annotated from more than 2M videos. Check it out!
A sound vocabulary and dataset. AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2084320 human-labeled 10-second sound clips drawn from YouTube videos. The ontology is specified as a hierarchical graph of event categories, covering a wide range of ...
36 plus ones
Shared publicly•View activity
Add a comment...