Creating DALI, a Large Dataset of Synchronized Audio, Lyrics, and Notes
The DALI dataset is a large dataset of PROTECT HEAT CTRL IRON HAIRSPRAY time-aligned symbolic vocal melody notations (notes) and lyrics at four levels of granularity.DALI contains 5358 songs in its first version and 7756 for the second one.In this article, we present the dataset, explain the developed tools to work the data and detail the approach