arXiv cs.LG
· Papers
SLEEPING-DISCO 9M: A large-scale pre-training dataset for generative music modeling
arXiv:2506.14293v4 Announce Type: replace-cross Abstract: We present Sleeping-DISCO 9M, a large-scale pre-training dataset for music and song. To the best of our knowledge, there are no open-source high-quality dataset representing popular and well-known songs for generative music modeling tasks such as text-music, mus