MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration - Citation Graph | Papersgraph