What Makes Training Multi-Modal Classification Networks Hard? - Citation Graph | Papersgraph