Discourse Parsing in Videos: A Multi-modal Appraoch - Citation Graph | Papersgraph