3260 papers • 126 benchmarks • 313 datasets
Abstractive text summarization by utilizing information from multiple modalities.
(Image credit: Papersgraph)
These leaderboards are used to track progress in multimodal-abstractive-text-summarization-8
Use these libraries to find multimodal-abstractive-text-summarization-8 models and implementations
No subtasks available.
A new model for Multimodal Abstractive Text Summarization that utilizes information from all three modalities – text, audio and video – in a multimodal video and presents a sequence-to-sequence trimodal hierarchical attention-based model that overcomes challenges by letting the model pay more attention to the text modality.
FinDSum, the first large-scale dataset for long text and multi-table summarization, is proposed and a set of evaluation metrics to assess the usage of numerical information in produced summaries is proposed.
Adding a benchmark result helps the community track progress.