Home Research Papers Datasets State of the Art Pricing

Discover, visualize, and connect AI research papers. Explore the latest trends and insights in artificial intelligence research.

Product

Home
Research Papers
About

Support

Contact
Terms of Service
Privacy Policy

miscellaneous-8

Data Summarization

3260 papers • 126 benchmarks • 313 datasets

Data Summarization is a central problem in the area of machine learning, where we want to compute a small summary of the data. Source: How to Solve Fair k-Center in Massive Data Models

(Image credit: Papersgraph)

Benchmarks

These leaderboards are used to track progress in data-summarization-8

Trend

Dataset

Best Model

Actions

No benchmarks available.

Libraries

Use these libraries to find data-summarization-8 models and implementations

MikeJaredS/hermiter

2 papers 14

Datasets

MentSum

PJM(AEP)

Subtasks

No subtasks available.

Most implemented papers

Soft-Label Dataset Distillation and Text Dataset Distillation

Matthias Schonlau, Ilia Sucholutsky•Sat Oct 05 2019

This work proposes to simultaneously distill both images and their labels, thus assigning each synthetic sample a `soft' label (a distribution of labels) and demonstrates that text distillation outperforms other methods across multiple datasets.

151

Content

Introduction Benchmarks Datasets Subtasks Libraries Papers

Paper Graph

Iterative Projection and Matching: Finding Structure-Preserving Representatives and Its Application to Computer Vision

M. Shah, Nazanin Rahnavard, M. Joneidi, Alireza Zaeemzadeh•Wed Nov 28 2018

A fast and accurate data selection method, in which the selected samples are optimized to span the subspace of all data, with linear complexity w.r.t. the number of data, and without any parameter to be tuned is presented.

18 0

Paper Graph

Flexible Dataset Distillation: Learn Labels Instead of Images

Timothy M. Hospedales, Yongxin Yang, Ondrej Bohdal•Sun Jun 14 2020

This work introduces a more robust and flexible meta-learning algorithm for distillation, as well as an effective first-order strategy based on convex optimization layers, and shows it to be more effective than the prior image-based approach to dataset distillation.

120 0

Paper Graph

Sequential estimation of Spearman rank correlation using Hermite series estimators

Michael Stephanou, Melvin M. Varughese•Thu Dec 10 2020

A new Hermite series based sequential estimator for the Spearman rank correlation coefficient is described and an exponentially weighted estimator is introduced, which allows the local nonparametric correlation of a bivariate data stream to be tracked.

27 0

Paper Graph

Sequential Quantiles via Hermite Series Density Estimation

Michael Stephanou, Melvin M. Varughese, I. MacDonald•Thu Jul 16 2015

Simulation studies and tests on real data reveal the Gauss-Hermite based algorithms to be competitive with a leading existing algorithm and provide a solution to online distribution function and online quantile function estimation on data streams.

16 0

Paper Graph

A Mixed Hierarchical Attention Based Encoder-Decoder Approach for Standard Table Summarization

Mitesh M. Khapra, Karthik Sankaranarayanan, Preksha Nema, Anirban Laha, Parag Jain, Shreyas Shetty•Sat Mar 31 2018

This work forms the standard table summarization problem, which deals with tables conforming to a single predefined schema, and proposes a mixed hierarchical attention based encoder-decoder model which is able to leverage the structure in addition to the content of the tables.

29 0

Paper Graph

Fair and Diverse DPP-based Data Summarization

L. E. Celis, Vijay Keswani, D. Straszak, A. Deshpande, Tarun Kathuria, Nisheeth K. Vishnoi•Sun Feb 11 2018

The experimental results on a real-world and an image dataset show that the diversity of the samples produced by adding fairness constraints is not too far from the unconstrained case, and a theoretical explanation of it is provided.

129 0

Paper Graph

An Online Algorithm for Nonparametric Correlations

Wei Xiao•Mon Dec 04 2017

This paper proposes a novel online algorithm that can compute the nonparametric correlations 10 to 1,000 times faster than the corresponding batch algorithm, and it can compute them based either on all past observations or on fixed-size sliding windows.

10 0

Paper Graph

Adding a benchmark result helps the community track progress.

Data Summarization | State-of-the-Art