3260 papers • 126 benchmarks • 313 datasets
Automated feature engineering improves upon the traditional approach to feature engineering by automatically extracting useful and meaningful features from a set of related data tables with a framework that can be applied to any problem.
(Image credit: Papersgraph)
These leaderboards are used to track progress in automated-feature-engineering-6
No benchmarks available.
Use these libraries to find automated-feature-engineering-6 models and implementations
No datasets available.
No subtasks available.
This work tests auto-sklearn, TPOT, auto_ml, and H2O's AutoML solution against a compiled set of regression and classification datasets sourced from OpenML and finds that auto-Sklearn performs the best across classification datasets and TPOT performs thebest across regression datasets.
This paper implements an open source Tree-based Pipeline Optimization Tool (TPOT) in Python and shows that TPOT can design machine learning pipelines that provide a significant improvement over a basic machine learning analysis while requiring little to no input nor prior knowledge from the user.
This paper proposes Cardea, an extensible open-source automated machine learning framework encapsulating common prediction problems in the health domain and allows users to build predictive models with their own data.
A novel model architecture is suggested that combines three feature sets for visual content and motion to predict importance scores, and improves state-of-the-art results for SumMe, while being on par with the state of the art for TVSum dataset.
This review seeks to motivate a more expansive perspective on what constitutes an automated/autonomous ML system, alongside consideration of how best to consolidate those elements, and develops a conceptual framework to illustrate one possible way of fusing high-level mechanisms into an autonomous ML system.
This work introduces Layered TPOT, a modification to TPOT which aims to create pipelines equally good as the original, but in significantly less time, using a modified evolutionary algorithm.
Adding a benchmark result helps the community track progress.