Boosting decision stumps for dynamic feature selection on data streams - Equipe Data, Intelligence and Graphs Accéder directement au contenu
Article Dans Une Revue Information Systems Année : 2019

Boosting decision stumps for dynamic feature selection on data streams

Résumé

Feature selection targets the identification of which features of a dataset are relevant to the learning task. It is also widely known and used to improve computation times, reduce computation requirements, and to decrease the impact of the curse of dimensionality and enhancing the generalization rates of classifiers. In data streams, classifiers shall benefit from all the items above, but more importantly, from the fact that the relevant subset of features may drift over time. In this paper, we propose a novel dynamic feature selection method for data streams called Adaptive Boosting for Feature Selection (ABFS). ABFS chains decision stumps and drift detectors, and as a result, identifies which features are relevant to the learning task as the stream progresses with reasonable success. In addition to our proposed algorithm, we bring feature selection-specific metrics from batch learning to streaming scenarios. Next, we evaluate ABFS according to these metrics in both synthetic and real-world scenarios. As a result, ABFS improves the classification rates of different types of learners and eventually enhances computational resources usage.
Fichier principal
Vignette du fichier
is_boosting_rev2.pdf (792.02 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-02339508 , version 1 (31-01-2024)

Identifiants

Citer

Jean Paul Barddal, Fabrició Enembreck, Heitor Murilo Gomes, Albert Bifet, Bernhard Pfahringer. Boosting decision stumps for dynamic feature selection on data streams. Information Systems, 2019, 83, pp.13-29. ⟨10.1016/j.is.2019.02.003⟩. ⟨hal-02339508⟩
38 Consultations
1 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More