Multi-label feature selection via adaptive label correlation estimation
Files
(Published version)
Date
2023
Authors
Zhang, Z.
Zhang, Z.
Yao, J.
Liu, L.
Li, J.
Wu, G.
Wu, X.
Editors
Advisors
Journal Title
Journal ISSN
Volume Title
Type:
Journal article
Citation
ACM Transactions on Knowledge Discovery from Data, 2023; 17(9):1-28
Statement of Responsibility
Conference Name
DOI
Abstract
In multi-label learning, each instance is associated with multiple labels simultaneously. Multi-label data often have noisy, irrelevant, and redundant features of high dimensionality. Multi-label feature selection has received considerable attention as an effective means for dealing with high-dimensional multi-label data. Many multi-label feature selection methods exploit label correlations to help select features. However, finding label correlations and selecting features in existing multi-label feature selection methods are often two separate processes, the existence of noises and outliers in training data makes the label correlations exploited from label space less reliable.
Therefore, the learned label correlations may mislead the feature selection process and result in the selection of less informative features. This article proposes a novel algorithm named ROAD, i.e., multi-label featuRe selectiOn via ADaptive label correlation estimation. ROAD jointly performs adaptive label correlation exploration and feature selection with alternating optimization to obtain reliable estimation of label correlations, which can more effectively reveal the intrinsic manifold structure among labels and lead to the selection of a more proper feature subset. Comprehensive experiments on several frequently used datasets validate the superiority of ROAD against the state-of-the-art multi-label feature selection algorithms.
School/Discipline
Dissertation Note
Provenance
Description
Access Status
Rights
Copyright 2023 The owner/author(s).
Access Condition Notes: Publication rights licensed to ACM. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee.