Unsupervised feature selection using weighted principal components

Seoung Bum Kim, Panaya Rattakorn

Research output: Contribution to journalArticle

34 Citations (Scopus)

Abstract

Feature selection has received considerable attention in various areas as a way to select informative features and to simplify the statistical model through dimensional reduction. One of the most widely used methods for dimensional reduction includes principal component analysis (PCA). Despite its popularity, PCA suffers from a lack of interpretability of the original feature because the reduced dimensions are linear combinations of a large number of original features. Traditionally, two or three dimensional loading plots provide information to identify important original features in the first few principal component dimensions. However, the interpretation of what constitutes a loading plot is frequently subjective, particularly when large numbers of features are involved. In this study, we propose an unsupervised feature selection method that combines weighted principal components (PCs) with a thresholding algorithm. The weighted PC is obtained by the weighted sum of the first k PCs of interest. Each of the k loading values in the weighted PC reflects the contribution of each individual feature. We also propose a thresholding algorithm that identifies the significant features. Our experimental results with both the simulated and real datasets demonstrated the effectiveness of the proposed unsupervised feature selection method.

Original languageEnglish
Pages (from-to)5704-5710
Number of pages7
JournalExpert Systems With Applications
Volume38
Issue number5
DOIs
Publication statusPublished - 2011 May

Keywords

  • Data mining
  • Feature selection
  • Principal component analysis
  • Unsupervised learning

ASJC Scopus subject areas

  • Engineering(all)
  • Computer Science Applications
  • Artificial Intelligence

Fingerprint Dive into the research topics of 'Unsupervised feature selection using weighted principal components'. Together they form a unique fingerprint.

  • Cite this