Two-stage framework for visualization of clustered high dimensional data

Jaegul Choo, Shawn Bohn, Haesun Park

Research output: Chapter in Book/Report/Conference proceedingConference contribution

31 Citations (Scopus)

Abstract

In this paper, we discuss dimension reduction methods for 2D visualization of high dimensional clustered data. We propose a two-stage framework for visualizing such data based on dimension reduction methods. In the first stage, we obtain the reduced dimensional data by applying a supervised dimension reduction method such as linear discriminant analysis which preserves the original cluster structure in terms of its criteria. The resulting optimal reduced dimension depends on the optimization criteria and is often larger than 2. In the second stage, the dimension is further reduced to 2 for visualization purposes by another dimension reduction method such as principal component analysis. The role of the second-stage is to minimize the loss of information due to reducing the dimension all the way to 2. Using this framework, we propose several two-stage methods, and present their theoretical characteristics as well as experimental comparisons on both artificial and real-world text data sets.

Original languageEnglish
Title of host publicationVAST 09 - IEEE Symposium on Visual Analytics Science and Technology, Proceedings
Pages67-74
Number of pages8
DOIs
Publication statusPublished - 2009
EventVAST 09 - IEEE Symposium on Visual Analytics Science and Technology - Atlantic City, NJ, United States
Duration: 2009 Oct 122009 Oct 13

Publication series

NameVAST 09 - IEEE Symposium on Visual Analytics Science and Technology, Proceedings

Conference

ConferenceVAST 09 - IEEE Symposium on Visual Analytics Science and Technology
CountryUnited States
CityAtlantic City, NJ
Period09/10/1209/10/13

Keywords

  • 2D projection
  • Clustered data
  • Dimension reduction
  • Generalized singular value decomposition
  • Linear discriminant analysis
  • Orthogonal centroid method
  • Principal component analysis
  • Regularization

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Vision and Pattern Recognition
  • Information Systems

Fingerprint Dive into the research topics of 'Two-stage framework for visualization of clustered high dimensional data'. Together they form a unique fingerprint.

  • Cite this

    Choo, J., Bohn, S., & Park, H. (2009). Two-stage framework for visualization of clustered high dimensional data. In VAST 09 - IEEE Symposium on Visual Analytics Science and Technology, Proceedings (pp. 67-74). [5332629] (VAST 09 - IEEE Symposium on Visual Analytics Science and Technology, Proceedings). https://doi.org/10.1109/VAST.2009.5332629