An interactive visual testbed system for dimension reduction and clustering of large-scale high-dimensional data

Jaegul Choo, Hanseung Lee, Zhicheng Liu, John Stasko, Haesun Park

Research output: Chapter in Book/Report/Conference proceedingConference contribution

16 Citations (Scopus)

Abstract

Many of the modern data sets such as text and image data can be represented in high-dimensional vector spaces and have benefited from computational methods that utilize advanced computational methods. Visual analytics approaches have contributed greatly to data understanding and analysis due to their capability of leveraging humans' ability for quick visual perception. However, visual analytics targeting large-scale data such as text and image data has been challenging due to the limited screen space in terms of both the numbers of data points and features to represent. Among various computational methods supporting visual analytics, dimension reduction and clustering have played essential roles by reducing these numbers in an intelligent way to visually manageable sizes. Given numerous dimension reduction and clustering methods available, however, the decision on the choice of algorithms and their parameters becomes difficult. In this paper, we present an interactive visual testbed system for dimension reduction and clustering in a large-scale high-dimensional data analysis. The testbed system enables users to apply various dimension reduction and clustering methods with different settings, visually compare the results from different algorithmic methods to obtain rich knowledge for the data and tasks at hand, and eventually choose the most appropriate path for a collection of algorithms and parameters. Using various data sets such as documents, images, and others that are already encoded in vectors, we demonstrate how the testbed system can support these tasks.

Original languageEnglish
Title of host publicationProceedings of SPIE-IS and T Electronic Imaging - Visualization and Data Analysis 2013
DOIs
Publication statusPublished - 2013 Apr 10
Externally publishedYes
EventVisualization and Data Analysis 2013 - Burlingame, CA, United States
Duration: 2013 Feb 42013 Feb 6

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume8654
ISSN (Print)0277-786X

Conference

ConferenceVisualization and Data Analysis 2013
CountryUnited States
CityBurlingame, CA
Period13/2/413/2/6

Fingerprint

Dimension Reduction
High-dimensional Data
Testbeds
Testbed
Computational methods
Clustering
Visual Analytics
Computational Methods
vector spaces
visual perception
Vector spaces
Reduction Method
Clustering Methods
Visual Perception
Vision
Vector space
Data analysis
High-dimensional
Choose
Path

Keywords

  • clustering
  • dimension reduction
  • high-dimensional data
  • visual knowledge discovery

ASJC Scopus subject areas

  • Electronic, Optical and Magnetic Materials
  • Condensed Matter Physics
  • Computer Science Applications
  • Applied Mathematics
  • Electrical and Electronic Engineering

Cite this

Choo, J., Lee, H., Liu, Z., Stasko, J., & Park, H. (2013). An interactive visual testbed system for dimension reduction and clustering of large-scale high-dimensional data. In Proceedings of SPIE-IS and T Electronic Imaging - Visualization and Data Analysis 2013 [865402] (Proceedings of SPIE - The International Society for Optical Engineering; Vol. 8654). https://doi.org/10.1117/12.2007316

An interactive visual testbed system for dimension reduction and clustering of large-scale high-dimensional data. / Choo, Jaegul; Lee, Hanseung; Liu, Zhicheng; Stasko, John; Park, Haesun.

Proceedings of SPIE-IS and T Electronic Imaging - Visualization and Data Analysis 2013. 2013. 865402 (Proceedings of SPIE - The International Society for Optical Engineering; Vol. 8654).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Choo, J, Lee, H, Liu, Z, Stasko, J & Park, H 2013, An interactive visual testbed system for dimension reduction and clustering of large-scale high-dimensional data. in Proceedings of SPIE-IS and T Electronic Imaging - Visualization and Data Analysis 2013., 865402, Proceedings of SPIE - The International Society for Optical Engineering, vol. 8654, Visualization and Data Analysis 2013, Burlingame, CA, United States, 13/2/4. https://doi.org/10.1117/12.2007316
Choo J, Lee H, Liu Z, Stasko J, Park H. An interactive visual testbed system for dimension reduction and clustering of large-scale high-dimensional data. In Proceedings of SPIE-IS and T Electronic Imaging - Visualization and Data Analysis 2013. 2013. 865402. (Proceedings of SPIE - The International Society for Optical Engineering). https://doi.org/10.1117/12.2007316
Choo, Jaegul ; Lee, Hanseung ; Liu, Zhicheng ; Stasko, John ; Park, Haesun. / An interactive visual testbed system for dimension reduction and clustering of large-scale high-dimensional data. Proceedings of SPIE-IS and T Electronic Imaging - Visualization and Data Analysis 2013. 2013. (Proceedings of SPIE - The International Society for Optical Engineering).
@inproceedings{3bb6ad1c38984a1aac34d28f96177404,
title = "An interactive visual testbed system for dimension reduction and clustering of large-scale high-dimensional data",
abstract = "Many of the modern data sets such as text and image data can be represented in high-dimensional vector spaces and have benefited from computational methods that utilize advanced computational methods. Visual analytics approaches have contributed greatly to data understanding and analysis due to their capability of leveraging humans' ability for quick visual perception. However, visual analytics targeting large-scale data such as text and image data has been challenging due to the limited screen space in terms of both the numbers of data points and features to represent. Among various computational methods supporting visual analytics, dimension reduction and clustering have played essential roles by reducing these numbers in an intelligent way to visually manageable sizes. Given numerous dimension reduction and clustering methods available, however, the decision on the choice of algorithms and their parameters becomes difficult. In this paper, we present an interactive visual testbed system for dimension reduction and clustering in a large-scale high-dimensional data analysis. The testbed system enables users to apply various dimension reduction and clustering methods with different settings, visually compare the results from different algorithmic methods to obtain rich knowledge for the data and tasks at hand, and eventually choose the most appropriate path for a collection of algorithms and parameters. Using various data sets such as documents, images, and others that are already encoded in vectors, we demonstrate how the testbed system can support these tasks.",
keywords = "clustering, dimension reduction, high-dimensional data, visual knowledge discovery",
author = "Jaegul Choo and Hanseung Lee and Zhicheng Liu and John Stasko and Haesun Park",
year = "2013",
month = "4",
day = "10",
doi = "10.1117/12.2007316",
language = "English",
isbn = "9780819494276",
series = "Proceedings of SPIE - The International Society for Optical Engineering",
booktitle = "Proceedings of SPIE-IS and T Electronic Imaging - Visualization and Data Analysis 2013",

}

TY - GEN

T1 - An interactive visual testbed system for dimension reduction and clustering of large-scale high-dimensional data

AU - Choo, Jaegul

AU - Lee, Hanseung

AU - Liu, Zhicheng

AU - Stasko, John

AU - Park, Haesun

PY - 2013/4/10

Y1 - 2013/4/10

N2 - Many of the modern data sets such as text and image data can be represented in high-dimensional vector spaces and have benefited from computational methods that utilize advanced computational methods. Visual analytics approaches have contributed greatly to data understanding and analysis due to their capability of leveraging humans' ability for quick visual perception. However, visual analytics targeting large-scale data such as text and image data has been challenging due to the limited screen space in terms of both the numbers of data points and features to represent. Among various computational methods supporting visual analytics, dimension reduction and clustering have played essential roles by reducing these numbers in an intelligent way to visually manageable sizes. Given numerous dimension reduction and clustering methods available, however, the decision on the choice of algorithms and their parameters becomes difficult. In this paper, we present an interactive visual testbed system for dimension reduction and clustering in a large-scale high-dimensional data analysis. The testbed system enables users to apply various dimension reduction and clustering methods with different settings, visually compare the results from different algorithmic methods to obtain rich knowledge for the data and tasks at hand, and eventually choose the most appropriate path for a collection of algorithms and parameters. Using various data sets such as documents, images, and others that are already encoded in vectors, we demonstrate how the testbed system can support these tasks.

AB - Many of the modern data sets such as text and image data can be represented in high-dimensional vector spaces and have benefited from computational methods that utilize advanced computational methods. Visual analytics approaches have contributed greatly to data understanding and analysis due to their capability of leveraging humans' ability for quick visual perception. However, visual analytics targeting large-scale data such as text and image data has been challenging due to the limited screen space in terms of both the numbers of data points and features to represent. Among various computational methods supporting visual analytics, dimension reduction and clustering have played essential roles by reducing these numbers in an intelligent way to visually manageable sizes. Given numerous dimension reduction and clustering methods available, however, the decision on the choice of algorithms and their parameters becomes difficult. In this paper, we present an interactive visual testbed system for dimension reduction and clustering in a large-scale high-dimensional data analysis. The testbed system enables users to apply various dimension reduction and clustering methods with different settings, visually compare the results from different algorithmic methods to obtain rich knowledge for the data and tasks at hand, and eventually choose the most appropriate path for a collection of algorithms and parameters. Using various data sets such as documents, images, and others that are already encoded in vectors, we demonstrate how the testbed system can support these tasks.

KW - clustering

KW - dimension reduction

KW - high-dimensional data

KW - visual knowledge discovery

UR - http://www.scopus.com/inward/record.url?scp=84875856709&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84875856709&partnerID=8YFLogxK

U2 - 10.1117/12.2007316

DO - 10.1117/12.2007316

M3 - Conference contribution

AN - SCOPUS:84875856709

SN - 9780819494276

T3 - Proceedings of SPIE - The International Society for Optical Engineering

BT - Proceedings of SPIE-IS and T Electronic Imaging - Visualization and Data Analysis 2013

ER -