Fast sort of floating-point data for data engineering

Changsoo Kim, Sungroh Yoon, Dongseung Kim

Research output: Contribution to journalArticle

Abstract

In this paper, a novel external sort algorithm that improves the speedup of the sorting of floating-point numbers has been described. Our algorithm decreases the computation time significantly by applying integer arithmetic on floating-point data in the IEEE-754 standard or similar formats. We conducted experiments with synthetic data on a 32-processor Linux cluster; in the case of the internal sort alone, the Giga-byte sorting achieved approximately fivefold speedups. Furthermore, the sorting achieved twofold or greater improvements over the typical parallel sort method, network of workstations (NOW)-sort. Moreover, the sorting scheme performance is independent of the computing platform. Thus, our sorting method can be successfully applied to binary search, data mining, numerical simulations, and graphics.

Original languageEnglish
Pages (from-to)50-54
Number of pages5
JournalAdvances in Engineering Software
Volume42
Issue number1-2
DOIs
Publication statusPublished - 2011

Keywords

  • Engineering simulation
  • External sort
  • Floating-point arithmetic
  • Message passing interface
  • Parallel sort
  • Workstation cluster

ASJC Scopus subject areas

  • Software
  • Engineering(all)

Fingerprint Dive into the research topics of 'Fast sort of floating-point data for data engineering'. Together they form a unique fingerprint.

  • Cite this