DQN-based OpenCL workload partition for performance optimization

Sanghyun Park, Taeweon Suh

Research output: Contribution to journalArticle

Abstract

This paper proposes a deep Q network (DQN)-based method for the workload partition problem in OpenCL. The DQN, a reinforcement learning algorithm, optimizes the workload partition for each processing unit by the self-training, based on the accumulated performance data on the computing environment. Our experiments reveal that the DQN-based partition provides the performance improvement by up to 62.2% and 6.9% in JPEG decoding, compared to the LuxMark-based and target-based partitions, respectively. The DQN is able to capture the low-level contention in slave devices such as caches and memory, and the communication bottleneck between devices, and reflect it to the workload partition ratio.

Original languageEnglish
JournalJournal of Supercomputing
DOIs
Publication statusPublished - 2019 Jan 1

Keywords

  • DQN
  • OpenCL
  • Workload partition

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science
  • Information Systems
  • Hardware and Architecture

Fingerprint Dive into the research topics of 'DQN-based OpenCL workload partition for performance optimization'. Together they form a unique fingerprint.

  • Cite this