Enhanced application of principal component analysis in machine learning for imputation of missing traffic data

Yoon Young Choi, Heeseung Shon, Young Ji Byon, Dong Kyu Kim, Seungmo Kang

Research output: Contribution to journalArticle

Abstract

Missing value imputation approaches have been widely used to support and maintain the quality of traffic data. Although the spatiotemporal dependency-based approaches can improve the imputation performance for large and continuous missing patterns, additionally considering traffic states can lead to more reliable results. In order to improve the imputation performances further, a section-based approach is also needed. This study proposes a novel approach for identifying traffic-states of different spots of road sections that comprise, namely, a section-based traffic state (SBTS), and determining their spatiotemporal dependencies customized for each SBTS, for missing value imputations. A principal component analysis (PCA) was employed, and angles obtained from the first principal component were used to identify the SBTSs. The pre-processing was combined with a support vector machine for developing the imputation model. It was found that the segmentation of the SBTS using the angles and considering the spatiotemporal dependency for each state by the proposed approach outperformed other existing models.

Original languageEnglish
Article number2149
JournalApplied Sciences (Switzerland)
Volume9
Issue number10
DOIs
Publication statusPublished - 2019 May 1

Fingerprint

machine learning
principal components analysis
Principal component analysis
traffic
Learning systems
Support vector machines
Processing
preprocessing
roads

Keywords

  • Machine learning
  • Missing value imputation
  • Principal component analysis
  • Support vector machine

ASJC Scopus subject areas

  • Materials Science(all)
  • Instrumentation
  • Engineering(all)
  • Process Chemistry and Technology
  • Computer Science Applications
  • Fluid Flow and Transfer Processes

Cite this

Enhanced application of principal component analysis in machine learning for imputation of missing traffic data. / Choi, Yoon Young; Shon, Heeseung; Byon, Young Ji; Kim, Dong Kyu; Kang, Seungmo.

In: Applied Sciences (Switzerland), Vol. 9, No. 10, 2149, 01.05.2019.

Research output: Contribution to journalArticle

@article{7c2a2d3d4a8b4bf1bf919f5824659648,
title = "Enhanced application of principal component analysis in machine learning for imputation of missing traffic data",
abstract = "Missing value imputation approaches have been widely used to support and maintain the quality of traffic data. Although the spatiotemporal dependency-based approaches can improve the imputation performance for large and continuous missing patterns, additionally considering traffic states can lead to more reliable results. In order to improve the imputation performances further, a section-based approach is also needed. This study proposes a novel approach for identifying traffic-states of different spots of road sections that comprise, namely, a section-based traffic state (SBTS), and determining their spatiotemporal dependencies customized for each SBTS, for missing value imputations. A principal component analysis (PCA) was employed, and angles obtained from the first principal component were used to identify the SBTSs. The pre-processing was combined with a support vector machine for developing the imputation model. It was found that the segmentation of the SBTS using the angles and considering the spatiotemporal dependency for each state by the proposed approach outperformed other existing models.",
keywords = "Machine learning, Missing value imputation, Principal component analysis, Support vector machine",
author = "Choi, {Yoon Young} and Heeseung Shon and Byon, {Young Ji} and Kim, {Dong Kyu} and Seungmo Kang",
year = "2019",
month = "5",
day = "1",
doi = "10.3390/app9102149",
language = "English",
volume = "9",
journal = "Applied Sciences (Switzerland)",
issn = "2076-3417",
publisher = "Multidisciplinary Digital Publishing Institute",
number = "10",

}

TY - JOUR

T1 - Enhanced application of principal component analysis in machine learning for imputation of missing traffic data

AU - Choi, Yoon Young

AU - Shon, Heeseung

AU - Byon, Young Ji

AU - Kim, Dong Kyu

AU - Kang, Seungmo

PY - 2019/5/1

Y1 - 2019/5/1

N2 - Missing value imputation approaches have been widely used to support and maintain the quality of traffic data. Although the spatiotemporal dependency-based approaches can improve the imputation performance for large and continuous missing patterns, additionally considering traffic states can lead to more reliable results. In order to improve the imputation performances further, a section-based approach is also needed. This study proposes a novel approach for identifying traffic-states of different spots of road sections that comprise, namely, a section-based traffic state (SBTS), and determining their spatiotemporal dependencies customized for each SBTS, for missing value imputations. A principal component analysis (PCA) was employed, and angles obtained from the first principal component were used to identify the SBTSs. The pre-processing was combined with a support vector machine for developing the imputation model. It was found that the segmentation of the SBTS using the angles and considering the spatiotemporal dependency for each state by the proposed approach outperformed other existing models.

AB - Missing value imputation approaches have been widely used to support and maintain the quality of traffic data. Although the spatiotemporal dependency-based approaches can improve the imputation performance for large and continuous missing patterns, additionally considering traffic states can lead to more reliable results. In order to improve the imputation performances further, a section-based approach is also needed. This study proposes a novel approach for identifying traffic-states of different spots of road sections that comprise, namely, a section-based traffic state (SBTS), and determining their spatiotemporal dependencies customized for each SBTS, for missing value imputations. A principal component analysis (PCA) was employed, and angles obtained from the first principal component were used to identify the SBTSs. The pre-processing was combined with a support vector machine for developing the imputation model. It was found that the segmentation of the SBTS using the angles and considering the spatiotemporal dependency for each state by the proposed approach outperformed other existing models.

KW - Machine learning

KW - Missing value imputation

KW - Principal component analysis

KW - Support vector machine

UR - http://www.scopus.com/inward/record.url?scp=85066625910&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85066625910&partnerID=8YFLogxK

U2 - 10.3390/app9102149

DO - 10.3390/app9102149

M3 - Article

VL - 9

JO - Applied Sciences (Switzerland)

JF - Applied Sciences (Switzerland)

SN - 2076-3417

IS - 10

M1 - 2149

ER -