VPL-based Big Data Analysis System: UDAS

Hyunjin Choi, Jangwon Gim, Young Duk Seo, Doo Kwon Baik

Research output: Contribution to journalArticle

Abstract

Over the past five years, research on big data analysis has been actively conducted, and many services have been developed to find valuable data. However, low quality of raw data and data loss problem during data analysis make it difficult to perform accurate data analysis. With the enormous generation of both unstructured and structured data, refinement of data is becoming increasingly difficult. As a result, data refinement plays an important role in data analysis. In addition, as part of efforts to ensure research reproducibility, the importance of reuse of researcher data and research methods is increasing; however, the research on systems supporting such roles have not been conducted sufficiently. Therefore, in this paper, we propose a big data analysis system named the unified data analytics suite (UDAS) that focuses on data refinement. UDAS performs data refinement based on the big data platform and ensures the reusability and reproducibility of refinement and analysis through the visual programming language interface. It also recommends open source and visualization libraries to users for statistical analysis. The qualitative evaluation of UDAS using the functional evaluation factor of the big data analysis platform demonstrated that the average satisfaction of the users is significantly high.

Original languageEnglish
JournalIEEE Access
DOIs
Publication statusAccepted/In press - 2018 Jul 19

Fingerprint

Reusability
Computer programming languages
Big data
Statistical methods
Visualization

Keywords

  • Big Data
  • Clouds
  • Data analysis
  • Data analysis
  • Data refinement
  • Data visualization
  • Data visualization
  • R
  • Reproducibility of results
  • Statistical analysis
  • Synthetic aperture sonar
  • Tools

ASJC Scopus subject areas

  • Computer Science(all)
  • Materials Science(all)
  • Engineering(all)

Cite this

VPL-based Big Data Analysis System : UDAS. / Choi, Hyunjin; Gim, Jangwon; Seo, Young Duk; Baik, Doo Kwon.

In: IEEE Access, 19.07.2018.

Research output: Contribution to journalArticle

Choi, Hyunjin ; Gim, Jangwon ; Seo, Young Duk ; Baik, Doo Kwon. / VPL-based Big Data Analysis System : UDAS. In: IEEE Access. 2018.
@article{691b7a49dc9244f0b9a4e73f3b4ab063,
title = "VPL-based Big Data Analysis System: UDAS",
abstract = "Over the past five years, research on big data analysis has been actively conducted, and many services have been developed to find valuable data. However, low quality of raw data and data loss problem during data analysis make it difficult to perform accurate data analysis. With the enormous generation of both unstructured and structured data, refinement of data is becoming increasingly difficult. As a result, data refinement plays an important role in data analysis. In addition, as part of efforts to ensure research reproducibility, the importance of reuse of researcher data and research methods is increasing; however, the research on systems supporting such roles have not been conducted sufficiently. Therefore, in this paper, we propose a big data analysis system named the unified data analytics suite (UDAS) that focuses on data refinement. UDAS performs data refinement based on the big data platform and ensures the reusability and reproducibility of refinement and analysis through the visual programming language interface. It also recommends open source and visualization libraries to users for statistical analysis. The qualitative evaluation of UDAS using the functional evaluation factor of the big data analysis platform demonstrated that the average satisfaction of the users is significantly high.",
keywords = "Big Data, Clouds, Data analysis, Data analysis, Data refinement, Data visualization, Data visualization, R, Reproducibility of results, Statistical analysis, Synthetic aperture sonar, Tools",
author = "Hyunjin Choi and Jangwon Gim and Seo, {Young Duk} and Baik, {Doo Kwon}",
year = "2018",
month = "7",
day = "19",
doi = "10.1109/ACCESS.2018.2857845",
language = "English",
journal = "IEEE Access",
issn = "2169-3536",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - VPL-based Big Data Analysis System

T2 - UDAS

AU - Choi, Hyunjin

AU - Gim, Jangwon

AU - Seo, Young Duk

AU - Baik, Doo Kwon

PY - 2018/7/19

Y1 - 2018/7/19

N2 - Over the past five years, research on big data analysis has been actively conducted, and many services have been developed to find valuable data. However, low quality of raw data and data loss problem during data analysis make it difficult to perform accurate data analysis. With the enormous generation of both unstructured and structured data, refinement of data is becoming increasingly difficult. As a result, data refinement plays an important role in data analysis. In addition, as part of efforts to ensure research reproducibility, the importance of reuse of researcher data and research methods is increasing; however, the research on systems supporting such roles have not been conducted sufficiently. Therefore, in this paper, we propose a big data analysis system named the unified data analytics suite (UDAS) that focuses on data refinement. UDAS performs data refinement based on the big data platform and ensures the reusability and reproducibility of refinement and analysis through the visual programming language interface. It also recommends open source and visualization libraries to users for statistical analysis. The qualitative evaluation of UDAS using the functional evaluation factor of the big data analysis platform demonstrated that the average satisfaction of the users is significantly high.

AB - Over the past five years, research on big data analysis has been actively conducted, and many services have been developed to find valuable data. However, low quality of raw data and data loss problem during data analysis make it difficult to perform accurate data analysis. With the enormous generation of both unstructured and structured data, refinement of data is becoming increasingly difficult. As a result, data refinement plays an important role in data analysis. In addition, as part of efforts to ensure research reproducibility, the importance of reuse of researcher data and research methods is increasing; however, the research on systems supporting such roles have not been conducted sufficiently. Therefore, in this paper, we propose a big data analysis system named the unified data analytics suite (UDAS) that focuses on data refinement. UDAS performs data refinement based on the big data platform and ensures the reusability and reproducibility of refinement and analysis through the visual programming language interface. It also recommends open source and visualization libraries to users for statistical analysis. The qualitative evaluation of UDAS using the functional evaluation factor of the big data analysis platform demonstrated that the average satisfaction of the users is significantly high.

KW - Big Data

KW - Clouds

KW - Data analysis

KW - Data analysis

KW - Data refinement

KW - Data visualization

KW - Data visualization

KW - R

KW - Reproducibility of results

KW - Statistical analysis

KW - Synthetic aperture sonar

KW - Tools

UR - http://www.scopus.com/inward/record.url?scp=85050407551&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85050407551&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2018.2857845

DO - 10.1109/ACCESS.2018.2857845

M3 - Article

AN - SCOPUS:85050407551

JO - IEEE Access

JF - IEEE Access

SN - 2169-3536

ER -