VPL-based big data analysis system: UDAS

Hyunjin Choi, Jangwon Gim, Young Duk Seo, Doo Kwon Baik

Research output: Contribution to journalArticlepeer-review

Abstract

Over the past five years, research on big data analysis has been actively conducted, and many services have been developed to find valuable data. However, low quality of raw data and data loss problem during data analysis make it difficult to perform accurate data analysis. With the enormous generation of both unstructured and structured data, refinement of data is becoming increasingly difficult. As a result, data refinement plays an important role in data analysis. In addition, as part of efforts to ensure research reproducibility, the importance of reuse of researcher data and research methods is increasing; however, the research on systems supporting such roles has not been conducted sufficiently. Therefore, in this paper, we propose a big data analysis system named the unified data analytics suite (UDAS) that focuses on data refinement. UDAS performs data refinement based on the big data platform and ensures the reusability and reproducibility of refinement and analysis through the visual programming language interface. It also recommends open source and visualization libraries to users for statistical analysis. The qualitative evaluation of UDAS using the functional evaluation factor of the big data analysis platform demonstrated that the average satisfaction of the users is significantly high.

Original languageEnglish
Article number8416666
Pages (from-to)40883-40897
Number of pages15
JournalIEEE Access
Volume6
DOIs
Publication statusPublished - 2018 Jul 19

Keywords

  • Clouds
  • Data analysis
  • Data refinement
  • Data visualization
  • R
  • Reproducibility of results

ASJC Scopus subject areas

  • Computer Science(all)
  • Materials Science(all)
  • Engineering(all)

Fingerprint Dive into the research topics of 'VPL-based big data analysis system: UDAS'. Together they form a unique fingerprint.

Cite this