Incremental data integration based on hierarchical metadata registry with data visibility

Dongwon Jeong, Doo Kwon Baik

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

A considerable number of researches have been studied on data integration based on metadata. However, existing approaches require too much cost to build an initial guideline. Most important reason is that the previous researches have not seriously considered the corresponding domain properties such as the data level and the user level. First, it is difficult in practice to create a standardized guideline on the entire data set, if there is a restricted cost given. Thus, a set of data to be integrated should be selected first. However, most databases have no statistical information that may be used to select such a set of data according to its usability. In this paper, we propose LOG (localization-based global metadata registry) methodology to build a guideline and integrate databases progressively considering the domain properties. The key idea is that the priorities of databases to be integrated are determined by the relationship to the domain properties. We also show the implementation by applying it to actual databases in Korea Institute of Science and Technology Information, which builds and manages a considerable number of databases on the science and technology in Korea. The LOG provides an incremental build method of metadata registry, and also supports progressive data integration mechanism on the existing distributed databases. It especially gives successful and efficient output on the creation of a standard guideline in the situation where the given cost is restricted.

Original languageEnglish
Pages (from-to)147-181
Number of pages35
JournalInformation Sciences
Volume162
Issue number3-4
DOIs
Publication statusPublished - 2004 Jun 4

    Fingerprint

Keywords

  • Data visibility
  • Hierarchical MDR
  • Incremental data integration
  • MDR
  • Metadata registry

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Information Systems
  • Information Systems and Management
  • Statistics, Probability and Uncertainty
  • Electrical and Electronic Engineering
  • Statistics and Probability

Cite this