A framework to preserve the privacy of electronic health data streams

Soohyung Kim, Min Kyoung Sung, Yon Dohn Chung

Research output: Contribution to journalArticle

22 Citations (Scopus)

Abstract

The anonymization of health data streams is important to protect these data against potential privacy breaches. A large number of research studies aiming at offering privacy in the context of data streams has been recently conducted. However, the techniques that have been proposed in these studies generate a significant delay during the anonymization process, since they concentrate on applying existing privacy models (e.g., k-anonymity and l-diversity) to batches of data extracted from data streams in a period of time. In this paper, we present delay-free anonymization, a framework for preserving the privacy of electronic health data streams. Unlike existing works, our method does not generate an accumulation delay, since input streams are anonymized immediately with counterfeit values. We further devise late validation for increasing the data utility of the anonymization results and managing the counterfeit values. Through experiments, we show the efficiency and effectiveness of the proposed method for the real-time release of data streams.

Original languageEnglish
Pages (from-to)95-106
Number of pages12
JournalJournal of Biomedical Informatics
Volume50
DOIs
Publication statusPublished - 2014 Jan 1

Fingerprint

Privacy
Health
Experiments
Research
Data Anonymization

ASJC Scopus subject areas

  • Computer Science Applications
  • Health Informatics

Cite this

A framework to preserve the privacy of electronic health data streams. / Kim, Soohyung; Sung, Min Kyoung; Chung, Yon Dohn.

In: Journal of Biomedical Informatics, Vol. 50, 01.01.2014, p. 95-106.

Research output: Contribution to journalArticle

@article{65103681298941a086d8dd8d1f7ec8b6,
title = "A framework to preserve the privacy of electronic health data streams",
abstract = "The anonymization of health data streams is important to protect these data against potential privacy breaches. A large number of research studies aiming at offering privacy in the context of data streams has been recently conducted. However, the techniques that have been proposed in these studies generate a significant delay during the anonymization process, since they concentrate on applying existing privacy models (e.g., k-anonymity and l-diversity) to batches of data extracted from data streams in a period of time. In this paper, we present delay-free anonymization, a framework for preserving the privacy of electronic health data streams. Unlike existing works, our method does not generate an accumulation delay, since input streams are anonymized immediately with counterfeit values. We further devise late validation for increasing the data utility of the anonymization results and managing the counterfeit values. Through experiments, we show the efficiency and effectiveness of the proposed method for the real-time release of data streams.",
keywords = "Anonymization, Health data stream, Privacy",
author = "Soohyung Kim and Sung, {Min Kyoung} and Chung, {Yon Dohn}",
year = "2014",
month = "1",
day = "1",
doi = "10.1016/j.jbi.2014.03.015",
language = "English",
volume = "50",
pages = "95--106",
journal = "Journal of Biomedical Informatics",
issn = "1532-0464",
publisher = "Academic Press Inc.",

}

TY - JOUR

T1 - A framework to preserve the privacy of electronic health data streams

AU - Kim, Soohyung

AU - Sung, Min Kyoung

AU - Chung, Yon Dohn

PY - 2014/1/1

Y1 - 2014/1/1

N2 - The anonymization of health data streams is important to protect these data against potential privacy breaches. A large number of research studies aiming at offering privacy in the context of data streams has been recently conducted. However, the techniques that have been proposed in these studies generate a significant delay during the anonymization process, since they concentrate on applying existing privacy models (e.g., k-anonymity and l-diversity) to batches of data extracted from data streams in a period of time. In this paper, we present delay-free anonymization, a framework for preserving the privacy of electronic health data streams. Unlike existing works, our method does not generate an accumulation delay, since input streams are anonymized immediately with counterfeit values. We further devise late validation for increasing the data utility of the anonymization results and managing the counterfeit values. Through experiments, we show the efficiency and effectiveness of the proposed method for the real-time release of data streams.

AB - The anonymization of health data streams is important to protect these data against potential privacy breaches. A large number of research studies aiming at offering privacy in the context of data streams has been recently conducted. However, the techniques that have been proposed in these studies generate a significant delay during the anonymization process, since they concentrate on applying existing privacy models (e.g., k-anonymity and l-diversity) to batches of data extracted from data streams in a period of time. In this paper, we present delay-free anonymization, a framework for preserving the privacy of electronic health data streams. Unlike existing works, our method does not generate an accumulation delay, since input streams are anonymized immediately with counterfeit values. We further devise late validation for increasing the data utility of the anonymization results and managing the counterfeit values. Through experiments, we show the efficiency and effectiveness of the proposed method for the real-time release of data streams.

KW - Anonymization

KW - Health data stream

KW - Privacy

UR - http://www.scopus.com/inward/record.url?scp=84905270810&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84905270810&partnerID=8YFLogxK

U2 - 10.1016/j.jbi.2014.03.015

DO - 10.1016/j.jbi.2014.03.015

M3 - Article

C2 - 24704716

AN - SCOPUS:84905270810

VL - 50

SP - 95

EP - 106

JO - Journal of Biomedical Informatics

JF - Journal of Biomedical Informatics

SN - 1532-0464

ER -