Re-identification of medical records by optimum quasi-identifiers

Yong Ju Lee, Kyung Ho Lee

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

Recently, medical records are shared to online for a purpose of medical research and expert opinion. There is a problem with sharing the medical records. If someone knows the subject of the record by using various methods, it can result in an invasion of the patient's privacy. De-identification techniques are applicable to address the problem, however, de-identified data has a risk of re-identification. For this reason, if de-identification techniques are not sufficient, it may increase a risk of re-identification. On the contrary, if the techniques are too excessive, data utility may be damaged. Meanwhile, de-identified data can be re-identified from inference using background knowledge. The objective of this paper is to analyze the probability of re-identification according to inferable quasi-identifiers. We analyzed factors, inferable quasi-identifiers, which can be inferred from background knowledge. Then, we estimated the probability of re-identification from taking advantage of the factors. As a result, we determined the effect of the re-identification according to the type and the range of inferable quasi-identifiers. This paper contributes to a decision on de-identification target and level for protecting patient's privacy through a comparative analysis of the probability of re-identification according to the type and the range of inference.

Original languageEnglish
Title of host publication19th International Conference on Advanced Communications Technology
Subtitle of host publicationOpening Era of Smart Society, ICACT 2017 - Proceeding
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages428-435
Number of pages8
ISBN (Electronic)9788996865094
DOIs
Publication statusPublished - 2017 Mar 29
Event19th International Conference on Advanced Communications Technology, ICACT 2017 - Pyeongchang, Korea, Republic of
Duration: 2017 Feb 192017 Feb 22

Other

Other19th International Conference on Advanced Communications Technology, ICACT 2017
CountryKorea, Republic of
CityPyeongchang
Period17/2/1917/2/22

Keywords

  • De-identification
  • Medical records
  • Privacy
  • Quasi-identifier
  • Re-identification

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Cite this

Lee, Y. J., & Lee, K. H. (2017). Re-identification of medical records by optimum quasi-identifiers. In 19th International Conference on Advanced Communications Technology: Opening Era of Smart Society, ICACT 2017 - Proceeding (pp. 428-435). [7890125] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.23919/ICACT.2017.7890125

Re-identification of medical records by optimum quasi-identifiers. / Lee, Yong Ju; Lee, Kyung Ho.

19th International Conference on Advanced Communications Technology: Opening Era of Smart Society, ICACT 2017 - Proceeding. Institute of Electrical and Electronics Engineers Inc., 2017. p. 428-435 7890125.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Lee, YJ & Lee, KH 2017, Re-identification of medical records by optimum quasi-identifiers. in 19th International Conference on Advanced Communications Technology: Opening Era of Smart Society, ICACT 2017 - Proceeding., 7890125, Institute of Electrical and Electronics Engineers Inc., pp. 428-435, 19th International Conference on Advanced Communications Technology, ICACT 2017, Pyeongchang, Korea, Republic of, 17/2/19. https://doi.org/10.23919/ICACT.2017.7890125
Lee YJ, Lee KH. Re-identification of medical records by optimum quasi-identifiers. In 19th International Conference on Advanced Communications Technology: Opening Era of Smart Society, ICACT 2017 - Proceeding. Institute of Electrical and Electronics Engineers Inc. 2017. p. 428-435. 7890125 https://doi.org/10.23919/ICACT.2017.7890125
Lee, Yong Ju ; Lee, Kyung Ho. / Re-identification of medical records by optimum quasi-identifiers. 19th International Conference on Advanced Communications Technology: Opening Era of Smart Society, ICACT 2017 - Proceeding. Institute of Electrical and Electronics Engineers Inc., 2017. pp. 428-435
@inproceedings{48393441cd1f4547ba0f5e03d2acd03a,
title = "Re-identification of medical records by optimum quasi-identifiers",
abstract = "Recently, medical records are shared to online for a purpose of medical research and expert opinion. There is a problem with sharing the medical records. If someone knows the subject of the record by using various methods, it can result in an invasion of the patient's privacy. De-identification techniques are applicable to address the problem, however, de-identified data has a risk of re-identification. For this reason, if de-identification techniques are not sufficient, it may increase a risk of re-identification. On the contrary, if the techniques are too excessive, data utility may be damaged. Meanwhile, de-identified data can be re-identified from inference using background knowledge. The objective of this paper is to analyze the probability of re-identification according to inferable quasi-identifiers. We analyzed factors, inferable quasi-identifiers, which can be inferred from background knowledge. Then, we estimated the probability of re-identification from taking advantage of the factors. As a result, we determined the effect of the re-identification according to the type and the range of inferable quasi-identifiers. This paper contributes to a decision on de-identification target and level for protecting patient's privacy through a comparative analysis of the probability of re-identification according to the type and the range of inference.",
keywords = "De-identification, Medical records, Privacy, Quasi-identifier, Re-identification",
author = "Lee, {Yong Ju} and Lee, {Kyung Ho}",
year = "2017",
month = "3",
day = "29",
doi = "10.23919/ICACT.2017.7890125",
language = "English",
pages = "428--435",
booktitle = "19th International Conference on Advanced Communications Technology",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - Re-identification of medical records by optimum quasi-identifiers

AU - Lee, Yong Ju

AU - Lee, Kyung Ho

PY - 2017/3/29

Y1 - 2017/3/29

N2 - Recently, medical records are shared to online for a purpose of medical research and expert opinion. There is a problem with sharing the medical records. If someone knows the subject of the record by using various methods, it can result in an invasion of the patient's privacy. De-identification techniques are applicable to address the problem, however, de-identified data has a risk of re-identification. For this reason, if de-identification techniques are not sufficient, it may increase a risk of re-identification. On the contrary, if the techniques are too excessive, data utility may be damaged. Meanwhile, de-identified data can be re-identified from inference using background knowledge. The objective of this paper is to analyze the probability of re-identification according to inferable quasi-identifiers. We analyzed factors, inferable quasi-identifiers, which can be inferred from background knowledge. Then, we estimated the probability of re-identification from taking advantage of the factors. As a result, we determined the effect of the re-identification according to the type and the range of inferable quasi-identifiers. This paper contributes to a decision on de-identification target and level for protecting patient's privacy through a comparative analysis of the probability of re-identification according to the type and the range of inference.

AB - Recently, medical records are shared to online for a purpose of medical research and expert opinion. There is a problem with sharing the medical records. If someone knows the subject of the record by using various methods, it can result in an invasion of the patient's privacy. De-identification techniques are applicable to address the problem, however, de-identified data has a risk of re-identification. For this reason, if de-identification techniques are not sufficient, it may increase a risk of re-identification. On the contrary, if the techniques are too excessive, data utility may be damaged. Meanwhile, de-identified data can be re-identified from inference using background knowledge. The objective of this paper is to analyze the probability of re-identification according to inferable quasi-identifiers. We analyzed factors, inferable quasi-identifiers, which can be inferred from background knowledge. Then, we estimated the probability of re-identification from taking advantage of the factors. As a result, we determined the effect of the re-identification according to the type and the range of inferable quasi-identifiers. This paper contributes to a decision on de-identification target and level for protecting patient's privacy through a comparative analysis of the probability of re-identification according to the type and the range of inference.

KW - De-identification

KW - Medical records

KW - Privacy

KW - Quasi-identifier

KW - Re-identification

UR - http://www.scopus.com/inward/record.url?scp=85018501972&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85018501972&partnerID=8YFLogxK

U2 - 10.23919/ICACT.2017.7890125

DO - 10.23919/ICACT.2017.7890125

M3 - Conference contribution

SP - 428

EP - 435

BT - 19th International Conference on Advanced Communications Technology

PB - Institute of Electrical and Electronics Engineers Inc.

ER -