An evaluation of sampling design for estimating an epidemiologic volume of diabetes and for assessing present status of its control in Korea

Ji Sung Lee, Jaiyong Kim, Sei-Hyun Baik, Ie Byung Park, Juneyoung Lee

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Objectives : An appropriate sampling strategy for estimating an epidemiologic volume of diabetes has been evaluated through a simulation. Methods : We analyzed about 250 million medical insurance claims data submitted to the Health Insurance Review & Assessment Service with diabetes as principal or subsequent diagnoses, more than or equal to once per year, in 2003. The database was re-constructed to a patient-hospital profile that had 3,676,164 cases, and then to a patient profile that consisted of 2,412,082 observations. The patient profile data was then used to test the validity of a proposed sampling frame and methods of sampling to develop diabetic-related epidemiologic indices. Results : Simulation study showed that a use of a stratified two-stage cluster sampling design with a total sample size of 4,000 will provide an estimate of 57.04% (95% prediction range, 49.83 - 64.24%) for a treatment prescription rate of diabetes. The proposed sampling design consists, at first, stratifying the area of the nation into metropolitan/city/county and the types of hospital into tertiary/secondary/primary/clinic with a proportion of 5:10:10:75. Hospitals were then randomly selected within the strata as a primary sampling unit, followed by a random selection of patients within the hospitals as a secondly sampling unit. The difference between the estimate and the parameter value was projected to be less than 0.3%. Conclusions : The sampling scheme proposed will be applied to a subsequent nationwide field survey not only for estimating the epidemiologic volume of diabetes but also for assessing the present status of nationwide diabetes control.

Original languageEnglish
Pages (from-to)135-142
Number of pages8
JournalJournal of Preventive Medicine and Public Health
Volume42
Issue number2
DOIs
Publication statusPublished - 2009 Mar 1

Fingerprint

Korea
County Hospitals
Urban Hospitals
Health Insurance
Insurance
Sample Size
Patient Selection
Prescriptions
Databases
Therapeutics

Keywords

  • Diabetes mellitus
  • Epidemiologic indices
  • Insurance claim review data
  • Simulation
  • Stratified two-stage cluster sampling

ASJC Scopus subject areas

  • Public Health, Environmental and Occupational Health

Cite this

@article{66d4269456ef40429174f07ad0b2bc67,
title = "An evaluation of sampling design for estimating an epidemiologic volume of diabetes and for assessing present status of its control in Korea",
abstract = "Objectives : An appropriate sampling strategy for estimating an epidemiologic volume of diabetes has been evaluated through a simulation. Methods : We analyzed about 250 million medical insurance claims data submitted to the Health Insurance Review & Assessment Service with diabetes as principal or subsequent diagnoses, more than or equal to once per year, in 2003. The database was re-constructed to a patient-hospital profile that had 3,676,164 cases, and then to a patient profile that consisted of 2,412,082 observations. The patient profile data was then used to test the validity of a proposed sampling frame and methods of sampling to develop diabetic-related epidemiologic indices. Results : Simulation study showed that a use of a stratified two-stage cluster sampling design with a total sample size of 4,000 will provide an estimate of 57.04{\%} (95{\%} prediction range, 49.83 - 64.24{\%}) for a treatment prescription rate of diabetes. The proposed sampling design consists, at first, stratifying the area of the nation into metropolitan/city/county and the types of hospital into tertiary/secondary/primary/clinic with a proportion of 5:10:10:75. Hospitals were then randomly selected within the strata as a primary sampling unit, followed by a random selection of patients within the hospitals as a secondly sampling unit. The difference between the estimate and the parameter value was projected to be less than 0.3{\%}. Conclusions : The sampling scheme proposed will be applied to a subsequent nationwide field survey not only for estimating the epidemiologic volume of diabetes but also for assessing the present status of nationwide diabetes control.",
keywords = "Diabetes mellitus, Epidemiologic indices, Insurance claim review data, Simulation, Stratified two-stage cluster sampling",
author = "Lee, {Ji Sung} and Jaiyong Kim and Sei-Hyun Baik and Park, {Ie Byung} and Juneyoung Lee",
year = "2009",
month = "3",
day = "1",
doi = "10.3961/jpmph.2009.42.2.135",
language = "English",
volume = "42",
pages = "135--142",
journal = "Journal of Preventive Medicine and Public Health",
issn = "1975-8375",
publisher = "Korean Society for Preventive Medicine",
number = "2",

}

TY - JOUR

T1 - An evaluation of sampling design for estimating an epidemiologic volume of diabetes and for assessing present status of its control in Korea

AU - Lee, Ji Sung

AU - Kim, Jaiyong

AU - Baik, Sei-Hyun

AU - Park, Ie Byung

AU - Lee, Juneyoung

PY - 2009/3/1

Y1 - 2009/3/1

N2 - Objectives : An appropriate sampling strategy for estimating an epidemiologic volume of diabetes has been evaluated through a simulation. Methods : We analyzed about 250 million medical insurance claims data submitted to the Health Insurance Review & Assessment Service with diabetes as principal or subsequent diagnoses, more than or equal to once per year, in 2003. The database was re-constructed to a patient-hospital profile that had 3,676,164 cases, and then to a patient profile that consisted of 2,412,082 observations. The patient profile data was then used to test the validity of a proposed sampling frame and methods of sampling to develop diabetic-related epidemiologic indices. Results : Simulation study showed that a use of a stratified two-stage cluster sampling design with a total sample size of 4,000 will provide an estimate of 57.04% (95% prediction range, 49.83 - 64.24%) for a treatment prescription rate of diabetes. The proposed sampling design consists, at first, stratifying the area of the nation into metropolitan/city/county and the types of hospital into tertiary/secondary/primary/clinic with a proportion of 5:10:10:75. Hospitals were then randomly selected within the strata as a primary sampling unit, followed by a random selection of patients within the hospitals as a secondly sampling unit. The difference between the estimate and the parameter value was projected to be less than 0.3%. Conclusions : The sampling scheme proposed will be applied to a subsequent nationwide field survey not only for estimating the epidemiologic volume of diabetes but also for assessing the present status of nationwide diabetes control.

AB - Objectives : An appropriate sampling strategy for estimating an epidemiologic volume of diabetes has been evaluated through a simulation. Methods : We analyzed about 250 million medical insurance claims data submitted to the Health Insurance Review & Assessment Service with diabetes as principal or subsequent diagnoses, more than or equal to once per year, in 2003. The database was re-constructed to a patient-hospital profile that had 3,676,164 cases, and then to a patient profile that consisted of 2,412,082 observations. The patient profile data was then used to test the validity of a proposed sampling frame and methods of sampling to develop diabetic-related epidemiologic indices. Results : Simulation study showed that a use of a stratified two-stage cluster sampling design with a total sample size of 4,000 will provide an estimate of 57.04% (95% prediction range, 49.83 - 64.24%) for a treatment prescription rate of diabetes. The proposed sampling design consists, at first, stratifying the area of the nation into metropolitan/city/county and the types of hospital into tertiary/secondary/primary/clinic with a proportion of 5:10:10:75. Hospitals were then randomly selected within the strata as a primary sampling unit, followed by a random selection of patients within the hospitals as a secondly sampling unit. The difference between the estimate and the parameter value was projected to be less than 0.3%. Conclusions : The sampling scheme proposed will be applied to a subsequent nationwide field survey not only for estimating the epidemiologic volume of diabetes but also for assessing the present status of nationwide diabetes control.

KW - Diabetes mellitus

KW - Epidemiologic indices

KW - Insurance claim review data

KW - Simulation

KW - Stratified two-stage cluster sampling

UR - http://www.scopus.com/inward/record.url?scp=67649550659&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=67649550659&partnerID=8YFLogxK

U2 - 10.3961/jpmph.2009.42.2.135

DO - 10.3961/jpmph.2009.42.2.135

M3 - Article

VL - 42

SP - 135

EP - 142

JO - Journal of Preventive Medicine and Public Health

JF - Journal of Preventive Medicine and Public Health

SN - 1975-8375

IS - 2

ER -