Association analysis in item response datasets

Eun Young Kwak, Hyeoncheol Kim

Research output: Contribution to journalArticle

Abstract

Association rule mining is a data mining technique used to find frequent patterns in a huge dataset. In this paper, we address the issues of its application to item response datasets, which is generally high multidimensional. The primary disadvantage about mining association rules in a high multidimensional dataset is the huge number of patterns that are discovered, most of which are trivial or uninteresting. In this paper, we introduce a new measure called suprisal that estimates the informativeness of transactional instances and attributes. Our approach to the item association analysis includes elimination of noisy and uninformative data using the surprisal first, and then generation of association rules of good quality. Experimental results on real datasets of national-level tests for Korean high school student show that the surprisal-based pruning improves quality of association rules in item response datasets significantly.

Original languageEnglish
Pages (from-to)913-920
Number of pages8
JournalWSEAS Transactions on Computers
Volume6
Issue number6
Publication statusPublished - 2007 Jun 1

Fingerprint

Association rules
Data mining
Students

Keywords

  • Association rule
  • Data mining
  • Interestingness measure
  • Item response analysis

ASJC Scopus subject areas

  • Computer Science (miscellaneous)

Cite this

Association analysis in item response datasets. / Kwak, Eun Young; Kim, Hyeoncheol.

In: WSEAS Transactions on Computers, Vol. 6, No. 6, 01.06.2007, p. 913-920.

Research output: Contribution to journalArticle

@article{74bc82cf2b914924803da785d2bfd946,
title = "Association analysis in item response datasets",
abstract = "Association rule mining is a data mining technique used to find frequent patterns in a huge dataset. In this paper, we address the issues of its application to item response datasets, which is generally high multidimensional. The primary disadvantage about mining association rules in a high multidimensional dataset is the huge number of patterns that are discovered, most of which are trivial or uninteresting. In this paper, we introduce a new measure called suprisal that estimates the informativeness of transactional instances and attributes. Our approach to the item association analysis includes elimination of noisy and uninformative data using the surprisal first, and then generation of association rules of good quality. Experimental results on real datasets of national-level tests for Korean high school student show that the surprisal-based pruning improves quality of association rules in item response datasets significantly.",
keywords = "Association rule, Data mining, Interestingness measure, Item response analysis",
author = "Kwak, {Eun Young} and Hyeoncheol Kim",
year = "2007",
month = "6",
day = "1",
language = "English",
volume = "6",
pages = "913--920",
journal = "WSEAS Transactions on Computers",
issn = "1109-2750",
publisher = "World Scientific and Engineering Academy and Society",
number = "6",

}

TY - JOUR

T1 - Association analysis in item response datasets

AU - Kwak, Eun Young

AU - Kim, Hyeoncheol

PY - 2007/6/1

Y1 - 2007/6/1

N2 - Association rule mining is a data mining technique used to find frequent patterns in a huge dataset. In this paper, we address the issues of its application to item response datasets, which is generally high multidimensional. The primary disadvantage about mining association rules in a high multidimensional dataset is the huge number of patterns that are discovered, most of which are trivial or uninteresting. In this paper, we introduce a new measure called suprisal that estimates the informativeness of transactional instances and attributes. Our approach to the item association analysis includes elimination of noisy and uninformative data using the surprisal first, and then generation of association rules of good quality. Experimental results on real datasets of national-level tests for Korean high school student show that the surprisal-based pruning improves quality of association rules in item response datasets significantly.

AB - Association rule mining is a data mining technique used to find frequent patterns in a huge dataset. In this paper, we address the issues of its application to item response datasets, which is generally high multidimensional. The primary disadvantage about mining association rules in a high multidimensional dataset is the huge number of patterns that are discovered, most of which are trivial or uninteresting. In this paper, we introduce a new measure called suprisal that estimates the informativeness of transactional instances and attributes. Our approach to the item association analysis includes elimination of noisy and uninformative data using the surprisal first, and then generation of association rules of good quality. Experimental results on real datasets of national-level tests for Korean high school student show that the surprisal-based pruning improves quality of association rules in item response datasets significantly.

KW - Association rule

KW - Data mining

KW - Interestingness measure

KW - Item response analysis

UR - http://www.scopus.com/inward/record.url?scp=34347384455&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34347384455&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:34347384455

VL - 6

SP - 913

EP - 920

JO - WSEAS Transactions on Computers

JF - WSEAS Transactions on Computers

SN - 1109-2750

IS - 6

ER -