Association analysis in item response datasets

Eun Young Kwak, Hyeoncheol Kim

Research output: Contribution to journalArticlepeer-review

Abstract

Association rule mining is a data mining technique used to find frequent patterns in a huge dataset. In this paper, we address the issues of its application to item response datasets, which is generally high multidimensional. The primary disadvantage about mining association rules in a high multidimensional dataset is the huge number of patterns that are discovered, most of which are trivial or uninteresting. In this paper, we introduce a new measure called suprisal that estimates the informativeness of transactional instances and attributes. Our approach to the item association analysis includes elimination of noisy and uninformative data using the surprisal first, and then generation of association rules of good quality. Experimental results on real datasets of national-level tests for Korean high school student show that the surprisal-based pruning improves quality of association rules in item response datasets significantly.

Original languageEnglish
Pages (from-to)913-920
Number of pages8
JournalWSEAS Transactions on Computers
Volume6
Issue number6
Publication statusPublished - 2007 Jun

Keywords

  • Association rule
  • Data mining
  • Interestingness measure
  • Item response analysis

ASJC Scopus subject areas

  • Computer Science(all)

Fingerprint Dive into the research topics of 'Association analysis in item response datasets'. Together they form a unique fingerprint.

Cite this