TY - GEN
T1 - Improved association rule mining by modified trimming
AU - Hwang, Wontae
AU - Kim, Dongseung
PY - 2006
Y1 - 2006
N2 - This paper presents a new association mining algorithm that uses two phase sampling for shortening the execution time at the cost of precision of the mining result. Previous FAST (Finding Association by Sampling Technique) algorithm has the weakness in that it only considered the frequent 1-itemsets in trimming/growing, thus, it did not have ways of considering mulit-itemsets including 2-itemsets. The new algorithm reflects the multi-itemsets in sampling transactions. It improves the mining results by adjusting the counts of both missing itemsets and false itemsets. Experimentally on a representative synthetic database, the accuracy of 2-itemsets reaches 0.68 compared to 0.46 while it maintains the same quality.
AB - This paper presents a new association mining algorithm that uses two phase sampling for shortening the execution time at the cost of precision of the mining result. Previous FAST (Finding Association by Sampling Technique) algorithm has the weakness in that it only considered the frequent 1-itemsets in trimming/growing, thus, it did not have ways of considering mulit-itemsets including 2-itemsets. The new algorithm reflects the multi-itemsets in sampling transactions. It improves the mining results by adjusting the counts of both missing itemsets and false itemsets. Experimentally on a representative synthetic database, the accuracy of 2-itemsets reaches 0.68 compared to 0.46 while it maintains the same quality.
UR - http://www.scopus.com/inward/record.url?scp=34547374462&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34547374462&partnerID=8YFLogxK
U2 - 10.1109/CIT.2006.101
DO - 10.1109/CIT.2006.101
M3 - Conference contribution
AN - SCOPUS:34547374462
SN - 076952687X
SN - 9780769526874
T3 - Proceedings - Sixth IEEE International Conference on Computer and Information Technology, CIT 2006
SP - 24
EP - 28
BT - Proceedings - Sixth IEEE International Conference on Computer and Information Technology, CIT 2006
PB - IEEE Computer Society
T2 - 6th IEEE International Conference on Computer and Information Technology, CIT 2006
Y2 - 20 September 2006 through 22 September 2006
ER -