TY - GEN
T1 - Reducing last level cache pollution in NUMA multicore systems for improving cache performance
AU - An, Deukhyeon
AU - Kim, Jeehong
AU - Han, Junghyun
AU - Eom, Young Ik
PY - 2012
Y1 - 2012
N2 - Non-uniform memory architecture (NUMA) system has numerous nodes with shared last level cache (LLC). Their shared LLC has brought many benefits in the cache utilization. However, LLC can be seriously polluted by tasks that cause huge I/O traffic for a long time since inclusive cache architecture of LLC replaces valid cache line by back-invalidate. Many research on the page coloring, partitioning, and pollute buffer mechanism handled this cache pollution. But, there are no scheduling approaches considering I/O-intensive tasks in NUMA systems. To address the above problem, OS scheduling that reduces cache pollution is highly needed in NUMA systems. In this paper, we propose a software-based mechanism that reduces shared LLC miss in NUMA systems. Our mechanism includes I/O traffic measurement and devil conscious scheduling. The experimental results show that LLC miss rate can be reduced up to 37.6%, and our approach improves execution time to 1.48%.
AB - Non-uniform memory architecture (NUMA) system has numerous nodes with shared last level cache (LLC). Their shared LLC has brought many benefits in the cache utilization. However, LLC can be seriously polluted by tasks that cause huge I/O traffic for a long time since inclusive cache architecture of LLC replaces valid cache line by back-invalidate. Many research on the page coloring, partitioning, and pollute buffer mechanism handled this cache pollution. But, there are no scheduling approaches considering I/O-intensive tasks in NUMA systems. To address the above problem, OS scheduling that reduces cache pollution is highly needed in NUMA systems. In this paper, we propose a software-based mechanism that reduces shared LLC miss in NUMA systems. Our mechanism includes I/O traffic measurement and devil conscious scheduling. The experimental results show that LLC miss rate can be reduced up to 37.6%, and our approach improves execution time to 1.48%.
KW - Cache Performance
KW - Cache Pollution
KW - I/O Intensive Task
KW - Last Level Cache
KW - NUMA Scheduling
KW - Task Characteristics
UR - http://www.scopus.com/inward/record.url?scp=84863952156&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84863952156&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-31137-6_21
DO - 10.1007/978-3-642-31137-6_21
M3 - Conference contribution
AN - SCOPUS:84863952156
SN - 9783642311369
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 272
EP - 282
BT - Computational Science and Its Applications - 12th International Conference, ICCSA 2012, Proceedings
T2 - 12th International Conference on Computational Science and Its Applications, ICCSA 2012
Y2 - 18 June 2012 through 21 June 2012
ER -