A fine-grained scheduling strategy for improving the performance of parallel frequent itemsets mining

Chao Chin Wu, Lien Fu Lai, Liang Tsung Huang, Syun Sheng Jhan, Chung Lu

研究成果: Article同行評審

4 引文 斯高帕斯(Scopus)

摘要

We propose a scheduling strategy in this paper to address the load imbalance problem of the distributed parallel apriori (DPA) algorithm published recently. We use fine grained tasks that are derived by dividing the tasks defined by DPA into smaller subtasks. The subtasks will be scheduled by a dynamic self-scheduling scheme for better load balance. Furthermore, we propose two different methods for data transmission from the master to workers. The first one broadcasts all the frequent k-itemsets to all work nodes while the second one transmits only the required data to each individual work node. Experimental results demonstrate the proposed two approaches both outperform DPA. The first one is more suitable for small datasets and the second one provides steadier performance improvement no matter which self-scheduling scheme is adopted.

原文English
頁(從 - 到)264-274
頁數11
期刊International Journal of Computational Science and Engineering
6
發行號4
DOIs
出版狀態Published - 2011 十二月

All Science Journal Classification (ASJC) codes

  • Software
  • Modelling and Simulation
  • Hardware and Architecture
  • Computational Mathematics
  • Computational Theory and Mathematics

指紋 深入研究「A fine-grained scheduling strategy for improving the performance of parallel frequent itemsets mining」主題。共同形成了獨特的指紋。

引用此