2. About me
• Education
• NCU (MIS)、NCCU (CS)
• Work Experience
• Telecom big data Innovation
• AI projects
• Retail marketing technology
• User Group
• TW Spark User Group
• TW Hadoop User Group
• Taiwan Data Engineer Association Director
• Research
• Big Data/ ML/ AIOT/ AI Columnist
2
26. 總結
• 探討 Apriori 的算法用於平行計算等領域
• https://www.researchgate.net/publication/316749396_Parallel_Impl
ementation_of_Apriori_Algorithm_Based_on_MapReduce
26
Searching frequent patterns in transactional databases is considered as one of the most
important data mining problems and Apriori is one of the typical algorithms for this task.
Developing fast and efficient algorithms that can handle large volumes of data becomes
a challenging task due to the large databases. In this paper, we implement a parallel
Apriori algorithm based on MapReduce, which is a framework for processing huge
datasets on certain kinds of distributable problems using a large number of computers
(nodes). The experimental results demonstrate that the proposed algorithm can scale
well and efficiently process large datasets on commodity hardware.
本文並未介紹 MapReduce 概念,同學可自行研究