2. About me
• Education
• NCU (MIS)、NCCU (CS)
• Work Experience
• Telecom big data Innovation
• AI projects
• Retail marketing technology
• User Group
• TW Spark User Group
• TW Hadoop User Group
• Taiwan Data Engineer Association Director
• Research
• Big Data/ ML/ AIOT/ AI Columnist
2
30. 總結
• 但 Apriori的算法擴展性較好,可以用於平行計算等領域
• https://www.researchgate.net/publication/316749396_Parallel_Impl
ementation_of_Apriori_Algorithm_Based_on_MapReduce
31
Searching frequent patterns in transactional databases is considered as one of the most
important data mining problems and Apriori is one of the typical algorithms for this task.
Developing fast and efficient algorithms that can handle large volumes of data becomes
a challenging task due to the large databases. In this paper, we implement a parallel
Apriori algorithm based on MapReduce, which is a framework for processing huge
datasets on certain kinds of distributable problems using a large number of computers
(nodes). The experimental results demonstrate that the proposed algorithm can scale
well and efficiently process large datasets on commodity hardware.
35. 延伸閱讀
• Frequent episode mining
• data streams in continuously and is stored as a sequence of time-stamped
events
• https://zimmermanna.users.greyc.fr/papers/journals/ida2014lirias-
version.pdf
36