2. About me
• Education
• NCU (MIS)、NCCU (CS)
• Work Experience
• Telecom big data Innovation
• AI projects
• Retail marketing technology
• User Group
• TW Spark User Group
• TW Hadoop User Group
• Taiwan Data Engineer Association Director
• Research
• Big Data/ ML/ AIOT/ AI Columnist
2
3. Tutorial
Content
3
The Orange 3 introduction
Getting started unsupervised learning with
Orange3
Home work
What is the unsupervised learning/ K-Means
5. Orange Data Mining
• How to use Orange 3
• https://www.youtube.com/@OrangeDataMining
5
6. Download Orange for Windows
• A Python 3 data mining library with GUI.
• https://orangedatamining.com/screenshots/
• Widget catalogs
• Orange Data Mining - Widget catalog
6
13. Basic Orange3
• Use File/Datasets widget and display dataset with Data-Table and
Scatter plot
13
14. Supervised learning vs. Unsupervised learning
• Supervised learning: discover patterns in the data that relate data
attributes with a target (class) attribute.
• These patterns are then utilized to predict the values of the target attribute in
future data instances.
• Unsupervised learning: The data have no target attribute.
• We want to explore the data to find some intrinsic structures in them.
• Classic unsupervised learning algorithm
• Clustering algorithms
• Association rules
14
16. Clustering (K-means algorithm)
• Steps
• Define number K (cluster groups)
• Randomize the centroids of each cluster, calculating the summary
of distances of each data point to the centroids.
• Move the centroids and re-calculating the summary of distances,
until the summary of distances are in the convergency.
• https://www.youtube.com/watch?v=5I3Ei69I40s
16
23. Homework
• Use your own dataset to find the best number K cluster and
explain each cluster statistic information
• Use Groceries data.csv for association rules and demo it.
• https://orangedatamining.com/blog/2016/04/25/association-rules-
in-orange/
23