SlideShare a Scribd company logo
1 of 16
Clear Lines Consulting · clear-lines.com 5/20/2013 · 1
F# Coding Dojo
A gentle introduction to Machine
Learning with F#
Clear Lines Consulting · clear-lines.com 5/20/2013 · 2
The goal tonight
» Take a Kaggle data science contest
» Write some code and have fun
» Write a classifier, from scratch, using F#
» Learn some Machine Learning concepts
» Stretch goal: send results to Kaggle
Clear Lines Consulting · clear-lines.com 5/20/2013 · 3
What you may need to know
Clear Lines Consulting · clear-lines.com 5/20/2013 · 4
Kaggle Digit Recognizer contest
» Full description on Kaggle.com
» Dataset: hand-written digits (0, 1, … , 9)
» Goal = automatically recognize digits
» Training sample = 50,000 examples
» Contest: predict 20,000 “unknown” digits
Clear Lines Consulting · clear-lines.com 5/20/2013 · 5
The data “looks like that”
1
Clear Lines Consulting · clear-lines.com 5/20/2013 · 6
Real data
» 28 x 28 pixels
» Grayscale: each pixel 0 (white) to 255 (black)
» Flattened: one record = Number + 784 Pixels
» CSV file
Clear Lines Consulting · clear-lines.com 5/20/2013 · 7
Illustration (simplified data)
Pixels (real: 784 fields, from 0 to 255)Actual Number
1,0,0,255,0,0,255,255,0,0,0,255,0,0,0,255,0
Clear Lines Consulting · clear-lines.com 5/20/2013 · 8
What’s a Classifier?
» “Give me an unknown data point, and I will
predict what class it belongs to”
» In this case, classes = 0, 1, 2, … 9
» Unknown data point = scanned digit, without
the class it belongs to
Clear Lines Consulting · clear-lines.com 5/20/2013 · 9
The KNN Classifier
» KNN = K-Nearest-Neighbors algorithm
» Given an unknown subject to classify,
» Look up all the known examples,
» Find the K closest examples,
» Take a majority vote,
» Predict what the majority says
Clear Lines Consulting · clear-lines.com 5/20/2013 · 10
Illustration: 1 nearest neighbor
1
0
?
Sample Unknown
Which item from the sample
is nearest / closest to the Unknown
item we want to predict?
Suppose we have just 2 examples in the sample,
and want to predict the class of Unknown
Clear Lines Consulting · clear-lines.com 5/20/2013 · 11
What does “close” mean?
» To define “close” we need a distance
» We can use the distance between images as a
measure for “close”
» Other distances can be used as well
» Note: Square root not important here
Clear Lines Consulting · clear-lines.com 5/20/2013 · 12
Illustration: 1 nearest neighbor
1
0
?
Sample Unknown
X
1
X
X
X
X
X
X
X
X
0
Differences
Let’s compute the distance
between Unknown and our
two examples…
Clear Lines Consulting · clear-lines.com 5/20/2013 · 13
Illustration: 1 nearest neighbor
1
0
?
Sample
Unknown
1
0
?

    
(255-0)2
(255-0)2
(255-0)2 (0-255)2 Etc… Distance = 721
Distance = 255
Clear Lines Consulting · clear-lines.com 5/20/2013 · 14
Illustration: 1 nearest neighbor
1
0
?
SampleUnknown The first example is closest
to our Unknown candidate:
we predict that Unknown
has the same Number, 1
Clear Lines Consulting · clear-lines.com 5/20/2013 · 15
Questions?
Clear Lines Consulting · clear-lines.com 5/20/2013 · 16
Let’s start coding!
» Code 1-nearest-neighbor classifier
» “Guided script” available at:
» Bit.ly/FSharp-ML-Dojo
» https://gist.github.com/mathias-
brandewinder/5558573

More Related Content

Similar to FSharp and Machine Learning Dojo

TE_B_10_INTERNSHIP_PPT_ANIKET_BHAVSAR.pptx
TE_B_10_INTERNSHIP_PPT_ANIKET_BHAVSAR.pptxTE_B_10_INTERNSHIP_PPT_ANIKET_BHAVSAR.pptx
TE_B_10_INTERNSHIP_PPT_ANIKET_BHAVSAR.pptxAbhijeetDhanrajSalve
 
BSSML16 L3. Clusters and Anomaly Detection
BSSML16 L3. Clusters and Anomaly DetectionBSSML16 L3. Clusters and Anomaly Detection
BSSML16 L3. Clusters and Anomaly DetectionBigML, Inc
 
VSSML16 L3. Clusters and Anomaly Detection
VSSML16 L3. Clusters and Anomaly DetectionVSSML16 L3. Clusters and Anomaly Detection
VSSML16 L3. Clusters and Anomaly DetectionBigML, Inc
 
Fast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA HardwareFast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA HardwareTigerGraph
 
VSSML18. Clustering and Latent Dirichlet Allocation
VSSML18. Clustering and Latent Dirichlet AllocationVSSML18. Clustering and Latent Dirichlet Allocation
VSSML18. Clustering and Latent Dirichlet AllocationBigML, Inc
 
Introduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regressionIntroduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regressionGirish Gore
 
A Novel Target Marketing Approach based on Influence Maximization
A Novel Target Marketing Approach based on Influence MaximizationA Novel Target Marketing Approach based on Influence Maximization
A Novel Target Marketing Approach based on Influence MaximizationSurendra Gadwal
 
Kaggle Gold Medal Case Study
Kaggle Gold Medal Case StudyKaggle Gold Medal Case Study
Kaggle Gold Medal Case StudyAlon Bochman, CFA
 
Improve ML Predictions using Graph Analytics (today!)
Improve ML Predictions using Graph Analytics (today!)Improve ML Predictions using Graph Analytics (today!)
Improve ML Predictions using Graph Analytics (today!)Neo4j
 
Tutorial on Theory and Application of Generative Adversarial Networks
Tutorial on Theory and Application of Generative Adversarial NetworksTutorial on Theory and Application of Generative Adversarial Networks
Tutorial on Theory and Application of Generative Adversarial NetworksMLReview
 
DutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End MLDutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End MLBigML, Inc
 
Kaggle Days Brussels - Alberto Danese
Kaggle Days Brussels - Alberto DaneseKaggle Days Brussels - Alberto Danese
Kaggle Days Brussels - Alberto DaneseAlberto Danese
 
Learning to Learn Model Behavior ( Capital One: data intelligence conference )
Learning to Learn Model Behavior ( Capital One: data intelligence conference )Learning to Learn Model Behavior ( Capital One: data intelligence conference )
Learning to Learn Model Behavior ( Capital One: data intelligence conference )Pramit Choudhary
 
DutchMLSchool. Clusters and Anomalies
DutchMLSchool. Clusters and AnomaliesDutchMLSchool. Clusters and Anomalies
DutchMLSchool. Clusters and AnomaliesBigML, Inc
 
Image-Based E-Commerce Product Discovery: A Deep Learning Case Study - Denis ...
Image-Based E-Commerce Product Discovery: A Deep Learning Case Study - Denis ...Image-Based E-Commerce Product Discovery: A Deep Learning Case Study - Denis ...
Image-Based E-Commerce Product Discovery: A Deep Learning Case Study - Denis ...Lucidworks
 
MLSD18. Basic Transformations - BigML
MLSD18. Basic Transformations - BigMLMLSD18. Basic Transformations - BigML
MLSD18. Basic Transformations - BigMLBigML, Inc
 
Main principles of Data Science and Machine Learning
Main principles of Data Science and Machine LearningMain principles of Data Science and Machine Learning
Main principles of Data Science and Machine LearningNikolay Karelin
 

Similar to FSharp and Machine Learning Dojo (20)

TE_B_10_INTERNSHIP_PPT_ANIKET_BHAVSAR.pptx
TE_B_10_INTERNSHIP_PPT_ANIKET_BHAVSAR.pptxTE_B_10_INTERNSHIP_PPT_ANIKET_BHAVSAR.pptx
TE_B_10_INTERNSHIP_PPT_ANIKET_BHAVSAR.pptx
 
BSSML16 L3. Clusters and Anomaly Detection
BSSML16 L3. Clusters and Anomaly DetectionBSSML16 L3. Clusters and Anomaly Detection
BSSML16 L3. Clusters and Anomaly Detection
 
VSSML16 L3. Clusters and Anomaly Detection
VSSML16 L3. Clusters and Anomaly DetectionVSSML16 L3. Clusters and Anomaly Detection
VSSML16 L3. Clusters and Anomaly Detection
 
Fast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA HardwareFast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA Hardware
 
VSSML18. Clustering and Latent Dirichlet Allocation
VSSML18. Clustering and Latent Dirichlet AllocationVSSML18. Clustering and Latent Dirichlet Allocation
VSSML18. Clustering and Latent Dirichlet Allocation
 
LR2. Summary Day 2
LR2. Summary Day 2LR2. Summary Day 2
LR2. Summary Day 2
 
Introduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regressionIntroduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regression
 
A Novel Target Marketing Approach based on Influence Maximization
A Novel Target Marketing Approach based on Influence MaximizationA Novel Target Marketing Approach based on Influence Maximization
A Novel Target Marketing Approach based on Influence Maximization
 
Kaggle Gold Medal Case Study
Kaggle Gold Medal Case StudyKaggle Gold Medal Case Study
Kaggle Gold Medal Case Study
 
Improve ML Predictions using Graph Analytics (today!)
Improve ML Predictions using Graph Analytics (today!)Improve ML Predictions using Graph Analytics (today!)
Improve ML Predictions using Graph Analytics (today!)
 
Tutorial on Theory and Application of Generative Adversarial Networks
Tutorial on Theory and Application of Generative Adversarial NetworksTutorial on Theory and Application of Generative Adversarial Networks
Tutorial on Theory and Application of Generative Adversarial Networks
 
DutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End MLDutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End ML
 
Cluster Analysis for Dummies
Cluster Analysis for DummiesCluster Analysis for Dummies
Cluster Analysis for Dummies
 
L13. Cluster Analysis
L13. Cluster AnalysisL13. Cluster Analysis
L13. Cluster Analysis
 
Kaggle Days Brussels - Alberto Danese
Kaggle Days Brussels - Alberto DaneseKaggle Days Brussels - Alberto Danese
Kaggle Days Brussels - Alberto Danese
 
Learning to Learn Model Behavior ( Capital One: data intelligence conference )
Learning to Learn Model Behavior ( Capital One: data intelligence conference )Learning to Learn Model Behavior ( Capital One: data intelligence conference )
Learning to Learn Model Behavior ( Capital One: data intelligence conference )
 
DutchMLSchool. Clusters and Anomalies
DutchMLSchool. Clusters and AnomaliesDutchMLSchool. Clusters and Anomalies
DutchMLSchool. Clusters and Anomalies
 
Image-Based E-Commerce Product Discovery: A Deep Learning Case Study - Denis ...
Image-Based E-Commerce Product Discovery: A Deep Learning Case Study - Denis ...Image-Based E-Commerce Product Discovery: A Deep Learning Case Study - Denis ...
Image-Based E-Commerce Product Discovery: A Deep Learning Case Study - Denis ...
 
MLSD18. Basic Transformations - BigML
MLSD18. Basic Transformations - BigMLMLSD18. Basic Transformations - BigML
MLSD18. Basic Transformations - BigML
 
Main principles of Data Science and Machine Learning
Main principles of Data Science and Machine LearningMain principles of Data Science and Machine Learning
Main principles of Data Science and Machine Learning
 

Recently uploaded

Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 

Recently uploaded (20)

Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 

FSharp and Machine Learning Dojo

  • 1. Clear Lines Consulting · clear-lines.com 5/20/2013 · 1 F# Coding Dojo A gentle introduction to Machine Learning with F#
  • 2. Clear Lines Consulting · clear-lines.com 5/20/2013 · 2 The goal tonight » Take a Kaggle data science contest » Write some code and have fun » Write a classifier, from scratch, using F# » Learn some Machine Learning concepts » Stretch goal: send results to Kaggle
  • 3. Clear Lines Consulting · clear-lines.com 5/20/2013 · 3 What you may need to know
  • 4. Clear Lines Consulting · clear-lines.com 5/20/2013 · 4 Kaggle Digit Recognizer contest » Full description on Kaggle.com » Dataset: hand-written digits (0, 1, … , 9) » Goal = automatically recognize digits » Training sample = 50,000 examples » Contest: predict 20,000 “unknown” digits
  • 5. Clear Lines Consulting · clear-lines.com 5/20/2013 · 5 The data “looks like that” 1
  • 6. Clear Lines Consulting · clear-lines.com 5/20/2013 · 6 Real data » 28 x 28 pixels » Grayscale: each pixel 0 (white) to 255 (black) » Flattened: one record = Number + 784 Pixels » CSV file
  • 7. Clear Lines Consulting · clear-lines.com 5/20/2013 · 7 Illustration (simplified data) Pixels (real: 784 fields, from 0 to 255)Actual Number 1,0,0,255,0,0,255,255,0,0,0,255,0,0,0,255,0
  • 8. Clear Lines Consulting · clear-lines.com 5/20/2013 · 8 What’s a Classifier? » “Give me an unknown data point, and I will predict what class it belongs to” » In this case, classes = 0, 1, 2, … 9 » Unknown data point = scanned digit, without the class it belongs to
  • 9. Clear Lines Consulting · clear-lines.com 5/20/2013 · 9 The KNN Classifier » KNN = K-Nearest-Neighbors algorithm » Given an unknown subject to classify, » Look up all the known examples, » Find the K closest examples, » Take a majority vote, » Predict what the majority says
  • 10. Clear Lines Consulting · clear-lines.com 5/20/2013 · 10 Illustration: 1 nearest neighbor 1 0 ? Sample Unknown Which item from the sample is nearest / closest to the Unknown item we want to predict? Suppose we have just 2 examples in the sample, and want to predict the class of Unknown
  • 11. Clear Lines Consulting · clear-lines.com 5/20/2013 · 11 What does “close” mean? » To define “close” we need a distance » We can use the distance between images as a measure for “close” » Other distances can be used as well » Note: Square root not important here
  • 12. Clear Lines Consulting · clear-lines.com 5/20/2013 · 12 Illustration: 1 nearest neighbor 1 0 ? Sample Unknown X 1 X X X X X X X X 0 Differences Let’s compute the distance between Unknown and our two examples…
  • 13. Clear Lines Consulting · clear-lines.com 5/20/2013 · 13 Illustration: 1 nearest neighbor 1 0 ? Sample Unknown 1 0 ?       (255-0)2 (255-0)2 (255-0)2 (0-255)2 Etc… Distance = 721 Distance = 255
  • 14. Clear Lines Consulting · clear-lines.com 5/20/2013 · 14 Illustration: 1 nearest neighbor 1 0 ? SampleUnknown The first example is closest to our Unknown candidate: we predict that Unknown has the same Number, 1
  • 15. Clear Lines Consulting · clear-lines.com 5/20/2013 · 15 Questions?
  • 16. Clear Lines Consulting · clear-lines.com 5/20/2013 · 16 Let’s start coding! » Code 1-nearest-neighbor classifier » “Guided script” available at: » Bit.ly/FSharp-ML-Dojo » https://gist.github.com/mathias- brandewinder/5558573