SlideShare a Scribd company logo
1 of 46
Download to read offline
Data Science Company 
Machine Learning in Practice 
An InfoFarm Seminar 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Data 
Science 
Big 
Data 
Identifying, extracting and using data of all types 
and origins; exploring, correlating and using it in new 
and innovative ways in order to extract meaning 
and business value from it.
2 Data Scientists 4 Big Data 
Consultants 
1 Infrastructure 
Specialist 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Java 
PHP 
E-Commerce 
Mobile 
Web 
Development
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Agenda 
• 13:00 What is Machine Learning? 
• 13:30 Techniques 
• 14:30 Tools 
• 15:00 Practical examples 
• 16:00 Wrap up 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
What is Machine Learning? 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Magic?
Machine Learning is a subfield of 
computer science and statistics that deals 
with systems that can learn from data, 
instead of follow explicitly programmed 
instructions. 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Machine Learning vs Data Science vs Big Data 
• You don’t need Big Data to leverage the 
benefits of machine learning, but more 
learning data makes a better machine 
• Data Science can help you to get the most 
out of Machine Learning 
• Machine Learning can help you to get the 
most out of Data Science 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Terminology 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Terminology 
Weight (g) Wingspan (cm) Webbed feet? Back color Species 
1000.1 125.0 No Brown Buteo jamaicenis 
3000.7 200.0 No Gray Sagittarius serpentarius 
3300.0 220.3 No Gray Sagittarius serpentarius 
4100.0 136.0 Yes Black Gavia Immer 
3.0 11.0 No Green Colothorax lucifer 
570.0 75.0 No Black Campephilus principalic 
• Features / attributes 
• Instance / data point 
• Label / target variable 
• Factorial versus Numeric versus Binary data
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Learning 
• Supervised Learning 
• Unsupervised Learning
Techniques 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Machine 
Learning 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Clustering 
Classification 
Association 
Rules 
Regression 
Information 
extraction
Classification 
• Predict a category for a given instance 
• Mostly supervised learning. 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• Algorithms 
– Naïve Bayes 
– Support Vector Machine 
– Decision Trees 
– Neural Networks
Classification: Use Cases 
• Incoming mail redirection 
• Sentiment analysis 
• Order picking optimization 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Clustering 
• Try to find clusters in unstructured data 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• Unsupervised learning 
• Algorithms: K-Means
Clustering: Use cases 
• Customer profiling 
• Grouping of shopping items 
• Recommendation systems 
• Fraud detection 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Association Rule Learning 
• Find interesting relations 
• Find frequent occurring patterns 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• Algorithms 
– Apriori 
– Singular Value Decomposition 
– FP-growth
Association Rule Learning: Use Cases 
• Recommendations 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• Data exploration 
• Find connections between unrelated 
events 
• Frequent pattern mining
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Regression 
• Prediction of a quantity 
• Algorithms: 
– Linear regression 
– Logistic regression
Regression: Use Cases 
• Order Quantity Prediction 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• Lag analysis 
• Trend estimation
Information Extraction 
• Extract variables out of unstructured data 
like text. 
• Named Entity Extraction 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye 
Tools
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Apache Mahout 
Pro Contra 
Relatively stable Poor documentation 
Build on Hadoop – Scales well Mahout is currently migrating from 
Apache Hadoop to Apache Spark. 
Development is slow and Apache Spark 
already built a machine learning library of 
their own… Instant legacy? 
Command-line access for most algorithms Kind of slow for smaller use cases 
All important algorithms are available
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Weka 
Pro Contra 
A lot of algorithms are available Not ‘Big Data’ ready 
Graphical user interface for prototyping 
and experimenting 
Requires custom data format – ARRF-files 
Available as a Java library Optimized for academic use cases
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Apache Spark: MLLib 
Pro Contra 
Based on Apache Spark – Very, fast and 
scalable 
Based on Apache Spark – Requires 
knowledge of Spark and Scala 
Very fast development cycle, new features 
are rolling out every couple of months 
Relatively new, so a small choice of 
algorithms. But the essential ones are 
there. 
New and refreshing API, easy integration 
with other components of Apache Spark.
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
R 
Pro Contra 
A lot of algorithms are available Can run on Hadoop/Spark, but requires a 
lot of knowledge from both platforms 
Well documented Must learn a new language 
Lot’s of existing packages, that are easily 
available
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Noteworthy 
Java 
• DeepLearning4J 
• Mallet 
• MOA 
Python 
• NLTK 
• Theano 
• PyBrain 
• SciKit-Learn 
Lua 
• Torch 
General 
• LibSVM 
• LibLinear
Integration with Software Development 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Development Cycle 
Collect Analyze Extract Train Test Use
Feature extraction 
• Describe an instance to be 
used in an algorithm 
• Recognize hand-written digits 
by converting the images to 
lines of 1’s and 0’s 
00000000000001000000000000000000 
00000000000001110000000000000000 
00000000000011110000000000000000 
00000000001111100000000000000000 
00000000001111000000000000000000 
00000000000111100000000000000000 
00000000001111100000000000000000 
00000000011111000000000000000000 
00000000011110000000000000000000 
00000000111110000000000000000000 
00000000011111000000000000000000 
00000000111111000000000000000000 
00000000111110000000000000000000 
00000000111100000000000000000000 
00000000011110000000000000000000 
00000000111110000111000000000000 
00000001111111111111111100000000 
00000001111111111111111110000000 
00000001111111111111111110000000 
00000000111111111111111111100000 
00000001111111110000011111100000 
00000001111100000000000111100000 
00000000111100000000000111100000 
00000000011110000000000011110000 
00000000011111000000000011110000 
00000000011111100000001111110000 
00000000011111111111111111110000 
00000000011111111111111111100000 
00000000000111111111111111100000 
00000000000011111111111111100000 
00000000000000111111111000000000 
00000000000000001111110000000000 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Training an algorithm 
1. Collect you’re data as a collection of 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
instances 
2. Split you’re data set into a training set 
and a testing set 
3. Train the algorithm with the training set 
4. Validate the results using the test set
Runtime model 
• During training most algorithms generate a 
mathematical runtime model. 
• Model should be updated on a regular 
basis 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
A / B Testing 
• Slow integration in the main system. 
• If the machine is certain (enough) the 
machine can take over 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Hands-on 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Demo 
• K-Nearest Neighbour Classifier 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• Clustering using Weka 
• Named-Entity Extraction 
• Classification of tweets
What’s in it for you? 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Benefits of using machine learning 
• Automate repetitive tasks 
• Can be a solution for problems that are 
difficult to automate 
• Gain insights about your business 
• Optimize business decisions by using the 
opinion of the computer 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Questions? 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Wrap-up 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye

More Related Content

What's hot

Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...Edureka!
 
Machine Learning in 10 Minutes | What is Machine Learning? | Edureka
Machine Learning in 10 Minutes | What is Machine Learning? | EdurekaMachine Learning in 10 Minutes | What is Machine Learning? | Edureka
Machine Learning in 10 Minutes | What is Machine Learning? | EdurekaEdureka!
 
Machine Learning Basics
Machine Learning BasicsMachine Learning Basics
Machine Learning BasicsSuresh Arora
 
Machine learning basics
Machine learning basics Machine learning basics
Machine learning basics Akanksha Bali
 
Machine learning overview
Machine learning overviewMachine learning overview
Machine learning overviewprih_yah
 
Machine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachinePulse
 
Introduction to Machine Learning
Introduction to Machine Learning   Introduction to Machine Learning
Introduction to Machine Learning snehal_152
 
Machine Learning
Machine LearningMachine Learning
Machine LearningRahul Kumar
 
An introduction to Machine Learning
An introduction to Machine LearningAn introduction to Machine Learning
An introduction to Machine Learningbutest
 
Machine learning
Machine learningMachine learning
Machine learningeonx_32
 
Introduction to XGBoost
Introduction to XGBoostIntroduction to XGBoost
Introduction to XGBoostJoonyoung Yi
 
Machine Learning Algorithms
Machine Learning AlgorithmsMachine Learning Algorithms
Machine Learning AlgorithmsHichem Felouat
 
Deep Learning Tutorial
Deep Learning TutorialDeep Learning Tutorial
Deep Learning TutorialAmr Rashed
 
Machine learning Algorithms
Machine learning AlgorithmsMachine learning Algorithms
Machine learning AlgorithmsWalaa Hamdy Assy
 
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...Edureka!
 
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...Simplilearn
 

What's hot (20)

Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine Learning in 10 Minutes | What is Machine Learning? | Edureka
Machine Learning in 10 Minutes | What is Machine Learning? | EdurekaMachine Learning in 10 Minutes | What is Machine Learning? | Edureka
Machine Learning in 10 Minutes | What is Machine Learning? | Edureka
 
Machine Learning Basics
Machine Learning BasicsMachine Learning Basics
Machine Learning Basics
 
Deep Learning
Deep LearningDeep Learning
Deep Learning
 
Machine learning basics
Machine learning basics Machine learning basics
Machine learning basics
 
Machine learning overview
Machine learning overviewMachine learning overview
Machine learning overview
 
Machine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachine Learning and Real-World Applications
Machine Learning and Real-World Applications
 
Introduction to Machine Learning
Introduction to Machine Learning   Introduction to Machine Learning
Introduction to Machine Learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
An introduction to Machine Learning
An introduction to Machine LearningAn introduction to Machine Learning
An introduction to Machine Learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Introduction to XGBoost
Introduction to XGBoostIntroduction to XGBoost
Introduction to XGBoost
 
Machine Learning Algorithms
Machine Learning AlgorithmsMachine Learning Algorithms
Machine Learning Algorithms
 
Deep Learning Tutorial
Deep Learning TutorialDeep Learning Tutorial
Deep Learning Tutorial
 
Machine learning Algorithms
Machine learning AlgorithmsMachine learning Algorithms
Machine learning Algorithms
 
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
 
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
 

Viewers also liked

Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningRahul Jain
 
Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Marina Santini
 
Machine Learning presentation.
Machine Learning presentation.Machine Learning presentation.
Machine Learning presentation.butest
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningLior Rokach
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learningbigdatasyd
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningLars Marius Garshol
 
Machine Learning and Data Mining: 10 Introduction to Classification
Machine Learning and Data Mining: 10 Introduction to ClassificationMachine Learning and Data Mining: 10 Introduction to Classification
Machine Learning and Data Mining: 10 Introduction to ClassificationPier Luca Lanzi
 
林守德/Practical Issues in Machine Learning
林守德/Practical Issues in Machine Learning林守德/Practical Issues in Machine Learning
林守德/Practical Issues in Machine Learning台灣資料科學年會
 
Introduction to Machine Learning and Deep Learning
Introduction to Machine Learning and Deep LearningIntroduction to Machine Learning and Deep Learning
Introduction to Machine Learning and Deep LearningTerry Taewoong Um
 
Retail Detail OmniChannel Congress 2015 - Data Science for e-commerce
Retail Detail OmniChannel Congress 2015 - Data Science for e-commerceRetail Detail OmniChannel Congress 2015 - Data Science for e-commerce
Retail Detail OmniChannel Congress 2015 - Data Science for e-commerceInfoFarm
 
Boosting big data with apache spark
Boosting big data with apache sparkBoosting big data with apache spark
Boosting big data with apache sparkInfoFarm
 
Data Driven Decisions seminar
Data Driven Decisions seminarData Driven Decisions seminar
Data Driven Decisions seminarInfoFarm
 
An explanation of machine learning for business
An explanation of machine learning for businessAn explanation of machine learning for business
An explanation of machine learning for businessClement Levallois
 
Big Data with Apache Hadoop
Big Data with Apache HadoopBig Data with Apache Hadoop
Big Data with Apache HadoopInfoFarm
 
First impressions of SparkR: our own machine learning algorithm
First impressions of SparkR: our own machine learning algorithmFirst impressions of SparkR: our own machine learning algorithm
First impressions of SparkR: our own machine learning algorithmInfoFarm
 
Fikrimuhal TRHUG 2016 Machine Learning
Fikrimuhal TRHUG 2016 Machine LearningFikrimuhal TRHUG 2016 Machine Learning
Fikrimuhal TRHUG 2016 Machine LearningSukru Hasdemir
 

Viewers also liked (20)

Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Machine Learning presentation.
Machine Learning presentation.Machine Learning presentation.
Machine Learning presentation.
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Machine Learning for Dummies
Machine Learning for DummiesMachine Learning for Dummies
Machine Learning for Dummies
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine Learning
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine Learning and Data Mining: 10 Introduction to Classification
Machine Learning and Data Mining: 10 Introduction to ClassificationMachine Learning and Data Mining: 10 Introduction to Classification
Machine Learning and Data Mining: 10 Introduction to Classification
 
林守德/Practical Issues in Machine Learning
林守德/Practical Issues in Machine Learning林守德/Practical Issues in Machine Learning
林守德/Practical Issues in Machine Learning
 
Introduction to Machine Learning and Deep Learning
Introduction to Machine Learning and Deep LearningIntroduction to Machine Learning and Deep Learning
Introduction to Machine Learning and Deep Learning
 
Retail Detail OmniChannel Congress 2015 - Data Science for e-commerce
Retail Detail OmniChannel Congress 2015 - Data Science for e-commerceRetail Detail OmniChannel Congress 2015 - Data Science for e-commerce
Retail Detail OmniChannel Congress 2015 - Data Science for e-commerce
 
Boosting big data with apache spark
Boosting big data with apache sparkBoosting big data with apache spark
Boosting big data with apache spark
 
Data Driven Decisions seminar
Data Driven Decisions seminarData Driven Decisions seminar
Data Driven Decisions seminar
 
An explanation of machine learning for business
An explanation of machine learning for businessAn explanation of machine learning for business
An explanation of machine learning for business
 
Big Data with Apache Hadoop
Big Data with Apache HadoopBig Data with Apache Hadoop
Big Data with Apache Hadoop
 
First impressions of SparkR: our own machine learning algorithm
First impressions of SparkR: our own machine learning algorithmFirst impressions of SparkR: our own machine learning algorithm
First impressions of SparkR: our own machine learning algorithm
 
Fikrimuhal TRHUG 2016 Machine Learning
Fikrimuhal TRHUG 2016 Machine LearningFikrimuhal TRHUG 2016 Machine Learning
Fikrimuhal TRHUG 2016 Machine Learning
 

Similar to Machine learning

Real Time Big Data
Real Time Big DataReal Time Big Data
Real Time Big DataInfoFarm
 
Automated Hyperparameter Tuning, Scaling and Tracking
Automated Hyperparameter Tuning, Scaling and TrackingAutomated Hyperparameter Tuning, Scaling and Tracking
Automated Hyperparameter Tuning, Scaling and TrackingDatabricks
 
SystemT: Declarative Information Extraction (invited talk at MIT CSAIL)
SystemT: Declarative Information Extraction (invited talk at MIT CSAIL)SystemT: Declarative Information Extraction (invited talk at MIT CSAIL)
SystemT: Declarative Information Extraction (invited talk at MIT CSAIL)Laura Chiticariu
 
Strategies for Processing and Explaining Distributed Queries on Linked Data
Strategies for Processing and Explaining Distributed Queries on Linked DataStrategies for Processing and Explaining Distributed Queries on Linked Data
Strategies for Processing and Explaining Distributed Queries on Linked DataRakebul Hasan
 
Tips to get the most out of OpenERP
Tips to get the most out of OpenERPTips to get the most out of OpenERP
Tips to get the most out of OpenERPAudaxis
 
Tips to get the most out of OpenERP. Jean Luc Delsaute & Coralie Girardet, Au...
Tips to get the most out of OpenERP. Jean Luc Delsaute & Coralie Girardet, Au...Tips to get the most out of OpenERP. Jean Luc Delsaute & Coralie Girardet, Au...
Tips to get the most out of OpenERP. Jean Luc Delsaute & Coralie Girardet, Au...Odoo
 
An AI-Powered Chatbot to Simplify Apache Spark Performance Management
An AI-Powered Chatbot to Simplify Apache Spark Performance ManagementAn AI-Powered Chatbot to Simplify Apache Spark Performance Management
An AI-Powered Chatbot to Simplify Apache Spark Performance ManagementDatabricks
 
Mortal analytics - Covid-19 and the problem of data quality
Mortal analytics - Covid-19 and the problem of data qualityMortal analytics - Covid-19 and the problem of data quality
Mortal analytics - Covid-19 and the problem of data qualityLars Albertsson
 
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALS
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALSPYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALS
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALSQuantUniversity
 
MITRE ATT&CKcon 2018: Hunters ATT&CKing with the Data, Roberto Rodriguez, Spe...
MITRE ATT&CKcon 2018: Hunters ATT&CKing with the Data, Roberto Rodriguez, Spe...MITRE ATT&CKcon 2018: Hunters ATT&CKing with the Data, Roberto Rodriguez, Spe...
MITRE ATT&CKcon 2018: Hunters ATT&CKing with the Data, Roberto Rodriguez, Spe...MITRE - ATT&CKcon
 
Elasticsearch Performance Testing and Scaling @ Signal
Elasticsearch Performance Testing and Scaling @ SignalElasticsearch Performance Testing and Scaling @ Signal
Elasticsearch Performance Testing and Scaling @ SignalJoachim Draeger
 
Data Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFixData Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFixC4Media
 
IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079ibankuk
 
Don't build a data science team
Don't build a data science teamDon't build a data science team
Don't build a data science teamLars Albertsson
 
How I Learned to Stop Worrying and Love Building Data Products
How I Learned to Stop Worrying and Love Building Data ProductsHow I Learned to Stop Worrying and Love Building Data Products
How I Learned to Stop Worrying and Love Building Data ProductsAlejandro Correa Bahnsen, PhD
 

Similar to Machine learning (20)

Real Time Big Data
Real Time Big DataReal Time Big Data
Real Time Big Data
 
Automated Hyperparameter Tuning, Scaling and Tracking
Automated Hyperparameter Tuning, Scaling and TrackingAutomated Hyperparameter Tuning, Scaling and Tracking
Automated Hyperparameter Tuning, Scaling and Tracking
 
SystemT: Declarative Information Extraction (invited talk at MIT CSAIL)
SystemT: Declarative Information Extraction (invited talk at MIT CSAIL)SystemT: Declarative Information Extraction (invited talk at MIT CSAIL)
SystemT: Declarative Information Extraction (invited talk at MIT CSAIL)
 
Strategies for Processing and Explaining Distributed Queries on Linked Data
Strategies for Processing and Explaining Distributed Queries on Linked DataStrategies for Processing and Explaining Distributed Queries on Linked Data
Strategies for Processing and Explaining Distributed Queries on Linked Data
 
Big data
Big dataBig data
Big data
 
Tips to get the most out of OpenERP
Tips to get the most out of OpenERPTips to get the most out of OpenERP
Tips to get the most out of OpenERP
 
Tips to get the most out of OpenERP. Jean Luc Delsaute & Coralie Girardet, Au...
Tips to get the most out of OpenERP. Jean Luc Delsaute & Coralie Girardet, Au...Tips to get the most out of OpenERP. Jean Luc Delsaute & Coralie Girardet, Au...
Tips to get the most out of OpenERP. Jean Luc Delsaute & Coralie Girardet, Au...
 
An AI-Powered Chatbot to Simplify Apache Spark Performance Management
An AI-Powered Chatbot to Simplify Apache Spark Performance ManagementAn AI-Powered Chatbot to Simplify Apache Spark Performance Management
An AI-Powered Chatbot to Simplify Apache Spark Performance Management
 
Mortal analytics - Covid-19 and the problem of data quality
Mortal analytics - Covid-19 and the problem of data qualityMortal analytics - Covid-19 and the problem of data quality
Mortal analytics - Covid-19 and the problem of data quality
 
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALS
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALSPYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALS
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALS
 
MITRE ATT&CKcon 2018: Hunters ATT&CKing with the Data, Roberto Rodriguez, Spe...
MITRE ATT&CKcon 2018: Hunters ATT&CKing with the Data, Roberto Rodriguez, Spe...MITRE ATT&CKcon 2018: Hunters ATT&CKing with the Data, Roberto Rodriguez, Spe...
MITRE ATT&CKcon 2018: Hunters ATT&CKing with the Data, Roberto Rodriguez, Spe...
 
Elasticsearch Performance Testing and Scaling @ Signal
Elasticsearch Performance Testing and Scaling @ SignalElasticsearch Performance Testing and Scaling @ Signal
Elasticsearch Performance Testing and Scaling @ Signal
 
ML master class
ML master classML master class
ML master class
 
Data Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFixData Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFix
 
IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079
 
Don't build a data science team
Don't build a data science teamDon't build a data science team
Don't build a data science team
 
How I Learned to Stop Worrying and Love Building Data Products
How I Learned to Stop Worrying and Love Building Data ProductsHow I Learned to Stop Worrying and Love Building Data Products
How I Learned to Stop Worrying and Love Building Data Products
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Hadoop PDF
Hadoop PDFHadoop PDF
Hadoop PDF
 

Recently uploaded

Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAAnypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAshyamraj55
 
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019IES VE
 
UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7DianaGray10
 
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdf
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdf
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdfJamie (Taka) Wang
 
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...UbiTrack UK
 
UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6DianaGray10
 
9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding TeamAdam Moalla
 
Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )Brian Pichman
 
Nanopower In Semiconductor Industry.pdf
Nanopower  In Semiconductor Industry.pdfNanopower  In Semiconductor Industry.pdf
Nanopower In Semiconductor Industry.pdfPedro Manuel
 
Designing A Time bound resource download URL
Designing A Time bound resource download URLDesigning A Time bound resource download URL
Designing A Time bound resource download URLRuncy Oommen
 
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdf
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdfIaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdf
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdfDaniel Santiago Silva Capera
 
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online CollaborationCOMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online Collaborationbruanjhuli
 
OpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability AdventureOpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability AdventureEric D. Schabell
 
Computer 10: Lesson 10 - Online Crimes and Hazards
Computer 10: Lesson 10 - Online Crimes and HazardsComputer 10: Lesson 10 - Online Crimes and Hazards
Computer 10: Lesson 10 - Online Crimes and HazardsSeth Reyes
 
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationUsing IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationIES VE
 
Videogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdfVideogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdfinfogdgmi
 
NIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopNIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopBachir Benyammi
 
Introduction to Matsuo Laboratory (ENG).pptx
Introduction to Matsuo Laboratory (ENG).pptxIntroduction to Matsuo Laboratory (ENG).pptx
Introduction to Matsuo Laboratory (ENG).pptxMatsuo Lab
 

Recently uploaded (20)

Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAAnypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
 
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
 
UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7
 
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdf
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdf
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdf
 
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
 
UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6
 
9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team
 
Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )
 
Nanopower In Semiconductor Industry.pdf
Nanopower  In Semiconductor Industry.pdfNanopower  In Semiconductor Industry.pdf
Nanopower In Semiconductor Industry.pdf
 
201610817 - edge part1
201610817 - edge part1201610817 - edge part1
201610817 - edge part1
 
Designing A Time bound resource download URL
Designing A Time bound resource download URLDesigning A Time bound resource download URL
Designing A Time bound resource download URL
 
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdf
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdfIaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdf
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdf
 
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online CollaborationCOMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
 
OpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability AdventureOpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability Adventure
 
Computer 10: Lesson 10 - Online Crimes and Hazards
Computer 10: Lesson 10 - Online Crimes and HazardsComputer 10: Lesson 10 - Online Crimes and Hazards
Computer 10: Lesson 10 - Online Crimes and Hazards
 
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationUsing IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
 
Videogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdfVideogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdf
 
20150722 - AGV
20150722 - AGV20150722 - AGV
20150722 - AGV
 
NIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopNIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 Workshop
 
Introduction to Matsuo Laboratory (ENG).pptx
Introduction to Matsuo Laboratory (ENG).pptxIntroduction to Matsuo Laboratory (ENG).pptx
Introduction to Matsuo Laboratory (ENG).pptx
 

Machine learning

  • 1. Data Science Company Machine Learning in Practice An InfoFarm Seminar Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 2. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Data Science Big Data Identifying, extracting and using data of all types and origins; exploring, correlating and using it in new and innovative ways in order to extract meaning and business value from it.
  • 3. 2 Data Scientists 4 Big Data Consultants 1 Infrastructure Specialist Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 4. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Java PHP E-Commerce Mobile Web Development
  • 5. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 6. Agenda • 13:00 What is Machine Learning? • 13:30 Techniques • 14:30 Tools • 15:00 Practical examples • 16:00 Wrap up Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 7. What is Machine Learning? Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
  • 8. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Magic?
  • 9. Machine Learning is a subfield of computer science and statistics that deals with systems that can learn from data, instead of follow explicitly programmed instructions. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 10. Machine Learning vs Data Science vs Big Data • You don’t need Big Data to leverage the benefits of machine learning, but more learning data makes a better machine • Data Science can help you to get the most out of Machine Learning • Machine Learning can help you to get the most out of Data Science Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 11. Terminology Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
  • 12. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Terminology Weight (g) Wingspan (cm) Webbed feet? Back color Species 1000.1 125.0 No Brown Buteo jamaicenis 3000.7 200.0 No Gray Sagittarius serpentarius 3300.0 220.3 No Gray Sagittarius serpentarius 4100.0 136.0 Yes Black Gavia Immer 3.0 11.0 No Green Colothorax lucifer 570.0 75.0 No Black Campephilus principalic • Features / attributes • Instance / data point • Label / target variable • Factorial versus Numeric versus Binary data
  • 13. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Learning • Supervised Learning • Unsupervised Learning
  • 14. Techniques Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
  • 15. Machine Learning Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Clustering Classification Association Rules Regression Information extraction
  • 16. Classification • Predict a category for a given instance • Mostly supervised learning. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be • Algorithms – Naïve Bayes – Support Vector Machine – Decision Trees – Neural Networks
  • 17. Classification: Use Cases • Incoming mail redirection • Sentiment analysis • Order picking optimization Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 18. Clustering • Try to find clusters in unstructured data Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be • Unsupervised learning • Algorithms: K-Means
  • 19. Clustering: Use cases • Customer profiling • Grouping of shopping items • Recommendation systems • Fraud detection Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 20. Association Rule Learning • Find interesting relations • Find frequent occurring patterns Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be • Algorithms – Apriori – Singular Value Decomposition – FP-growth
  • 21. Association Rule Learning: Use Cases • Recommendations Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be • Data exploration • Find connections between unrelated events • Frequent pattern mining
  • 22. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Regression • Prediction of a quantity • Algorithms: – Linear regression – Logistic regression
  • 23. Regression: Use Cases • Order Quantity Prediction Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be • Lag analysis • Trend estimation
  • 24. Information Extraction • Extract variables out of unstructured data like text. • Named Entity Extraction Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 25. Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye Tools
  • 26. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 27. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Apache Mahout Pro Contra Relatively stable Poor documentation Build on Hadoop – Scales well Mahout is currently migrating from Apache Hadoop to Apache Spark. Development is slow and Apache Spark already built a machine learning library of their own… Instant legacy? Command-line access for most algorithms Kind of slow for smaller use cases All important algorithms are available
  • 28. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 29. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Weka Pro Contra A lot of algorithms are available Not ‘Big Data’ ready Graphical user interface for prototyping and experimenting Requires custom data format – ARRF-files Available as a Java library Optimized for academic use cases
  • 30. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 31. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Apache Spark: MLLib Pro Contra Based on Apache Spark – Very, fast and scalable Based on Apache Spark – Requires knowledge of Spark and Scala Very fast development cycle, new features are rolling out every couple of months Relatively new, so a small choice of algorithms. But the essential ones are there. New and refreshing API, easy integration with other components of Apache Spark.
  • 32. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 33. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be R Pro Contra A lot of algorithms are available Can run on Hadoop/Spark, but requires a lot of knowledge from both platforms Well documented Must learn a new language Lot’s of existing packages, that are easily available
  • 34. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Noteworthy Java • DeepLearning4J • Mallet • MOA Python • NLTK • Theano • PyBrain • SciKit-Learn Lua • Torch General • LibSVM • LibLinear
  • 35. Integration with Software Development Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
  • 36. Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be Development Cycle Collect Analyze Extract Train Test Use
  • 37. Feature extraction • Describe an instance to be used in an algorithm • Recognize hand-written digits by converting the images to lines of 1’s and 0’s 00000000000001000000000000000000 00000000000001110000000000000000 00000000000011110000000000000000 00000000001111100000000000000000 00000000001111000000000000000000 00000000000111100000000000000000 00000000001111100000000000000000 00000000011111000000000000000000 00000000011110000000000000000000 00000000111110000000000000000000 00000000011111000000000000000000 00000000111111000000000000000000 00000000111110000000000000000000 00000000111100000000000000000000 00000000011110000000000000000000 00000000111110000111000000000000 00000001111111111111111100000000 00000001111111111111111110000000 00000001111111111111111110000000 00000000111111111111111111100000 00000001111111110000011111100000 00000001111100000000000111100000 00000000111100000000000111100000 00000000011110000000000011110000 00000000011111000000000011110000 00000000011111100000001111110000 00000000011111111111111111110000 00000000011111111111111111100000 00000000000111111111111111100000 00000000000011111111111111100000 00000000000000111111111000000000 00000000000000001111110000000000 Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 38. Training an algorithm 1. Collect you’re data as a collection of Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be instances 2. Split you’re data set into a training set and a testing set 3. Train the algorithm with the training set 4. Validate the results using the test set
  • 39. Runtime model • During training most algorithms generate a mathematical runtime model. • Model should be updated on a regular basis Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 40. A / B Testing • Slow integration in the main system. • If the machine is certain (enough) the machine can take over Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 41. Hands-on Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
  • 42. Demo • K-Nearest Neighbour Classifier Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be • Clustering using Weka • Named-Entity Extraction • Classification of tweets
  • 43. What’s in it for you? Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
  • 44. Benefits of using machine learning • Automate repetitive tasks • Can be a solution for problems that are difficult to automate • Gain insights about your business • Optimize business decisions by using the opinion of the computer Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
  • 45. Questions? Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
  • 46. Wrap-up Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye