SlideShare a Scribd company logo
1 of 18
Machine Learning
michel.bruley@teradata.com

Extract from various presentations: University of Nebraska, Scott,
Freund, Domingo, Hong, …

www.decideo.fr/bruley
What is learning?


“Learning is making useful changes in our minds”
Marvin Minsky



“Learning is constructing or modifying
representations of what is being experienced”
Ryszard Michalski



“Learning denotes changes in a system that ...
enable a system to do the same task more efficiently
the next time”
Herbert Simon

www.decideo.fr/bruley

2
What is Machine Learning?






Definition
– A program learns from experience E with respect to some class of tasks
T and performance measure P, if its performance at task T, as
measured by P, improves with experience E
Learning systems are not directly programmed to solve a problem, instead
develop own program based on
– examples of how they should behave
– from trial-and-error experience trying to solve the problem
Another definition
– For the purposes of computer, machine learning should really be
viewed as a set of techniques for leveraging data
– Machine Learning algorithms discover the relationships between the
variables of a system (input, output and hidden) from direct samples of
the system
– These algorithms originate from many fields (Statistics, mathematics,
theoretical computer science, physics, neuroscience, etc.)

www.decideo.fr/bruley
Machine Learning: Data Driven Modeling
Traditional programming
Data
Program

Computer

Output

Machine Learning
Data
Computer
Output

www.decideo.fr/bruley

Program
Magic?
No, more like gardening


Seeds = Algorithms



Nutrients = Data



Gardener = You



Plants = Programs

“The goal of machine learning is to
build computer system that can adapt
and learn from their experience.”
Tom Dietterich

www.decideo.fr/bruley
The black-box approach

 Statistical

A

models are not generators, they are predictors

predictor is a function from observation X to action Z

 After

action is taken, outcome Y is observed which implies
loss L (a real valued number)

 Goal:

find a predictor with small loss (in expectation, with
high probability, cumulative, …)

www.decideo.fr/bruley
Main software components

A predictor

A learner

x

z

Training examples
x1,y1 , x2 ,y2 ,, xm ,ym

We assume the predictor will be applied to
examples similar to those on which it was trained

www.decideo.fr/bruley
Learning in a system

Learning System
Training
Examples

predictor

Target System
Sensor Data

Action

feedback
www.decideo.fr/bruley
Types of Learning
 Supervised

(inductive) learning
– Training data includes desired outputs

 Unsupervised

learning
– Training data does not include desired outputs

 Semi-supervised

learning
– Training data includes a few desired outputs

 Reinforcement

learning
– Rewards from sequence of actions

www.decideo.fr/bruley
Supervised Learning

Given: Training examples

x1 , f x1

, x2 , f x2

,..., x P , f x P

for some unknown function (system) y

f x

Find f x
Predict

www.decideo.fr/bruley

y

f x

Where x

is not in training set
Main class of learning problems
Learning scenarios differ according to the available
information in training examples
 Supervised:

correct output available
– Classification: 1-of-N output (speech recognition, object
recognition, medical diagnosis)
– Regression: real-valued output (predicting market prices,
temperature)

 Unsupervised:

no feedback, need to construct measure of

good output
– Clustering : Clustering refers to techniques to segmenting
data into coherent “clusters.”
 Reinforcement:

www.decideo.fr/bruley

scalar feedback, possibly temporally delayed
And more …


Time series analysis



Dimension reduction



Model selection



Generic methods



Graphical models

www.decideo.fr/bruley
Why do we need learning?

 Computers

–
–
–
–
 For

need functions that map highly variable data:
Speech recognition: Audio signal -> words
Image analysis: Video signal -> objects
Bio-Informatics: Micro-array Images -> gene function
Data Mining: Transaction logs -> customer classification
accuracy, functions must be tuned to fit the data source

 For

real-time processing, function computation has to be
very fast

www.decideo.fr/bruley
A very small set of uses of ML


Vision
– Object recognition, Hand writing recognition, Emotion
labeling, Surveillance, …



Sound
– Speech recognition, music genre classification, …

 Text

– Document labeling, Part of speech tagging,
Summarization, …


Finance
– Algorithmic trading, …



Medical, Biological, Chemical, and on, and on, …

www.decideo.fr/bruley
Example: Face Recognition

15
www.decideo.fr/bruley
Recognition: Combinations of Components

www.decideo.fr/bruley
Machine learning in Big Data Infrastructure

www.decideo.fr/bruley
Teradata set of Technology
Aster/Teradata
Hadoop Connectors

Data transformation
& batch processing
• Image processing
• Search indexes
• Graph (PYMK)
• MapReduce

Batch data transformations for
engineering groups using HDFS +
MapReduce
www.decideo.fr/bruley

Aster/Teradata
Bi-Directional Connector

Analytic Platform for data
discovery
• nPath Pattern/Path
• Clickstream analysis
• A/B site testing
• Data Sciences discovery
• SQL-MapReduce

Interactive MapReduce
analytics for the enterprise using
MapReduce Analytics &
SQL-MapReduce

Integrated Data
Warehouse
• Exec Dashboards
• Adhoc/OLAP
• Complex SQL
• SQL

Integration with structured data,
operational intelligence, scalable
distribution of analytics
18

More Related Content

Viewers also liked

13 genetic algorithms
13 genetic algorithms13 genetic algorithms
13 genetic algorithmsNidul Sinha
 
Big Data and Marketing Attribution
Big Data and Marketing AttributionBig Data and Marketing Attribution
Big Data and Marketing AttributionMichel Bruley
 
Big Data and Social CRM
Big Data and Social CRMBig Data and Social CRM
Big Data and Social CRMMichel Bruley
 
Big Data and GeoMarketing, Geolocation, Geotargeting, Geomatic,…
Big Data and GeoMarketing, Geolocation, Geotargeting, Geomatic,…Big Data and GeoMarketing, Geolocation, Geotargeting, Geomatic,…
Big Data and GeoMarketing, Geolocation, Geotargeting, Geomatic,…Michel Bruley
 
Irfm mini guide de mauvaise conduite
Irfm mini guide de mauvaise  conduiteIrfm mini guide de mauvaise  conduite
Irfm mini guide de mauvaise conduiteMichel Bruley
 
Big Data and Product Affinity
Big Data and Product Affinity Big Data and Product Affinity
Big Data and Product Affinity Michel Bruley
 
Artificial intelligence
Artificial intelligence Artificial intelligence
Artificial intelligence Jagadeesh Kumar
 
검색어 대중도, 연결망 분석 - 21021899 김수빈
검색어 대중도, 연결망 분석 - 21021899 김수빈검색어 대중도, 연결망 분석 - 21021899 김수빈
검색어 대중도, 연결망 분석 - 21021899 김수빈Webometrics Class
 
Machine Learning in the age of Big Data
Machine Learning in the age of Big DataMachine Learning in the age of Big Data
Machine Learning in the age of Big DataDaniel Sârbe
 
3 problem-solving-
3 problem-solving-3 problem-solving-
3 problem-solving-Mhd Sb
 
Big data, new epistemologies and paradigm shifts
Big data, new epistemologies and paradigm shiftsBig data, new epistemologies and paradigm shifts
Big data, new epistemologies and paradigm shiftsrobkitchin
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial IntelligenceMhd Sb
 
2-Agents- Artificial Intelligence
2-Agents- Artificial Intelligence2-Agents- Artificial Intelligence
2-Agents- Artificial IntelligenceMhd Sb
 
6 games
6 games6 games
6 gamesMhd Sb
 
20120924134035 빅데이터시대,ai의새로운의미와가치
20120924134035 빅데이터시대,ai의새로운의미와가치20120924134035 빅데이터시대,ai의새로운의미와가치
20120924134035 빅데이터시대,ai의새로운의미와가치Webometrics Class
 
Big Data y Minería de datos
Big Data y Minería de datos Big Data y Minería de datos
Big Data y Minería de datos Luis Joyanes
 
Big Data and Natural Language Processing
Big Data and Natural Language ProcessingBig Data and Natural Language Processing
Big Data and Natural Language ProcessingMichel Bruley
 
Big Data and the Next Best Offer
Big Data and the Next Best OfferBig Data and the Next Best Offer
Big Data and the Next Best OfferMichel Bruley
 

Viewers also liked (20)

13 genetic algorithms
13 genetic algorithms13 genetic algorithms
13 genetic algorithms
 
Big Data and Marketing Attribution
Big Data and Marketing AttributionBig Data and Marketing Attribution
Big Data and Marketing Attribution
 
Big Data and Social CRM
Big Data and Social CRMBig Data and Social CRM
Big Data and Social CRM
 
01 intro1
01 intro101 intro1
01 intro1
 
Big Data and GeoMarketing, Geolocation, Geotargeting, Geomatic,…
Big Data and GeoMarketing, Geolocation, Geotargeting, Geomatic,…Big Data and GeoMarketing, Geolocation, Geotargeting, Geomatic,…
Big Data and GeoMarketing, Geolocation, Geotargeting, Geomatic,…
 
Irfm mini guide de mauvaise conduite
Irfm mini guide de mauvaise  conduiteIrfm mini guide de mauvaise  conduite
Irfm mini guide de mauvaise conduite
 
Big Data and Product Affinity
Big Data and Product Affinity Big Data and Product Affinity
Big Data and Product Affinity
 
Artificial intelligence
Artificial intelligence Artificial intelligence
Artificial intelligence
 
검색어 대중도, 연결망 분석 - 21021899 김수빈
검색어 대중도, 연결망 분석 - 21021899 김수빈검색어 대중도, 연결망 분석 - 21021899 김수빈
검색어 대중도, 연결망 분석 - 21021899 김수빈
 
Machine Learning in the age of Big Data
Machine Learning in the age of Big DataMachine Learning in the age of Big Data
Machine Learning in the age of Big Data
 
3 problem-solving-
3 problem-solving-3 problem-solving-
3 problem-solving-
 
Big data, new epistemologies and paradigm shifts
Big data, new epistemologies and paradigm shiftsBig data, new epistemologies and paradigm shifts
Big data, new epistemologies and paradigm shifts
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
2-Agents- Artificial Intelligence
2-Agents- Artificial Intelligence2-Agents- Artificial Intelligence
2-Agents- Artificial Intelligence
 
6 games
6 games6 games
6 games
 
5 csp
5 csp5 csp
5 csp
 
20120924134035 빅데이터시대,ai의새로운의미와가치
20120924134035 빅데이터시대,ai의새로운의미와가치20120924134035 빅데이터시대,ai의새로운의미와가치
20120924134035 빅데이터시대,ai의새로운의미와가치
 
Big Data y Minería de datos
Big Data y Minería de datos Big Data y Minería de datos
Big Data y Minería de datos
 
Big Data and Natural Language Processing
Big Data and Natural Language ProcessingBig Data and Natural Language Processing
Big Data and Natural Language Processing
 
Big Data and the Next Best Offer
Big Data and the Next Best OfferBig Data and the Next Best Offer
Big Data and the Next Best Offer
 

Similar to Big Data and Machine Learning

Unit I and II Machine Learning MCA CREC.pptx
Unit I and II Machine Learning MCA CREC.pptxUnit I and II Machine Learning MCA CREC.pptx
Unit I and II Machine Learning MCA CREC.pptxtrishipaul
 
Machine learning at b.e.s.t. summer university
Machine learning  at b.e.s.t. summer universityMachine learning  at b.e.s.t. summer university
Machine learning at b.e.s.t. summer universityLászló Kovács
 
Chapter01.ppt
Chapter01.pptChapter01.ppt
Chapter01.pptbutest
 
LearningAG.ppt
LearningAG.pptLearningAG.ppt
LearningAG.pptbutest
 
Machine learning presentation (razi)
Machine learning presentation (razi)Machine learning presentation (razi)
Machine learning presentation (razi)Rizwan Shaukat
 
Machine Learning Basics
Machine Learning BasicsMachine Learning Basics
Machine Learning BasicsSuresh Arora
 
Machine Learning Landscape
Machine Learning LandscapeMachine Learning Landscape
Machine Learning LandscapeEng Teong Cheah
 
Available Research Topics in Machine Learning
Available Research Topics in Machine LearningAvailable Research Topics in Machine Learning
Available Research Topics in Machine LearningTechsparks
 
Machine Learning Ch 1.ppt
Machine Learning Ch 1.pptMachine Learning Ch 1.ppt
Machine Learning Ch 1.pptARVIND SARDAR
 
Intro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationIntro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationAnkit Gupta
 
Introduction To Machine Learning
Introduction To Machine LearningIntroduction To Machine Learning
Introduction To Machine LearningKnoldus Inc.
 
A Machine Learning Primer,
A Machine Learning Primer,A Machine Learning Primer,
A Machine Learning Primer,Eirini Ntoutsi
 
Machine learning applications nurturing growth of various business domains
Machine learning applications nurturing growth of various business domainsMachine learning applications nurturing growth of various business domains
Machine learning applications nurturing growth of various business domainsShrutika Oswal
 
Eick/Alpaydin Introduction
Eick/Alpaydin IntroductionEick/Alpaydin Introduction
Eick/Alpaydin Introductionbutest
 

Similar to Big Data and Machine Learning (20)

Machine learning
Machine learningMachine learning
Machine learning
 
Unit I and II Machine Learning MCA CREC.pptx
Unit I and II Machine Learning MCA CREC.pptxUnit I and II Machine Learning MCA CREC.pptx
Unit I and II Machine Learning MCA CREC.pptx
 
Machine learning at b.e.s.t. summer university
Machine learning  at b.e.s.t. summer universityMachine learning  at b.e.s.t. summer university
Machine learning at b.e.s.t. summer university
 
module 6 (1).ppt
module 6 (1).pptmodule 6 (1).ppt
module 6 (1).ppt
 
LEC-6.ppt
LEC-6.pptLEC-6.ppt
LEC-6.ppt
 
Chapter01.ppt
Chapter01.pptChapter01.ppt
Chapter01.ppt
 
recent.pptx
recent.pptxrecent.pptx
recent.pptx
 
LearningAG.ppt
LearningAG.pptLearningAG.ppt
LearningAG.ppt
 
Machine_Learning.pptx
Machine_Learning.pptxMachine_Learning.pptx
Machine_Learning.pptx
 
Machine learning presentation (razi)
Machine learning presentation (razi)Machine learning presentation (razi)
Machine learning presentation (razi)
 
Machine Learning Basics
Machine Learning BasicsMachine Learning Basics
Machine Learning Basics
 
Machine Learning Landscape
Machine Learning LandscapeMachine Learning Landscape
Machine Learning Landscape
 
Available Research Topics in Machine Learning
Available Research Topics in Machine LearningAvailable Research Topics in Machine Learning
Available Research Topics in Machine Learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine Learning Ch 1.ppt
Machine Learning Ch 1.pptMachine Learning Ch 1.ppt
Machine Learning Ch 1.ppt
 
Intro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationIntro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning Presentation
 
Introduction To Machine Learning
Introduction To Machine LearningIntroduction To Machine Learning
Introduction To Machine Learning
 
A Machine Learning Primer,
A Machine Learning Primer,A Machine Learning Primer,
A Machine Learning Primer,
 
Machine learning applications nurturing growth of various business domains
Machine learning applications nurturing growth of various business domainsMachine learning applications nurturing growth of various business domains
Machine learning applications nurturing growth of various business domains
 
Eick/Alpaydin Introduction
Eick/Alpaydin IntroductionEick/Alpaydin Introduction
Eick/Alpaydin Introduction
 

More from Michel Bruley

Religion : Dieu y es-tu ? (les articles)
Religion : Dieu y es-tu ? (les articles)Religion : Dieu y es-tu ? (les articles)
Religion : Dieu y es-tu ? (les articles)Michel Bruley
 
Réflexion sur les religions : Dieu y es-tu ?
Réflexion sur les religions : Dieu y es-tu ?Réflexion sur les religions : Dieu y es-tu ?
Réflexion sur les religions : Dieu y es-tu ?Michel Bruley
 
La chute de l'Empire romain comme modèle.pdf
La chute de l'Empire romain comme modèle.pdfLa chute de l'Empire romain comme modèle.pdf
La chute de l'Empire romain comme modèle.pdfMichel Bruley
 
Synthèse sur Neuville.pdf
Synthèse sur Neuville.pdfSynthèse sur Neuville.pdf
Synthèse sur Neuville.pdfMichel Bruley
 
Propos sur des sujets qui m'ont titillé.pdf
Propos sur des sujets qui m'ont titillé.pdfPropos sur des sujets qui m'ont titillé.pdf
Propos sur des sujets qui m'ont titillé.pdfMichel Bruley
 
Propos sur les Big Data.pdf
Propos sur les Big Data.pdfPropos sur les Big Data.pdf
Propos sur les Big Data.pdfMichel Bruley
 
Georges Anselmi - 1914 - 1918 Campagnes de France et d'Orient
Georges Anselmi - 1914 - 1918 Campagnes de France et d'OrientGeorges Anselmi - 1914 - 1918 Campagnes de France et d'Orient
Georges Anselmi - 1914 - 1918 Campagnes de France et d'OrientMichel Bruley
 
Poc banking industry - Churn
Poc banking industry - ChurnPoc banking industry - Churn
Poc banking industry - ChurnMichel Bruley
 
Big Data POC in communication industry
Big Data POC in communication industryBig Data POC in communication industry
Big Data POC in communication industryMichel Bruley
 
Photos de famille 1895 1966
Photos de famille 1895   1966Photos de famille 1895   1966
Photos de famille 1895 1966Michel Bruley
 
Compilation d'autres textes de famille
Compilation d'autres textes de familleCompilation d'autres textes de famille
Compilation d'autres textes de familleMichel Bruley
 
Textes de famille concernant les guerres (1814 - 1944)
Textes de famille concernant les guerres (1814 - 1944)Textes de famille concernant les guerres (1814 - 1944)
Textes de famille concernant les guerres (1814 - 1944)Michel Bruley
 
Recette de la dinde au whisky
Recette de la dinde au whiskyRecette de la dinde au whisky
Recette de la dinde au whiskyMichel Bruley
 
Les 2 guerres de René Puig
Les 2 guerres de René PuigLes 2 guerres de René Puig
Les 2 guerres de René PuigMichel Bruley
 
Une societe se_presente
Une societe se_presenteUne societe se_presente
Une societe se_presenteMichel Bruley
 
Dossiers noirs va 4191
Dossiers noirs va 4191Dossiers noirs va 4191
Dossiers noirs va 4191Michel Bruley
 
Estissac et thuisy 2017
Estissac et thuisy   2017Estissac et thuisy   2017
Estissac et thuisy 2017Michel Bruley
 
Guerre, captivité & evasion de jb v3
Guerre, captivité & evasion de jb   v3Guerre, captivité & evasion de jb   v3
Guerre, captivité & evasion de jb v3Michel Bruley
 

More from Michel Bruley (20)

Religion : Dieu y es-tu ? (les articles)
Religion : Dieu y es-tu ? (les articles)Religion : Dieu y es-tu ? (les articles)
Religion : Dieu y es-tu ? (les articles)
 
Réflexion sur les religions : Dieu y es-tu ?
Réflexion sur les religions : Dieu y es-tu ?Réflexion sur les religions : Dieu y es-tu ?
Réflexion sur les religions : Dieu y es-tu ?
 
La chute de l'Empire romain comme modèle.pdf
La chute de l'Empire romain comme modèle.pdfLa chute de l'Empire romain comme modèle.pdf
La chute de l'Empire romain comme modèle.pdf
 
Synthèse sur Neuville.pdf
Synthèse sur Neuville.pdfSynthèse sur Neuville.pdf
Synthèse sur Neuville.pdf
 
Propos sur des sujets qui m'ont titillé.pdf
Propos sur des sujets qui m'ont titillé.pdfPropos sur des sujets qui m'ont titillé.pdf
Propos sur des sujets qui m'ont titillé.pdf
 
Propos sur les Big Data.pdf
Propos sur les Big Data.pdfPropos sur les Big Data.pdf
Propos sur les Big Data.pdf
 
Sun tzu
Sun tzuSun tzu
Sun tzu
 
Georges Anselmi - 1914 - 1918 Campagnes de France et d'Orient
Georges Anselmi - 1914 - 1918 Campagnes de France et d'OrientGeorges Anselmi - 1914 - 1918 Campagnes de France et d'Orient
Georges Anselmi - 1914 - 1918 Campagnes de France et d'Orient
 
Poc banking industry - Churn
Poc banking industry - ChurnPoc banking industry - Churn
Poc banking industry - Churn
 
Big Data POC in communication industry
Big Data POC in communication industryBig Data POC in communication industry
Big Data POC in communication industry
 
Photos de famille 1895 1966
Photos de famille 1895   1966Photos de famille 1895   1966
Photos de famille 1895 1966
 
Compilation d'autres textes de famille
Compilation d'autres textes de familleCompilation d'autres textes de famille
Compilation d'autres textes de famille
 
J'aime BRULEY
J'aime BRULEYJ'aime BRULEY
J'aime BRULEY
 
Textes de famille concernant les guerres (1814 - 1944)
Textes de famille concernant les guerres (1814 - 1944)Textes de famille concernant les guerres (1814 - 1944)
Textes de famille concernant les guerres (1814 - 1944)
 
Recette de la dinde au whisky
Recette de la dinde au whiskyRecette de la dinde au whisky
Recette de la dinde au whisky
 
Les 2 guerres de René Puig
Les 2 guerres de René PuigLes 2 guerres de René Puig
Les 2 guerres de René Puig
 
Une societe se_presente
Une societe se_presenteUne societe se_presente
Une societe se_presente
 
Dossiers noirs va 4191
Dossiers noirs va 4191Dossiers noirs va 4191
Dossiers noirs va 4191
 
Estissac et thuisy 2017
Estissac et thuisy   2017Estissac et thuisy   2017
Estissac et thuisy 2017
 
Guerre, captivité & evasion de jb v3
Guerre, captivité & evasion de jb   v3Guerre, captivité & evasion de jb   v3
Guerre, captivité & evasion de jb v3
 

Recently uploaded

It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayNZSG
 
DEPED Work From Home WORKWEEK-PLAN.docx
DEPED Work From Home  WORKWEEK-PLAN.docxDEPED Work From Home  WORKWEEK-PLAN.docx
DEPED Work From Home WORKWEEK-PLAN.docxRodelinaLaud
 
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779Delhi Call girls
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth MarketingShawn Pang
 
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesMysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesDipal Arora
 
Catalogue ONG NƯỚC uPVC - HDPE DE NHAT.pdf
Catalogue ONG NƯỚC uPVC - HDPE DE NHAT.pdfCatalogue ONG NƯỚC uPVC - HDPE DE NHAT.pdf
Catalogue ONG NƯỚC uPVC - HDPE DE NHAT.pdfOrient Homes
 
Catalogue ONG NUOC PPR DE NHAT .pdf
Catalogue ONG NUOC PPR DE NHAT      .pdfCatalogue ONG NUOC PPR DE NHAT      .pdf
Catalogue ONG NUOC PPR DE NHAT .pdfOrient Homes
 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...Paul Menig
 
The Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case studyThe Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case studyEthan lee
 
Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Neil Kimberley
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst SummitHolger Mueller
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Lviv Startup Club
 
The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024christinemoorman
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfPaul Menig
 
Creating Low-Code Loan Applications using the Trisotech Mortgage Feature Set
Creating Low-Code Loan Applications using the Trisotech Mortgage Feature SetCreating Low-Code Loan Applications using the Trisotech Mortgage Feature Set
Creating Low-Code Loan Applications using the Trisotech Mortgage Feature SetDenis Gagné
 
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999Tina Ji
 
Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessAggregage
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.Aaiza Hassan
 
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLMONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLSeo
 

Recently uploaded (20)

It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 May
 
DEPED Work From Home WORKWEEK-PLAN.docx
DEPED Work From Home  WORKWEEK-PLAN.docxDEPED Work From Home  WORKWEEK-PLAN.docx
DEPED Work From Home WORKWEEK-PLAN.docx
 
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
 
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesMysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
 
Catalogue ONG NƯỚC uPVC - HDPE DE NHAT.pdf
Catalogue ONG NƯỚC uPVC - HDPE DE NHAT.pdfCatalogue ONG NƯỚC uPVC - HDPE DE NHAT.pdf
Catalogue ONG NƯỚC uPVC - HDPE DE NHAT.pdf
 
Catalogue ONG NUOC PPR DE NHAT .pdf
Catalogue ONG NUOC PPR DE NHAT      .pdfCatalogue ONG NUOC PPR DE NHAT      .pdf
Catalogue ONG NUOC PPR DE NHAT .pdf
 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...
 
The Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case studyThe Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case study
 
Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023
 
Forklift Operations: Safety through Cartoons
Forklift Operations: Safety through CartoonsForklift Operations: Safety through Cartoons
Forklift Operations: Safety through Cartoons
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst Summit
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
 
The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdf
 
Creating Low-Code Loan Applications using the Trisotech Mortgage Feature Set
Creating Low-Code Loan Applications using the Trisotech Mortgage Feature SetCreating Low-Code Loan Applications using the Trisotech Mortgage Feature Set
Creating Low-Code Loan Applications using the Trisotech Mortgage Feature Set
 
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
 
Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for Success
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.
 
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLMONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
 

Big Data and Machine Learning

  • 1. Machine Learning michel.bruley@teradata.com Extract from various presentations: University of Nebraska, Scott, Freund, Domingo, Hong, … www.decideo.fr/bruley
  • 2. What is learning?  “Learning is making useful changes in our minds” Marvin Minsky  “Learning is constructing or modifying representations of what is being experienced” Ryszard Michalski  “Learning denotes changes in a system that ... enable a system to do the same task more efficiently the next time” Herbert Simon www.decideo.fr/bruley 2
  • 3. What is Machine Learning?    Definition – A program learns from experience E with respect to some class of tasks T and performance measure P, if its performance at task T, as measured by P, improves with experience E Learning systems are not directly programmed to solve a problem, instead develop own program based on – examples of how they should behave – from trial-and-error experience trying to solve the problem Another definition – For the purposes of computer, machine learning should really be viewed as a set of techniques for leveraging data – Machine Learning algorithms discover the relationships between the variables of a system (input, output and hidden) from direct samples of the system – These algorithms originate from many fields (Statistics, mathematics, theoretical computer science, physics, neuroscience, etc.) www.decideo.fr/bruley
  • 4. Machine Learning: Data Driven Modeling Traditional programming Data Program Computer Output Machine Learning Data Computer Output www.decideo.fr/bruley Program
  • 5. Magic? No, more like gardening  Seeds = Algorithms  Nutrients = Data  Gardener = You  Plants = Programs “The goal of machine learning is to build computer system that can adapt and learn from their experience.” Tom Dietterich www.decideo.fr/bruley
  • 6. The black-box approach  Statistical A models are not generators, they are predictors predictor is a function from observation X to action Z  After action is taken, outcome Y is observed which implies loss L (a real valued number)  Goal: find a predictor with small loss (in expectation, with high probability, cumulative, …) www.decideo.fr/bruley
  • 7. Main software components A predictor A learner x z Training examples x1,y1 , x2 ,y2 ,, xm ,ym We assume the predictor will be applied to examples similar to those on which it was trained www.decideo.fr/bruley
  • 8. Learning in a system Learning System Training Examples predictor Target System Sensor Data Action feedback www.decideo.fr/bruley
  • 9. Types of Learning  Supervised (inductive) learning – Training data includes desired outputs  Unsupervised learning – Training data does not include desired outputs  Semi-supervised learning – Training data includes a few desired outputs  Reinforcement learning – Rewards from sequence of actions www.decideo.fr/bruley
  • 10. Supervised Learning Given: Training examples x1 , f x1 , x2 , f x2 ,..., x P , f x P for some unknown function (system) y f x Find f x Predict www.decideo.fr/bruley y f x Where x is not in training set
  • 11. Main class of learning problems Learning scenarios differ according to the available information in training examples  Supervised: correct output available – Classification: 1-of-N output (speech recognition, object recognition, medical diagnosis) – Regression: real-valued output (predicting market prices, temperature)  Unsupervised: no feedback, need to construct measure of good output – Clustering : Clustering refers to techniques to segmenting data into coherent “clusters.”  Reinforcement: www.decideo.fr/bruley scalar feedback, possibly temporally delayed
  • 12. And more …  Time series analysis  Dimension reduction  Model selection  Generic methods  Graphical models www.decideo.fr/bruley
  • 13. Why do we need learning?  Computers – – – –  For need functions that map highly variable data: Speech recognition: Audio signal -> words Image analysis: Video signal -> objects Bio-Informatics: Micro-array Images -> gene function Data Mining: Transaction logs -> customer classification accuracy, functions must be tuned to fit the data source  For real-time processing, function computation has to be very fast www.decideo.fr/bruley
  • 14. A very small set of uses of ML  Vision – Object recognition, Hand writing recognition, Emotion labeling, Surveillance, …  Sound – Speech recognition, music genre classification, …  Text – Document labeling, Part of speech tagging, Summarization, …  Finance – Algorithmic trading, …  Medical, Biological, Chemical, and on, and on, … www.decideo.fr/bruley
  • 16. Recognition: Combinations of Components www.decideo.fr/bruley
  • 17. Machine learning in Big Data Infrastructure www.decideo.fr/bruley
  • 18. Teradata set of Technology Aster/Teradata Hadoop Connectors Data transformation & batch processing • Image processing • Search indexes • Graph (PYMK) • MapReduce Batch data transformations for engineering groups using HDFS + MapReduce www.decideo.fr/bruley Aster/Teradata Bi-Directional Connector Analytic Platform for data discovery • nPath Pattern/Path • Clickstream analysis • A/B site testing • Data Sciences discovery • SQL-MapReduce Interactive MapReduce analytics for the enterprise using MapReduce Analytics & SQL-MapReduce Integrated Data Warehouse • Exec Dashboards • Adhoc/OLAP • Complex SQL • SQL Integration with structured data, operational intelligence, scalable distribution of analytics 18