SlideShare a Scribd company logo
BigML Inc IJCAI-15 1
The Past, Present, and Future of Machine Learning APIs
May 2015
petersen@bigml.com
BigML Inc IJCAI-15
Machine Learning
“a field of study that gives computers the
ability to learn without being explicitly
programmed”
Professor Arthur Samuel, 1959
•The world's first self-learning program was a checkers-
playing program developed for IBM by Professor Arthur
Samuel in 1952.
•Thomas J. Watson Sr., the founder and President of IBM,
predicted that Samuel’s checkers public demonstration
would raise the price of IBM stock 15 points. It did.
2
BigML Inc IJCAI-15 3
1950 1960 1970 1980 1990 2000 2010
Perceptron
Neural
Networks
Ensembles
Support Vector Machines
Boosting
Brief HistoryInterpretability
Rosenblatt, 1957
Quinlan, 1979 (ID3),
Minsky, 1969
Vapnik, 1963 Corina & Vapnik, 1995
Schapire, 1989 (Boosting)
Schapire, 1995 (Adaboost)
Breiman, 2001 (Random Forests)
Breiman, 1994 (Bagging)
Deep Learning
Hinton, 2006Fukushima, 1989 (ANN)
Breiman, 1984 (CART)
2020
+
-
Decision Trees
BigML Inc IJCAI-15 4
New algorithms
&
Theory
Parameter estimation
&
Scalability
Automated
Representation &
Composability
Applicability
&
Deployability
1950 1960 1970 1980 1990 2000 2010 2020
Focus Focus
AUTOMATION
1st Machine Learning Workshop
Pittsburgh, PA, 1980
BigML Inc IJCAI-15 5
Smarter Apps?
•Years after the data deluge, why
don’t we see more smarter
apps?
•Real-world Machine Learning is
more then choosing an
algorithm.
•Scaling Machine Learning is
hard
•C u r r e n t t o o l s w e r e n ’ t
designed for developers.
They require a Ph.D., are
c o m p l e x , e r r o r p r o n e ,
expensive, etc)
BigML Inc IJCAI-15 6
State the problem
Data Wrangling
Feature Engineering
Learning
Deploying
Predicting
Measuring Impact
The Stages of a ML app
Machine Learning That Matters, Kiri Wagstaff, 2012
Machine Learning
is only as good as the impact it makes on the real world
BigML Inc IJCAI-15 7
•Value of data is often time sensitive - how long can you wait?
•Consider: Having 1M users, needing to create a model for
each one, and then running 10 predictions for each one a
day (100M predictions)
Learning (Training) Predicting (Scoring)
DATA MODEL NEW DATA PREDICTIONS
Scaling Machine Learning
BigML Inc IJCAI-15 8
Legacy ML Tools
•By scientists (with a Ph.D.) for scientists (with a Ph.D.)
•Excess of algorithms
•Single-threaded, desktop apps for small datasets
•Overcomplicated for common people
•Oversimplified for real world problems
•Poorly engineered for real world use or high scale
1993 1997 20071997 2004 2008 2013
PRE-HADOOP POST-HADOOP
•Commercial tools (SPSS, SAS) not only inherit the same
issues but are also overpriced
BigML Inc IJCAI-15 9
The Paradox of Choice
Do we need hundreds of classifiers? The Paradox of Choice
BigML Inc IJCAI-15 10
REST APIs
REST, Roy Fielding
History of APIs
2000 2001 2002
XML, 2000
XML, 2000
XML, 2002
REST, 2004
2003 2004
BigML Inc IJCAI-15 11
2010 2011 2012 2013 2014 2015
Hadoop and Big Data
Craziness
Machine Learning APIs
Watson wins
Jeopardy
BigML Inc IJCAI-15 12
Anomalies
Isolation Forest:
Grow a random decision tree until
each instance is in its own leaf
“easy” to isolate
“hard” to isolate
Depth
Now repeat the process several times and
use average Depth to compute anomaly
score: 0 (similar) -> 1 (dissimilar)
BigML Inc IJCAI-15 13
Source Dataset Anomaly Detector
Dataset with scores
Batch anomaly
score
Dataset filtered
Filter
Anomaly Detection
Real-Time scores
BigML Inc IJCAI-15 14
export BIGML_USERNAME=ijcai
export BIGML_API_KEY=aa3140519eacc1e9c034f8c973d976e35fffdemo
export BIGML_AUTH="username=$BIGML_USERNAME;api_key=$BIGML_API_KEY"
export BIGML_DOMAIN=bigml.io
export BIGML_URL=https://$BIGML_DOMAIN
export DEV_BIGML_URL=$BIGML_URL/dev
RESOURCES="source dataset sample model cluster anomaly ensemble evaluation
prediction centroid anomalyscore batchprediction batchcentroid
batchanomalyscore project"
for RESOURCE in $RESOURCES; do
VARIABLE=$(echo $RESOURCE | tr '[a-z]' '[A-Z]')
export ${VARIABLE}="$BIGML_URL/$RESOURCE?$BIGML_AUTH"
export DEV_${RESOURCE}="$DEV_BIGML_URL/$RESOURCE?$BIGML_AUTH"
Anomaly Detection at the prompt
https://github.com/jakubroztocil/httpie
http://stedolan.github.io/jq/
HTTPie: a CLI, cURL-like tool for humans
jq: sed for JSON data
BigML Inc IJCAI-15 15
Anomaly Detection in Python
#!/usr/bin/env python
# -*- coding: utf-8 -*-
from bigml.api import BigML
from bigml.anomaly import Anomaly
BigML()
APPLE = "https://s3.amazonaws.com/bigml-public/csv/nasdaq_aapl.csv"
source = api.create_source(APPLE, {'name': 'IJCAI'})
api.ok(source)
dataset = api.create_dataset(source)
api.ok(dataset)
anomaly = api.create_anomaly(dataset)
api.ok(anomaly)
local_anomaly = Anomaly(anomaly)
local_anomaly.anomaly_score({"Open": 275, "High": 300, "Low": 250})
• http://bigml.readthedocs.org/en/latest/#anomaly-detector
• http://bigml.readthedocs.org/en/latest/#local-anomaly-detector
• http://bigml.readthedocs.org/en/latest/#local-anomaly-scores
• https://github.com/bigmlcom/python
BigML Inc IJCAI-15 16
Anomaly Detection in BigMLer
APPLE=https://s3.amazonaws.com/bigml-public/csv/nasdaq_aapl.csv
bigmler anomaly --train $APPLE --name IJCAI
• http://bigmler.readthedocs.org/en/latest/#anomaly-subcommand
• https://github.com/bigmlcom/bigmler
BigML Inc IJCAI-15 17
•Machine Learning (or Predictive) APIs can:
•Abstract the inherent complexity of ML algorithms
•Manage the heavy infrastructure needed to learn from
data and make predictions at scale. No additional servers
to provision or manage
•Easily close the gap between model training and scoring
•Be built for developers and provide full flow automation
•Add traceability and repeatability to ML tasks
Machine Learning APIs
BigML Inc IJCAI-15 18
Democratization
Immediately available, anyone can try it for free!!!
BigML Inc IJCAI-15 19
Exportability
yes
no
Transparency
B>A
yes
Models are exportable to
predict outside the platform
Black-boxmodeling
no
White-boxmodeling Predicting only available via
the same platform
N/A
Exportability vs Transparency
BigML Inc IJCAI-15 20
Composability
Enhancing your cloud applications with Artificial Intelligence
BigML Inc IJCAI-15 21
API-first
BigML Inc IJCAI-15 22
Comparing ML APIs
• # Algorithms
• Training speed
• Prediction speed
• Performance
• Ease-of-Use
• Deployability
• Scalability
• API-first?
• API design
• Documentation
• UI (Dashboard, Studio, Console)
• SDKs
• Automation
• Time-to-productivity
• Importability
• Exportability
• Transparency
• Dependency
• Price
Recent tools with too many aspects to compare and too few
benchmarks so far
BigML Inc IJCAI-15 23
Simplicity
vs
1.Select: classification or regression
2.Select: two-class or multi-class
3.Select: algorithm
and infer the task based on the type
and distribution of the objective field
BigML Inc IJCAI-15 24
Specialization
Classification Regression
Cluster
Analysis
Anomaly
Detection
Other…
Specific
Data
Specialized API
Specific Data
Transformations
and Feature
Engineering
Specific Modeling
Strategy
Specific Predicting
Strategy
Specific
Evaluations
Language
Identification
Sentiment
Analysis
Age
Guessing
Mood
Guessing
Many
Others…
BigML Inc IJCAI-15 25
Programmability
• Future: Remote Execution / Mobile Code
• Today: Cloud Client Computing
BigML Inc IJCAI-15 26
Standardization?
Classification Regression
Cluster
Analysis
Anomaly
Detection
Other…
Standard ML API
The SQL of Machine Learning?
BigML Inc IJCAI-15 27
Machine Learning Layer
•Machine Learning is becoming a new abstraction layer of
the computing infrastructure.
•An application developer expects to have access to a
machine learning platform.
Tushar Chandra, Google
BigML Inc IJCAI-15 28
Born to learn
from django.db import models
class Customer(models.Model)
name = models.CharsField(max_length=30)
age = models.PositiveIntegerField()
monthly_income = models.FloatField(blank=True, null=True)
dependents = models.PositiveIntegerField(default=0)
open_credit_lines = models.PositiveIntegerField(default=0)
delinquent = models.BooleanField(predictable=True)
•Predictions will be embedded into data models
•Development frameworks will increasingly abstract modeling
and predicting strategies
•New applications designed and implemented from scratch
will take advantage of machine learning from day 0
BigML Inc IJCAI-15 29
“As machine learning leaves the lab and goes into practice, it
will threaten white-collar, knowledge-worker jobs just as
machines, automation and assembly lines destroyed factory
jobs in the 19th and 20th centuries.”
The Economist, February 1, 2014
Leaving the lab
BigML Inc IJCAI-15 30

More Related Content

Similar to Past, present and future of predictive APIs - Poul Petersen

The Past, Present, and Future of Machine Learning APIs
The Past, Present, and Future of Machine Learning APIsThe Past, Present, and Future of Machine Learning APIs
The Past, Present, and Future of Machine Learning APIs
BigML, Inc
 
D365 power platform-user-group-deck-v02
D365 power platform-user-group-deck-v02D365 power platform-user-group-deck-v02
D365 power platform-user-group-deck-v02
Boonthawee Tangsoonthornthum
 
Sir 20200115 v8
Sir 20200115 v8Sir 20200115 v8
Sir 20200115 v8
ISSIP
 
Ibm and innovation overview 20150326 v15 short
Ibm and innovation overview 20150326 v15 shortIbm and innovation overview 20150326 v15 short
Ibm and innovation overview 20150326 v15 short
ISSIP
 
Constraint Programming - An Alternative Approach to Heuristics in Scheduling
Constraint Programming - An Alternative Approach to Heuristics in SchedulingConstraint Programming - An Alternative Approach to Heuristics in Scheduling
Constraint Programming - An Alternative Approach to Heuristics in Scheduling
Eray Cakici
 
Intel 20180608 v2
Intel 20180608 v2Intel 20180608 v2
Intel 20180608 v2
ISSIP
 
LJC-Unconference-2023-Keynote.pdf
LJC-Unconference-2023-Keynote.pdfLJC-Unconference-2023-Keynote.pdf
LJC-Unconference-2023-Keynote.pdf
EmilyJiang23
 
Summit Australia 2019 - Supercharge PowerPlatform with AI - Dipankar Bhattach...
Summit Australia 2019 - Supercharge PowerPlatform with AI - Dipankar Bhattach...Summit Australia 2019 - Supercharge PowerPlatform with AI - Dipankar Bhattach...
Summit Australia 2019 - Supercharge PowerPlatform with AI - Dipankar Bhattach...
Andrew Ly
 
20210128 jim spohrer ai house_fund v4
20210128 jim spohrer ai house_fund v420210128 jim spohrer ai house_fund v4
20210128 jim spohrer ai house_fund v4
ISSIP
 
[DSC Adria 23] Tarry Singh Building High dencity startup.pdf
[DSC Adria 23] Tarry Singh Building High dencity startup.pdf[DSC Adria 23] Tarry Singh Building High dencity startup.pdf
[DSC Adria 23] Tarry Singh Building High dencity startup.pdf
DataScienceConferenc1
 
Ibm db2update2019 machine learning and db2 ai
Ibm db2update2019 machine learning and db2 aiIbm db2update2019 machine learning and db2 ai
Ibm db2update2019 machine learning and db2 ai
Gustav Lundström
 
Ypo 20190131 v1
Ypo 20190131 v1 Ypo 20190131 v1
Ypo 20190131 v1
ISSIP
 
OpenPOWER/POWER9 AI webinar
OpenPOWER/POWER9 AI webinar OpenPOWER/POWER9 AI webinar
OpenPOWER/POWER9 AI webinar
Ganesan Narayanasamy
 
InTTrust -IBM Artificial Intelligence Event
InTTrust -IBM Artificial Intelligence  EventInTTrust -IBM Artificial Intelligence  Event
InTTrust -IBM Artificial Intelligence Event
Michail Pagiatakis
 
Northwestern 20181004 v9
Northwestern 20181004 v9Northwestern 20181004 v9
Northwestern 20181004 v9
ISSIP
 
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson StudioIBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
Svetlana Levitan, PhD
 
Machine learning model to production
Machine learning model to productionMachine learning model to production
Machine learning model to production
Georg Heiler
 
IBM Power Systems Update 2Q17
IBM Power Systems Update 2Q17IBM Power Systems Update 2Q17
IBM Power Systems Update 2Q17
David Spurway
 
Machine Learning Infrastructure
Machine Learning InfrastructureMachine Learning Infrastructure
Machine Learning Infrastructure
SigOpt
 
Recommendations for Building Machine Learning Software
Recommendations for Building Machine Learning SoftwareRecommendations for Building Machine Learning Software
Recommendations for Building Machine Learning Software
Justin Basilico
 

Similar to Past, present and future of predictive APIs - Poul Petersen (20)

The Past, Present, and Future of Machine Learning APIs
The Past, Present, and Future of Machine Learning APIsThe Past, Present, and Future of Machine Learning APIs
The Past, Present, and Future of Machine Learning APIs
 
D365 power platform-user-group-deck-v02
D365 power platform-user-group-deck-v02D365 power platform-user-group-deck-v02
D365 power platform-user-group-deck-v02
 
Sir 20200115 v8
Sir 20200115 v8Sir 20200115 v8
Sir 20200115 v8
 
Ibm and innovation overview 20150326 v15 short
Ibm and innovation overview 20150326 v15 shortIbm and innovation overview 20150326 v15 short
Ibm and innovation overview 20150326 v15 short
 
Constraint Programming - An Alternative Approach to Heuristics in Scheduling
Constraint Programming - An Alternative Approach to Heuristics in SchedulingConstraint Programming - An Alternative Approach to Heuristics in Scheduling
Constraint Programming - An Alternative Approach to Heuristics in Scheduling
 
Intel 20180608 v2
Intel 20180608 v2Intel 20180608 v2
Intel 20180608 v2
 
LJC-Unconference-2023-Keynote.pdf
LJC-Unconference-2023-Keynote.pdfLJC-Unconference-2023-Keynote.pdf
LJC-Unconference-2023-Keynote.pdf
 
Summit Australia 2019 - Supercharge PowerPlatform with AI - Dipankar Bhattach...
Summit Australia 2019 - Supercharge PowerPlatform with AI - Dipankar Bhattach...Summit Australia 2019 - Supercharge PowerPlatform with AI - Dipankar Bhattach...
Summit Australia 2019 - Supercharge PowerPlatform with AI - Dipankar Bhattach...
 
20210128 jim spohrer ai house_fund v4
20210128 jim spohrer ai house_fund v420210128 jim spohrer ai house_fund v4
20210128 jim spohrer ai house_fund v4
 
[DSC Adria 23] Tarry Singh Building High dencity startup.pdf
[DSC Adria 23] Tarry Singh Building High dencity startup.pdf[DSC Adria 23] Tarry Singh Building High dencity startup.pdf
[DSC Adria 23] Tarry Singh Building High dencity startup.pdf
 
Ibm db2update2019 machine learning and db2 ai
Ibm db2update2019 machine learning and db2 aiIbm db2update2019 machine learning and db2 ai
Ibm db2update2019 machine learning and db2 ai
 
Ypo 20190131 v1
Ypo 20190131 v1 Ypo 20190131 v1
Ypo 20190131 v1
 
OpenPOWER/POWER9 AI webinar
OpenPOWER/POWER9 AI webinar OpenPOWER/POWER9 AI webinar
OpenPOWER/POWER9 AI webinar
 
InTTrust -IBM Artificial Intelligence Event
InTTrust -IBM Artificial Intelligence  EventInTTrust -IBM Artificial Intelligence  Event
InTTrust -IBM Artificial Intelligence Event
 
Northwestern 20181004 v9
Northwestern 20181004 v9Northwestern 20181004 v9
Northwestern 20181004 v9
 
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson StudioIBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
 
Machine learning model to production
Machine learning model to productionMachine learning model to production
Machine learning model to production
 
IBM Power Systems Update 2Q17
IBM Power Systems Update 2Q17IBM Power Systems Update 2Q17
IBM Power Systems Update 2Q17
 
Machine Learning Infrastructure
Machine Learning InfrastructureMachine Learning Infrastructure
Machine Learning Infrastructure
 
Recommendations for Building Machine Learning Software
Recommendations for Building Machine Learning SoftwareRecommendations for Building Machine Learning Software
Recommendations for Building Machine Learning Software
 

More from PAPIs.io

Shortening the time from analysis to deployment with ml as-a-service — Luiz A...
Shortening the time from analysis to deployment with ml as-a-service — Luiz A...Shortening the time from analysis to deployment with ml as-a-service — Luiz A...
Shortening the time from analysis to deployment with ml as-a-service — Luiz A...
PAPIs.io
 
Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017
Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017
Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017
PAPIs.io
 
Extracting information from images using deep learning and transfer learning ...
Extracting information from images using deep learning and transfer learning ...Extracting information from images using deep learning and transfer learning ...
Extracting information from images using deep learning and transfer learning ...
PAPIs.io
 
Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...
Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...
Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...
PAPIs.io
 
Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...
Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...
Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...
PAPIs.io
 
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
PAPIs.io
 
Building machine learning applications locally with Spark — Joel Pinho Lucas ...
Building machine learning applications locally with Spark — Joel Pinho Lucas ...Building machine learning applications locally with Spark — Joel Pinho Lucas ...
Building machine learning applications locally with Spark — Joel Pinho Lucas ...
PAPIs.io
 
Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...
Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...
Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...
PAPIs.io
 
A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...
A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...
A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...
PAPIs.io
 
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
PAPIs.io
 
Real-world applications of AI - Daniel Hulme @ PAPIs Connect
Real-world applications of AI - Daniel Hulme @ PAPIs ConnectReal-world applications of AI - Daniel Hulme @ PAPIs Connect
Real-world applications of AI - Daniel Hulme @ PAPIs Connect
PAPIs.io
 
Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...
Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...
Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...
PAPIs.io
 
Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...
Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...
Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...
PAPIs.io
 
Demystifying Deep Learning - Roberto Paredes Palacios @ PAPIs Connect
Demystifying Deep Learning - Roberto Paredes Palacios @ PAPIs ConnectDemystifying Deep Learning - Roberto Paredes Palacios @ PAPIs Connect
Demystifying Deep Learning - Roberto Paredes Palacios @ PAPIs Connect
PAPIs.io
 
Predictive APIs: What about Banking? - Natalino Busa @ PAPIs Connect
Predictive APIs: What about Banking? - Natalino Busa @ PAPIs ConnectPredictive APIs: What about Banking? - Natalino Busa @ PAPIs Connect
Predictive APIs: What about Banking? - Natalino Busa @ PAPIs Connect
PAPIs.io
 
Microdecision making in financial services - Greg Lamp @ PAPIs Connect
Microdecision making in financial services - Greg Lamp @ PAPIs ConnectMicrodecision making in financial services - Greg Lamp @ PAPIs Connect
Microdecision making in financial services - Greg Lamp @ PAPIs Connect
PAPIs.io
 
Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...
Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...
Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...
PAPIs.io
 
Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...
Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...
Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...
PAPIs.io
 
How to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
How to predict the future of shopping - Ulrich Kerzel @ PAPIs ConnectHow to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
How to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
PAPIs.io
 
The emergent opportunity of Big Data for Social Good - Nuria Oliver @ PAPIs C...
The emergent opportunity of Big Data for Social Good - Nuria Oliver @ PAPIs C...The emergent opportunity of Big Data for Social Good - Nuria Oliver @ PAPIs C...
The emergent opportunity of Big Data for Social Good - Nuria Oliver @ PAPIs C...
PAPIs.io
 

More from PAPIs.io (20)

Shortening the time from analysis to deployment with ml as-a-service — Luiz A...
Shortening the time from analysis to deployment with ml as-a-service — Luiz A...Shortening the time from analysis to deployment with ml as-a-service — Luiz A...
Shortening the time from analysis to deployment with ml as-a-service — Luiz A...
 
Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017
Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017
Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017
 
Extracting information from images using deep learning and transfer learning ...
Extracting information from images using deep learning and transfer learning ...Extracting information from images using deep learning and transfer learning ...
Extracting information from images using deep learning and transfer learning ...
 
Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...
Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...
Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...
 
Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...
Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...
Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...
 
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
 
Building machine learning applications locally with Spark — Joel Pinho Lucas ...
Building machine learning applications locally with Spark — Joel Pinho Lucas ...Building machine learning applications locally with Spark — Joel Pinho Lucas ...
Building machine learning applications locally with Spark — Joel Pinho Lucas ...
 
Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...
Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...
Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...
 
A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...
A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...
A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...
 
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
 
Real-world applications of AI - Daniel Hulme @ PAPIs Connect
Real-world applications of AI - Daniel Hulme @ PAPIs ConnectReal-world applications of AI - Daniel Hulme @ PAPIs Connect
Real-world applications of AI - Daniel Hulme @ PAPIs Connect
 
Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...
Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...
Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...
 
Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...
Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...
Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...
 
Demystifying Deep Learning - Roberto Paredes Palacios @ PAPIs Connect
Demystifying Deep Learning - Roberto Paredes Palacios @ PAPIs ConnectDemystifying Deep Learning - Roberto Paredes Palacios @ PAPIs Connect
Demystifying Deep Learning - Roberto Paredes Palacios @ PAPIs Connect
 
Predictive APIs: What about Banking? - Natalino Busa @ PAPIs Connect
Predictive APIs: What about Banking? - Natalino Busa @ PAPIs ConnectPredictive APIs: What about Banking? - Natalino Busa @ PAPIs Connect
Predictive APIs: What about Banking? - Natalino Busa @ PAPIs Connect
 
Microdecision making in financial services - Greg Lamp @ PAPIs Connect
Microdecision making in financial services - Greg Lamp @ PAPIs ConnectMicrodecision making in financial services - Greg Lamp @ PAPIs Connect
Microdecision making in financial services - Greg Lamp @ PAPIs Connect
 
Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...
Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...
Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...
 
Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...
Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...
Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...
 
How to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
How to predict the future of shopping - Ulrich Kerzel @ PAPIs ConnectHow to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
How to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
 
The emergent opportunity of Big Data for Social Good - Nuria Oliver @ PAPIs C...
The emergent opportunity of Big Data for Social Good - Nuria Oliver @ PAPIs C...The emergent opportunity of Big Data for Social Good - Nuria Oliver @ PAPIs C...
The emergent opportunity of Big Data for Social Good - Nuria Oliver @ PAPIs C...
 

Recently uploaded

一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Subhajit Sahu
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 

Recently uploaded (20)

一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 

Past, present and future of predictive APIs - Poul Petersen

  • 1. BigML Inc IJCAI-15 1 The Past, Present, and Future of Machine Learning APIs May 2015 petersen@bigml.com
  • 2. BigML Inc IJCAI-15 Machine Learning “a field of study that gives computers the ability to learn without being explicitly programmed” Professor Arthur Samuel, 1959 •The world's first self-learning program was a checkers- playing program developed for IBM by Professor Arthur Samuel in 1952. •Thomas J. Watson Sr., the founder and President of IBM, predicted that Samuel’s checkers public demonstration would raise the price of IBM stock 15 points. It did. 2
  • 3. BigML Inc IJCAI-15 3 1950 1960 1970 1980 1990 2000 2010 Perceptron Neural Networks Ensembles Support Vector Machines Boosting Brief HistoryInterpretability Rosenblatt, 1957 Quinlan, 1979 (ID3), Minsky, 1969 Vapnik, 1963 Corina & Vapnik, 1995 Schapire, 1989 (Boosting) Schapire, 1995 (Adaboost) Breiman, 2001 (Random Forests) Breiman, 1994 (Bagging) Deep Learning Hinton, 2006Fukushima, 1989 (ANN) Breiman, 1984 (CART) 2020 + - Decision Trees
  • 4. BigML Inc IJCAI-15 4 New algorithms & Theory Parameter estimation & Scalability Automated Representation & Composability Applicability & Deployability 1950 1960 1970 1980 1990 2000 2010 2020 Focus Focus AUTOMATION 1st Machine Learning Workshop Pittsburgh, PA, 1980
  • 5. BigML Inc IJCAI-15 5 Smarter Apps? •Years after the data deluge, why don’t we see more smarter apps? •Real-world Machine Learning is more then choosing an algorithm. •Scaling Machine Learning is hard •C u r r e n t t o o l s w e r e n ’ t designed for developers. They require a Ph.D., are c o m p l e x , e r r o r p r o n e , expensive, etc)
  • 6. BigML Inc IJCAI-15 6 State the problem Data Wrangling Feature Engineering Learning Deploying Predicting Measuring Impact The Stages of a ML app Machine Learning That Matters, Kiri Wagstaff, 2012 Machine Learning is only as good as the impact it makes on the real world
  • 7. BigML Inc IJCAI-15 7 •Value of data is often time sensitive - how long can you wait? •Consider: Having 1M users, needing to create a model for each one, and then running 10 predictions for each one a day (100M predictions) Learning (Training) Predicting (Scoring) DATA MODEL NEW DATA PREDICTIONS Scaling Machine Learning
  • 8. BigML Inc IJCAI-15 8 Legacy ML Tools •By scientists (with a Ph.D.) for scientists (with a Ph.D.) •Excess of algorithms •Single-threaded, desktop apps for small datasets •Overcomplicated for common people •Oversimplified for real world problems •Poorly engineered for real world use or high scale 1993 1997 20071997 2004 2008 2013 PRE-HADOOP POST-HADOOP •Commercial tools (SPSS, SAS) not only inherit the same issues but are also overpriced
  • 9. BigML Inc IJCAI-15 9 The Paradox of Choice Do we need hundreds of classifiers? The Paradox of Choice
  • 10. BigML Inc IJCAI-15 10 REST APIs REST, Roy Fielding History of APIs 2000 2001 2002 XML, 2000 XML, 2000 XML, 2002 REST, 2004 2003 2004
  • 11. BigML Inc IJCAI-15 11 2010 2011 2012 2013 2014 2015 Hadoop and Big Data Craziness Machine Learning APIs Watson wins Jeopardy
  • 12. BigML Inc IJCAI-15 12 Anomalies Isolation Forest: Grow a random decision tree until each instance is in its own leaf “easy” to isolate “hard” to isolate Depth Now repeat the process several times and use average Depth to compute anomaly score: 0 (similar) -> 1 (dissimilar)
  • 13. BigML Inc IJCAI-15 13 Source Dataset Anomaly Detector Dataset with scores Batch anomaly score Dataset filtered Filter Anomaly Detection Real-Time scores
  • 14. BigML Inc IJCAI-15 14 export BIGML_USERNAME=ijcai export BIGML_API_KEY=aa3140519eacc1e9c034f8c973d976e35fffdemo export BIGML_AUTH="username=$BIGML_USERNAME;api_key=$BIGML_API_KEY" export BIGML_DOMAIN=bigml.io export BIGML_URL=https://$BIGML_DOMAIN export DEV_BIGML_URL=$BIGML_URL/dev RESOURCES="source dataset sample model cluster anomaly ensemble evaluation prediction centroid anomalyscore batchprediction batchcentroid batchanomalyscore project" for RESOURCE in $RESOURCES; do VARIABLE=$(echo $RESOURCE | tr '[a-z]' '[A-Z]') export ${VARIABLE}="$BIGML_URL/$RESOURCE?$BIGML_AUTH" export DEV_${RESOURCE}="$DEV_BIGML_URL/$RESOURCE?$BIGML_AUTH" Anomaly Detection at the prompt https://github.com/jakubroztocil/httpie http://stedolan.github.io/jq/ HTTPie: a CLI, cURL-like tool for humans jq: sed for JSON data
  • 15. BigML Inc IJCAI-15 15 Anomaly Detection in Python #!/usr/bin/env python # -*- coding: utf-8 -*- from bigml.api import BigML from bigml.anomaly import Anomaly BigML() APPLE = "https://s3.amazonaws.com/bigml-public/csv/nasdaq_aapl.csv" source = api.create_source(APPLE, {'name': 'IJCAI'}) api.ok(source) dataset = api.create_dataset(source) api.ok(dataset) anomaly = api.create_anomaly(dataset) api.ok(anomaly) local_anomaly = Anomaly(anomaly) local_anomaly.anomaly_score({"Open": 275, "High": 300, "Low": 250}) • http://bigml.readthedocs.org/en/latest/#anomaly-detector • http://bigml.readthedocs.org/en/latest/#local-anomaly-detector • http://bigml.readthedocs.org/en/latest/#local-anomaly-scores • https://github.com/bigmlcom/python
  • 16. BigML Inc IJCAI-15 16 Anomaly Detection in BigMLer APPLE=https://s3.amazonaws.com/bigml-public/csv/nasdaq_aapl.csv bigmler anomaly --train $APPLE --name IJCAI • http://bigmler.readthedocs.org/en/latest/#anomaly-subcommand • https://github.com/bigmlcom/bigmler
  • 17. BigML Inc IJCAI-15 17 •Machine Learning (or Predictive) APIs can: •Abstract the inherent complexity of ML algorithms •Manage the heavy infrastructure needed to learn from data and make predictions at scale. No additional servers to provision or manage •Easily close the gap between model training and scoring •Be built for developers and provide full flow automation •Add traceability and repeatability to ML tasks Machine Learning APIs
  • 18. BigML Inc IJCAI-15 18 Democratization Immediately available, anyone can try it for free!!!
  • 19. BigML Inc IJCAI-15 19 Exportability yes no Transparency B>A yes Models are exportable to predict outside the platform Black-boxmodeling no White-boxmodeling Predicting only available via the same platform N/A Exportability vs Transparency
  • 20. BigML Inc IJCAI-15 20 Composability Enhancing your cloud applications with Artificial Intelligence
  • 21. BigML Inc IJCAI-15 21 API-first
  • 22. BigML Inc IJCAI-15 22 Comparing ML APIs • # Algorithms • Training speed • Prediction speed • Performance • Ease-of-Use • Deployability • Scalability • API-first? • API design • Documentation • UI (Dashboard, Studio, Console) • SDKs • Automation • Time-to-productivity • Importability • Exportability • Transparency • Dependency • Price Recent tools with too many aspects to compare and too few benchmarks so far
  • 23. BigML Inc IJCAI-15 23 Simplicity vs 1.Select: classification or regression 2.Select: two-class or multi-class 3.Select: algorithm and infer the task based on the type and distribution of the objective field
  • 24. BigML Inc IJCAI-15 24 Specialization Classification Regression Cluster Analysis Anomaly Detection Other… Specific Data Specialized API Specific Data Transformations and Feature Engineering Specific Modeling Strategy Specific Predicting Strategy Specific Evaluations Language Identification Sentiment Analysis Age Guessing Mood Guessing Many Others…
  • 25. BigML Inc IJCAI-15 25 Programmability • Future: Remote Execution / Mobile Code • Today: Cloud Client Computing
  • 26. BigML Inc IJCAI-15 26 Standardization? Classification Regression Cluster Analysis Anomaly Detection Other… Standard ML API The SQL of Machine Learning?
  • 27. BigML Inc IJCAI-15 27 Machine Learning Layer •Machine Learning is becoming a new abstraction layer of the computing infrastructure. •An application developer expects to have access to a machine learning platform. Tushar Chandra, Google
  • 28. BigML Inc IJCAI-15 28 Born to learn from django.db import models class Customer(models.Model) name = models.CharsField(max_length=30) age = models.PositiveIntegerField() monthly_income = models.FloatField(blank=True, null=True) dependents = models.PositiveIntegerField(default=0) open_credit_lines = models.PositiveIntegerField(default=0) delinquent = models.BooleanField(predictable=True) •Predictions will be embedded into data models •Development frameworks will increasingly abstract modeling and predicting strategies •New applications designed and implemented from scratch will take advantage of machine learning from day 0
  • 29. BigML Inc IJCAI-15 29 “As machine learning leaves the lab and goes into practice, it will threaten white-collar, knowledge-worker jobs just as machines, automation and assembly lines destroyed factory jobs in the 19th and 20th centuries.” The Economist, February 1, 2014 Leaving the lab