DN 2017 | AIaaS - Artificial Intelligence off the Shelf | Jana Kludas | um*

AIaaS - Artificial Intelligence of the Shelf
by Jana Kludas, Data Scientist @ *um
Data Natives 2017

Outline
• Introduction
• Low level vs. High level AI
• Market overview of AIaaS
• Usage Examples
• Services as Black Box
• Conclusions
27.11.2017 The unbelievable Machine Company 2

Definition and Scope of AI
"The study of the computations that make it possible to perceive, reason, and
act” (Winston, 1992)
• Reasoning, Problem solving
• Knowledge representation
• Planning
• Learning (ie. from example/experience)
• Natural Language Processing
• Perception (Vision and other Sensors)
• Acting and Manipulating
• Social Intelligence
• General Intelligence

Everything-as-a-Service (X-a-a-S)
Software-aaS
Platform-aaS
Infrastructure-aaS
Cloud hosters offer simple access and on demand usage of computational resources (can be really
anything!)
Pros Cons
- focus on the core business - dependency on the service provider and a working and fast data
connection
- transparent costs through pay as you use - offer being limited to standard solutions (standardization instead
of innovation)
- reduced development time and investment risk - reduced security of data and transactions
- increased strategic flexibility

Artificial-Intelligence-as-a-Service (AIaaS)
AIaaS
Pros
• improves the Time-to-Value significantly
• fast testing of new approaches to your problems
• no big investments in hardware or software required
… but there are also some downsides that will be discussed in the following

Low level AI
Classic Machine Learning algorithms
– Classification, Regression, Clustering
– Bayesian Networks
– Reinforcement Learning, Representation Learning
– Genetic Algorithms
• Solutions or algorithms for classes of problems
• processing pipeline: data collection and pre-processing, training, parameter
optimization, testing
LL AIaaS:
 requires expert knowledge
allows creation of innovative solutions for unsolved, non-standardized problems

High level AI
Deep learning with Artificial Neural Networks (ANNs)
– Convolutional neural network (CNN)
– Autoencoder
– Recurrent neural network (RNN)
– Long/Short Term Memory (LSTM)
– Generative Adversarial Network (GAN)
• problem oriented solutions/algorithms i.e. face recognition, text-to-speech
• solve a standardized problem
HL AIaaS:
• have a simple interface
 easy to handle by non AI experts

AIaaS providers
… and more (http://www.butleranalytics.com/20-
machine-learning-service-platforms/) Apr. 2017

AI Services Low level High level
MS Azure ML Studio
- Anomaly Detection
- Classification (binary/multi class)
- Clustering
- Statistical Functions
- Text Analytics
- Computer Vision
Cognitive Services:
- Vision
- Speech
- Knowledge
- Language
- Search
AWS AWS Machine Learning
- Algorithms: Regression, classification
(binary/multiclass)
Amazon Lex:
- natural language understanding (NLU)
- automatic speech recognition (ASR)
Amazon Rekognition:
- visual search and image recognition
Amazon Polly: text-to-speech (TTS)
AWS Deep Learning AMI: custom AI models
Google Cloud Large Scale Machine Learning Service
- Custom models from regression models to image
classification based on deep learning
- Google Cloud Job Discovery
- Google Cloud Video Intelligence
- Google Cloud Vision
- Google Cloud Speech
- Google Natural Language
- Google Cloud Translation
IBM Watson IBM Data Science Experience
- Spark ML algorithms
- RStudio
- Deep Learning libraries
Watson Developer:
- Conversation
- Knowledge (Discovery, NLU, Document Conversion)
- Vision
- Speech
- Language (Translator, Classifier, Retrieve & Rank)
- Empathy (Personality Insights, Tone Analyzer)

Stability and Availability of the Services
Lively growth market:
• waves of selection and consolidation across the market that yield winners
and losers
• regularly new and improved services emerge
• old ones are updated (including the API) or shut down, even within a platform
Search for the best service can be daunting
Challenging for usage in production systems

Usage of High Level AI

Usage of Low Level AI
Casual bikers Registered bikers
Validation Err Testing Err Validation Err Testing Err
Python 3 (Sklearn) 16.70 (0.42) 11.71 (0.06) 70.39 (0.25) 92.86 (0.21)
Dataiku (standard) 24.33 (0.45) 17.28 (0.01) 89.20 (0.30) 106.80 (0.18)
Dataiku (manual) 24.47 (0.45) 17.16 (0.06) 92.47 (0.27) 93.35 (0.20)
Azure ML Studio 33.62 (0.002) 23.20 (-0.58) 97.32 (0.26) 104.23 (0.15)
Bike Rental Data (UCI) – number of bike rentals in Washington D.C. for casual and registered
users; 2 years of data
Features: weather, season, holidays, weekdays, time
Testing Data: December 2012
Training Data: Rest
Metrics: median or mean absolute error (R2)
Model: Ridge Regression (L2 Norm)

Usage of Low Level AI
Casual bikers Registered bikers
Validation Err Testing Err Validation Err Testing Err
Python 3 (Sklearn) 37.15 23.90 128.09 136.59
Dataiku (standard) 36.53 23.33 124.00 138.80
Dataiku (manual) 36.98 22.65 129.00 137.28
Azure ML Studio 49.95 29.44 130.54 140.84
AWS 32.58 20.93 91.86 105.21
Bike Rental Data (UCI) – number of bike rentals in Washington D.C. for casual and registered
users; 2y of data
Features: weather, season, holidays, weekdays, time
Testing Data: December 2012
Training Data: Rest
Metric: root mean square error (RMSE) – only metric offered by AWS
Model: Ridge Regression (L2 Norm)

User Interfaces

Services as Black Box
?

What’s behind the services?
• mostly not open source
• often not even clear what algorithm is implemented, what parameters are
used
Like any other software: implementations can be buggy
Systematic tests recommended!
• different aspects of AIaaS hinder comparability of services between providers
– no versioning (except: Algorithmia)
– HL AI: specialized tasks, different output formats
– LL AI: Limitations of what is (easily) possible in the environment

What’s the data behind the services? (HL AI)
Important: most algorithms are “learning-by-example”
• What is the model trained on?
– Data biases
– Corner cases
– Missing examples
• Adversarial attacks

The Black Box in the Black Box
General Black Box AI problem
– White Box AI: see talk of my colleague Ulf Schöneberg at Predictive Analytics
World this Tuesday 14.11.
General Data Protection Regulation (GDPR) - goes into effect on May 25, 2018
– defines and strengthens data protection for consumers and harmonizes data security
rules within the EU
– controls on data processing and consumer profiling
– right to an explanation
– prevent discriminatory effects based on racial or ethnic origin, political opinion, religion or
beliefs, trade union membership, genetic or health status or sexual orientation

Conclusions

DN 2017 | AIaaS - Artificial Intelligence off the Shelf | Jana Kludas | um*

Recommended

Recommended

More Related Content

More from Dataconomy Media

More from Dataconomy Media (20)

Recently uploaded

Recently uploaded (20)

DN 2017 | AIaaS - Artificial Intelligence off the Shelf | Jana Kludas | um*

Editor's Notes