SlideShare a Scribd company logo
1 of 57
Download to read offline
Open Source @ IBM
Open source
AI/Machine Learning
2018 / © 2018 IBM Corporation 1
Luciano Resende
Data Science Platform Architect
About me - Luciano Resende
2
Data Science Platform Architect – IBM – CODAIT
• Have been contributing to open source at ASF for over 10 years
• Currently contributing to : Jupyter Notebook ecosystem, Apache Bahir, Apache
Toree, Apache Spark among other projects related to AI/ML platforms
lresende@apache.org
https://www.linkedin.com/in/lresende
@lresende1975
https://github.com/lresende
Open Source @ IBM
Center for Open Source Data & AI Technologies (CODAIT)
Model Asset eXchange (MAX)
Fabric for Deep Learning (FfDL)
Jupyter Enterprise Gateway
Q&A
Agenda
32018 / © 2018 IBM Corporation
4
Learn
Open Source @ IBM
Program touches
78,000
IBMers annually
Consume
Virtually all
IBM products
contain some
open source
• 40,363 pkgs
Per Year
Contribute
• >62K OS Certs
per year
• ~10K IBM
commits per
month
Connect
> 1000
active IBM
Contributors
Working in key OS
projects
2018 / © 2018 IBM Corporation
Open Source participation and usage is simpler than ever
5
Open Source is essential to Developer Advocacy
IBM generated open source innovation
• 137 Code Open (dWO) projects w/1000+ Github projects
• 4 graduates: Node-Red, OpenWhisk, SystemML,
Blockchain fabric to full open governance in the last year
• developer.ibm.com/code/open/code/
Community
• IBM focused on 18 strategic communities
• Drive open governance in “Centers of Gravity”
• IBM Leaders drive key technologies and assure freedom
of action
The IBM OS Way is now open sourced
• Training, Recognition, Tooling
• Organization, Consuming, Contributing
2018 / © 2018 IBM Corporation
Center for Open Source
Data and AI Technologies
(CODAIT)
62018 / © 2018 IBM Corporation
7
IBM’s history of strong AI leadership
1997: Deep Blue
• Deep Blue became the first machine to beat a world chess
champion in tournament play
2011: Jeopardy!
• Watson beat two top
Jeopardy! champions
1968, 2001: A Space Odyssey
• IBM was a technical
advisor
• HAL is “the latest in
machine intelligence”
2018: Open Tech, AI & emerging
standards
• New IBM centers of gravity for AI
• OS projects increasing exponentially
• Emerging global standards in AI
2018 / © 2018 IBM Corporation
Center for Open Source
Data and AI Technologies
CODAIT
codait.org
2018 / © 2018 IBM Corporation
codait (French)
= coder/coded
https://m.interglot.com/fr/en/codait
CODAIT aims to make AI solutions
dramatically easier to create, deploy,
and manage in the enterprise
Relaunch of the Spark Technology
Center (STC) to reflect expanded
mission
8
CODAIT by the numb3rs
CODAIT
codait.org
2018 / © 2018 IBM Corporation
codait (French)
= coder/coded
https://m.interglot.com/fr/en/codait
The team contributes to over 10 open source projects. These
projects include - Spark, Tensorflow, Keras, SystemML, Arrow,
Bahir, Toree, Livy, Zeppelin, R4ML, Stocator, Jupyter Enterprise
Gateway
17 committers and many contributors in Apache projects- Spark,
Arrow, systemML, Bahir, Toree, Livy
Over 980 JIRAs and 50,000 lines of code committed to Apache
Spark itself, and Over 65,000 LoC into SystemML
• Established IBM as the number 1 contributor to Spark
Machine Learning in Spark 2.0 release
Over 25 product lines within IBM leveraging Apache Spark in
some form or another. CODAIT engineers have interacted and
interlocked with many of them.
Speakers at over 100 conferences, MeetUps, un-conferences
etc.
9
Spark code contribution growth by
week
Center for Open Source
Data and AI Technologies
2018 / © 2018 IBM Corporation
codait (French)
= coder/coded
https://m.interglot.com/fr/en/codaitCode - Build and improve practical frameworks
to enable more developers to realize immediate
value (e.g. FfDL, Tensorflow Jupyter, Spark)
Content – Showcase solutions to complex and
real world AI problems
Community – Bring developers and data
scientists to engage with IBM (e.g. MAX)
Improving Enterprise AI lifecycle in Open Source
Gather
Data
Analyze
Data
Machine
Learning
Deep
Learning
Deploy
Model
Maintain
Model
Python
Data Science
Stack
Fabric for
Deep Learning
(FfDL)
Mleap +
PFA
Scikit-LearnPandas
Apache
Spark
Apache
Spark
Jupyter
Model
Asset
eXchange
Keras +
Tensorflow
CODAIT
codait.org
10
Model Asset eXchange
112018 / © 2018 IBM Corporation
Enabling domain experts to
use deep learning in the
enterprise
CODAIT: Enabling End-to-End AI in the Enterprise
122018 / © 2018 IBM Corporation
Gather
Data
Analyze
Data
Machine
Learning
Deep
Learning
Deploy
Model
Maintain
Model
Python
Data Science
Stack
Fabric for
Deep Learning
(FfDL)
Mleap +
PFA
Scikit-LearnPandas
Apache
Spark
Apache
Spark
Jupyter
Model
Asset
eXchange
Keras +
Tensorflow
Making AI as Ubiquitous
as the Telephone
132018 / © 2018 IBM Corporation
Q: What is deep learning?
A: Machine learning using
deep neural networks.
142018 / © 2018 IBM Corporation
InceptionV3 Convolutional Neural Net
(A “medium-sized” deep learning model)
Image Source:
https://github.com/tensorflow/models/blob/master/research/inception/g
3doc/inception_v3_architecture.png
Characteristics of Deep
Learning (1)
15
State-of-the-Art prediction
quality in many domains
– Image classification
– Machine translation
– Facial recognition
– Time series prediction
– Many more
2018 / © 2018 IBM Corporation
Characteristics of Deep
Learning (2)
16
Large, complex models
– Model size generally determined by “how big a
model can you fit on your device?”
2018 / © 2018 IBM Corporation
Each box ≈ between
32 and 768 linear
regression models
Characteristics of Deep
Learning (3)
17
Poorly understood today
…even by experts
– Why do the models converge?
– Why do the models converge with low loss?
– Why do the models generalize?
2018 / © 2018 IBM Corporation
Focus of this Talk
18
Incorporating well-
understood deep learning
models into enterprise
applications.
2018 / © 2018 IBM Corporation
192018 / © 2018 IBM Corporation
Sounds easy!
“cat”
The Components of a Deep
Learning Model
202018 / © 2018 IBM Corporation
Dense
(3×8)
Dense
(8×6)
Input
(3)
Output
(2)Dense
(6×4)
Dense
(4×2)
Neural Network
Graph
Weights
(not to scale)
Driver Program
Example: Get an Image Classifier
21
Step 1: Find a suitable neural
network graph.
– Need to read some papers
2018 / © 2018 IBM Corporation
Example: Get an Image Classifier
22
Step 2: Find code to generate
the neural network graph
2018 / © 2018 IBM Corporation
TensorFlow code to build ResNet50 neural network graph
Example: Get an Image Classifier
23
Step 3: Find some pre-trained
weights for your graph
2018 / © 2018 IBM Corporation
Caffe2 ResNet50 model weights
Example: Get an Image Classifier
24
Step 4: Find example code
that performs model
inference
2018 / © 2018 IBM Corporation
TensorFlow code for training and batch inference on ResNet50
Example: Get an Image Classifier
25
Step 5: Write your own code to
perform model inference on one
image at a time
Step 6: Package your inference
code, graph creation code, and pre-
trained weights together
Step 7: Deploy your package
2018 / © 2018 IBM Corporation
Model Marketplaces
26
Collections of well-
understood deep learning
models
Provide a central place to find
known-good implementations
of these models
2018 / © 2018 IBM Corporation
IBM Model Asset eXchange
MAX is a one-stop shop open source
ecosystem for data scientists and AI
developers to share and consume models that
use machine learning engines, such
as TensorFlow, PyTorch and Caffe2.
It also provides a standard approach to
classify, annotate, and deploy these models
for prediction and inferencing.
MAX
https://developer.ibm.com/
code/exchanges/models/
2018 / © 2018 IBM Corporation 27
282018 / © 2018 IBM Corporation
Demo!
https://developer.ibm.com/code/exchanges/models/
Summary
29
Free, open-source models.
Wide variety of domains.
Multiple deep learning frameworks.
Vetted and tested code and IP.
Build and deploy a web service in 30
seconds.
Start training on Watson Studio in
minutes.
2018 / © 2018 IBM Corporation
MAX: Future Plans
30
Many more models
– Train with Watson Studio/DLaaS
– Run inference on IBM infrastructure
Revamped website
Integration with Watson Catalog
IBMer-uploaded models
More IBM Code code patterns showing usage
2018 / © 2018 IBM Corporation
https://developer.ibm.com/code/exchanges/models/
But if you can’t wait
MAX Models at DockerHub
2018 / © 2018 IBM Corporation
MAX models are exposed as Docker
containers, and published to
DockerHub under CODAIT
organization.
31
https://hub.docker.com/u/codait/dashboard/
MAX and Container Services
K8 Deployment Descriptor
apiVersion: v1
kind: Pod
metadata:
name: image-caption-generator
namespace: default
labels:
app: image-caption-generator
spec:
restartPolicy: Always
containers:
- env:
name: image-caption-generator
image: codait/max-image-caption-generator
---
apiVersion: v1
kind: Service
metadata:
labels:
app: image-caption-generator
component: image-caption-generator
name: image-caption-generator
spec:
ports:
- name: http
port: 5000
targetPort: 5000
selector:
app: image-caption-generator
sessionAffinity: None
type: NodePort2018 / © 2018 IBM Corporation
MAX models require a Kubernetes
deployment descriptor to enable easy
deployment in IBM Cloud Container
Services.
32
Kubectl apply –f image-caption-generator.yaml
IBM Cloud Container Service
2018 / © 2018 IBM Corporation
IBM Cloud CLI
IBM Cloud Container Service plug-in
Kubernetes CLI (kubectl)
Useful Commands
Pointing Kubectl to IBM Cloud Container Service
export KUBECONFIG=/Users/lresende/.bluemix/plugins/container-
service/clusters/lresende-kubernetes/kube-config-hou02-lresende-kubernetes.yml
Accessing Kubernetes dashboard via kubectl proxy
kubectl config view -o jsonpath='{.users[0].user.auth-provider.config.id-token}'
kubectl proxy
http://localhost:8001/ui
Deploying application
kubectl apply –f image-caption-generator.yml
Accessing application
bx cs workers lresende-kubernetes
kubectl describe service image-caption-generator
curl -F "image=@assets/surfing.jpg" -X POST
http://184.172.242.55:32229/model/predict
curl -F "image=@/Users/lresende/Pictures/375337.jpg" -X POST
http://184.172.242.55:32229/model/predict
curl -F "image=@/Users/lresende/Pictures/362809.jpg" -X POST
http://184.172.242.55:32229/model/predict
Click to edit Master title style
FfDL
Fabric for Deep Learning
2018 / © 2018 IBM Corporation 34
FfDL provides a scalable,
resilient, and fault tolerant
deep-learning framework
Fabric for Deep Learning
https://github.com/IBM/FfDL
2018 / © 2018 IBM Corporation
FfDL provides a scalable, resilient, and fault
tolerant deep-learning framework
FfDL Github Page
https://github.com/IBM/FfDL
FfDL dwOpen Page
https://developer.ibm.com/code/open/projects/fa
bric-for-deep-learning-ffdl/
FfDL Announcement Blog
http://developer.ibm.com/code/2018/03/20/fabri
c-for-deep-learning
FfDL Technical Architecture Blog
http://developer.ibm.com/code/2018/03/20/dem
ocratize-ai-with-fabric-for-deep-learning
Deep Learning as a Service within Watson Studio
https://www.ibm.com/cloud/deep-learning
Research paper: “Scalable Multi-Framework
Management of Deep Learning Training Jobs”
http://learningsys.org/nips17/assets/papers/pape
r_29.pdf
• Fabric for Deep Learning or FfDL (pronounced as ‘fiddle’) is an open source
project which aims at making Deep Learning easily accessible to the
people it matters the most i.e. Data Scientists, and AI developers.
• FfDL Provides a consistent way to deploy, train and visualize Deep
Learning jobs across multiple frameworks like TensorFlow, Caffe, PyTorch,
Keras etc.
• FfDL is being developed in close collaboration with IBM Research and IBM
Watson. It forms the core of Watson`s Deep Learning service in open
source.
FfDL
35
Fabric for Deep Learning
https://github.com/IBM/FfDL
FfDL is built using Microservices architecture
on Kubernetes
• FfDL platform uses a microservices architecture to offer
resilience, scalability, multi-tenancy, and security without
modifying the deep learning frameworks, and with no or minimal
changes to model code.
• FfDL control plane microservices are deployed as pods on
Kubernetes to manage this cluster of GPU- and CPU-enabled
machines effectively
• Tested Platforms: Minikube, IBM Cloud Public, IBM Cloud
Private, GPUs using both Kubernetes feature gate Accelerators
and NVidia device plugins
362018 / © 2018 IBM Corporation
Try FfDL/DLaaS
https://ibm.biz/BdZtab
source code
training
definition
Auto-allocation means infrastructure is used only when needed
Kubernetes container
training
artifacts
compute cluster
NVIDIA Tesla K80, P100, V100
Cloud Object Storage
Training assets are
managed and tracked.
Access to elastic compute leveraging Kubernetes
372018 / © 2018 IBM Corporation
NVIDIA GPUs
Kubernetes
container orchestration
training runs
containers
server cluster
dataset
Cloud Object Storage
Model training distributed across containers
382018 / © 2018 IBM Corporation
39
FfDL: Architecture
2018 / © 2018 IBM Corporation
40
https://arxiv.org/abs/1709.05871
FfDL: Research Papers
2018 / © 2018 IBM Corporation
Click to edit Master title style
Jupyter
Enterprise
Gateway
2018 / © 2018 IBM Corporation 41
Provides multi-tenant,
scalable and secure remote
Jupyter Notebook kernels
Jupyter Notebooks
Overview
42© 2018 IBM Corporation
Jupyter Notebooks
© 2018 IBM Corporation 43
Notebooks are interactive
computational
environments, in which
you can combine code
execution, rich text,
mathematics, plots and
rich media.
Jupyter Notebooks
© 2018 IBM Corporation 44
• Notebook UI runs on the browser
• The Notebook Server serves the
’Notebooks’
• Kernels interpret/execute cell contents
– Are responsible for code execution
– Abstracts different languages
Building a
Data Science
Analytical Platform
45© 2018 IBM Corporation
Building an Data Science Platform
© 2018 IBM Corporation
Large pool of shared computing resources
• Enterprise Cloud, Public Cloud or Hybrid
• Data in the cloud (Data Lakes/Object Storage)
Distributed Consumers
• Notebooks running local (users laptop)
or as a service (e.g. Jupyter Hub)
Different Resource Utilization Patterns
• High number of idle resources
Vanilla Jupyter Notebooks
© 2018 IBM Corporation
Gather
Data
Analyze
Data
Machine
Learning
Deep
Learning
Deploy
Model
Maintain
Model
Python
Data Science
Stack
Fabric for
Deep Learning
(FfDL)
Mleap +
PFA
Scikit-LearnPandas
Apache
Spark
Apache
Spark
Jupyter
Model
Asset
eXchange
Keras +
Tensorflow
47
8 8 8 8
0
10
20
30
40
50
60
70
80
4 Nodes 8 Nodes 12 Nodes 16 NodesMaxKernels(4GBHeap)
Cluster Size (32GB Nodes)
MAXIMUM NUMBER OF
SIMULTANEOUS KERNELS
Kernel
Kernel
Kernel
Kernel
Limitations of Jupyter Notebook Stack
• Security limitations
• Single user sharing the same privileges
• Users can see and control each other process
using Jupyter administrative utilities
• Scalability limitations
• Jupyter Kernels running as local process
• Resources are limited by what is available
on the one single node that runs all Kernels
and associated Spark drivers
Kernel
Jupyter Enterprise
Gateway
© 2018 IBM Corporation
Jupyter Enterprise Gateway at IBM Code
https://developer.ibm.com/code/openprojects/jupyter-enterprise-gateway/
Jupyter Enterprise Gateway source code at GitHub
https://github.com/jupyter-incubator/enterprise_gateway
Jupyter Enterprise Gateway Documentation
http://jupyter-enterprise-gateway.readthedocs.io/en/latest/
Supported Kernels
Supported Platforms
49
A lightweight, multi-tenant, scalable
and secure gateway that enables
Jupyter Notebooks to share resources
across an Apache Spark or Kubernetes
cluster for Enterprise/Cloud use cases
Spectrum Conductor
+
Jupyter Enterprise Gateway
© 2018 IBM Corporation
Gather
Data
Analyze
Data
Machine
Learning
Deep
Learning
Deploy
Model
Maintain
Model
Python
Data Science
Stack
Fabric for
Deep Learning
(FfDL)
Mleap +
PFA
Scikit-LearnPandas
Apache
Spark
Apache
Spark
Jupyter
Model
Asset
eXchange
Keras +
Tensorflow
50
16
32
48
64
0
10
20
30
40
50
60
70
80
4 Nodes 8 Nodes 12 Nodes 16 NodesMaxKernels(4GBHeap)
Cluster Size (32GB Nodes)
MAXIMUM NUMBER OF
SIMULTANEOUS KERNELS
Kernel
Kernel
KernelKernel
Optimized Resource Allocation
– Utilize resources on all cluster nodes by running kernels as Spark
applications in YARN Cluster Mode.
– Pluggable architecture to enable support for additional Resource Managers
Enhanced Security
– End-to-End secure communications
• Secure socket communications
• Encrypted HTTP communication using SSL
Multiuser support with user impersonation
– Enhance security and sandboxing by enabling user impersonation when
running kernels (using Kerberos).
– Individual HDFS home folder for each notebook user.
– Use the same user ID for notebook and batch jobs.
KernelKernel
Kernel
Jupyter Enterprise Gateway – YARN
© 2018 IBM Corporation 51
YARN Cluster
YARN
Workers
Gateway Node
Jupyter Enterprise Gateway
• Multitenancy
• Remote kernel lifecycle management via process proxies
Spark Executors
Spark Executors
Spark Executors
Yarn Container
Jupyter Kernel
Spark Driver
Impersonation:
Alice’s kernel runs
under Alice’s user ID.
Spark Executors
Spark Executors
Spark Executors
Yarn Container
Jupyter Kernel
Spark Driver
SecurityLayer
nb2kg
nb2kg
Spark Executors
Spark Executors
Spark Executors
Yarn Container
Jupyter Kernel
Spark Driver
Bob
Alice
Enterprise Gateway & Kubernetes
© 2018 IBM Corporation
Supported Platforms
Kernel
Kernel
Kernel
Kernel
Before Jupyter Enterprise Gateway …
• Scalability limitations
• Resources are limited and the amount
required to all kernels needs to be allocated
during Notebook Server pod creation.
• Resources are limited by what is available
on the one single node that runs all Kernels
and associated Spark drivers
Kernel
KernelKernel
Jupyter Enterprise Gateway - Kubernetes
© 2018 IBM Corporation 53
Container images defined in kernelspec
Community image
Kernel
Spark on K8
Kernel
Distributed
File
System
Vanilla Kernels
Spark based kernels
Gateway
nb2kg
nb2kg
Summary
58© 2018 IBM Corporation
Summary
© 2018 IBM Corporation 59
• Model Asset Exchange
• Curated set of models ready to use or embedded in your
application or solution
• Fabric for Deep Learning
• Provides a consistent way for AI developers and
Data Scientists to train their models
• Jupyter Enterprise Gateway
• Enables your Jupyter Notebook stack to scale in
order to build Machine Learning and AI Models
more resource effectively
MAX
https://developer.ibm.com/
code/exchanges/models/
60© 2018 IBM Corporation
61May 17, 2018 / © 2018 IBM Corporation
62© 2018 IBM Corporation

More Related Content

What's hot

What is Weak (Narrow) AI? Here Are 8 Practical Examples
What is Weak (Narrow) AI? Here Are 8 Practical Examples What is Weak (Narrow) AI? Here Are 8 Practical Examples
What is Weak (Narrow) AI? Here Are 8 Practical Examples Bernard Marr
 
Artificial intelligence in anesthesiology by dr tushar chokshi
Artificial intelligence in anesthesiology by dr tushar chokshi Artificial intelligence in anesthesiology by dr tushar chokshi
Artificial intelligence in anesthesiology by dr tushar chokshi dr tushar chokshi
 
How To Solve AI’s Bias Problem, Create Emotional AIs, And Democratize AI With...
How To Solve AI’s Bias Problem, Create Emotional AIs, And Democratize AI With...How To Solve AI’s Bias Problem, Create Emotional AIs, And Democratize AI With...
How To Solve AI’s Bias Problem, Create Emotional AIs, And Democratize AI With...Bernard Marr
 
The 10 Best AI And Data Science Undergraduate Courses For 2021
The 10 Best AI And Data Science Undergraduate Courses For 2021The 10 Best AI And Data Science Undergraduate Courses For 2021
The 10 Best AI And Data Science Undergraduate Courses For 2021Bernard Marr
 
How To Get Started With Your AI Journey
How To Get Started With Your AI JourneyHow To Get Started With Your AI Journey
How To Get Started With Your AI JourneyBernard Marr
 
Future Of Intimacy: Sex Bots, Virtual Reality, And Smart Sex Toys
Future Of Intimacy: Sex Bots, Virtual Reality, And Smart Sex Toys Future Of Intimacy: Sex Bots, Virtual Reality, And Smart Sex Toys
Future Of Intimacy: Sex Bots, Virtual Reality, And Smart Sex Toys Bernard Marr
 
The Amazing Ways Artificial Intelligence Is Transforming The Music Industry
The Amazing Ways Artificial Intelligence Is Transforming The Music IndustryThe Amazing Ways Artificial Intelligence Is Transforming The Music Industry
The Amazing Ways Artificial Intelligence Is Transforming The Music IndustryBernard Marr
 
9 Soft Skills Every Employee Will Need In The Age Of Artificial Intelligence ...
9 Soft Skills Every Employee Will Need In The Age Of Artificial Intelligence ...9 Soft Skills Every Employee Will Need In The Age Of Artificial Intelligence ...
9 Soft Skills Every Employee Will Need In The Age Of Artificial Intelligence ...Bernard Marr
 
How can artificial intelligence be used in e learning
How can artificial intelligence be used in e learning How can artificial intelligence be used in e learning
How can artificial intelligence be used in e learning GlobalTechCouncil
 
How AI Creates Synthetic Speech
How AI Creates Synthetic SpeechHow AI Creates Synthetic Speech
How AI Creates Synthetic SpeechBernard Marr
 
Comm tech final
Comm tech finalComm tech final
Comm tech finalrlmorr09
 
Industry 4.0 – Tech Trends Driving Innovation in Manufacturing
Industry 4.0 – Tech Trends Driving Innovation in ManufacturingIndustry 4.0 – Tech Trends Driving Innovation in Manufacturing
Industry 4.0 – Tech Trends Driving Innovation in ManufacturingBernard Marr
 
3 Important Ways Artificial Intelligence Will Transform Your Business And Tur...
3 Important Ways Artificial Intelligence Will Transform Your Business And Tur...3 Important Ways Artificial Intelligence Will Transform Your Business And Tur...
3 Important Ways Artificial Intelligence Will Transform Your Business And Tur...Bernard Marr
 
3 Steps To Tackle The Problem Of Bias In Artificial Intelligence
3 Steps To Tackle The Problem Of Bias In Artificial Intelligence3 Steps To Tackle The Problem Of Bias In Artificial Intelligence
3 Steps To Tackle The Problem Of Bias In Artificial IntelligenceBernard Marr
 
The Most Amazing Artificial Intelligence Milestones So Far
The Most Amazing Artificial Intelligence Milestones So FarThe Most Amazing Artificial Intelligence Milestones So Far
The Most Amazing Artificial Intelligence Milestones So FarBernard Marr
 
Blockchain revolution
Blockchain revolutionBlockchain revolution
Blockchain revolutionfahad ali
 
AI in the Cloud Contact Centre
AI in the Cloud Contact Centre AI in the Cloud Contact Centre
AI in the Cloud Contact Centre Bernard Marr
 
The Amazing Ways eBay Is Using Artificial Intelligence To Boost Business Success
The Amazing Ways eBay Is Using Artificial Intelligence To Boost Business SuccessThe Amazing Ways eBay Is Using Artificial Intelligence To Boost Business Success
The Amazing Ways eBay Is Using Artificial Intelligence To Boost Business SuccessBernard Marr
 
The 4 Biggest Trends In Big Data and Analytics Right For 2021
The 4 Biggest Trends In Big Data and Analytics Right For 2021The 4 Biggest Trends In Big Data and Analytics Right For 2021
The 4 Biggest Trends In Big Data and Analytics Right For 2021Bernard Marr
 

What's hot (20)

What is Weak (Narrow) AI? Here Are 8 Practical Examples
What is Weak (Narrow) AI? Here Are 8 Practical Examples What is Weak (Narrow) AI? Here Are 8 Practical Examples
What is Weak (Narrow) AI? Here Are 8 Practical Examples
 
Artificial intelligence in anesthesiology by dr tushar chokshi
Artificial intelligence in anesthesiology by dr tushar chokshi Artificial intelligence in anesthesiology by dr tushar chokshi
Artificial intelligence in anesthesiology by dr tushar chokshi
 
How To Solve AI’s Bias Problem, Create Emotional AIs, And Democratize AI With...
How To Solve AI’s Bias Problem, Create Emotional AIs, And Democratize AI With...How To Solve AI’s Bias Problem, Create Emotional AIs, And Democratize AI With...
How To Solve AI’s Bias Problem, Create Emotional AIs, And Democratize AI With...
 
Annual Day at iLabs
Annual Day at iLabsAnnual Day at iLabs
Annual Day at iLabs
 
The 10 Best AI And Data Science Undergraduate Courses For 2021
The 10 Best AI And Data Science Undergraduate Courses For 2021The 10 Best AI And Data Science Undergraduate Courses For 2021
The 10 Best AI And Data Science Undergraduate Courses For 2021
 
How To Get Started With Your AI Journey
How To Get Started With Your AI JourneyHow To Get Started With Your AI Journey
How To Get Started With Your AI Journey
 
Future Of Intimacy: Sex Bots, Virtual Reality, And Smart Sex Toys
Future Of Intimacy: Sex Bots, Virtual Reality, And Smart Sex Toys Future Of Intimacy: Sex Bots, Virtual Reality, And Smart Sex Toys
Future Of Intimacy: Sex Bots, Virtual Reality, And Smart Sex Toys
 
The Amazing Ways Artificial Intelligence Is Transforming The Music Industry
The Amazing Ways Artificial Intelligence Is Transforming The Music IndustryThe Amazing Ways Artificial Intelligence Is Transforming The Music Industry
The Amazing Ways Artificial Intelligence Is Transforming The Music Industry
 
9 Soft Skills Every Employee Will Need In The Age Of Artificial Intelligence ...
9 Soft Skills Every Employee Will Need In The Age Of Artificial Intelligence ...9 Soft Skills Every Employee Will Need In The Age Of Artificial Intelligence ...
9 Soft Skills Every Employee Will Need In The Age Of Artificial Intelligence ...
 
How can artificial intelligence be used in e learning
How can artificial intelligence be used in e learning How can artificial intelligence be used in e learning
How can artificial intelligence be used in e learning
 
How AI Creates Synthetic Speech
How AI Creates Synthetic SpeechHow AI Creates Synthetic Speech
How AI Creates Synthetic Speech
 
Comm tech final
Comm tech finalComm tech final
Comm tech final
 
Industry 4.0 – Tech Trends Driving Innovation in Manufacturing
Industry 4.0 – Tech Trends Driving Innovation in ManufacturingIndustry 4.0 – Tech Trends Driving Innovation in Manufacturing
Industry 4.0 – Tech Trends Driving Innovation in Manufacturing
 
3 Important Ways Artificial Intelligence Will Transform Your Business And Tur...
3 Important Ways Artificial Intelligence Will Transform Your Business And Tur...3 Important Ways Artificial Intelligence Will Transform Your Business And Tur...
3 Important Ways Artificial Intelligence Will Transform Your Business And Tur...
 
3 Steps To Tackle The Problem Of Bias In Artificial Intelligence
3 Steps To Tackle The Problem Of Bias In Artificial Intelligence3 Steps To Tackle The Problem Of Bias In Artificial Intelligence
3 Steps To Tackle The Problem Of Bias In Artificial Intelligence
 
The Most Amazing Artificial Intelligence Milestones So Far
The Most Amazing Artificial Intelligence Milestones So FarThe Most Amazing Artificial Intelligence Milestones So Far
The Most Amazing Artificial Intelligence Milestones So Far
 
Blockchain revolution
Blockchain revolutionBlockchain revolution
Blockchain revolution
 
AI in the Cloud Contact Centre
AI in the Cloud Contact Centre AI in the Cloud Contact Centre
AI in the Cloud Contact Centre
 
The Amazing Ways eBay Is Using Artificial Intelligence To Boost Business Success
The Amazing Ways eBay Is Using Artificial Intelligence To Boost Business SuccessThe Amazing Ways eBay Is Using Artificial Intelligence To Boost Business Success
The Amazing Ways eBay Is Using Artificial Intelligence To Boost Business Success
 
The 4 Biggest Trends In Big Data and Analytics Right For 2021
The 4 Biggest Trends In Big Data and Analytics Right For 2021The 4 Biggest Trends In Big Data and Analytics Right For 2021
The 4 Biggest Trends In Big Data and Analytics Right For 2021
 

Similar to Open Source AI - News and examples

Inteligencia artificial, open source e IBM Call for Code
Inteligencia artificial, open source e IBM Call for CodeInteligencia artificial, open source e IBM Call for Code
Inteligencia artificial, open source e IBM Call for CodeLuciano Resende
 
From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...
From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...
From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...Luciano Resende
 
IBM Developer Model Asset eXchange
IBM Developer Model Asset eXchangeIBM Developer Model Asset eXchange
IBM Developer Model Asset eXchangeNick Pentreath
 
Optimizing your SparkML pipelines using the latest features in Spark 2.3
Optimizing your SparkML pipelines using the latest features in Spark 2.3Optimizing your SparkML pipelines using the latest features in Spark 2.3
Optimizing your SparkML pipelines using the latest features in Spark 2.3DataWorks Summit
 
How to build containerized architectures for deep learning - Data Festival 20...
How to build containerized architectures for deep learning - Data Festival 20...How to build containerized architectures for deep learning - Data Festival 20...
How to build containerized architectures for deep learning - Data Festival 20...Antje Barth
 
Ai pipelines powered by jupyter notebooks
Ai pipelines powered by jupyter notebooksAi pipelines powered by jupyter notebooks
Ai pipelines powered by jupyter notebooksLuciano Resende
 
Using LLVM to accelerate processing of data in Apache Arrow
Using LLVM to accelerate processing of data in Apache ArrowUsing LLVM to accelerate processing of data in Apache Arrow
Using LLVM to accelerate processing of data in Apache ArrowDataWorks Summit
 
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...Alok Singh
 
Luciano Resende - Scaling Big Data Interactive Workloads across Kubernetes Cl...
Luciano Resende - Scaling Big Data Interactive Workloads across Kubernetes Cl...Luciano Resende - Scaling Big Data Interactive Workloads across Kubernetes Cl...
Luciano Resende - Scaling Big Data Interactive Workloads across Kubernetes Cl...Codemotion
 
Intel 20180608 v2
Intel 20180608 v2Intel 20180608 v2
Intel 20180608 v2ISSIP
 
Digital Reinvention by NRB
Digital Reinvention by NRBDigital Reinvention by NRB
Digital Reinvention by NRBWilliam Poos
 
Artificial Intelligence and Machine Learning with the Oracle Data Science Cloud
Artificial Intelligence and Machine Learning with the Oracle Data Science CloudArtificial Intelligence and Machine Learning with the Oracle Data Science Cloud
Artificial Intelligence and Machine Learning with the Oracle Data Science CloudJuarez Junior
 
IBM Developer Model Asset eXchange - Deep Learning for Everyone
IBM Developer Model Asset eXchange - Deep Learning for EveryoneIBM Developer Model Asset eXchange - Deep Learning for Everyone
IBM Developer Model Asset eXchange - Deep Learning for EveryoneNick Pentreath
 
Scaling up Deep Learning by Scaling Down
Scaling up Deep Learning by Scaling DownScaling up Deep Learning by Scaling Down
Scaling up Deep Learning by Scaling DownDatabricks
 
Scaling up deep learning by scaling down
Scaling up deep learning by scaling downScaling up deep learning by scaling down
Scaling up deep learning by scaling downNick Pentreath
 
Introduction to pyspark new
Introduction to pyspark newIntroduction to pyspark new
Introduction to pyspark newAnam Mahmood
 
IBM Keynote presentation, OW2con'19, June 12-13, 2019, Paris.
IBM Keynote presentation, OW2con'19, June 12-13, 2019, Paris.IBM Keynote presentation, OW2con'19, June 12-13, 2019, Paris.
IBM Keynote presentation, OW2con'19, June 12-13, 2019, Paris.OW2
 
Continuous Deployment for Deep Learning
Continuous Deployment for Deep LearningContinuous Deployment for Deep Learning
Continuous Deployment for Deep LearningDatabricks
 
Deploying End-to-End Deep Learning Pipelines with ONNX
Deploying End-to-End Deep Learning Pipelines with ONNXDeploying End-to-End Deep Learning Pipelines with ONNX
Deploying End-to-End Deep Learning Pipelines with ONNXDatabricks
 

Similar to Open Source AI - News and examples (20)

Inteligencia artificial, open source e IBM Call for Code
Inteligencia artificial, open source e IBM Call for CodeInteligencia artificial, open source e IBM Call for Code
Inteligencia artificial, open source e IBM Call for Code
 
From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...
From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...
From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...
 
IBM Developer Model Asset eXchange
IBM Developer Model Asset eXchangeIBM Developer Model Asset eXchange
IBM Developer Model Asset eXchange
 
Optimizing your SparkML pipelines using the latest features in Spark 2.3
Optimizing your SparkML pipelines using the latest features in Spark 2.3Optimizing your SparkML pipelines using the latest features in Spark 2.3
Optimizing your SparkML pipelines using the latest features in Spark 2.3
 
How to build containerized architectures for deep learning - Data Festival 20...
How to build containerized architectures for deep learning - Data Festival 20...How to build containerized architectures for deep learning - Data Festival 20...
How to build containerized architectures for deep learning - Data Festival 20...
 
Ai pipelines powered by jupyter notebooks
Ai pipelines powered by jupyter notebooksAi pipelines powered by jupyter notebooks
Ai pipelines powered by jupyter notebooks
 
Using LLVM to accelerate processing of data in Apache Arrow
Using LLVM to accelerate processing of data in Apache ArrowUsing LLVM to accelerate processing of data in Apache Arrow
Using LLVM to accelerate processing of data in Apache Arrow
 
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
 
Luciano Resende - Scaling Big Data Interactive Workloads across Kubernetes Cl...
Luciano Resende - Scaling Big Data Interactive Workloads across Kubernetes Cl...Luciano Resende - Scaling Big Data Interactive Workloads across Kubernetes Cl...
Luciano Resende - Scaling Big Data Interactive Workloads across Kubernetes Cl...
 
Intel 20180608 v2
Intel 20180608 v2Intel 20180608 v2
Intel 20180608 v2
 
Digital Reinvention by NRB
Digital Reinvention by NRBDigital Reinvention by NRB
Digital Reinvention by NRB
 
Artificial Intelligence and Machine Learning with the Oracle Data Science Cloud
Artificial Intelligence and Machine Learning with the Oracle Data Science CloudArtificial Intelligence and Machine Learning with the Oracle Data Science Cloud
Artificial Intelligence and Machine Learning with the Oracle Data Science Cloud
 
IBM Developer Model Asset eXchange - Deep Learning for Everyone
IBM Developer Model Asset eXchange - Deep Learning for EveryoneIBM Developer Model Asset eXchange - Deep Learning for Everyone
IBM Developer Model Asset eXchange - Deep Learning for Everyone
 
Scaling up Deep Learning by Scaling Down
Scaling up Deep Learning by Scaling DownScaling up Deep Learning by Scaling Down
Scaling up Deep Learning by Scaling Down
 
Scaling up deep learning by scaling down
Scaling up deep learning by scaling downScaling up deep learning by scaling down
Scaling up deep learning by scaling down
 
Introduction to pyspark new
Introduction to pyspark newIntroduction to pyspark new
Introduction to pyspark new
 
Center of Excellence
Center of Excellence Center of Excellence
Center of Excellence
 
IBM Keynote presentation, OW2con'19, June 12-13, 2019, Paris.
IBM Keynote presentation, OW2con'19, June 12-13, 2019, Paris.IBM Keynote presentation, OW2con'19, June 12-13, 2019, Paris.
IBM Keynote presentation, OW2con'19, June 12-13, 2019, Paris.
 
Continuous Deployment for Deep Learning
Continuous Deployment for Deep LearningContinuous Deployment for Deep Learning
Continuous Deployment for Deep Learning
 
Deploying End-to-End Deep Learning Pipelines with ONNX
Deploying End-to-End Deep Learning Pipelines with ONNXDeploying End-to-End Deep Learning Pipelines with ONNX
Deploying End-to-End Deep Learning Pipelines with ONNX
 

More from Luciano Resende

A Jupyter kernel for Scala and Apache Spark.pdf
A Jupyter kernel for Scala and Apache Spark.pdfA Jupyter kernel for Scala and Apache Spark.pdf
A Jupyter kernel for Scala and Apache Spark.pdfLuciano Resende
 
Using Elyra for COVID-19 Analytics
Using Elyra for COVID-19 AnalyticsUsing Elyra for COVID-19 Analytics
Using Elyra for COVID-19 AnalyticsLuciano Resende
 
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.Luciano Resende
 
Strata - Scaling Jupyter with Jupyter Enterprise Gateway
Strata - Scaling Jupyter with Jupyter Enterprise GatewayStrata - Scaling Jupyter with Jupyter Enterprise Gateway
Strata - Scaling Jupyter with Jupyter Enterprise GatewayLuciano Resende
 
Scaling notebooks for Deep Learning workloads
Scaling notebooks for Deep Learning workloadsScaling notebooks for Deep Learning workloads
Scaling notebooks for Deep Learning workloadsLuciano Resende
 
Jupyter Enterprise Gateway Overview
Jupyter Enterprise Gateway OverviewJupyter Enterprise Gateway Overview
Jupyter Enterprise Gateway OverviewLuciano Resende
 
IoT Applications and Patterns using Apache Spark & Apache Bahir
IoT Applications and Patterns using Apache Spark & Apache BahirIoT Applications and Patterns using Apache Spark & Apache Bahir
IoT Applications and Patterns using Apache Spark & Apache BahirLuciano Resende
 
Getting insights from IoT data with Apache Spark and Apache Bahir
Getting insights from IoT data with Apache Spark and Apache BahirGetting insights from IoT data with Apache Spark and Apache Bahir
Getting insights from IoT data with Apache Spark and Apache BahirLuciano Resende
 
Building analytical microservices powered by jupyter kernels
Building analytical microservices powered by jupyter kernelsBuilding analytical microservices powered by jupyter kernels
Building analytical microservices powered by jupyter kernelsLuciano Resende
 
Building iot applications with Apache Spark and Apache Bahir
Building iot applications with Apache Spark and Apache BahirBuilding iot applications with Apache Spark and Apache Bahir
Building iot applications with Apache Spark and Apache BahirLuciano Resende
 
An Enterprise Analytics Platform with Jupyter Notebooks and Apache Spark
An Enterprise Analytics Platform with Jupyter Notebooks and Apache SparkAn Enterprise Analytics Platform with Jupyter Notebooks and Apache Spark
An Enterprise Analytics Platform with Jupyter Notebooks and Apache SparkLuciano Resende
 
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017Luciano Resende
 
What's new in Apache SystemML - Declarative Machine Learning
What's new in Apache SystemML  - Declarative Machine LearningWhat's new in Apache SystemML  - Declarative Machine Learning
What's new in Apache SystemML - Declarative Machine LearningLuciano Resende
 
Big analytics meetup - Extended Jupyter Kernel Gateway
Big analytics meetup - Extended Jupyter Kernel GatewayBig analytics meetup - Extended Jupyter Kernel Gateway
Big analytics meetup - Extended Jupyter Kernel GatewayLuciano Resende
 
Jupyter con meetup extended jupyter kernel gateway
Jupyter con meetup   extended jupyter kernel gatewayJupyter con meetup   extended jupyter kernel gateway
Jupyter con meetup extended jupyter kernel gatewayLuciano Resende
 
Writing Apache Spark and Apache Flink Applications Using Apache Bahir
Writing Apache Spark and Apache Flink Applications Using Apache BahirWriting Apache Spark and Apache Flink Applications Using Apache Bahir
Writing Apache Spark and Apache Flink Applications Using Apache BahirLuciano Resende
 
How mentoring can help you start contributing to open source
How mentoring can help you start contributing to open sourceHow mentoring can help you start contributing to open source
How mentoring can help you start contributing to open sourceLuciano Resende
 
SystemML - Declarative Machine Learning
SystemML - Declarative Machine LearningSystemML - Declarative Machine Learning
SystemML - Declarative Machine LearningLuciano Resende
 
Luciano Resende's keynote at Apache big data conference
Luciano Resende's keynote at Apache big data conferenceLuciano Resende's keynote at Apache big data conference
Luciano Resende's keynote at Apache big data conferenceLuciano Resende
 

More from Luciano Resende (20)

A Jupyter kernel for Scala and Apache Spark.pdf
A Jupyter kernel for Scala and Apache Spark.pdfA Jupyter kernel for Scala and Apache Spark.pdf
A Jupyter kernel for Scala and Apache Spark.pdf
 
Using Elyra for COVID-19 Analytics
Using Elyra for COVID-19 AnalyticsUsing Elyra for COVID-19 Analytics
Using Elyra for COVID-19 Analytics
 
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
 
Strata - Scaling Jupyter with Jupyter Enterprise Gateway
Strata - Scaling Jupyter with Jupyter Enterprise GatewayStrata - Scaling Jupyter with Jupyter Enterprise Gateway
Strata - Scaling Jupyter with Jupyter Enterprise Gateway
 
Scaling notebooks for Deep Learning workloads
Scaling notebooks for Deep Learning workloadsScaling notebooks for Deep Learning workloads
Scaling notebooks for Deep Learning workloads
 
Jupyter Enterprise Gateway Overview
Jupyter Enterprise Gateway OverviewJupyter Enterprise Gateway Overview
Jupyter Enterprise Gateway Overview
 
IoT Applications and Patterns using Apache Spark & Apache Bahir
IoT Applications and Patterns using Apache Spark & Apache BahirIoT Applications and Patterns using Apache Spark & Apache Bahir
IoT Applications and Patterns using Apache Spark & Apache Bahir
 
Getting insights from IoT data with Apache Spark and Apache Bahir
Getting insights from IoT data with Apache Spark and Apache BahirGetting insights from IoT data with Apache Spark and Apache Bahir
Getting insights from IoT data with Apache Spark and Apache Bahir
 
Building analytical microservices powered by jupyter kernels
Building analytical microservices powered by jupyter kernelsBuilding analytical microservices powered by jupyter kernels
Building analytical microservices powered by jupyter kernels
 
Building iot applications with Apache Spark and Apache Bahir
Building iot applications with Apache Spark and Apache BahirBuilding iot applications with Apache Spark and Apache Bahir
Building iot applications with Apache Spark and Apache Bahir
 
An Enterprise Analytics Platform with Jupyter Notebooks and Apache Spark
An Enterprise Analytics Platform with Jupyter Notebooks and Apache SparkAn Enterprise Analytics Platform with Jupyter Notebooks and Apache Spark
An Enterprise Analytics Platform with Jupyter Notebooks and Apache Spark
 
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017
 
What's new in Apache SystemML - Declarative Machine Learning
What's new in Apache SystemML  - Declarative Machine LearningWhat's new in Apache SystemML  - Declarative Machine Learning
What's new in Apache SystemML - Declarative Machine Learning
 
Big analytics meetup - Extended Jupyter Kernel Gateway
Big analytics meetup - Extended Jupyter Kernel GatewayBig analytics meetup - Extended Jupyter Kernel Gateway
Big analytics meetup - Extended Jupyter Kernel Gateway
 
Jupyter con meetup extended jupyter kernel gateway
Jupyter con meetup   extended jupyter kernel gatewayJupyter con meetup   extended jupyter kernel gateway
Jupyter con meetup extended jupyter kernel gateway
 
Writing Apache Spark and Apache Flink Applications Using Apache Bahir
Writing Apache Spark and Apache Flink Applications Using Apache BahirWriting Apache Spark and Apache Flink Applications Using Apache Bahir
Writing Apache Spark and Apache Flink Applications Using Apache Bahir
 
How mentoring can help you start contributing to open source
How mentoring can help you start contributing to open sourceHow mentoring can help you start contributing to open source
How mentoring can help you start contributing to open source
 
SystemML - Declarative Machine Learning
SystemML - Declarative Machine LearningSystemML - Declarative Machine Learning
SystemML - Declarative Machine Learning
 
Luciano Resende's keynote at Apache big data conference
Luciano Resende's keynote at Apache big data conferenceLuciano Resende's keynote at Apache big data conference
Luciano Resende's keynote at Apache big data conference
 
Asf icfoss-mentoring
Asf icfoss-mentoringAsf icfoss-mentoring
Asf icfoss-mentoring
 

Recently uploaded

A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 

Recently uploaded (20)

A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 

Open Source AI - News and examples

  • 1. Open Source @ IBM Open source AI/Machine Learning 2018 / © 2018 IBM Corporation 1 Luciano Resende Data Science Platform Architect
  • 2. About me - Luciano Resende 2 Data Science Platform Architect – IBM – CODAIT • Have been contributing to open source at ASF for over 10 years • Currently contributing to : Jupyter Notebook ecosystem, Apache Bahir, Apache Toree, Apache Spark among other projects related to AI/ML platforms lresende@apache.org https://www.linkedin.com/in/lresende @lresende1975 https://github.com/lresende
  • 3. Open Source @ IBM Center for Open Source Data & AI Technologies (CODAIT) Model Asset eXchange (MAX) Fabric for Deep Learning (FfDL) Jupyter Enterprise Gateway Q&A Agenda 32018 / © 2018 IBM Corporation
  • 4. 4 Learn Open Source @ IBM Program touches 78,000 IBMers annually Consume Virtually all IBM products contain some open source • 40,363 pkgs Per Year Contribute • >62K OS Certs per year • ~10K IBM commits per month Connect > 1000 active IBM Contributors Working in key OS projects 2018 / © 2018 IBM Corporation Open Source participation and usage is simpler than ever
  • 5. 5 Open Source is essential to Developer Advocacy IBM generated open source innovation • 137 Code Open (dWO) projects w/1000+ Github projects • 4 graduates: Node-Red, OpenWhisk, SystemML, Blockchain fabric to full open governance in the last year • developer.ibm.com/code/open/code/ Community • IBM focused on 18 strategic communities • Drive open governance in “Centers of Gravity” • IBM Leaders drive key technologies and assure freedom of action The IBM OS Way is now open sourced • Training, Recognition, Tooling • Organization, Consuming, Contributing 2018 / © 2018 IBM Corporation
  • 6. Center for Open Source Data and AI Technologies (CODAIT) 62018 / © 2018 IBM Corporation
  • 7. 7 IBM’s history of strong AI leadership 1997: Deep Blue • Deep Blue became the first machine to beat a world chess champion in tournament play 2011: Jeopardy! • Watson beat two top Jeopardy! champions 1968, 2001: A Space Odyssey • IBM was a technical advisor • HAL is “the latest in machine intelligence” 2018: Open Tech, AI & emerging standards • New IBM centers of gravity for AI • OS projects increasing exponentially • Emerging global standards in AI 2018 / © 2018 IBM Corporation
  • 8. Center for Open Source Data and AI Technologies CODAIT codait.org 2018 / © 2018 IBM Corporation codait (French) = coder/coded https://m.interglot.com/fr/en/codait CODAIT aims to make AI solutions dramatically easier to create, deploy, and manage in the enterprise Relaunch of the Spark Technology Center (STC) to reflect expanded mission 8
  • 9. CODAIT by the numb3rs CODAIT codait.org 2018 / © 2018 IBM Corporation codait (French) = coder/coded https://m.interglot.com/fr/en/codait The team contributes to over 10 open source projects. These projects include - Spark, Tensorflow, Keras, SystemML, Arrow, Bahir, Toree, Livy, Zeppelin, R4ML, Stocator, Jupyter Enterprise Gateway 17 committers and many contributors in Apache projects- Spark, Arrow, systemML, Bahir, Toree, Livy Over 980 JIRAs and 50,000 lines of code committed to Apache Spark itself, and Over 65,000 LoC into SystemML • Established IBM as the number 1 contributor to Spark Machine Learning in Spark 2.0 release Over 25 product lines within IBM leveraging Apache Spark in some form or another. CODAIT engineers have interacted and interlocked with many of them. Speakers at over 100 conferences, MeetUps, un-conferences etc. 9 Spark code contribution growth by week
  • 10. Center for Open Source Data and AI Technologies 2018 / © 2018 IBM Corporation codait (French) = coder/coded https://m.interglot.com/fr/en/codaitCode - Build and improve practical frameworks to enable more developers to realize immediate value (e.g. FfDL, Tensorflow Jupyter, Spark) Content – Showcase solutions to complex and real world AI problems Community – Bring developers and data scientists to engage with IBM (e.g. MAX) Improving Enterprise AI lifecycle in Open Source Gather Data Analyze Data Machine Learning Deep Learning Deploy Model Maintain Model Python Data Science Stack Fabric for Deep Learning (FfDL) Mleap + PFA Scikit-LearnPandas Apache Spark Apache Spark Jupyter Model Asset eXchange Keras + Tensorflow CODAIT codait.org 10
  • 11. Model Asset eXchange 112018 / © 2018 IBM Corporation Enabling domain experts to use deep learning in the enterprise
  • 12. CODAIT: Enabling End-to-End AI in the Enterprise 122018 / © 2018 IBM Corporation Gather Data Analyze Data Machine Learning Deep Learning Deploy Model Maintain Model Python Data Science Stack Fabric for Deep Learning (FfDL) Mleap + PFA Scikit-LearnPandas Apache Spark Apache Spark Jupyter Model Asset eXchange Keras + Tensorflow
  • 13. Making AI as Ubiquitous as the Telephone 132018 / © 2018 IBM Corporation
  • 14. Q: What is deep learning? A: Machine learning using deep neural networks. 142018 / © 2018 IBM Corporation InceptionV3 Convolutional Neural Net (A “medium-sized” deep learning model) Image Source: https://github.com/tensorflow/models/blob/master/research/inception/g 3doc/inception_v3_architecture.png
  • 15. Characteristics of Deep Learning (1) 15 State-of-the-Art prediction quality in many domains – Image classification – Machine translation – Facial recognition – Time series prediction – Many more 2018 / © 2018 IBM Corporation
  • 16. Characteristics of Deep Learning (2) 16 Large, complex models – Model size generally determined by “how big a model can you fit on your device?” 2018 / © 2018 IBM Corporation Each box ≈ between 32 and 768 linear regression models
  • 17. Characteristics of Deep Learning (3) 17 Poorly understood today …even by experts – Why do the models converge? – Why do the models converge with low loss? – Why do the models generalize? 2018 / © 2018 IBM Corporation
  • 18. Focus of this Talk 18 Incorporating well- understood deep learning models into enterprise applications. 2018 / © 2018 IBM Corporation
  • 19. 192018 / © 2018 IBM Corporation Sounds easy!
  • 20. “cat” The Components of a Deep Learning Model 202018 / © 2018 IBM Corporation Dense (3×8) Dense (8×6) Input (3) Output (2)Dense (6×4) Dense (4×2) Neural Network Graph Weights (not to scale) Driver Program
  • 21. Example: Get an Image Classifier 21 Step 1: Find a suitable neural network graph. – Need to read some papers 2018 / © 2018 IBM Corporation
  • 22. Example: Get an Image Classifier 22 Step 2: Find code to generate the neural network graph 2018 / © 2018 IBM Corporation TensorFlow code to build ResNet50 neural network graph
  • 23. Example: Get an Image Classifier 23 Step 3: Find some pre-trained weights for your graph 2018 / © 2018 IBM Corporation Caffe2 ResNet50 model weights
  • 24. Example: Get an Image Classifier 24 Step 4: Find example code that performs model inference 2018 / © 2018 IBM Corporation TensorFlow code for training and batch inference on ResNet50
  • 25. Example: Get an Image Classifier 25 Step 5: Write your own code to perform model inference on one image at a time Step 6: Package your inference code, graph creation code, and pre- trained weights together Step 7: Deploy your package 2018 / © 2018 IBM Corporation
  • 26. Model Marketplaces 26 Collections of well- understood deep learning models Provide a central place to find known-good implementations of these models 2018 / © 2018 IBM Corporation
  • 27. IBM Model Asset eXchange MAX is a one-stop shop open source ecosystem for data scientists and AI developers to share and consume models that use machine learning engines, such as TensorFlow, PyTorch and Caffe2. It also provides a standard approach to classify, annotate, and deploy these models for prediction and inferencing. MAX https://developer.ibm.com/ code/exchanges/models/ 2018 / © 2018 IBM Corporation 27
  • 28. 282018 / © 2018 IBM Corporation Demo! https://developer.ibm.com/code/exchanges/models/
  • 29. Summary 29 Free, open-source models. Wide variety of domains. Multiple deep learning frameworks. Vetted and tested code and IP. Build and deploy a web service in 30 seconds. Start training on Watson Studio in minutes. 2018 / © 2018 IBM Corporation
  • 30. MAX: Future Plans 30 Many more models – Train with Watson Studio/DLaaS – Run inference on IBM infrastructure Revamped website Integration with Watson Catalog IBMer-uploaded models More IBM Code code patterns showing usage 2018 / © 2018 IBM Corporation https://developer.ibm.com/code/exchanges/models/
  • 31. But if you can’t wait MAX Models at DockerHub 2018 / © 2018 IBM Corporation MAX models are exposed as Docker containers, and published to DockerHub under CODAIT organization. 31 https://hub.docker.com/u/codait/dashboard/
  • 32. MAX and Container Services K8 Deployment Descriptor apiVersion: v1 kind: Pod metadata: name: image-caption-generator namespace: default labels: app: image-caption-generator spec: restartPolicy: Always containers: - env: name: image-caption-generator image: codait/max-image-caption-generator --- apiVersion: v1 kind: Service metadata: labels: app: image-caption-generator component: image-caption-generator name: image-caption-generator spec: ports: - name: http port: 5000 targetPort: 5000 selector: app: image-caption-generator sessionAffinity: None type: NodePort2018 / © 2018 IBM Corporation MAX models require a Kubernetes deployment descriptor to enable easy deployment in IBM Cloud Container Services. 32 Kubectl apply –f image-caption-generator.yaml
  • 33. IBM Cloud Container Service 2018 / © 2018 IBM Corporation IBM Cloud CLI IBM Cloud Container Service plug-in Kubernetes CLI (kubectl) Useful Commands Pointing Kubectl to IBM Cloud Container Service export KUBECONFIG=/Users/lresende/.bluemix/plugins/container- service/clusters/lresende-kubernetes/kube-config-hou02-lresende-kubernetes.yml Accessing Kubernetes dashboard via kubectl proxy kubectl config view -o jsonpath='{.users[0].user.auth-provider.config.id-token}' kubectl proxy http://localhost:8001/ui Deploying application kubectl apply –f image-caption-generator.yml Accessing application bx cs workers lresende-kubernetes kubectl describe service image-caption-generator curl -F "image=@assets/surfing.jpg" -X POST http://184.172.242.55:32229/model/predict curl -F "image=@/Users/lresende/Pictures/375337.jpg" -X POST http://184.172.242.55:32229/model/predict curl -F "image=@/Users/lresende/Pictures/362809.jpg" -X POST http://184.172.242.55:32229/model/predict
  • 34. Click to edit Master title style FfDL Fabric for Deep Learning 2018 / © 2018 IBM Corporation 34 FfDL provides a scalable, resilient, and fault tolerant deep-learning framework
  • 35. Fabric for Deep Learning https://github.com/IBM/FfDL 2018 / © 2018 IBM Corporation FfDL provides a scalable, resilient, and fault tolerant deep-learning framework FfDL Github Page https://github.com/IBM/FfDL FfDL dwOpen Page https://developer.ibm.com/code/open/projects/fa bric-for-deep-learning-ffdl/ FfDL Announcement Blog http://developer.ibm.com/code/2018/03/20/fabri c-for-deep-learning FfDL Technical Architecture Blog http://developer.ibm.com/code/2018/03/20/dem ocratize-ai-with-fabric-for-deep-learning Deep Learning as a Service within Watson Studio https://www.ibm.com/cloud/deep-learning Research paper: “Scalable Multi-Framework Management of Deep Learning Training Jobs” http://learningsys.org/nips17/assets/papers/pape r_29.pdf • Fabric for Deep Learning or FfDL (pronounced as ‘fiddle’) is an open source project which aims at making Deep Learning easily accessible to the people it matters the most i.e. Data Scientists, and AI developers. • FfDL Provides a consistent way to deploy, train and visualize Deep Learning jobs across multiple frameworks like TensorFlow, Caffe, PyTorch, Keras etc. • FfDL is being developed in close collaboration with IBM Research and IBM Watson. It forms the core of Watson`s Deep Learning service in open source. FfDL 35
  • 36. Fabric for Deep Learning https://github.com/IBM/FfDL FfDL is built using Microservices architecture on Kubernetes • FfDL platform uses a microservices architecture to offer resilience, scalability, multi-tenancy, and security without modifying the deep learning frameworks, and with no or minimal changes to model code. • FfDL control plane microservices are deployed as pods on Kubernetes to manage this cluster of GPU- and CPU-enabled machines effectively • Tested Platforms: Minikube, IBM Cloud Public, IBM Cloud Private, GPUs using both Kubernetes feature gate Accelerators and NVidia device plugins 362018 / © 2018 IBM Corporation Try FfDL/DLaaS https://ibm.biz/BdZtab
  • 37. source code training definition Auto-allocation means infrastructure is used only when needed Kubernetes container training artifacts compute cluster NVIDIA Tesla K80, P100, V100 Cloud Object Storage Training assets are managed and tracked. Access to elastic compute leveraging Kubernetes 372018 / © 2018 IBM Corporation
  • 38. NVIDIA GPUs Kubernetes container orchestration training runs containers server cluster dataset Cloud Object Storage Model training distributed across containers 382018 / © 2018 IBM Corporation
  • 39. 39 FfDL: Architecture 2018 / © 2018 IBM Corporation
  • 41. Click to edit Master title style Jupyter Enterprise Gateway 2018 / © 2018 IBM Corporation 41 Provides multi-tenant, scalable and secure remote Jupyter Notebook kernels
  • 43. Jupyter Notebooks © 2018 IBM Corporation 43 Notebooks are interactive computational environments, in which you can combine code execution, rich text, mathematics, plots and rich media.
  • 44. Jupyter Notebooks © 2018 IBM Corporation 44 • Notebook UI runs on the browser • The Notebook Server serves the ’Notebooks’ • Kernels interpret/execute cell contents – Are responsible for code execution – Abstracts different languages
  • 45. Building a Data Science Analytical Platform 45© 2018 IBM Corporation
  • 46. Building an Data Science Platform © 2018 IBM Corporation Large pool of shared computing resources • Enterprise Cloud, Public Cloud or Hybrid • Data in the cloud (Data Lakes/Object Storage) Distributed Consumers • Notebooks running local (users laptop) or as a service (e.g. Jupyter Hub) Different Resource Utilization Patterns • High number of idle resources
  • 47. Vanilla Jupyter Notebooks © 2018 IBM Corporation Gather Data Analyze Data Machine Learning Deep Learning Deploy Model Maintain Model Python Data Science Stack Fabric for Deep Learning (FfDL) Mleap + PFA Scikit-LearnPandas Apache Spark Apache Spark Jupyter Model Asset eXchange Keras + Tensorflow 47 8 8 8 8 0 10 20 30 40 50 60 70 80 4 Nodes 8 Nodes 12 Nodes 16 NodesMaxKernels(4GBHeap) Cluster Size (32GB Nodes) MAXIMUM NUMBER OF SIMULTANEOUS KERNELS Kernel Kernel Kernel Kernel Limitations of Jupyter Notebook Stack • Security limitations • Single user sharing the same privileges • Users can see and control each other process using Jupyter administrative utilities • Scalability limitations • Jupyter Kernels running as local process • Resources are limited by what is available on the one single node that runs all Kernels and associated Spark drivers Kernel
  • 48. Jupyter Enterprise Gateway © 2018 IBM Corporation Jupyter Enterprise Gateway at IBM Code https://developer.ibm.com/code/openprojects/jupyter-enterprise-gateway/ Jupyter Enterprise Gateway source code at GitHub https://github.com/jupyter-incubator/enterprise_gateway Jupyter Enterprise Gateway Documentation http://jupyter-enterprise-gateway.readthedocs.io/en/latest/ Supported Kernels Supported Platforms 49 A lightweight, multi-tenant, scalable and secure gateway that enables Jupyter Notebooks to share resources across an Apache Spark or Kubernetes cluster for Enterprise/Cloud use cases Spectrum Conductor +
  • 49. Jupyter Enterprise Gateway © 2018 IBM Corporation Gather Data Analyze Data Machine Learning Deep Learning Deploy Model Maintain Model Python Data Science Stack Fabric for Deep Learning (FfDL) Mleap + PFA Scikit-LearnPandas Apache Spark Apache Spark Jupyter Model Asset eXchange Keras + Tensorflow 50 16 32 48 64 0 10 20 30 40 50 60 70 80 4 Nodes 8 Nodes 12 Nodes 16 NodesMaxKernels(4GBHeap) Cluster Size (32GB Nodes) MAXIMUM NUMBER OF SIMULTANEOUS KERNELS Kernel Kernel KernelKernel Optimized Resource Allocation – Utilize resources on all cluster nodes by running kernels as Spark applications in YARN Cluster Mode. – Pluggable architecture to enable support for additional Resource Managers Enhanced Security – End-to-End secure communications • Secure socket communications • Encrypted HTTP communication using SSL Multiuser support with user impersonation – Enhance security and sandboxing by enabling user impersonation when running kernels (using Kerberos). – Individual HDFS home folder for each notebook user. – Use the same user ID for notebook and batch jobs. KernelKernel Kernel
  • 50. Jupyter Enterprise Gateway – YARN © 2018 IBM Corporation 51 YARN Cluster YARN Workers Gateway Node Jupyter Enterprise Gateway • Multitenancy • Remote kernel lifecycle management via process proxies Spark Executors Spark Executors Spark Executors Yarn Container Jupyter Kernel Spark Driver Impersonation: Alice’s kernel runs under Alice’s user ID. Spark Executors Spark Executors Spark Executors Yarn Container Jupyter Kernel Spark Driver SecurityLayer nb2kg nb2kg Spark Executors Spark Executors Spark Executors Yarn Container Jupyter Kernel Spark Driver Bob Alice
  • 51. Enterprise Gateway & Kubernetes © 2018 IBM Corporation Supported Platforms Kernel Kernel Kernel Kernel Before Jupyter Enterprise Gateway … • Scalability limitations • Resources are limited and the amount required to all kernels needs to be allocated during Notebook Server pod creation. • Resources are limited by what is available on the one single node that runs all Kernels and associated Spark drivers Kernel KernelKernel
  • 52. Jupyter Enterprise Gateway - Kubernetes © 2018 IBM Corporation 53 Container images defined in kernelspec Community image Kernel Spark on K8 Kernel Distributed File System Vanilla Kernels Spark based kernels Gateway nb2kg nb2kg
  • 53. Summary 58© 2018 IBM Corporation
  • 54. Summary © 2018 IBM Corporation 59 • Model Asset Exchange • Curated set of models ready to use or embedded in your application or solution • Fabric for Deep Learning • Provides a consistent way for AI developers and Data Scientists to train their models • Jupyter Enterprise Gateway • Enables your Jupyter Notebook stack to scale in order to build Machine Learning and AI Models more resource effectively MAX https://developer.ibm.com/ code/exchanges/models/
  • 55. 60© 2018 IBM Corporation
  • 56. 61May 17, 2018 / © 2018 IBM Corporation
  • 57. 62© 2018 IBM Corporation