SlideShare a Scribd company logo
1 of 50
ELLEN FRIEDMAN BIG DATA LONDON
Principal Technologist 14 November 2018
7 Successful Habits for Data-Intensive
Applications in Production
2 © 2018 MapR Technologies, Inc. // MapR Confidential
Ellen Friedman, PhD
Principal Technologist, MapR Technologies
Committer Apache Drill & Apache Mahout projects
O’Reilly author
Email efriedman@mapr.com ellenf@apache.org
Twitter @Ellen_Friedman
Contact Information
3 © 2018 MapR Technologies, Inc. // MapR Confidential
.
Images © Friedman & Dunning
Image courtesy Mtell used with permission
Big Value from Big Data in Production across Diverse Businesses
Telecommunications
Smart-Metered Utilities
Heavy Industry
Financial
Transportation
Agritech
4 © 2018 MapR Technologies, Inc. // MapR Confidential
“The future is already here – it’s just
not evenly distributed.”
- William
Gibson
5 © 2018 MapR Technologies, Inc. // MapR Confidential
.
Some organizations more successful at getting large scale systems into production
• 2018 Gartner report stated only 17% of Hadoop-based systems were in production
vs
• Over 90% of our customers have large scale production systems
Why?
Value from Data-Intensive Applications in Production
Image © 2003 Ellen Friedman
6 © 2018 MapR Technologies, Inc. // MapR Confidential
1Data may be in production long before
code is complete
7 © 2018 MapR Technologies, Inc. // MapR Confidential
.
Labs in Canada froze blood samples for years in case they contain valuable information.
• Modern genetic techniques revealed key disease data
• Correlated with outcomes for the donor patients
• The data was preserved before the analysis was even
Real World Example: Biological Samples Contain Data
Image © 2003 Ellen Friedman
8 © 2018 MapR Technologies, Inc. // MapR Confidential
Are you asking the right question?
tnow
Hourly
clicks at
A B C
Which marketing email is most effective?
• Looks like C – Blue is best when you base the
question on data for highest hourly response
rate at a current point in time tnow
• But this is misleading. Why?
9 © 2018 MapR Technologies, Inc. // MapR Confidential
Better way to frame the question:
Which marketing email is most effective based
on response rate at time t after launch?
• Collect and retain data across same time
interval relative to launch
• Looks like B – Green is best performer and
C- Blue is least effective for response rate
Click
rate tnow
t
Click
rate
Time after launch
Launch Measure
A
B
C
Measuring performance at a
constant time after launch gives
consistent comparison
10 © 2018 MapR Technologies, Inc. // MapR Confidential
Better way to frame the question:
Which marketing email is most effective based
on response rate at time t after launch?
• Collect and retain data across same time
interval relative to launch
• Looks like B – Green is best performer and
C- Blue is least effective for response rate
Click
rate tnow
t
Click
rate
Time after launch
Launch Measure
A
B
C
But that’s hard after we have over-
written the old data with current data
11 © 2018 MapR Technologies, Inc. // MapR Confidential
Spot the Difference?
GET	photo.jpg	HTTP/1.1
Host:	lh4.googleusercontent
User-agent:	Mozilla/5.0	(Ma
Accept:	image/png,image/*
Accept-language:	en-US,en
Accept-encoding:	gzip,	defl
Referer:	https://www.google
Connection:	keep-alive
If-none-match:	"v9”
Cache-control:	max-age=0
GET	cc/borken.json	HTTP/1.1
host:	c.qrs.my
user-agent:	Mozilla/4.0	(co
accept:	application/json,	t
accept-language:	en-US,en
accept-encoding:	gzip,	defl
referer:	none
connection:	keep-alive
if-none-match:	"v9”
cache-control:	max-age=0
Attacker requestReal request
12 © 2018 MapR Technologies, Inc. // MapR Confidential
Spot the Difference?
GET	photo.jpg	HTTP/1.1
Host:	lh4.googleusercontent
User-agent:	Mozilla/5.0	(Ma
Accept:	image/png,image/*
Accept-language:	en-US,en
Accept-encoding:	gzip,	defl
Referer:	https://www.google
Connection:	keep-alive
If-none-match:	"v9”
Cache-control:	max-age=0
GET	cc/borken.json	HTTP/1.1
host:	c.qrs.my
user-agent:	Mozilla/4.0	(co
accept:	application/json,	t
accept-language:	en-US,en
accept-encoding:	gzip,	defl
referer:	none
connection:	keep-alive
if-none-match:	"v9”
cache-control:	max-age=0
Attacker requestReal request
13 © 2018 MapR Technologies, Inc. // MapR Confidential
Security expert at a bank preserved headers for web site requests
Detected anomaly in headers for the attackers vs normal (real) requests
But how would you know what data to preserve?
• Pattern of behavior for attackers was allowable for headers
• It was not predictable: but it was different
Domain Knowledge Matters: Detecting Security Attacks
14 © 2018 MapR Technologies, Inc. // MapR Confidential
2End-to-end design for production
15 © 2018 MapR Technologies, Inc. // MapR Confidential
Do you have a good fit between application, its SLAs and practical business goal?
Is there a way to take action on the output of the application?
§ Note: a report is not an action
Do you have real advance planning for production?
§ Note: slapping an SLA onto a complex application & tossing it to IT with hazily
defined critical tasks is not production planning
The missing bits usually aren’t in the application itself
16 © 2018 MapR Technologies, Inc. // MapR Confidential
.
Remember: IT doesn’t have a magic wand…
17 © 2018 MapR Technologies, Inc. // MapR Confidential
.
Remember: IT doesn’t have a magic wand…
18 © 2018 MapR Technologies, Inc. // MapR Confidential
Build a Data Fabric
Flexibility & agility to respond as life changes
19 © 2018 MapR Technologies, Inc. // MapR Confidential
A Global Data Fabric: Edge to On-Premises to Cloud
Data where you want it, compute power where you need it.
20 © 2018 MapR Technologies, Inc. // MapR Confidential
.
Comprehensive View of Data vs Isolated Services
Commonality
Isolation
Both are needed, for different purposes. Best practice achieves a balance.
21 © 2018 MapR Technologies, Inc. // MapR Confidential
3Orchestration of applications and
orchestration of data
22 © 2018 MapR Technologies, Inc. // MapR Confidential
Containerized applications run in different environments on same cluster at same time
Kubernetes is emerging as the leader in orchestration of containerized applications
• You specify what needs to be done, Kubernetes arranges it by running containers
• Kubernetes allows access to services by name
Containerization: flexibility, convenience, predictability
23 © 2018 MapR Technologies, Inc. // MapR Confidential
CNCF reported
• nearly 70% of organizations surveyed use Kubernetes to manage containers
• but the #1 issue Kubernetes users have is storage
You don’t want to store state in containers – defeats flexibility
Biggest Challenge with Kubernetes is Data Persistence
24 © 2018 MapR Technologies, Inc. // MapR Confidential
CNCF reported
• nearly 70% of organizations surveyed use Kubernetes to manage containers
• but the #1 issue Kubernetes users have is storage
You don’t want to store state in containers – defeats flexibility
How do you get the benefits of containerization without being limited to
stateless applications?
Biggest Challenge with Kubernetes is Data Persistence
25 © 2018 MapR Technologies, Inc. // MapR Confidential
App 1 App 2 App 3
Kubernetes
26 © 2018 MapR Technologies, Inc. // MapR Confidential
App 1 App 2 App 3
Kubernetes
rpc
stream
LogFile
27 © 2018 MapR Technologies, Inc. // MapR Confidential
App 1 App 2 App 3
Kubernetes
rpc
stream
LogFile
Data Platform
28 © 2018 MapR Technologies, Inc. // MapR Confidential
Kubernetes for orchestration of applications
+
Dataware for orchestration of data
A Powerful Combination
29 © 2018 MapR Technologies, Inc. // MapR Confidential
4Simplicity is golden
You should not need an army to administer
a cluster
30 © 2018 MapR Technologies, Inc. // MapR Confidential
.
If your system has lots of work-arounds, think again…
31 © 2018 MapR Technologies, Inc. // MapR Confidential
.
If your system has lots of work-arounds, think again…
32 © 2018 MapR Technologies, Inc. // MapR Confidential
Many logistics and processes should be handled by the platform, not by developers at
the application level
• Separation of concerns for system administrators vs developers/data scientists
• More efficient, less risk of error
• Ease of administration
Data Platform Should Simplify
33 © 2018 MapR Technologies, Inc. // MapR Confidential
Orchestrate Data with Dataware
Legacy Applications Big Data 1.0 Applications Next-Gen Applications
MapR Converged Data Platform
High Availability Real Time Unified Security Multi-tenancy Disaster Recovery Global Namespace
Real-Time NoQL Database Stream TransportWeb-Scale Storage
MapR is more than just data storage – it’s like Kubernetes for data.
It’s dataware.
34 © 2018 MapR Technologies, Inc. // MapR Confidential
MapR Volume: Directory with Special Management Capabilities
Cluster
Volume mount point
Directories
Files
Streams
Table
Volumes used for easy control of access, multitenancy, data locality & DR
35 © 2018 MapR Technologies, Inc. // MapR Confidential
MapR can run huge numbers of applications on same cluster
• Great performance, optimized resources
• Comprehensive view of data & better collaboration
• Ease of administration
Is your system easy to administer?
We’ve seen a large retail customer manage a cluster >1000 nodes with
just 3 administrators
36 © 2018 MapR Technologies, Inc. // MapR Confidential
Open APIs
Multicloud & Hybrid Cloud Strategy with MapR:
Application
• Unified Security Model
• Data access decoupled from physical
storage location. Globally.
• No lock-in to proprietary APIs
• Full openness
• Data made portable
API Connector
✓
GLOBAL DATA MANAGEMENT
Edge Private Cloud
On Premise
Public Cloud Public Cloud Public Cloud
Uniform computing environment
everywhere
37 © 2018 MapR Technologies, Inc. // MapR Confidential
5Build real multitenancy
38 © 2018 MapR Technologies, Inc. // MapR Confidential
MapR Volumes: Easy Data Management
• Multiple volumes span a cluster
• Files, tables, streams in same volume
• Fine-grained control of who has access
• Basis for mirroring, snapshots
• Great advantage for multi-tenancy
39 © 2018 MapR Technologies, Inc. // MapR Confidential
MapR Volumes: Control Data Locality
• Place data on specialized hardware
(such as GPUs)
• Meet compliance requirements
• Better optimization of resources
• Great advantage for AI/ machine
learning
40 © 2018 MapR Technologies, Inc. // MapR Confidential
A simpler system is more cost effective, with better performance
• Remember:
Every HBase commit requires a round trip to the namenode if run on HDFS
MapR has no namenode – avoids the problem
• It isn’t magic. It’s just real multitenancy.
Does your system avoid unnecessary sprawl?
We’ve seen a customer collapse 17 HBase clusters in AWS to 1 cluster
running five nodes HBase on MapR with great performance
41 © 2018 MapR Technologies, Inc. // MapR Confidential
6Streaming architecture provides
flexibility
42 © 2018 MapR Technologies, Inc. // MapR Confidential
Stream Transport to Decouple Producers & Consumers
P
P
P
C
C
C
Transport Processing
Kafka /
MapR Streams
“Streaming Microservices” by Ted Dunning & Ellen Friedman, in Encyclopedia of Big Data Technologies, Sherif Sakr and Albert Zomaya, editors, © 2018
(Springer International Publishing)
ebook Streaming Architecture by Ted Dunning & Ellen Friedman © 2016 ( O’Reilly Media), chapter 3:
https://mapr.com/ebooks/streaming-architecture/chapter-03-streaming-platform-for-microservices.html
43 © 2018 MapR Technologies, Inc. // MapR Confidential
Stream-first Architecture Supports Microservices
Real-time
analytics
EMR
Patient Facilities
management
Insurance
audit
A
B
Medical tests
C
Medical test
results
With the right messaging tool you
support multiple classes of use cases
44 © 2018 MapR Technologies, Inc. // MapR Confidential
ResultsRendezvous
Rendezvous Architecture is based on streaming microservices
Scores
ArchiveDecoy
m1
m2
m3
Features /
profiles
InputRaw
Rendezvous Architecture described in:
- Machine Learning Logistics book by Ted Dunning & Ellen Friedman © 2018 (O’Reilly)
- “Rendezvous Architecture” by Ted Dunning & Ellen Friedman, chapter in Encyclopedia of Big Data Technologies. Sherif Sakr and
Albert Zomaya, editors. Springer International Publishing, © 2018 in press.
45 © 2018 MapR Technologies, Inc. // MapR Confidential
7Build a data-aware culture.
That also gives you a production-ready culture.
46 © 2018 MapR Technologies, Inc. // MapR Confidential
DataOps: Brings Flexibility & Focus
• Cross-functional teams cut
across skill guilds
• Better communication &
shared goal keep effort
focused and flexible
Refs:
“DevOps, DataOps & Machine Learning”
by Terry McCann
http://www.hyperbi.co.uk/devops-dataops-
and-machine-learning/
“How to Manage a DataOps Team” by E.
Friedman in RTInsights
https://www.rtinsights.com/how-manage-
dataops-team/
Developer
Data
Scientist /
Data
Engineer
Ops NetworkSite
Reliability
QA
Security
Project 1
Project 2
Project 3
Kubernetes
Data platform
Storage
Required team skill
Occasional help
47 © 2018 MapR Technologies, Inc. // MapR Confidential
AI & Analytics in Production:
How to Make It Work
Free book signing with authors Ted Dunning & Ellen Friedman
MapR stand #145
Wed 1:00 pm – 1:45 pm
Download free pdf via MapR:
https://mapr.com/ebook/ai-and-analytics-in-production/
48 © 2018 MapR Technologies, Inc. // MapR Confidential
Please support women in tech – help build
girls’ dreams of what they can accomplish
© Ellen Friedman 2015#womenintech #datawomen
49 © 2018 MapR Technologies, Inc. // MapR Confidential
Thank you !
50 © 2018 MapR Technologies, Inc. // MapR Confidential
Ellen Friedman, PhD
Principal Technologist, MapR Technologies
Committer Apache Drill & Apache Mahout projects
O’Reilly author
Email efriedman@mapr.com ellenf@apache.org
Twitter @Ellen_Friedman
Contact Information

More Related Content

What's hot

Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsMapR Technologies
 
MapR Edge : Act Locally Learn Globally
MapR Edge : Act Locally Learn GloballyMapR Edge : Act Locally Learn Globally
MapR Edge : Act Locally Learn Globallyridhav
 
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareBest Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareMapR Technologies
 
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationMapR Technologies
 
Converging your data landscape
Converging your data landscapeConverging your data landscape
Converging your data landscapeMapR Technologies
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action MapR Technologies
 
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...MapR Technologies
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionMapR Technologies
 
Real-time data integration to the cloud
Real-time data integration to the cloudReal-time data integration to the cloud
Real-time data integration to the cloudSankar Nagarajan
 
HPCC Systems - Using Big Data to Help Feed the World
HPCC Systems - Using Big Data to Help Feed the WorldHPCC Systems - Using Big Data to Help Feed the World
HPCC Systems - Using Big Data to Help Feed the WorldHPCC Systems
 
Proof of Concept for Hadoop: storage and analytics of electrical time-series
Proof of Concept for Hadoop: storage and analytics of electrical time-seriesProof of Concept for Hadoop: storage and analytics of electrical time-series
Proof of Concept for Hadoop: storage and analytics of electrical time-seriesDataWorks Summit
 
GE Digital Predix. Mario Testino, General Electrics
GE Digital Predix. Mario Testino, General ElectricsGE Digital Predix. Mario Testino, General Electrics
GE Digital Predix. Mario Testino, General ElectricsData Driven Innovation
 
Augmented OLAP for Big Data Analytics
Augmented OLAP for Big Data AnalyticsAugmented OLAP for Big Data Analytics
Augmented OLAP for Big Data AnalyticsTyler Wishnoff
 
Private Cloud Delivers Big Data in Oil & Gas v4
Private Cloud Delivers Big Data in Oil & Gas v4Private Cloud Delivers Big Data in Oil & Gas v4
Private Cloud Delivers Big Data in Oil & Gas v4Andy Moore
 
Predictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural NetworksPredictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural NetworksJustin Brandenburg
 
Meetup 27/6/2018: AIOPS om de uitdagingen van een slimme stad te ondersteunen
Meetup 27/6/2018: AIOPS om de uitdagingen van een slimme stad te ondersteunenMeetup 27/6/2018: AIOPS om de uitdagingen van een slimme stad te ondersteunen
Meetup 27/6/2018: AIOPS om de uitdagingen van een slimme stad te ondersteunenDigipolis Antwerpen
 

What's hot (20)

Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
 
MapR Edge : Act Locally Learn Globally
MapR Edge : Act Locally Learn GloballyMapR Edge : Act Locally Learn Globally
MapR Edge : Act Locally Learn Globally
 
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareBest Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in Healthcare
 
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & Evaluation
 
Converging your data landscape
Converging your data landscapeConverging your data landscape
Converging your data landscape
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
 
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
 
I'm being followed by drones
I'm being followed by dronesI'm being followed by drones
I'm being followed by drones
 
Real-time data integration to the cloud
Real-time data integration to the cloudReal-time data integration to the cloud
Real-time data integration to the cloud
 
Industrial Internet
Industrial InternetIndustrial Internet
Industrial Internet
 
Building a Data Analytics PaaS for Smart Cities
Building a Data Analytics PaaS for Smart CitiesBuilding a Data Analytics PaaS for Smart Cities
Building a Data Analytics PaaS for Smart Cities
 
HPCC Systems - Using Big Data to Help Feed the World
HPCC Systems - Using Big Data to Help Feed the WorldHPCC Systems - Using Big Data to Help Feed the World
HPCC Systems - Using Big Data to Help Feed the World
 
Proof of Concept for Hadoop: storage and analytics of electrical time-series
Proof of Concept for Hadoop: storage and analytics of electrical time-seriesProof of Concept for Hadoop: storage and analytics of electrical time-series
Proof of Concept for Hadoop: storage and analytics of electrical time-series
 
GE Digital Predix. Mario Testino, General Electrics
GE Digital Predix. Mario Testino, General ElectricsGE Digital Predix. Mario Testino, General Electrics
GE Digital Predix. Mario Testino, General Electrics
 
Augmented OLAP for Big Data Analytics
Augmented OLAP for Big Data AnalyticsAugmented OLAP for Big Data Analytics
Augmented OLAP for Big Data Analytics
 
Smart App@Pivotal by Dat Tran
Smart App@Pivotal by Dat TranSmart App@Pivotal by Dat Tran
Smart App@Pivotal by Dat Tran
 
Private Cloud Delivers Big Data in Oil & Gas v4
Private Cloud Delivers Big Data in Oil & Gas v4Private Cloud Delivers Big Data in Oil & Gas v4
Private Cloud Delivers Big Data in Oil & Gas v4
 
Predictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural NetworksPredictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural Networks
 
Meetup 27/6/2018: AIOPS om de uitdagingen van een slimme stad te ondersteunen
Meetup 27/6/2018: AIOPS om de uitdagingen van een slimme stad te ondersteunenMeetup 27/6/2018: AIOPS om de uitdagingen van een slimme stad te ondersteunen
Meetup 27/6/2018: AIOPS om de uitdagingen van een slimme stad te ondersteunen
 

Similar to Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRODUCTION

DataOps: An Agile Method for Data-Driven Organizations
DataOps: An Agile Method for Data-Driven OrganizationsDataOps: An Agile Method for Data-Driven Organizations
DataOps: An Agile Method for Data-Driven OrganizationsEllen Friedman
 
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...Matt Stubbs
 
Surprising Advantages of Streaming - ACM March 2018
Surprising Advantages of Streaming - ACM March 2018Surprising Advantages of Streaming - ACM March 2018
Surprising Advantages of Streaming - ACM March 2018Ellen Friedman
 
Container and Kubernetes without limits
Container and Kubernetes without limitsContainer and Kubernetes without limits
Container and Kubernetes without limitsAntje Barth
 
Who Broke My Cloud? SaaS Monitoring Best Practices
Who Broke My Cloud? SaaS Monitoring Best PracticesWho Broke My Cloud? SaaS Monitoring Best Practices
Who Broke My Cloud? SaaS Monitoring Best PracticesThousandEyes
 
Veritas + MongoDB
Veritas + MongoDBVeritas + MongoDB
Veritas + MongoDBMongoDB
 
Steve Jenkins - Business Opportunities for Big Data in the Enterprise
Steve Jenkins - Business Opportunities for Big Data in the Enterprise Steve Jenkins - Business Opportunities for Big Data in the Enterprise
Steve Jenkins - Business Opportunities for Big Data in the Enterprise WeAreEsynergy
 
Containers and Kubernetes without limits
Containers and Kubernetes without limitsContainers and Kubernetes without limits
Containers and Kubernetes without limitsAntje Barth
 
Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...DataWorks Summit
 
Big Data LDN 2017: How to leverage the cloud for Business Solutions
Big Data LDN 2017: How to leverage the cloud for Business SolutionsBig Data LDN 2017: How to leverage the cloud for Business Solutions
Big Data LDN 2017: How to leverage the cloud for Business SolutionsMatt Stubbs
 
How to Get Going with Kubernetes
How to Get Going with KubernetesHow to Get Going with Kubernetes
How to Get Going with KubernetesTed Dunning
 
Big Data LDN 2018: PROGRESS FOR BIG DATA IN KUBERNETES
Big Data LDN 2018: PROGRESS FOR BIG DATA IN KUBERNETESBig Data LDN 2018: PROGRESS FOR BIG DATA IN KUBERNETES
Big Data LDN 2018: PROGRESS FOR BIG DATA IN KUBERNETESMatt Stubbs
 
Cheryl Wiebe - Advanced Analytics in the Industrial World
Cheryl Wiebe - Advanced Analytics in the Industrial WorldCheryl Wiebe - Advanced Analytics in the Industrial World
Cheryl Wiebe - Advanced Analytics in the Industrial WorldRehgan Avon
 
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughtonReal-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughtonSynerzip
 
Big Data LDN 2017: The Intelligent Edge: What Data-driven Means in the Age of...
Big Data LDN 2017: The Intelligent Edge: What Data-driven Means in the Age of...Big Data LDN 2017: The Intelligent Edge: What Data-driven Means in the Age of...
Big Data LDN 2017: The Intelligent Edge: What Data-driven Means in the Age of...Matt Stubbs
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataMapR Technologies
 
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...ervogler
 
Postgres Vision 2018: The Pragmatic Cloud
Postgres Vision 2018:  The Pragmatic CloudPostgres Vision 2018:  The Pragmatic Cloud
Postgres Vision 2018: The Pragmatic CloudEDB
 
Digital Transformation; Digital Twins for Delivering Business Value in IIoT
Digital Transformation; Digital Twins for Delivering Business Value in IIoTDigital Transformation; Digital Twins for Delivering Business Value in IIoT
Digital Transformation; Digital Twins for Delivering Business Value in IIoTThe Hive
 

Similar to Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRODUCTION (20)

DataOps: An Agile Method for Data-Driven Organizations
DataOps: An Agile Method for Data-Driven OrganizationsDataOps: An Agile Method for Data-Driven Organizations
DataOps: An Agile Method for Data-Driven Organizations
 
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
 
Surprising Advantages of Streaming - ACM March 2018
Surprising Advantages of Streaming - ACM March 2018Surprising Advantages of Streaming - ACM March 2018
Surprising Advantages of Streaming - ACM March 2018
 
Container and Kubernetes without limits
Container and Kubernetes without limitsContainer and Kubernetes without limits
Container and Kubernetes without limits
 
Who Broke My Cloud? SaaS Monitoring Best Practices
Who Broke My Cloud? SaaS Monitoring Best PracticesWho Broke My Cloud? SaaS Monitoring Best Practices
Who Broke My Cloud? SaaS Monitoring Best Practices
 
Veritas + MongoDB
Veritas + MongoDBVeritas + MongoDB
Veritas + MongoDB
 
Steve Jenkins - Business Opportunities for Big Data in the Enterprise
Steve Jenkins - Business Opportunities for Big Data in the Enterprise Steve Jenkins - Business Opportunities for Big Data in the Enterprise
Steve Jenkins - Business Opportunities for Big Data in the Enterprise
 
Containers and Kubernetes without limits
Containers and Kubernetes without limitsContainers and Kubernetes without limits
Containers and Kubernetes without limits
 
Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...
 
Big Data LDN 2017: How to leverage the cloud for Business Solutions
Big Data LDN 2017: How to leverage the cloud for Business SolutionsBig Data LDN 2017: How to leverage the cloud for Business Solutions
Big Data LDN 2017: How to leverage the cloud for Business Solutions
 
How to Get Going with Kubernetes
How to Get Going with KubernetesHow to Get Going with Kubernetes
How to Get Going with Kubernetes
 
Big Data LDN 2018: PROGRESS FOR BIG DATA IN KUBERNETES
Big Data LDN 2018: PROGRESS FOR BIG DATA IN KUBERNETESBig Data LDN 2018: PROGRESS FOR BIG DATA IN KUBERNETES
Big Data LDN 2018: PROGRESS FOR BIG DATA IN KUBERNETES
 
Cheryl Wiebe - Advanced Analytics in the Industrial World
Cheryl Wiebe - Advanced Analytics in the Industrial WorldCheryl Wiebe - Advanced Analytics in the Industrial World
Cheryl Wiebe - Advanced Analytics in the Industrial World
 
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughtonReal-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton
 
Big Data LDN 2017: The Intelligent Edge: What Data-driven Means in the Age of...
Big Data LDN 2017: The Intelligent Edge: What Data-driven Means in the Age of...Big Data LDN 2017: The Intelligent Edge: What Data-driven Means in the Age of...
Big Data LDN 2017: The Intelligent Edge: What Data-driven Means in the Age of...
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your Data
 
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
 
Hadoop In The Real World
Hadoop In The Real WorldHadoop In The Real World
Hadoop In The Real World
 
Postgres Vision 2018: The Pragmatic Cloud
Postgres Vision 2018:  The Pragmatic CloudPostgres Vision 2018:  The Pragmatic Cloud
Postgres Vision 2018: The Pragmatic Cloud
 
Digital Transformation; Digital Twins for Delivering Business Value in IIoT
Digital Transformation; Digital Twins for Delivering Business Value in IIoTDigital Transformation; Digital Twins for Delivering Business Value in IIoT
Digital Transformation; Digital Twins for Delivering Business Value in IIoT
 

More from Matt Stubbs

Blueprint Series: Banking In The Cloud – Ultra-high Reliability Architectures
Blueprint Series: Banking In The Cloud – Ultra-high Reliability ArchitecturesBlueprint Series: Banking In The Cloud – Ultra-high Reliability Architectures
Blueprint Series: Banking In The Cloud – Ultra-high Reliability ArchitecturesMatt Stubbs
 
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...Matt Stubbs
 
Blueprint Series: Expedia Partner Solutions, Data Platform
Blueprint Series: Expedia Partner Solutions, Data PlatformBlueprint Series: Expedia Partner Solutions, Data Platform
Blueprint Series: Expedia Partner Solutions, Data PlatformMatt Stubbs
 
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...Matt Stubbs
 
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.Matt Stubbs
 
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCEBig Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCEMatt Stubbs
 
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQLBig Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQLMatt Stubbs
 
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTSBig Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTSMatt Stubbs
 
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...Matt Stubbs
 
Big Data LDN 2018: AI VS. GDPR
Big Data LDN 2018: AI VS. GDPRBig Data LDN 2018: AI VS. GDPR
Big Data LDN 2018: AI VS. GDPRMatt Stubbs
 
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...Matt Stubbs
 
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...Matt Stubbs
 
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...Matt Stubbs
 
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...Matt Stubbs
 
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICS
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICSBig Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICS
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICSMatt Stubbs
 
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSE
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSEBig Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSE
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSEMatt Stubbs
 
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNING
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNINGBig Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNING
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNINGMatt Stubbs
 
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...Matt Stubbs
 
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...Matt Stubbs
 
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATE
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATEBig Data LDN 2018: DATA APIS DON’T DISCRIMINATE
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATEMatt Stubbs
 

More from Matt Stubbs (20)

Blueprint Series: Banking In The Cloud – Ultra-high Reliability Architectures
Blueprint Series: Banking In The Cloud – Ultra-high Reliability ArchitecturesBlueprint Series: Banking In The Cloud – Ultra-high Reliability Architectures
Blueprint Series: Banking In The Cloud – Ultra-high Reliability Architectures
 
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
 
Blueprint Series: Expedia Partner Solutions, Data Platform
Blueprint Series: Expedia Partner Solutions, Data PlatformBlueprint Series: Expedia Partner Solutions, Data Platform
Blueprint Series: Expedia Partner Solutions, Data Platform
 
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
 
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.
 
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCEBig Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
 
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQLBig Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
 
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTSBig Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
 
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
 
Big Data LDN 2018: AI VS. GDPR
Big Data LDN 2018: AI VS. GDPRBig Data LDN 2018: AI VS. GDPR
Big Data LDN 2018: AI VS. GDPR
 
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
 
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
 
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...
 
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...
 
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICS
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICSBig Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICS
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICS
 
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSE
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSEBig Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSE
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSE
 
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNING
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNINGBig Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNING
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNING
 
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...
 
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
 
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATE
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATEBig Data LDN 2018: DATA APIS DON’T DISCRIMINATE
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATE
 

Recently uploaded

Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 

Recently uploaded (20)

Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 

Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRODUCTION

  • 1. ELLEN FRIEDMAN BIG DATA LONDON Principal Technologist 14 November 2018 7 Successful Habits for Data-Intensive Applications in Production
  • 2. 2 © 2018 MapR Technologies, Inc. // MapR Confidential Ellen Friedman, PhD Principal Technologist, MapR Technologies Committer Apache Drill & Apache Mahout projects O’Reilly author Email efriedman@mapr.com ellenf@apache.org Twitter @Ellen_Friedman Contact Information
  • 3. 3 © 2018 MapR Technologies, Inc. // MapR Confidential . Images © Friedman & Dunning Image courtesy Mtell used with permission Big Value from Big Data in Production across Diverse Businesses Telecommunications Smart-Metered Utilities Heavy Industry Financial Transportation Agritech
  • 4. 4 © 2018 MapR Technologies, Inc. // MapR Confidential “The future is already here – it’s just not evenly distributed.” - William Gibson
  • 5. 5 © 2018 MapR Technologies, Inc. // MapR Confidential . Some organizations more successful at getting large scale systems into production • 2018 Gartner report stated only 17% of Hadoop-based systems were in production vs • Over 90% of our customers have large scale production systems Why? Value from Data-Intensive Applications in Production Image © 2003 Ellen Friedman
  • 6. 6 © 2018 MapR Technologies, Inc. // MapR Confidential 1Data may be in production long before code is complete
  • 7. 7 © 2018 MapR Technologies, Inc. // MapR Confidential . Labs in Canada froze blood samples for years in case they contain valuable information. • Modern genetic techniques revealed key disease data • Correlated with outcomes for the donor patients • The data was preserved before the analysis was even Real World Example: Biological Samples Contain Data Image © 2003 Ellen Friedman
  • 8. 8 © 2018 MapR Technologies, Inc. // MapR Confidential Are you asking the right question? tnow Hourly clicks at A B C Which marketing email is most effective? • Looks like C – Blue is best when you base the question on data for highest hourly response rate at a current point in time tnow • But this is misleading. Why?
  • 9. 9 © 2018 MapR Technologies, Inc. // MapR Confidential Better way to frame the question: Which marketing email is most effective based on response rate at time t after launch? • Collect and retain data across same time interval relative to launch • Looks like B – Green is best performer and C- Blue is least effective for response rate Click rate tnow t Click rate Time after launch Launch Measure A B C Measuring performance at a constant time after launch gives consistent comparison
  • 10. 10 © 2018 MapR Technologies, Inc. // MapR Confidential Better way to frame the question: Which marketing email is most effective based on response rate at time t after launch? • Collect and retain data across same time interval relative to launch • Looks like B – Green is best performer and C- Blue is least effective for response rate Click rate tnow t Click rate Time after launch Launch Measure A B C But that’s hard after we have over- written the old data with current data
  • 11. 11 © 2018 MapR Technologies, Inc. // MapR Confidential Spot the Difference? GET photo.jpg HTTP/1.1 Host: lh4.googleusercontent User-agent: Mozilla/5.0 (Ma Accept: image/png,image/* Accept-language: en-US,en Accept-encoding: gzip, defl Referer: https://www.google Connection: keep-alive If-none-match: "v9” Cache-control: max-age=0 GET cc/borken.json HTTP/1.1 host: c.qrs.my user-agent: Mozilla/4.0 (co accept: application/json, t accept-language: en-US,en accept-encoding: gzip, defl referer: none connection: keep-alive if-none-match: "v9” cache-control: max-age=0 Attacker requestReal request
  • 12. 12 © 2018 MapR Technologies, Inc. // MapR Confidential Spot the Difference? GET photo.jpg HTTP/1.1 Host: lh4.googleusercontent User-agent: Mozilla/5.0 (Ma Accept: image/png,image/* Accept-language: en-US,en Accept-encoding: gzip, defl Referer: https://www.google Connection: keep-alive If-none-match: "v9” Cache-control: max-age=0 GET cc/borken.json HTTP/1.1 host: c.qrs.my user-agent: Mozilla/4.0 (co accept: application/json, t accept-language: en-US,en accept-encoding: gzip, defl referer: none connection: keep-alive if-none-match: "v9” cache-control: max-age=0 Attacker requestReal request
  • 13. 13 © 2018 MapR Technologies, Inc. // MapR Confidential Security expert at a bank preserved headers for web site requests Detected anomaly in headers for the attackers vs normal (real) requests But how would you know what data to preserve? • Pattern of behavior for attackers was allowable for headers • It was not predictable: but it was different Domain Knowledge Matters: Detecting Security Attacks
  • 14. 14 © 2018 MapR Technologies, Inc. // MapR Confidential 2End-to-end design for production
  • 15. 15 © 2018 MapR Technologies, Inc. // MapR Confidential Do you have a good fit between application, its SLAs and practical business goal? Is there a way to take action on the output of the application? § Note: a report is not an action Do you have real advance planning for production? § Note: slapping an SLA onto a complex application & tossing it to IT with hazily defined critical tasks is not production planning The missing bits usually aren’t in the application itself
  • 16. 16 © 2018 MapR Technologies, Inc. // MapR Confidential . Remember: IT doesn’t have a magic wand…
  • 17. 17 © 2018 MapR Technologies, Inc. // MapR Confidential . Remember: IT doesn’t have a magic wand…
  • 18. 18 © 2018 MapR Technologies, Inc. // MapR Confidential Build a Data Fabric Flexibility & agility to respond as life changes
  • 19. 19 © 2018 MapR Technologies, Inc. // MapR Confidential A Global Data Fabric: Edge to On-Premises to Cloud Data where you want it, compute power where you need it.
  • 20. 20 © 2018 MapR Technologies, Inc. // MapR Confidential . Comprehensive View of Data vs Isolated Services Commonality Isolation Both are needed, for different purposes. Best practice achieves a balance.
  • 21. 21 © 2018 MapR Technologies, Inc. // MapR Confidential 3Orchestration of applications and orchestration of data
  • 22. 22 © 2018 MapR Technologies, Inc. // MapR Confidential Containerized applications run in different environments on same cluster at same time Kubernetes is emerging as the leader in orchestration of containerized applications • You specify what needs to be done, Kubernetes arranges it by running containers • Kubernetes allows access to services by name Containerization: flexibility, convenience, predictability
  • 23. 23 © 2018 MapR Technologies, Inc. // MapR Confidential CNCF reported • nearly 70% of organizations surveyed use Kubernetes to manage containers • but the #1 issue Kubernetes users have is storage You don’t want to store state in containers – defeats flexibility Biggest Challenge with Kubernetes is Data Persistence
  • 24. 24 © 2018 MapR Technologies, Inc. // MapR Confidential CNCF reported • nearly 70% of organizations surveyed use Kubernetes to manage containers • but the #1 issue Kubernetes users have is storage You don’t want to store state in containers – defeats flexibility How do you get the benefits of containerization without being limited to stateless applications? Biggest Challenge with Kubernetes is Data Persistence
  • 25. 25 © 2018 MapR Technologies, Inc. // MapR Confidential App 1 App 2 App 3 Kubernetes
  • 26. 26 © 2018 MapR Technologies, Inc. // MapR Confidential App 1 App 2 App 3 Kubernetes rpc stream LogFile
  • 27. 27 © 2018 MapR Technologies, Inc. // MapR Confidential App 1 App 2 App 3 Kubernetes rpc stream LogFile Data Platform
  • 28. 28 © 2018 MapR Technologies, Inc. // MapR Confidential Kubernetes for orchestration of applications + Dataware for orchestration of data A Powerful Combination
  • 29. 29 © 2018 MapR Technologies, Inc. // MapR Confidential 4Simplicity is golden You should not need an army to administer a cluster
  • 30. 30 © 2018 MapR Technologies, Inc. // MapR Confidential . If your system has lots of work-arounds, think again…
  • 31. 31 © 2018 MapR Technologies, Inc. // MapR Confidential . If your system has lots of work-arounds, think again…
  • 32. 32 © 2018 MapR Technologies, Inc. // MapR Confidential Many logistics and processes should be handled by the platform, not by developers at the application level • Separation of concerns for system administrators vs developers/data scientists • More efficient, less risk of error • Ease of administration Data Platform Should Simplify
  • 33. 33 © 2018 MapR Technologies, Inc. // MapR Confidential Orchestrate Data with Dataware Legacy Applications Big Data 1.0 Applications Next-Gen Applications MapR Converged Data Platform High Availability Real Time Unified Security Multi-tenancy Disaster Recovery Global Namespace Real-Time NoQL Database Stream TransportWeb-Scale Storage MapR is more than just data storage – it’s like Kubernetes for data. It’s dataware.
  • 34. 34 © 2018 MapR Technologies, Inc. // MapR Confidential MapR Volume: Directory with Special Management Capabilities Cluster Volume mount point Directories Files Streams Table Volumes used for easy control of access, multitenancy, data locality & DR
  • 35. 35 © 2018 MapR Technologies, Inc. // MapR Confidential MapR can run huge numbers of applications on same cluster • Great performance, optimized resources • Comprehensive view of data & better collaboration • Ease of administration Is your system easy to administer? We’ve seen a large retail customer manage a cluster >1000 nodes with just 3 administrators
  • 36. 36 © 2018 MapR Technologies, Inc. // MapR Confidential Open APIs Multicloud & Hybrid Cloud Strategy with MapR: Application • Unified Security Model • Data access decoupled from physical storage location. Globally. • No lock-in to proprietary APIs • Full openness • Data made portable API Connector ✓ GLOBAL DATA MANAGEMENT Edge Private Cloud On Premise Public Cloud Public Cloud Public Cloud Uniform computing environment everywhere
  • 37. 37 © 2018 MapR Technologies, Inc. // MapR Confidential 5Build real multitenancy
  • 38. 38 © 2018 MapR Technologies, Inc. // MapR Confidential MapR Volumes: Easy Data Management • Multiple volumes span a cluster • Files, tables, streams in same volume • Fine-grained control of who has access • Basis for mirroring, snapshots • Great advantage for multi-tenancy
  • 39. 39 © 2018 MapR Technologies, Inc. // MapR Confidential MapR Volumes: Control Data Locality • Place data on specialized hardware (such as GPUs) • Meet compliance requirements • Better optimization of resources • Great advantage for AI/ machine learning
  • 40. 40 © 2018 MapR Technologies, Inc. // MapR Confidential A simpler system is more cost effective, with better performance • Remember: Every HBase commit requires a round trip to the namenode if run on HDFS MapR has no namenode – avoids the problem • It isn’t magic. It’s just real multitenancy. Does your system avoid unnecessary sprawl? We’ve seen a customer collapse 17 HBase clusters in AWS to 1 cluster running five nodes HBase on MapR with great performance
  • 41. 41 © 2018 MapR Technologies, Inc. // MapR Confidential 6Streaming architecture provides flexibility
  • 42. 42 © 2018 MapR Technologies, Inc. // MapR Confidential Stream Transport to Decouple Producers & Consumers P P P C C C Transport Processing Kafka / MapR Streams “Streaming Microservices” by Ted Dunning & Ellen Friedman, in Encyclopedia of Big Data Technologies, Sherif Sakr and Albert Zomaya, editors, © 2018 (Springer International Publishing) ebook Streaming Architecture by Ted Dunning & Ellen Friedman © 2016 ( O’Reilly Media), chapter 3: https://mapr.com/ebooks/streaming-architecture/chapter-03-streaming-platform-for-microservices.html
  • 43. 43 © 2018 MapR Technologies, Inc. // MapR Confidential Stream-first Architecture Supports Microservices Real-time analytics EMR Patient Facilities management Insurance audit A B Medical tests C Medical test results With the right messaging tool you support multiple classes of use cases
  • 44. 44 © 2018 MapR Technologies, Inc. // MapR Confidential ResultsRendezvous Rendezvous Architecture is based on streaming microservices Scores ArchiveDecoy m1 m2 m3 Features / profiles InputRaw Rendezvous Architecture described in: - Machine Learning Logistics book by Ted Dunning & Ellen Friedman © 2018 (O’Reilly) - “Rendezvous Architecture” by Ted Dunning & Ellen Friedman, chapter in Encyclopedia of Big Data Technologies. Sherif Sakr and Albert Zomaya, editors. Springer International Publishing, © 2018 in press.
  • 45. 45 © 2018 MapR Technologies, Inc. // MapR Confidential 7Build a data-aware culture. That also gives you a production-ready culture.
  • 46. 46 © 2018 MapR Technologies, Inc. // MapR Confidential DataOps: Brings Flexibility & Focus • Cross-functional teams cut across skill guilds • Better communication & shared goal keep effort focused and flexible Refs: “DevOps, DataOps & Machine Learning” by Terry McCann http://www.hyperbi.co.uk/devops-dataops- and-machine-learning/ “How to Manage a DataOps Team” by E. Friedman in RTInsights https://www.rtinsights.com/how-manage- dataops-team/ Developer Data Scientist / Data Engineer Ops NetworkSite Reliability QA Security Project 1 Project 2 Project 3 Kubernetes Data platform Storage Required team skill Occasional help
  • 47. 47 © 2018 MapR Technologies, Inc. // MapR Confidential AI & Analytics in Production: How to Make It Work Free book signing with authors Ted Dunning & Ellen Friedman MapR stand #145 Wed 1:00 pm – 1:45 pm Download free pdf via MapR: https://mapr.com/ebook/ai-and-analytics-in-production/
  • 48. 48 © 2018 MapR Technologies, Inc. // MapR Confidential Please support women in tech – help build girls’ dreams of what they can accomplish © Ellen Friedman 2015#womenintech #datawomen
  • 49. 49 © 2018 MapR Technologies, Inc. // MapR Confidential Thank you !
  • 50. 50 © 2018 MapR Technologies, Inc. // MapR Confidential Ellen Friedman, PhD Principal Technologist, MapR Technologies Committer Apache Drill & Apache Mahout projects O’Reilly author Email efriedman@mapr.com ellenf@apache.org Twitter @Ellen_Friedman Contact Information