SlideShare a Scribd company logo
1 of 19
Download to read offline
REAL TIME STREAMING ANALYTICS
@ FORD
June 13, 2017
1
•Original Problem Statement
•Architecture Components
•Data Journey
•Challenges
•Live Demo – Streaming from Dearborn
•RTSA RoadMap & Vision
Agenda
2
3
Product Vision / Mission Statement
•Experiments (BDD 2.0)
• No platform to do ‘Streaming’ Experiments
• How do we enable ‘Self-Service’ Streaming?
•Utility Ingestion
• Existing Storm solution would not scale
operationally the way it had been implemented.
• Today applications developer their own one off
ingestion solutions to deal with proxy and
firewall rules. How do we reduce the surface
area that is exposed while handling multiple
types of ingest?
SCA-V / BDD BUSINESS VALUE
BDD (Big Data Drive) drives value across the enterprise today and in the
future
Pillar 1
Collection
Pillar 2
Configuration
Pillar 3
Edge Analytics
Enables
• Off cycle credit validation
• Intelligent Customer Interactions
• Vehicle performance insights
• Customer specific city solutions
• Fleet based telematics
• Warranty reduction across fleets
• Powertrain fuel efficiency improvement
• Automotive cybersecurity
• High-touch customer / dealer engagement
• Product feature validation
• Vehicle feature deployment
• Product development lifecycle reduction
• Vehicle diagnostic and prognostic enhacements
5
SCA-V (Single Complete
Actionable Vehicle
Landing Zone
Discovery
Zone
Data Supply
Chain
Multi-Platform Data and Analytics Ecosystem
Data and Analytics
Ecosystem
SCA-C (Single Complete
Actionable Customer)
other
• Development leverages the product team approach which promotes cross-
functional partnerships in FordLabs, PD, IT and GDI&A
• Developed the first edge computing platform which emulates the fully
networked vehicle-1 and 2 (FNV-1/FNV-2) and provides production grade
web based software to support this vehicle platform
• Created the first real-time streaming application in the enterprise
• Represents a significant shift toward data-driven decision making by
leveraging rich, connected vehicle data. The solution includes Natural
Language Search, Real Time Streaming, vehicle architecture agnosticism,
software deployable anywhere (ePID2.0, TCU, Sync, ECG), and rapid
vehicle data validation processes
• The platform can accommodate a diverse set of vehicles across the fleet
With BDD, we created a cloud agnostic Ford owned and managed
real time streaming solution
66
BDD 2.0 ACCOMPLISHMENTS: A THIN SLICE
Real Time Streaming Analytics - Conceptual
Real Time streaming is an incremental capability over traditional batch processing to
ingest, transform and score individual streams of real time data
Lambda architecture is a data-processing architecture
designed to handle massive quantities of data by taking
advantage of both batch and stream-processing methods.
Routing Pub/Sub Processing
AnalyzeStore
Real-Time
Batch Model is trained,
optimized and
deployedHistorical
persistence
The model is executed
Real Time Streaming Analytics – Conceptual
8
Routing Pub/Sub Processing
AnalyzeStore
Real-Time
Batch Model is trained,
optimized and
deployedHistorical
persistence
The model is executed
1
2
3
Real Time Streaming Data ingested, routed, transformed
Data passed from speed layer to batch/storage layer
Analytical apps consuming/producing data in the real-time speed layer
4 Historical data analyzed, models developed and trained
RTSA – Analytics & Data Flow Life-Cycle
5 Trained analytical models deployed to the real-time speed layer
1
2
3
4
5
Apps
Data
Analytics
Speed
Demonstration
BDD Dashboard: http://bdd-vase.apps-
q01.pcfqaecc.ford.com/#/
SAS ESP: RTSA
Vehicle
WebSocket
NiFi
Apps XYZ
NiFi
Pull*
HDFS
Push
Push
Apps XYZ
Azure CLOUD
*Native NiFi Site-2-Site HTTP Proxy Capability.
Fixes Storm Endpoint Scaling Ops problem today.
EventHub/IoTHub
Ford Network and
Data Center
Firewall
P
M
M
L
Firewall
P
M
M
L
Intelligent Mobile
Apps
Public Internet
EDGE/IoT
Dynamic Stream Routing
10
1
2
3
Data from OpenXC ingested via Cloud Foundry WebSocket
Data routed from Cloud to Ford data center via NiFi
Specific data consumed by an analytical app
4 Data published to Kafka on prem
Live Demo - Data Flow Narrative
5 Data persisted in Hadoop on prem
5
1
2
1
3
4
Live Demo
Real Time Streaming Analytics – Physical
HBase
Summary of Key Concepts
RTSA is….
•Fully developed, managed, and deployed by Ford
•We own the data at every step
•Fully cloud and data center agnostic
•Push and pull capable
•No additional Ford Data Center Exposure
•Horizontally scalable
11
With BDD (Big Data Drive), we created a cloud agnostic Ford owned and
managed real time streaming solution
• RTSA product to provide foundational enterprise services :
–Data ingest
–Data Processing
–Stream Routing
• Including Cloud to On-premise
–Analytics
–Data Persistence On-premise
Roadmap
12
Ingestion, Transformation, Processing, and Persistence of
Streaming Data in Real-Time
Foundational services available in production environment Q1 for
applications promoted from experiment status.
Vehicle
WebSocket
NiFi
Apps XYZ
NiFi
Pull*
HDFS
Push
Push
Apps XYZ
Azure CLOUD
*Native NiFi Site-2-Site HTTP Proxy Capability.
Fixes Storm Endpoint Scaling Ops problem today.
EventHub/IoTHub
Ford Network and
Data Center
Firewall
P
M
M
L
Firewall
P
M
M
L
Intelligent Mobile
Apps
Public Internet
EDGE/IoT
Dynamic Stream Routing
13
HBase
Other Opportunities
14
Vehicle
WebSocket
NiFi
Apps XYZ
NiFi
Pull*
HDFS
Push
Push
Apps XYZ
Azure CLOUD
*Native NiFi Site-2-Site HTTP Proxy Capability.
Fixes Storm Endpoint Scaling Ops problem today.
EventHub/IoTHub
Ford Network and
Data Center
Firewall
REST
P
M
M
L
Firewall
P
M
M
L
Intelligent Mobile
Apps
Public Internet
EDGE/IoT
Dynamic Stream Routing
Other Opportunities
 NY FordHub Cisco Meraki WiFi
 Data started flowing 2/28 via RTSA
 Production infrastructure in Q1
HBase
15
Vehicle
WebSocket
NiFi
Apps XYZ
NiFi
Pull*
HDFS
Push
Push
Apps XYZ
Azure CLOUD
*Native NiFi Site-2-Site HTTP Proxy Capability.
Fixes Storm Endpoint Scaling Ops problem today.
EventHub/IoTHub
Ford Network and
Data Center
Firewall
REST
P
M
M
L
Firewall
P
M
M
L
Intelligent Mobile
Apps
Public Internet
EDGE/IoT
Dynamic Stream Routing
Other Opportunities??
HBase
16
Third Party
Data Sources
Third Party
Data Consumers
(as needed)
Vehicle
WebSocket
NiFi
Apps XYZ
NiFi
Pull*
HDFS
Push
Push
Apps XYZ
Azure CLOUD
*Native NiFi Site-2-Site HTTP Proxy Capability.
Fixes Storm Endpoint Scaling Ops problem today.
EventHub/IoTHub
Ford Network and
Data Center
Firewall
REST
WebSocket
REST
MQTT
P
M
M
L
Firewall
P
M
M
L
Intelligent Mobile
Apps
Public Internet
EDGE/IoT
Dynamic Stream Routing
Event and/or
Streaming Data
Made Available to
Authorized Third
Party Partners as
needed
• DPF Regen
• Silver
• Security
• Plant Floor
• ControlTec
• LCV Telematics
• MiniFi
• Cisco Meraki
-Dealer WiFi
-Other Hubs
HBase
This Is The End
•Discussion
•Questions
17
18
Andrea Siudara
Tom BryansMelissa Richards
Kevin Cooper
RTSA Product Owner
Tracy HewiitDan Totten
Core RTSA Organization
RTSA Product Organization
3/11/2017
Laura Churchill
PM
T Young
J Niemiec
G Gwidz
DHickey
Jill Johnson
PM
Raju Doma
Delivery Supervisor
C Petras
E Ulicny
D Godwin
GDIA
Information Technology
GDIA
Smart Mobility Analytics
Appendix
19

More Related Content

What's hot

Fast Start Failover DataGuard
Fast Start Failover DataGuardFast Start Failover DataGuard
Fast Start Failover DataGuardBorsaniya Vaibhav
 
Free Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseFree Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseDatabricks
 
Data modeling star schema
Data modeling star schemaData modeling star schema
Data modeling star schemaSayed Ahmed
 
The State of Spark in the Cloud with Nicolas Poggi
The State of Spark in the Cloud with Nicolas PoggiThe State of Spark in the Cloud with Nicolas Poggi
The State of Spark in the Cloud with Nicolas PoggiSpark Summit
 
Kafka Summit NYC 2017 - Data Processing at LinkedIn with Apache Kafka
Kafka Summit NYC 2017 - Data Processing at LinkedIn with Apache KafkaKafka Summit NYC 2017 - Data Processing at LinkedIn with Apache Kafka
Kafka Summit NYC 2017 - Data Processing at LinkedIn with Apache Kafkaconfluent
 
SQL Server High Availability and Disaster Recovery
SQL Server High Availability and Disaster RecoverySQL Server High Availability and Disaster Recovery
SQL Server High Availability and Disaster RecoveryMichael Poremba
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkDatabricks
 
Tutorial On Database Management System
Tutorial On Database Management SystemTutorial On Database Management System
Tutorial On Database Management Systempsathishcs
 
Basic oracle-database-administration
Basic oracle-database-administrationBasic oracle-database-administration
Basic oracle-database-administrationsreehari orienit
 
No sql distilled-distilled
No sql distilled-distilledNo sql distilled-distilled
No sql distilled-distilledrICh morrow
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundationshktripathy
 
Building a Spatial Database in PostgreSQL
Building a Spatial Database in PostgreSQLBuilding a Spatial Database in PostgreSQL
Building a Spatial Database in PostgreSQLKudos S.A.S
 
Mejores prácticas para migrar sus bases de datos a AWS
Mejores prácticas para migrar sus bases de datos a AWSMejores prácticas para migrar sus bases de datos a AWS
Mejores prácticas para migrar sus bases de datos a AWSAmazon Web Services LATAM
 
Red Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFSRed Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFSGlusterFS
 
Design Patterns For Real Time Streaming Data Analytics
Design Patterns For Real Time Streaming Data AnalyticsDesign Patterns For Real Time Streaming Data Analytics
Design Patterns For Real Time Streaming Data AnalyticsDataWorks Summit
 

What's hot (20)

Multimedia Database
Multimedia DatabaseMultimedia Database
Multimedia Database
 
Fast Start Failover DataGuard
Fast Start Failover DataGuardFast Start Failover DataGuard
Fast Start Failover DataGuard
 
Free Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseFree Training: How to Build a Lakehouse
Free Training: How to Build a Lakehouse
 
Data modeling star schema
Data modeling star schemaData modeling star schema
Data modeling star schema
 
Big data architecture
Big data architectureBig data architecture
Big data architecture
 
The State of Spark in the Cloud with Nicolas Poggi
The State of Spark in the Cloud with Nicolas PoggiThe State of Spark in the Cloud with Nicolas Poggi
The State of Spark in the Cloud with Nicolas Poggi
 
The CAP Theorem
The CAP Theorem The CAP Theorem
The CAP Theorem
 
Kafka Summit NYC 2017 - Data Processing at LinkedIn with Apache Kafka
Kafka Summit NYC 2017 - Data Processing at LinkedIn with Apache KafkaKafka Summit NYC 2017 - Data Processing at LinkedIn with Apache Kafka
Kafka Summit NYC 2017 - Data Processing at LinkedIn with Apache Kafka
 
SQL Server High Availability and Disaster Recovery
SQL Server High Availability and Disaster RecoverySQL Server High Availability and Disaster Recovery
SQL Server High Availability and Disaster Recovery
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
 
Tutorial On Database Management System
Tutorial On Database Management SystemTutorial On Database Management System
Tutorial On Database Management System
 
Basic oracle-database-administration
Basic oracle-database-administrationBasic oracle-database-administration
Basic oracle-database-administration
 
No sql distilled-distilled
No sql distilled-distilledNo sql distilled-distilled
No sql distilled-distilled
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundations
 
Building a Spatial Database in PostgreSQL
Building a Spatial Database in PostgreSQLBuilding a Spatial Database in PostgreSQL
Building a Spatial Database in PostgreSQL
 
Rdbms vs. no sql
Rdbms vs. no sqlRdbms vs. no sql
Rdbms vs. no sql
 
Nosql databases
Nosql databasesNosql databases
Nosql databases
 
Mejores prácticas para migrar sus bases de datos a AWS
Mejores prácticas para migrar sus bases de datos a AWSMejores prácticas para migrar sus bases de datos a AWS
Mejores prácticas para migrar sus bases de datos a AWS
 
Red Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFSRed Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFS
 
Design Patterns For Real Time Streaming Data Analytics
Design Patterns For Real Time Streaming Data AnalyticsDesign Patterns For Real Time Streaming Data Analytics
Design Patterns For Real Time Streaming Data Analytics
 

Similar to Real Time Streaming Architecture at Ford

Preventative Maintenance of Robots in Automotive Industry
Preventative Maintenance of Robots in Automotive IndustryPreventative Maintenance of Robots in Automotive Industry
Preventative Maintenance of Robots in Automotive IndustryDataWorks Summit/Hadoop Summit
 
Schnellere Digitalisierung mit einer cloudbasierten Datenstrategie
Schnellere Digitalisierung mit einer cloudbasierten DatenstrategieSchnellere Digitalisierung mit einer cloudbasierten Datenstrategie
Schnellere Digitalisierung mit einer cloudbasierten DatenstrategieMongoDB
 
MongoDB World 2019: Wipro Software Defined Everything Powered by MongoDB
MongoDB World 2019: Wipro Software Defined Everything Powered by MongoDBMongoDB World 2019: Wipro Software Defined Everything Powered by MongoDB
MongoDB World 2019: Wipro Software Defined Everything Powered by MongoDBMongoDB
 
DEVNET-1166 Open SDN Controller APIs
DEVNET-1166	Open SDN Controller APIsDEVNET-1166	Open SDN Controller APIs
DEVNET-1166 Open SDN Controller APIsCisco DevNet
 
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Yellowfin
 
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...Big Data Value Association
 
Webinar: Enterprise Trends for Database-as-a-Service
Webinar: Enterprise Trends for Database-as-a-ServiceWebinar: Enterprise Trends for Database-as-a-Service
Webinar: Enterprise Trends for Database-as-a-ServiceMongoDB
 
Daimler’s Community Approach to TAS Platform Monitoring
Daimler’s Community Approach to TAS Platform MonitoringDaimler’s Community Approach to TAS Platform Monitoring
Daimler’s Community Approach to TAS Platform MonitoringVMware Tanzu
 
Intel IT Open Cloud - What's under the Hood and How do we Drive it?
Intel IT Open Cloud - What's under the Hood and How do we Drive it?Intel IT Open Cloud - What's under the Hood and How do we Drive it?
Intel IT Open Cloud - What's under the Hood and How do we Drive it?Odinot Stanislas
 
z Systems redefining Enterprise IT for digital business - Alain Poquillon
z Systems redefining Enterprise IT for digital business - Alain Poquillonz Systems redefining Enterprise IT for digital business - Alain Poquillon
z Systems redefining Enterprise IT for digital business - Alain PoquillonNRB
 
Functional AI and Pervasive Networking in Automotive
 Functional AI and Pervasive Networking in Automotive Functional AI and Pervasive Networking in Automotive
Functional AI and Pervasive Networking in AutomotiveAlison Chaiken
 
In memory computing principles by Mac Moore of GridGain
In memory computing principles by Mac Moore of GridGainIn memory computing principles by Mac Moore of GridGain
In memory computing principles by Mac Moore of GridGainData Con LA
 
Microsoft SQL Server 2012 Data Warehouse on Hitachi Converged Platform
Microsoft SQL Server 2012 Data Warehouse on Hitachi Converged PlatformMicrosoft SQL Server 2012 Data Warehouse on Hitachi Converged Platform
Microsoft SQL Server 2012 Data Warehouse on Hitachi Converged PlatformHitachi Vantara
 
Brocade Software Networking Presentation at Interface 2016
Brocade Software Networking Presentation at Interface 2016Brocade Software Networking Presentation at Interface 2016
Brocade Software Networking Presentation at Interface 2016Scott Sims
 
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and SimulinkApplying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and SimulinkGerardo Pardo-Castellote
 
IND3: Predix for Transportation (Predix Transform 2016)
IND3: Predix for Transportation (Predix Transform 2016)IND3: Predix for Transportation (Predix Transform 2016)
IND3: Predix for Transportation (Predix Transform 2016)Predix
 
IMS01 IMS Keynote
IMS01   IMS KeynoteIMS01   IMS Keynote
IMS01 IMS KeynoteRobert Hain
 
Cloudera - IoT & Smart Cities
Cloudera - IoT & Smart CitiesCloudera - IoT & Smart Cities
Cloudera - IoT & Smart CitiesCloudera, Inc.
 

Similar to Real Time Streaming Architecture at Ford (20)

Preventative Maintenance of Robots in Automotive Industry
Preventative Maintenance of Robots in Automotive IndustryPreventative Maintenance of Robots in Automotive Industry
Preventative Maintenance of Robots in Automotive Industry
 
Forecast key1 0615_ak_evening
Forecast key1 0615_ak_eveningForecast key1 0615_ak_evening
Forecast key1 0615_ak_evening
 
Schnellere Digitalisierung mit einer cloudbasierten Datenstrategie
Schnellere Digitalisierung mit einer cloudbasierten DatenstrategieSchnellere Digitalisierung mit einer cloudbasierten Datenstrategie
Schnellere Digitalisierung mit einer cloudbasierten Datenstrategie
 
MongoDB World 2019: Wipro Software Defined Everything Powered by MongoDB
MongoDB World 2019: Wipro Software Defined Everything Powered by MongoDBMongoDB World 2019: Wipro Software Defined Everything Powered by MongoDB
MongoDB World 2019: Wipro Software Defined Everything Powered by MongoDB
 
DEVNET-1166 Open SDN Controller APIs
DEVNET-1166	Open SDN Controller APIsDEVNET-1166	Open SDN Controller APIs
DEVNET-1166 Open SDN Controller APIs
 
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
 
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
 
Webinar: Enterprise Trends for Database-as-a-Service
Webinar: Enterprise Trends for Database-as-a-ServiceWebinar: Enterprise Trends for Database-as-a-Service
Webinar: Enterprise Trends for Database-as-a-Service
 
Daimler’s Community Approach to TAS Platform Monitoring
Daimler’s Community Approach to TAS Platform MonitoringDaimler’s Community Approach to TAS Platform Monitoring
Daimler’s Community Approach to TAS Platform Monitoring
 
Big Data Ready Enterprise
Big Data Ready Enterprise Big Data Ready Enterprise
Big Data Ready Enterprise
 
Intel IT Open Cloud - What's under the Hood and How do we Drive it?
Intel IT Open Cloud - What's under the Hood and How do we Drive it?Intel IT Open Cloud - What's under the Hood and How do we Drive it?
Intel IT Open Cloud - What's under the Hood and How do we Drive it?
 
z Systems redefining Enterprise IT for digital business - Alain Poquillon
z Systems redefining Enterprise IT for digital business - Alain Poquillonz Systems redefining Enterprise IT for digital business - Alain Poquillon
z Systems redefining Enterprise IT for digital business - Alain Poquillon
 
Functional AI and Pervasive Networking in Automotive
 Functional AI and Pervasive Networking in Automotive Functional AI and Pervasive Networking in Automotive
Functional AI and Pervasive Networking in Automotive
 
In memory computing principles by Mac Moore of GridGain
In memory computing principles by Mac Moore of GridGainIn memory computing principles by Mac Moore of GridGain
In memory computing principles by Mac Moore of GridGain
 
Microsoft SQL Server 2012 Data Warehouse on Hitachi Converged Platform
Microsoft SQL Server 2012 Data Warehouse on Hitachi Converged PlatformMicrosoft SQL Server 2012 Data Warehouse on Hitachi Converged Platform
Microsoft SQL Server 2012 Data Warehouse on Hitachi Converged Platform
 
Brocade Software Networking Presentation at Interface 2016
Brocade Software Networking Presentation at Interface 2016Brocade Software Networking Presentation at Interface 2016
Brocade Software Networking Presentation at Interface 2016
 
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and SimulinkApplying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
 
IND3: Predix for Transportation (Predix Transform 2016)
IND3: Predix for Transportation (Predix Transform 2016)IND3: Predix for Transportation (Predix Transform 2016)
IND3: Predix for Transportation (Predix Transform 2016)
 
IMS01 IMS Keynote
IMS01   IMS KeynoteIMS01   IMS Keynote
IMS01 IMS Keynote
 
Cloudera - IoT & Smart Cities
Cloudera - IoT & Smart CitiesCloudera - IoT & Smart Cities
Cloudera - IoT & Smart Cities
 

More from DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

More from DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Recently uploaded

All These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFAll These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFMichael Gough
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...itnewsafrica
 
Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfAarwolf Industries LLC
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...amber724300
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
A Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxA Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxAna-Maria Mihalceanu
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
Kuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialKuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialJoão Esperancinha
 
React JS; all concepts. Contains React Features, JSX, functional & Class comp...
React JS; all concepts. Contains React Features, JSX, functional & Class comp...React JS; all concepts. Contains React Features, JSX, functional & Class comp...
React JS; all concepts. Contains React Features, JSX, functional & Class comp...Karmanjay Verma
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 

Recently uploaded (20)

All These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFAll These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDF
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
 
Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdf
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
A Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxA Glance At The Java Performance Toolbox
A Glance At The Java Performance Toolbox
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
Kuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialKuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorial
 
React JS; all concepts. Contains React Features, JSX, functional & Class comp...
React JS; all concepts. Contains React Features, JSX, functional & Class comp...React JS; all concepts. Contains React Features, JSX, functional & Class comp...
React JS; all concepts. Contains React Features, JSX, functional & Class comp...
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 

Real Time Streaming Architecture at Ford

  • 1. REAL TIME STREAMING ANALYTICS @ FORD June 13, 2017 1
  • 2. •Original Problem Statement •Architecture Components •Data Journey •Challenges •Live Demo – Streaming from Dearborn •RTSA RoadMap & Vision Agenda 2
  • 3. 3 Product Vision / Mission Statement •Experiments (BDD 2.0) • No platform to do ‘Streaming’ Experiments • How do we enable ‘Self-Service’ Streaming? •Utility Ingestion • Existing Storm solution would not scale operationally the way it had been implemented. • Today applications developer their own one off ingestion solutions to deal with proxy and firewall rules. How do we reduce the surface area that is exposed while handling multiple types of ingest?
  • 4. SCA-V / BDD BUSINESS VALUE BDD (Big Data Drive) drives value across the enterprise today and in the future Pillar 1 Collection Pillar 2 Configuration Pillar 3 Edge Analytics Enables • Off cycle credit validation • Intelligent Customer Interactions • Vehicle performance insights • Customer specific city solutions • Fleet based telematics • Warranty reduction across fleets • Powertrain fuel efficiency improvement • Automotive cybersecurity • High-touch customer / dealer engagement • Product feature validation • Vehicle feature deployment • Product development lifecycle reduction • Vehicle diagnostic and prognostic enhacements
  • 5. 5 SCA-V (Single Complete Actionable Vehicle Landing Zone Discovery Zone Data Supply Chain Multi-Platform Data and Analytics Ecosystem Data and Analytics Ecosystem SCA-C (Single Complete Actionable Customer) other
  • 6. • Development leverages the product team approach which promotes cross- functional partnerships in FordLabs, PD, IT and GDI&A • Developed the first edge computing platform which emulates the fully networked vehicle-1 and 2 (FNV-1/FNV-2) and provides production grade web based software to support this vehicle platform • Created the first real-time streaming application in the enterprise • Represents a significant shift toward data-driven decision making by leveraging rich, connected vehicle data. The solution includes Natural Language Search, Real Time Streaming, vehicle architecture agnosticism, software deployable anywhere (ePID2.0, TCU, Sync, ECG), and rapid vehicle data validation processes • The platform can accommodate a diverse set of vehicles across the fleet With BDD, we created a cloud agnostic Ford owned and managed real time streaming solution 66 BDD 2.0 ACCOMPLISHMENTS: A THIN SLICE
  • 7. Real Time Streaming Analytics - Conceptual Real Time streaming is an incremental capability over traditional batch processing to ingest, transform and score individual streams of real time data Lambda architecture is a data-processing architecture designed to handle massive quantities of data by taking advantage of both batch and stream-processing methods. Routing Pub/Sub Processing AnalyzeStore Real-Time Batch Model is trained, optimized and deployedHistorical persistence The model is executed
  • 8. Real Time Streaming Analytics – Conceptual 8 Routing Pub/Sub Processing AnalyzeStore Real-Time Batch Model is trained, optimized and deployedHistorical persistence The model is executed 1 2 3 Real Time Streaming Data ingested, routed, transformed Data passed from speed layer to batch/storage layer Analytical apps consuming/producing data in the real-time speed layer 4 Historical data analyzed, models developed and trained RTSA – Analytics & Data Flow Life-Cycle 5 Trained analytical models deployed to the real-time speed layer 1 2 3 4 5 Apps Data Analytics Speed
  • 10. Vehicle WebSocket NiFi Apps XYZ NiFi Pull* HDFS Push Push Apps XYZ Azure CLOUD *Native NiFi Site-2-Site HTTP Proxy Capability. Fixes Storm Endpoint Scaling Ops problem today. EventHub/IoTHub Ford Network and Data Center Firewall P M M L Firewall P M M L Intelligent Mobile Apps Public Internet EDGE/IoT Dynamic Stream Routing 10 1 2 3 Data from OpenXC ingested via Cloud Foundry WebSocket Data routed from Cloud to Ford data center via NiFi Specific data consumed by an analytical app 4 Data published to Kafka on prem Live Demo - Data Flow Narrative 5 Data persisted in Hadoop on prem 5 1 2 1 3 4 Live Demo Real Time Streaming Analytics – Physical HBase
  • 11. Summary of Key Concepts RTSA is…. •Fully developed, managed, and deployed by Ford •We own the data at every step •Fully cloud and data center agnostic •Push and pull capable •No additional Ford Data Center Exposure •Horizontally scalable 11 With BDD (Big Data Drive), we created a cloud agnostic Ford owned and managed real time streaming solution
  • 12. • RTSA product to provide foundational enterprise services : –Data ingest –Data Processing –Stream Routing • Including Cloud to On-premise –Analytics –Data Persistence On-premise Roadmap 12 Ingestion, Transformation, Processing, and Persistence of Streaming Data in Real-Time Foundational services available in production environment Q1 for applications promoted from experiment status.
  • 13. Vehicle WebSocket NiFi Apps XYZ NiFi Pull* HDFS Push Push Apps XYZ Azure CLOUD *Native NiFi Site-2-Site HTTP Proxy Capability. Fixes Storm Endpoint Scaling Ops problem today. EventHub/IoTHub Ford Network and Data Center Firewall P M M L Firewall P M M L Intelligent Mobile Apps Public Internet EDGE/IoT Dynamic Stream Routing 13 HBase Other Opportunities
  • 14. 14 Vehicle WebSocket NiFi Apps XYZ NiFi Pull* HDFS Push Push Apps XYZ Azure CLOUD *Native NiFi Site-2-Site HTTP Proxy Capability. Fixes Storm Endpoint Scaling Ops problem today. EventHub/IoTHub Ford Network and Data Center Firewall REST P M M L Firewall P M M L Intelligent Mobile Apps Public Internet EDGE/IoT Dynamic Stream Routing Other Opportunities  NY FordHub Cisco Meraki WiFi  Data started flowing 2/28 via RTSA  Production infrastructure in Q1 HBase
  • 15. 15 Vehicle WebSocket NiFi Apps XYZ NiFi Pull* HDFS Push Push Apps XYZ Azure CLOUD *Native NiFi Site-2-Site HTTP Proxy Capability. Fixes Storm Endpoint Scaling Ops problem today. EventHub/IoTHub Ford Network and Data Center Firewall REST P M M L Firewall P M M L Intelligent Mobile Apps Public Internet EDGE/IoT Dynamic Stream Routing Other Opportunities?? HBase
  • 16. 16 Third Party Data Sources Third Party Data Consumers (as needed) Vehicle WebSocket NiFi Apps XYZ NiFi Pull* HDFS Push Push Apps XYZ Azure CLOUD *Native NiFi Site-2-Site HTTP Proxy Capability. Fixes Storm Endpoint Scaling Ops problem today. EventHub/IoTHub Ford Network and Data Center Firewall REST WebSocket REST MQTT P M M L Firewall P M M L Intelligent Mobile Apps Public Internet EDGE/IoT Dynamic Stream Routing Event and/or Streaming Data Made Available to Authorized Third Party Partners as needed • DPF Regen • Silver • Security • Plant Floor • ControlTec • LCV Telematics • MiniFi • Cisco Meraki -Dealer WiFi -Other Hubs HBase
  • 17. This Is The End •Discussion •Questions 17
  • 18. 18 Andrea Siudara Tom BryansMelissa Richards Kevin Cooper RTSA Product Owner Tracy HewiitDan Totten Core RTSA Organization RTSA Product Organization 3/11/2017 Laura Churchill PM T Young J Niemiec G Gwidz DHickey Jill Johnson PM Raju Doma Delivery Supervisor C Petras E Ulicny D Godwin GDIA Information Technology GDIA Smart Mobility Analytics

Editor's Notes

  1. 1) Intro RTSA          Lambda     2) BBD was to validate and instantiate the RTSA     3) Demo - Live Drive          - Oldie but goodies          - Huey     4) Vision     5) Roadmap - production plans                  - NY Hub          - BDD 2           Cotinued support for expierments          - PLant floor (FIS)          - Security          - Silver          - DPF regen          - Dealer WiFi (Meraki)
  2. GDIA is building an enterprise single complete and actionable data and analytics eco-system, centered around SCA-C, focused on ingesting and curating Ford’s internal applications and warehouses and providing analytics as a service opportunities. This important work and can be accomplished with a shared vision and roadmap with IT. But as we enter the emerging world of connectivity driven customer experience management and data driven everything, the data and analytics ecosystem must expand to include other edge nodes, including the car. This integrated multi-platform data analytics ecosystem can not be delivered by GDIA and IT alone. The partnership needs to be expanded to include PD. Winners are moving through build->measure->learn fastest. Our born into competitors understand this. Just another node. Not part of data analytics ecosystem. Emerging requires shared understanding with PD.
  3. Real-time analytics is a term used to refer to analytics that are able to be accessed as they come into a system. In general, the term analytics is used to define data patterns that provide meaning to a business or other entity, where analysts collect valuable information by sorting through and analyzing that data. Vast amounts of data are flowing at high velocity over the wire today. Organizations that can process and act on this streaming data in real time can dramatically improve efficiencies and differentiate themselves in the market.   Some additional  bullet points for the ‘what is real time streaming’   Real time data ingesting and analysis – the speed of today’s processing systems have moved from classical data-warehousing batch reporting to the realm of real-time processing and analytics. Real-time means near to zero latency and access to information whenever it is required.
  4. Real-time analytics is a term used to refer to analytics that are able to be accessed as they come into a system. In general, the term analytics is used to define data patterns that provide meaning to a business or other entity, where analysts collect valuable information by sorting through and analyzing that data. Vast amounts of data are flowing at high velocity over the wire today. Organizations that can process and act on this streaming data in real time can dramatically improve efficiencies and differentiate themselves in the market.   Some additional  bullet points for the ‘what is real time streaming’   Real time data ingesting and analysis – the speed of today’s processing systems have moved from classical data-warehousing batch reporting to the realm of real-time processing and analytics. Real-time means near to zero latency and access to information whenever it is required. DPF Regen Silver Security Plant Floor Cisco Meraki Dealer WiFi New York Hub