SlideShare a Scribd company logo
1 of 19
Download to read offline
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 1
Boeing F/A-18F Super Hornet at takeoff at Danish Air Show 2014 on June 22
Image attribution: Slaunger
CC BY-SA 3.0
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 2
The views of the author(s) expressed herein do not necessarily
represent those of the U.S. Navy or Department of Defense
(DoD). Presentation of this material does not constitute or imply its
endorsement, recommendation, or favoring by the DoD.
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 3
• Team of 20 data scientists and
software engineers
• Navy civil servants and
contractor support personnel
• Naval Air Warfare Center
Training Systems Division in
Orlando, FL
NAWCTSD Enterprise Research Data
Science
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 4
Issue
Unscheduled maintenance and
component degradation impacting
readiness
Hornet Health Assessment and Readiness Tool (HhART)
Create a real-time monitoring tool for a fleet of aircraft that is driven by engineering-
approved features and models that can be used to assist maintenance decisions and
predict component degradation
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 5
SME ETL Data science Deployment Feedback
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 6
SME ETL Data science Deployment Feedback
• Engineers
• Designers
• Maintainers
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 7
SME ETL Data science Deployment Feedback
• Engineers
• Designers
• Maintainers
Engagement
Education
Communication
Trust
Culture
EngineersMaintainers
Leaders
Data
scientists
Process
engineers
Developers
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 8
SME ETL Data science Deployment Feedback
• Engineers
• Designers
• Maintainers
SES 405 - Exploration Systems Engineering (ASU)
System Hierarchy Module (9)
• Complex system of systems
• Data scientists can support at
each level
• Each component is unique
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 9
• Confounding effects
• Inconsistent recording resolutions
• Both discrete and continuous data
• Changing schemas between software versions
SME ETL Data science Deployment Feedback
• Identify
• Acquire
• Load
• Engineers
• Designers
• Maintainers
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 10
Batch analysis Streaming
SME ETL Data science Deployment Feedback
• Identify
• Acquire
• Load
• Engineers
• Designers
• Maintainers
Flight data Storage Warehouse
Staging Analysis
Cleaning
Validation
Analysis
Results
Models
Flight data Storage
Analysis
Livedata
Models
Streaming
Logic-based metrics
Dashboard
GitLab is a registered trademark of GitLab, Inc.
Apache Kafka, Kafka, and the Kafka logo are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other
countries.
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 11
SME ETL Data science Deployment Feedback
• Identify
• Acquire
• Load
• Explore
• Develop
• Iterate
• Engineers
• Designers
• Maintainers
• Learn
• Understand the problem domain
• Analyze
• Find indicators of the problem
• Develop
• Features to predict the indicators
• Refine
• Iterate with the SME to better
define the features
Iterate
Learn
Analyze
Develop
Refine
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 12
• Logic
• Supply engineers with data science
superpowers
• Enhance error detection
• Moderately predictive
• Deep learning
• Learn what normal behavior is
• Detect complex parameter
interaction
• Highly predictive
SME ETL Data science Deployment Feedback
• Identify
• Acquire
• Load
• Explore
• Develop
• Iterate
• Engineers
• Designers
• Maintainers
Images
created
using
public
NASA
DashLink
dataset
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 13
SME ETL Data science Deployment Feedback
• Identify
• Acquire
• Load
• Explore
• Develop
• Iterate
• Engineers
• Designers
• Maintainers
• Anomaly detection
• Finding errant behavior in noisy
signals
• Virtual sensors
• Replicating normal behavior
• Information compression
• Targeting specific interactions
ModelInput
ModelOutput
Reconstruction
Error
Potential
Anomalies
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 14
• Feature score normalization
• Aggregate scores
• Seeing trends
SME ETL Data science Deployment Feedback
• Engineers
• Designers
• Maintainers
• Identify
• Acquire
• Load
• Explore
• Develop
• Iterate
• Live
• Targeted
• Relevant
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 15
SME ETL Data science Deployment Feedback
• Engineers
• Designers
• Maintainers
• Identify
• Acquire
• Load
• Explore
• Develop
• Iterate
• Live
• Targeted
• Relevant
• Speed
• On-prem vs. cloud
• Automation
• Security
• Hardening containers
• RMF, C-ATO
• Platform One
• AF CSO- software.af.mil
• Live monitoring system
Kubernetes and the Kubernetes logo are registered trademarks of The Linux
Foundation.
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 16
• Critical at all stages of the process cycle
• User needs should drive development
• Tool should be accurate and explainable
• Regularly communicate with users
SME ETL Data science Deployment Feedback
• Engineers
• Designers
• Maintainers
• Identify
• Acquire
• Load
• Explore
• Develop
• Iterate
• Live
• Targeted
• Relevant
• Communicate
• Incorporate
• Update
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 17
• Continuous
• Tailored to new platforms and
sub-systems
• Adapted to meet the needs of the
user
Collaboration
Learning
Developing
RefiningVisualizing
Deploying
Feedback
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 18
Lessons learned
• Data science is most effective when tightly integrated into
organizational structure
• Progress is difficult at best without Data Scientist understanding of
the system
Near-term goals
• Move to include additional platforms
• Deploying HhART at the edge in collaboration with partners
UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 19

More Related Content

Similar to Using Apache Spark for Predicting Degrading and Failing Parts in Aviation

IoT Development; Managing hardware and software Development
IoT Development; Managing hardware and software DevelopmentIoT Development; Managing hardware and software Development
IoT Development; Managing hardware and software DevelopmentIntland Software GmbH
 
Beyond JIRA: When Issue Tracking Alone Isn't Enough
Beyond JIRA: When Issue Tracking Alone Isn't Enough Beyond JIRA: When Issue Tracking Alone Isn't Enough
Beyond JIRA: When Issue Tracking Alone Isn't Enough Perforce
 
Removing Barriers Between Dev and Ops
Removing Barriers Between Dev and OpsRemoving Barriers Between Dev and Ops
Removing Barriers Between Dev and OpsVMware Tanzu
 
Data as a Strategic Asset
Data as a Strategic AssetData as a Strategic Asset
Data as a Strategic AssetTamrMarketing
 
Openstack Summit Vancouver 2015 - Maintaining and Operating Swift at Public C...
Openstack Summit Vancouver 2015 - Maintaining and Operating Swift at Public C...Openstack Summit Vancouver 2015 - Maintaining and Operating Swift at Public C...
Openstack Summit Vancouver 2015 - Maintaining and Operating Swift at Public C...donaghmccabe
 
Architecting with power vm
Architecting with power vmArchitecting with power vm
Architecting with power vmCharlie Cler
 
JDK 8 and JDK 8 Updates in OpenJDK
JDK 8 and JDK 8 Updates in OpenJDKJDK 8 and JDK 8 Updates in OpenJDK
JDK 8 and JDK 8 Updates in OpenJDKWolfgang Weigend
 
Webinar: Build Better Software: Removing the Constraints Slowing Dev/Test Tea...
Webinar: Build Better Software: Removing the Constraints Slowing Dev/Test Tea...Webinar: Build Better Software: Removing the Constraints Slowing Dev/Test Tea...
Webinar: Build Better Software: Removing the Constraints Slowing Dev/Test Tea...Skytap Cloud
 
A Reference Architecture to Enable Visibility and Traceability across the Ent...
A Reference Architecture to Enable Visibility and Traceability across the Ent...A Reference Architecture to Enable Visibility and Traceability across the Ent...
A Reference Architecture to Enable Visibility and Traceability across the Ent...CollabNet
 
Coverage Solutions on Emulators
Coverage Solutions on EmulatorsCoverage Solutions on Emulators
Coverage Solutions on EmulatorsDVClub
 
Introduction to the DAOS Scale-out object store (HLRS Workshop, April 2017)
Introduction to the DAOS Scale-out object store (HLRS Workshop, April 2017)Introduction to the DAOS Scale-out object store (HLRS Workshop, April 2017)
Introduction to the DAOS Scale-out object store (HLRS Workshop, April 2017)Johann Lombardi
 
SLTS kernel and base-layer development in the Civil Infrastructure Platform
SLTS kernel and base-layer development in the Civil Infrastructure PlatformSLTS kernel and base-layer development in the Civil Infrastructure Platform
SLTS kernel and base-layer development in the Civil Infrastructure PlatformYoshitake Kobayashi
 
Software Developer - Contractor
Software Developer - ContractorSoftware Developer - Contractor
Software Developer - ContractorTremaine Island
 
OOW16 - Maintenance Strategies for Oracle E-Business Suite [CON6725]
OOW16 - Maintenance Strategies for Oracle E-Business Suite [CON6725]OOW16 - Maintenance Strategies for Oracle E-Business Suite [CON6725]
OOW16 - Maintenance Strategies for Oracle E-Business Suite [CON6725]vasuballa
 
Design and Optimize your code for high-performance with Intel® Advisor and I...
Design and Optimize your code for high-performance with Intel®  Advisor and I...Design and Optimize your code for high-performance with Intel®  Advisor and I...
Design and Optimize your code for high-performance with Intel® Advisor and I...Tyrone Systems
 
Increased Developer Productivity for IoT with Java and Reactive Blocks (Oracl...
Increased Developer Productivity for IoT with Java and Reactive Blocks (Oracl...Increased Developer Productivity for IoT with Java and Reactive Blocks (Oracl...
Increased Developer Productivity for IoT with Java and Reactive Blocks (Oracl...Bart Jonkers
 
OpenHPC: A Comprehensive System Software Stack
OpenHPC: A Comprehensive System Software StackOpenHPC: A Comprehensive System Software Stack
OpenHPC: A Comprehensive System Software Stackinside-BigData.com
 

Similar to Using Apache Spark for Predicting Degrading and Failing Parts in Aviation (20)

IoT Development; Managing hardware and software Development
IoT Development; Managing hardware and software DevelopmentIoT Development; Managing hardware and software Development
IoT Development; Managing hardware and software Development
 
Beyond JIRA: When Issue Tracking Alone Isn't Enough
Beyond JIRA: When Issue Tracking Alone Isn't Enough Beyond JIRA: When Issue Tracking Alone Isn't Enough
Beyond JIRA: When Issue Tracking Alone Isn't Enough
 
Kota_Udayachandrakumar
Kota_UdayachandrakumarKota_Udayachandrakumar
Kota_Udayachandrakumar
 
Symvionics general overview
Symvionics general overview Symvionics general overview
Symvionics general overview
 
Removing Barriers Between Dev and Ops
Removing Barriers Between Dev and OpsRemoving Barriers Between Dev and Ops
Removing Barriers Between Dev and Ops
 
Data as a Strategic Asset
Data as a Strategic AssetData as a Strategic Asset
Data as a Strategic Asset
 
vBrownBagIT-VCDX
vBrownBagIT-VCDXvBrownBagIT-VCDX
vBrownBagIT-VCDX
 
Openstack Summit Vancouver 2015 - Maintaining and Operating Swift at Public C...
Openstack Summit Vancouver 2015 - Maintaining and Operating Swift at Public C...Openstack Summit Vancouver 2015 - Maintaining and Operating Swift at Public C...
Openstack Summit Vancouver 2015 - Maintaining and Operating Swift at Public C...
 
Architecting with power vm
Architecting with power vmArchitecting with power vm
Architecting with power vm
 
JDK 8 and JDK 8 Updates in OpenJDK
JDK 8 and JDK 8 Updates in OpenJDKJDK 8 and JDK 8 Updates in OpenJDK
JDK 8 and JDK 8 Updates in OpenJDK
 
Webinar: Build Better Software: Removing the Constraints Slowing Dev/Test Tea...
Webinar: Build Better Software: Removing the Constraints Slowing Dev/Test Tea...Webinar: Build Better Software: Removing the Constraints Slowing Dev/Test Tea...
Webinar: Build Better Software: Removing the Constraints Slowing Dev/Test Tea...
 
A Reference Architecture to Enable Visibility and Traceability across the Ent...
A Reference Architecture to Enable Visibility and Traceability across the Ent...A Reference Architecture to Enable Visibility and Traceability across the Ent...
A Reference Architecture to Enable Visibility and Traceability across the Ent...
 
Coverage Solutions on Emulators
Coverage Solutions on EmulatorsCoverage Solutions on Emulators
Coverage Solutions on Emulators
 
Introduction to the DAOS Scale-out object store (HLRS Workshop, April 2017)
Introduction to the DAOS Scale-out object store (HLRS Workshop, April 2017)Introduction to the DAOS Scale-out object store (HLRS Workshop, April 2017)
Introduction to the DAOS Scale-out object store (HLRS Workshop, April 2017)
 
SLTS kernel and base-layer development in the Civil Infrastructure Platform
SLTS kernel and base-layer development in the Civil Infrastructure PlatformSLTS kernel and base-layer development in the Civil Infrastructure Platform
SLTS kernel and base-layer development in the Civil Infrastructure Platform
 
Software Developer - Contractor
Software Developer - ContractorSoftware Developer - Contractor
Software Developer - Contractor
 
OOW16 - Maintenance Strategies for Oracle E-Business Suite [CON6725]
OOW16 - Maintenance Strategies for Oracle E-Business Suite [CON6725]OOW16 - Maintenance Strategies for Oracle E-Business Suite [CON6725]
OOW16 - Maintenance Strategies for Oracle E-Business Suite [CON6725]
 
Design and Optimize your code for high-performance with Intel® Advisor and I...
Design and Optimize your code for high-performance with Intel®  Advisor and I...Design and Optimize your code for high-performance with Intel®  Advisor and I...
Design and Optimize your code for high-performance with Intel® Advisor and I...
 
Increased Developer Productivity for IoT with Java and Reactive Blocks (Oracl...
Increased Developer Productivity for IoT with Java and Reactive Blocks (Oracl...Increased Developer Productivity for IoT with Java and Reactive Blocks (Oracl...
Increased Developer Productivity for IoT with Java and Reactive Blocks (Oracl...
 
OpenHPC: A Comprehensive System Software Stack
OpenHPC: A Comprehensive System Software StackOpenHPC: A Comprehensive System Software Stack
OpenHPC: A Comprehensive System Software Stack
 

More from Databricks

DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDatabricks
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Databricks
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Databricks
 
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Databricks
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Databricks
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of HadoopDatabricks
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDatabricks
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceDatabricks
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringDatabricks
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixThe Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixDatabricks
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationDatabricks
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchDatabricks
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesScaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesDatabricks
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesScaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesDatabricks
 
Sawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsSawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsDatabricks
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkRedis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkDatabricks
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkDatabricks
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesDatabricks
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkDatabricks
 
Massive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeMassive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeDatabricks
 

More from Databricks (20)

DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2
 
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data Science
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML Monitoring
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixThe Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI Integration
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorch
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesScaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on Kubernetes
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesScaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
 
Sawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsSawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature Aggregations
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkRedis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and Spark
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction Queries
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
 
Massive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeMassive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta Lake
 

Recently uploaded

RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 

Recently uploaded (20)

RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 

Using Apache Spark for Predicting Degrading and Failing Parts in Aviation

  • 1. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 1 Boeing F/A-18F Super Hornet at takeoff at Danish Air Show 2014 on June 22 Image attribution: Slaunger CC BY-SA 3.0
  • 2. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 2 The views of the author(s) expressed herein do not necessarily represent those of the U.S. Navy or Department of Defense (DoD). Presentation of this material does not constitute or imply its endorsement, recommendation, or favoring by the DoD.
  • 3. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 3 • Team of 20 data scientists and software engineers • Navy civil servants and contractor support personnel • Naval Air Warfare Center Training Systems Division in Orlando, FL NAWCTSD Enterprise Research Data Science
  • 4. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 4 Issue Unscheduled maintenance and component degradation impacting readiness Hornet Health Assessment and Readiness Tool (HhART) Create a real-time monitoring tool for a fleet of aircraft that is driven by engineering- approved features and models that can be used to assist maintenance decisions and predict component degradation
  • 5. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 5 SME ETL Data science Deployment Feedback
  • 6. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 6 SME ETL Data science Deployment Feedback • Engineers • Designers • Maintainers
  • 7. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 7 SME ETL Data science Deployment Feedback • Engineers • Designers • Maintainers Engagement Education Communication Trust Culture EngineersMaintainers Leaders Data scientists Process engineers Developers
  • 8. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 8 SME ETL Data science Deployment Feedback • Engineers • Designers • Maintainers SES 405 - Exploration Systems Engineering (ASU) System Hierarchy Module (9) • Complex system of systems • Data scientists can support at each level • Each component is unique
  • 9. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 9 • Confounding effects • Inconsistent recording resolutions • Both discrete and continuous data • Changing schemas between software versions SME ETL Data science Deployment Feedback • Identify • Acquire • Load • Engineers • Designers • Maintainers
  • 10. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 10 Batch analysis Streaming SME ETL Data science Deployment Feedback • Identify • Acquire • Load • Engineers • Designers • Maintainers Flight data Storage Warehouse Staging Analysis Cleaning Validation Analysis Results Models Flight data Storage Analysis Livedata Models Streaming Logic-based metrics Dashboard GitLab is a registered trademark of GitLab, Inc. Apache Kafka, Kafka, and the Kafka logo are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.
  • 11. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 11 SME ETL Data science Deployment Feedback • Identify • Acquire • Load • Explore • Develop • Iterate • Engineers • Designers • Maintainers • Learn • Understand the problem domain • Analyze • Find indicators of the problem • Develop • Features to predict the indicators • Refine • Iterate with the SME to better define the features Iterate Learn Analyze Develop Refine
  • 12. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 12 • Logic • Supply engineers with data science superpowers • Enhance error detection • Moderately predictive • Deep learning • Learn what normal behavior is • Detect complex parameter interaction • Highly predictive SME ETL Data science Deployment Feedback • Identify • Acquire • Load • Explore • Develop • Iterate • Engineers • Designers • Maintainers Images created using public NASA DashLink dataset
  • 13. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 13 SME ETL Data science Deployment Feedback • Identify • Acquire • Load • Explore • Develop • Iterate • Engineers • Designers • Maintainers • Anomaly detection • Finding errant behavior in noisy signals • Virtual sensors • Replicating normal behavior • Information compression • Targeting specific interactions ModelInput ModelOutput Reconstruction Error Potential Anomalies
  • 14. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 14 • Feature score normalization • Aggregate scores • Seeing trends SME ETL Data science Deployment Feedback • Engineers • Designers • Maintainers • Identify • Acquire • Load • Explore • Develop • Iterate • Live • Targeted • Relevant
  • 15. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 15 SME ETL Data science Deployment Feedback • Engineers • Designers • Maintainers • Identify • Acquire • Load • Explore • Develop • Iterate • Live • Targeted • Relevant • Speed • On-prem vs. cloud • Automation • Security • Hardening containers • RMF, C-ATO • Platform One • AF CSO- software.af.mil • Live monitoring system Kubernetes and the Kubernetes logo are registered trademarks of The Linux Foundation.
  • 16. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 16 • Critical at all stages of the process cycle • User needs should drive development • Tool should be accurate and explainable • Regularly communicate with users SME ETL Data science Deployment Feedback • Engineers • Designers • Maintainers • Identify • Acquire • Load • Explore • Develop • Iterate • Live • Targeted • Relevant • Communicate • Incorporate • Update
  • 17. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 17 • Continuous • Tailored to new platforms and sub-systems • Adapted to meet the needs of the user Collaboration Learning Developing RefiningVisualizing Deploying Feedback
  • 18. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 18 Lessons learned • Data science is most effective when tightly integrated into organizational structure • Progress is difficult at best without Data Scientist understanding of the system Near-term goals • Move to include additional platforms • Deploying HhART at the edge in collaboration with partners
  • 19. UNCLASSIFIED // APPROVED FOR PUBLIC RELEASE 19