SlideShare a Scribd company logo
1 of 24
Download to read offline
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Stephen Moon
Senior Solutions Architect, Department of Defense Team
Data Supply Chain Pipeline
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
What is Supply Change Management?
Wikipedia... “In commerce, supply chain management (SCM), the
management of the flow of goods and services, involves the movement
and storage of raw materials, of work-in-process inventory, and of
finished goods from point of origin to point of consumption.
Interconnected or interlinked networks, channels and node businesses
combine in the provision of products and services required by end
customers in a supply chain. Supply-chain management has been
defined as the "design, planning, execution, control, and monitoring of
supply chain activities with the objective of creating net value, building a
competitive infrastructure, leveraging worldwide logistics, synchronizing
supply with demand and measuring performance globally.”
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Customer Use Case
• An organization has tens, hundreds, or thousands of
disparate, heterogeneous systems, applications, or
devices across multiple organizational domains.
• Each organizational domain has multiple systems of
record storing data about the same entities and it’s
attributes
• Leaders need to be able to ask single-domain and multi-
domain questions and receive a single, accurate answer
• For example… what is my force readiness if a conflict
arises in <region>?
• This is a multi-domain question
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
What is the Importance of Data?
• Data is a critical organizational asset, not an IT function
asset
• Data + Context + Relationships = Information
• Information provides strategic, tactical, and operational
advantages by enabling execution of the mission more
rapidly with greater agility and precision than the
adversary.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data Handling
! Without good quality data, the best modeling techniques in the world won’t deliver accurate results
• Prepare raw data for model training
• Training Sets, Development Sets, Test Sets
• Historical event data required to build a machine learning model should be stored in the data wareho
Importance of Data for Machine / Deep Learning
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data Supply Chain Pipeline Mission Statement
Democratize data by creating an accurate and consistent
360° view of the organization with the purpose of
providing actionable strategic, tactical, and operational
insight in order to enable execution excellence.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data Supply Chain Pipeline Operating Model
Current State Future State
Ross, Jeanne W, et al. Enterprise Architecture As Strategy: Creating a Foundation for Business Execution. Harvard Business Review
Press, 2006.
https://www.amazon.com/dp/B004OC07EE/ref=dp-kindle-redirect?_encoding=UTF8&btkr=1
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Volume
Velocity
Variety
Veracity
Value
Kilobytes (KB)
Megabytes (MB)
Gigabytes (GB)
Terabytes (TB)
Petabytes (PB)
Time-based
Event-driven
────────
KB/s
MB/s
GB/s
Structured | Semi-structured | Unstructured
Accuracy
Authenticity
Consistency
Reliability
Decision Making
Strategic
Tactical
Operational
The Five Vs of Data
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Collect
&
Process
Organize
&
Catalog
Explore
&
Discover
Analyze
&
Report
Suppliers Consumers
Search
&
Prepare
Data Supply Chain Pipeline (DSCP)
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Name Build It and Make Them Come
Statement Customers should by able to access the data where it lives regardless of their own location
Name Minimal Disruption
Statement Minimize/Eliminate disruption to data producing systems, applications, or devices
Name Configure-to-Customize
Statement Leverage configurable components to meet at least 80% of the requirements
Name Decoupling
Statement Pipeline stages are independent of one another
DSCP Architecture & Design Principles
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
DSCP Architecture
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
DSCP Design
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Minimal Viable Product (MVP)
Initial Operating Capability (IOC)
Collection & Processing
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
DSCP Collection
! There is no “tool” for collecting data from disparate,
heterogeneous suppliers
• Focus on cultivating the organization competencies and
the processes for engaging with suppliers
• Build tiger teams who understand the organizational
domains
• Develop templates for Memorandums of Understanding
(MoU) and Interface Control Documents (ICD)
• Use AWS Cloud Services to build and enable the right
technology solutions for data collection use cases
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
DSCP Collection
The result will be a small set of common Collection &
Processing patterns that can be standardized, automated,
and scaled to hundreds or thousands of suppliers on AWS’
Cloud Services.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Architecture Pattern Design Pattern Methodologies
Extraction, Transformation, & Loading Change Data Capture
• Push (Source initiated)
• Pull (Target initiated)
• Transaction Log (Synchronous /
Asynchronous)
• Slowly Changing Dimensions (Type 0:6)
Enterprise Application Integration Publish / Subscribe
• Topic-based
• Content-based
DSCP Collection Architecture & Design Patterns
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Interest
Influence
Control
Control
Applications/Systems which are owned and/or
directly managed by the collecting organization
Influence
Applications/Systems of which the collecting
organization is an internal or external stakeholder but
does not own or manage the application/system
Interest
Applications/Systems of which the collecting
organization has a concern for the data but does not
have control or influence over the application/system
Why is this important?
Determines how data is going to be collected!
DSCP Collection, Circles of Concern
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Hard Rules Soft Rules
Quality rules that are independent of
business value or meaning.
Example 1
A social security number must be
numeric and contain exactly 9 digits.
Example 2
An address must be valid per the USPS.
Policy rules based on context that add
or alter business value or meaning.
Example 2
A mailing address for prescription
drugs cannot be a P.O. box.
Example 1
A social security number and date of
birth must be a unique combination.
DSCP Processing, Business Rules
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
DSCP Technical
L0
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
DSCP Technical
L1
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
DSCP Technical L2,
Structured
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
DSCP Next Steps
1) Engage your AWS Solutions Architect
2) Develop a set of questions
3) Identify the information required to answer the questions
4) Identify suppliers of the relevant data
5) Build the solution platform
6) Collect & Process the data
7) Start answering questions
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Thank You!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
moonstep@amazon.com
Senior Solutions Architect
Department of Defense (DoD)
Worldwide Public Sector (WWPS)
Amazon Web Services (AWS)
Stephen Moon

More Related Content

What's hot

Evtm 281 07_bi2015_infographic_r2h
Evtm 281 07_bi2015_infographic_r2hEvtm 281 07_bi2015_infographic_r2h
Evtm 281 07_bi2015_infographic_r2h
Nadia Smith
 
bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000
Kartik Padmanabhan
 
Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...
Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...
Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...
MongoDB
 
Design On A Dime Dashboard Edition Laura Edell Gibbons 3
Design On A Dime Dashboard Edition Laura Edell Gibbons 3Design On A Dime Dashboard Edition Laura Edell Gibbons 3
Design On A Dime Dashboard Edition Laura Edell Gibbons 3
Laura Edell
 

What's hot (20)

DataArchiva - A Native Data Archiving Solution for Salesforce
DataArchiva - A Native Data Archiving Solution for Salesforce DataArchiva - A Native Data Archiving Solution for Salesforce
DataArchiva - A Native Data Archiving Solution for Salesforce
 
Evtm 281 07_bi2015_infographic_r2h
Evtm 281 07_bi2015_infographic_r2hEvtm 281 07_bi2015_infographic_r2h
Evtm 281 07_bi2015_infographic_r2h
 
Why You Need to Govern Big Data
Why You Need to Govern Big DataWhy You Need to Govern Big Data
Why You Need to Govern Big Data
 
Data Mashups for Analytics
Data Mashups for AnalyticsData Mashups for Analytics
Data Mashups for Analytics
 
Build a Next-Generation Clinical Operational Metrics Solution
Build a Next-Generation Clinical Operational Metrics SolutionBuild a Next-Generation Clinical Operational Metrics Solution
Build a Next-Generation Clinical Operational Metrics Solution
 
Chief Data Officer (CDO) Organization Roles
Chief Data Officer (CDO) Organization RolesChief Data Officer (CDO) Organization Roles
Chief Data Officer (CDO) Organization Roles
 
bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000
 
VILT - Archiving and Decommissioning with OpenText InfoArchive
VILT - Archiving and Decommissioning with OpenText InfoArchiveVILT - Archiving and Decommissioning with OpenText InfoArchive
VILT - Archiving and Decommissioning with OpenText InfoArchive
 
Data lineage to drive compliance and as a business imperative
Data lineage to drive compliance and as a business imperativeData lineage to drive compliance and as a business imperative
Data lineage to drive compliance and as a business imperative
 
Optimize SAP, archiving your content and data
Optimize SAP, archiving your content and dataOptimize SAP, archiving your content and data
Optimize SAP, archiving your content and data
 
Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...
Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...
Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...
 
The art of implementing data lineage
The art of implementing data lineageThe art of implementing data lineage
The art of implementing data lineage
 
Sap ilm detailed presentation
Sap ilm detailed presentationSap ilm detailed presentation
Sap ilm detailed presentation
 
Unlock Data-driven Insights in Databricks Using Location Intelligence
Unlock Data-driven Insights in Databricks Using Location IntelligenceUnlock Data-driven Insights in Databricks Using Location Intelligence
Unlock Data-driven Insights in Databricks Using Location Intelligence
 
Oracle big data publix sector 1
Oracle big data publix sector 1Oracle big data publix sector 1
Oracle big data publix sector 1
 
Supercharging Smart Meter BIG DATA Analytics with Microsoft Azure Cloud- SRP ...
Supercharging Smart Meter BIG DATA Analytics with Microsoft Azure Cloud- SRP ...Supercharging Smart Meter BIG DATA Analytics with Microsoft Azure Cloud- SRP ...
Supercharging Smart Meter BIG DATA Analytics with Microsoft Azure Cloud- SRP ...
 
How to create a successful data archiving strategy for your Salesforce Org.
How to create a successful data archiving strategy for your Salesforce Org.How to create a successful data archiving strategy for your Salesforce Org.
How to create a successful data archiving strategy for your Salesforce Org.
 
Design On A Dime Dashboard Edition Laura Edell Gibbons 3
Design On A Dime Dashboard Edition Laura Edell Gibbons 3Design On A Dime Dashboard Edition Laura Edell Gibbons 3
Design On A Dime Dashboard Edition Laura Edell Gibbons 3
 
The Importance of DataOps in a Multi-Cloud World
The Importance of DataOps in a Multi-Cloud WorldThe Importance of DataOps in a Multi-Cloud World
The Importance of DataOps in a Multi-Cloud World
 
CPS Hyperion
CPS HyperionCPS Hyperion
CPS Hyperion
 

Similar to AWS Public Sector Summit 2018, Data Supply Chain Pipeline

Similar to AWS Public Sector Summit 2018, Data Supply Chain Pipeline (20)

Build Data Engineering Platforms with Amazon EMR (ANT204) - AWS re:Invent 2018
Build Data Engineering Platforms with Amazon EMR (ANT204) - AWS re:Invent 2018Build Data Engineering Platforms with Amazon EMR (ANT204) - AWS re:Invent 2018
Build Data Engineering Platforms with Amazon EMR (ANT204) - AWS re:Invent 2018
 
The Zen of DataOps – AWS Lake Formation and the Data Supply Chain Pipeline
The Zen of DataOps – AWS Lake Formation and the Data Supply Chain PipelineThe Zen of DataOps – AWS Lake Formation and the Data Supply Chain Pipeline
The Zen of DataOps – AWS Lake Formation and the Data Supply Chain Pipeline
 
Cloud Deep Dive: Total Cost of Ownership - John Enoch
Cloud Deep Dive: Total Cost of Ownership - John EnochCloud Deep Dive: Total Cost of Ownership - John Enoch
Cloud Deep Dive: Total Cost of Ownership - John Enoch
 
Ask an Amazon Redshift Customer Anything (ANT389) - AWS re:Invent 2018
Ask an Amazon Redshift Customer Anything (ANT389) - AWS re:Invent 2018Ask an Amazon Redshift Customer Anything (ANT389) - AWS re:Invent 2018
Ask an Amazon Redshift Customer Anything (ANT389) - AWS re:Invent 2018
 
Trends in Digital Transformation (ARC212) - AWS re:Invent 2018
Trends in Digital Transformation (ARC212) - AWS re:Invent 2018Trends in Digital Transformation (ARC212) - AWS re:Invent 2018
Trends in Digital Transformation (ARC212) - AWS re:Invent 2018
 
SaaS Analytics and Metrics: Capturing and Surfacing the Data That's Fundament...
SaaS Analytics and Metrics: Capturing and Surfacing the Data That's Fundament...SaaS Analytics and Metrics: Capturing and Surfacing the Data That's Fundament...
SaaS Analytics and Metrics: Capturing and Surfacing the Data That's Fundament...
 
Digital Transformation: Empowering People to Adapt to the Cloud
Digital Transformation: Empowering People to Adapt to the CloudDigital Transformation: Empowering People to Adapt to the Cloud
Digital Transformation: Empowering People to Adapt to the Cloud
 
An Overview of Best Practices for Large Scale Migrations - AWS Transformation...
An Overview of Best Practices for Large Scale Migrations - AWS Transformation...An Overview of Best Practices for Large Scale Migrations - AWS Transformation...
An Overview of Best Practices for Large Scale Migrations - AWS Transformation...
 
Overview Best Practices for Large Scale Migrations - Transformation Day Phila...
Overview Best Practices for Large Scale Migrations - Transformation Day Phila...Overview Best Practices for Large Scale Migrations - Transformation Day Phila...
Overview Best Practices for Large Scale Migrations - Transformation Day Phila...
 
Best Practices for Large Scale Migrations - AWS Transformation Day Boston 2018
Best Practices for Large Scale Migrations - AWS Transformation Day Boston 2018Best Practices for Large Scale Migrations - AWS Transformation Day Boston 2018
Best Practices for Large Scale Migrations - AWS Transformation Day Boston 2018
 
BI & Analytics - A Datalake on AWS
BI & Analytics - A Datalake on AWSBI & Analytics - A Datalake on AWS
BI & Analytics - A Datalake on AWS
 
Large-Scale Migration: Best Practices - ENT210 - Chicago AWS Summit
Large-Scale Migration: Best Practices - ENT210 - Chicago AWS SummitLarge-Scale Migration: Best Practices - ENT210 - Chicago AWS Summit
Large-Scale Migration: Best Practices - ENT210 - Chicago AWS Summit
 
Introducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksIntroducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech Talks
 
Enterprise Cloud Adoption
Enterprise Cloud Adoption Enterprise Cloud Adoption
Enterprise Cloud Adoption
 
An Overview of Best Practices for Large-Scale Migrations - AWS Transformation...
An Overview of Best Practices for Large-Scale Migrations - AWS Transformation...An Overview of Best Practices for Large-Scale Migrations - AWS Transformation...
An Overview of Best Practices for Large-Scale Migrations - AWS Transformation...
 
Uses of Data Lakes: Data Analytics Week SF
Uses of Data Lakes: Data Analytics Week SFUses of Data Lakes: Data Analytics Week SF
Uses of Data Lakes: Data Analytics Week SF
 
Customer Uses of Data Lakes
Customer Uses of Data LakesCustomer Uses of Data Lakes
Customer Uses of Data Lakes
 
An Overview of Best Practices for Large Scale Migrations - AWS Transformation...
An Overview of Best Practices for Large Scale Migrations - AWS Transformation...An Overview of Best Practices for Large Scale Migrations - AWS Transformation...
An Overview of Best Practices for Large Scale Migrations - AWS Transformation...
 
BI & Analytics
BI & AnalyticsBI & Analytics
BI & Analytics
 
Large Scale Migrations - Transformation Day Montreal 2018
Large Scale Migrations - Transformation Day Montreal 2018Large Scale Migrations - Transformation Day Montreal 2018
Large Scale Migrations - Transformation Day Montreal 2018
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governance
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Decarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceDecarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational Performance
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software Engineering
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Navigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseNavigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern Enterprise
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Quantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingQuantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation Computing
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 

AWS Public Sector Summit 2018, Data Supply Chain Pipeline

  • 1. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Stephen Moon Senior Solutions Architect, Department of Defense Team Data Supply Chain Pipeline
  • 2. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. What is Supply Change Management? Wikipedia... “In commerce, supply chain management (SCM), the management of the flow of goods and services, involves the movement and storage of raw materials, of work-in-process inventory, and of finished goods from point of origin to point of consumption. Interconnected or interlinked networks, channels and node businesses combine in the provision of products and services required by end customers in a supply chain. Supply-chain management has been defined as the "design, planning, execution, control, and monitoring of supply chain activities with the objective of creating net value, building a competitive infrastructure, leveraging worldwide logistics, synchronizing supply with demand and measuring performance globally.”
  • 3. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Customer Use Case • An organization has tens, hundreds, or thousands of disparate, heterogeneous systems, applications, or devices across multiple organizational domains. • Each organizational domain has multiple systems of record storing data about the same entities and it’s attributes • Leaders need to be able to ask single-domain and multi- domain questions and receive a single, accurate answer • For example… what is my force readiness if a conflict arises in <region>? • This is a multi-domain question
  • 4. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. What is the Importance of Data? • Data is a critical organizational asset, not an IT function asset • Data + Context + Relationships = Information • Information provides strategic, tactical, and operational advantages by enabling execution of the mission more rapidly with greater agility and precision than the adversary.
  • 5. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data Handling ! Without good quality data, the best modeling techniques in the world won’t deliver accurate results • Prepare raw data for model training • Training Sets, Development Sets, Test Sets • Historical event data required to build a machine learning model should be stored in the data wareho Importance of Data for Machine / Deep Learning
  • 6. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data Supply Chain Pipeline Mission Statement Democratize data by creating an accurate and consistent 360° view of the organization with the purpose of providing actionable strategic, tactical, and operational insight in order to enable execution excellence.
  • 7. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data Supply Chain Pipeline Operating Model Current State Future State Ross, Jeanne W, et al. Enterprise Architecture As Strategy: Creating a Foundation for Business Execution. Harvard Business Review Press, 2006. https://www.amazon.com/dp/B004OC07EE/ref=dp-kindle-redirect?_encoding=UTF8&btkr=1
  • 8. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Volume Velocity Variety Veracity Value Kilobytes (KB) Megabytes (MB) Gigabytes (GB) Terabytes (TB) Petabytes (PB) Time-based Event-driven ──────── KB/s MB/s GB/s Structured | Semi-structured | Unstructured Accuracy Authenticity Consistency Reliability Decision Making Strategic Tactical Operational The Five Vs of Data
  • 9. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Collect & Process Organize & Catalog Explore & Discover Analyze & Report Suppliers Consumers Search & Prepare Data Supply Chain Pipeline (DSCP)
  • 10. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Name Build It and Make Them Come Statement Customers should by able to access the data where it lives regardless of their own location Name Minimal Disruption Statement Minimize/Eliminate disruption to data producing systems, applications, or devices Name Configure-to-Customize Statement Leverage configurable components to meet at least 80% of the requirements Name Decoupling Statement Pipeline stages are independent of one another DSCP Architecture & Design Principles
  • 11. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. DSCP Architecture
  • 12. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. DSCP Design
  • 13. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Minimal Viable Product (MVP) Initial Operating Capability (IOC) Collection & Processing
  • 14. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. DSCP Collection ! There is no “tool” for collecting data from disparate, heterogeneous suppliers • Focus on cultivating the organization competencies and the processes for engaging with suppliers • Build tiger teams who understand the organizational domains • Develop templates for Memorandums of Understanding (MoU) and Interface Control Documents (ICD) • Use AWS Cloud Services to build and enable the right technology solutions for data collection use cases
  • 15. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. DSCP Collection The result will be a small set of common Collection & Processing patterns that can be standardized, automated, and scaled to hundreds or thousands of suppliers on AWS’ Cloud Services.
  • 16. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Architecture Pattern Design Pattern Methodologies Extraction, Transformation, & Loading Change Data Capture • Push (Source initiated) • Pull (Target initiated) • Transaction Log (Synchronous / Asynchronous) • Slowly Changing Dimensions (Type 0:6) Enterprise Application Integration Publish / Subscribe • Topic-based • Content-based DSCP Collection Architecture & Design Patterns
  • 17. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Interest Influence Control Control Applications/Systems which are owned and/or directly managed by the collecting organization Influence Applications/Systems of which the collecting organization is an internal or external stakeholder but does not own or manage the application/system Interest Applications/Systems of which the collecting organization has a concern for the data but does not have control or influence over the application/system Why is this important? Determines how data is going to be collected! DSCP Collection, Circles of Concern
  • 18. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Hard Rules Soft Rules Quality rules that are independent of business value or meaning. Example 1 A social security number must be numeric and contain exactly 9 digits. Example 2 An address must be valid per the USPS. Policy rules based on context that add or alter business value or meaning. Example 2 A mailing address for prescription drugs cannot be a P.O. box. Example 1 A social security number and date of birth must be a unique combination. DSCP Processing, Business Rules
  • 19. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. DSCP Technical L0
  • 20. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. DSCP Technical L1
  • 21. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. DSCP Technical L2, Structured
  • 22. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. DSCP Next Steps 1) Engage your AWS Solutions Architect 2) Develop a set of questions 3) Identify the information required to answer the questions 4) Identify suppliers of the relevant data 5) Build the solution platform 6) Collect & Process the data 7) Start answering questions
  • 23. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Thank You!
  • 24. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. moonstep@amazon.com Senior Solutions Architect Department of Defense (DoD) Worldwide Public Sector (WWPS) Amazon Web Services (AWS) Stephen Moon