SlideShare a Scribd company logo
1 of 14
TheThe BI SandboxBI Sandbox
Madison, Wisconsin AreaMadison, Wisconsin Area
Business Intelligence & Data WarehousingBusiness Intelligence & Data Warehousing
Discussion GroupDiscussion Group
Production ETL
Analytic Data LayerData Acquisition
Layer
Operational Data Layer
BI architecture at a glance …
Legacy
Source
Systems
Legacy
Source
Systems
New
Source
Systems
New
Source
Systems
TriageTriage
ConformedConformed
StorageStorage
AreaArea
batch
transaction OperationalOperational
Data StoresData Stores
OperationalOperational
Data StoresData Stores
XML
Message
XML
Message
DataData
MartsMarts
AnalysisAnalysis
SandboxesSandboxes
Other Sources:
Operational systems
 User supplied data
Manual Loads
BI architecture at a glance …
Operational Data Layer Analytic Data Layer
ConformedConformed
StorageStorage
AreaArea
OperationalOperational
Data StoresData Stores
OperationalOperational
Data StoresData Stores
DataData
MartsMarts
Consolidated
data feeds
(legacy & new)
to downstream
systems
Consolidated
data feeds
(legacy & new)
to downstream
systems
Near real-time
data feeds of new
systems’ data
Near real-time
data feeds of new
systems’ data
Standardized
reporting, ad
hoc reporting
and analysis,
data mining,
predictive
models
Standardized
reporting, ad
hoc reporting
and analysis,
data mining,
predictive
models
Standardized
reporting
Standardized
reporting
AnalysisAnalysis
SandboxesSandboxes
What do you think of when you hear
“sandbox”?
Sandboxes are places to play where
The sand and box are provided
You bring your own toys
What you create is temporary

Obviously some of us are more talented
with sandboxes than others…
Which is the best analogy for a BI
environment?
Assembly Line
Assembly Line
A Predictive Model Test Bed
A Predictive Model Test Bed
A Library
A Library
An Artist’s Studio
An Artist’s Studio
An Information Goldmine
An Information Goldmine
sandbox noun /'san(d) , bäks/
The BI Sandbox, defined
Responsibilities • To facilitate short term ad-hoc exploratory analysis.
• To remove roadblocks to client self-service (minimizing the need for I/S
assistance) with short term ad-hoc exploratory analysis.
• To avoid the creation of unmanaged spreadsheet based data on user
desktops or shared network drives.
• To better enable short term ad-hoc exploratory analysis to be converted to
long term operational analysis as needed (through traceability)
Collaborators Semantic Layer, Operational Data Layer (ODL), Analytic Data Layer (ADL)
Rationale Typically reporting and analysis is ongoing, consistent, and can be enabled by
production structures such as ODSs and data marts.
Occasionally, business requirements indicate a need for temporary or ad-hoc
exploratory data analysis that cannot be supported by existing data structures.
These business requirements often results in unmanaged disparate spreadsheet data
on individual user desktops or shared network drives.
Sandboxes are meant to mitigate the risk that these ad hoc data sets are created
through inconsistent techniques and the subsequent risk that analytical results
discovered by using them are hard to trace and convert to a more permanent
process; and doing so typically requires a complex project to convert the untraceable
data set, integration, and analytical rules into repeatable rules.
The BI Sandbox, defined
Issues and
Notes
• Sandbox data sets will be short-lived.
• The sandbox will support Ad hoc analysis.
• Sandbox data sets will be intended for a specific purpose.
• Reporting generated from the sandbox will not be considered “official”.
• Sandbox data sets should be transitional.
• Sandboxes, if they cannot be decommissioned, should be transitioned into
production structures (e.g., ODSs or data marts).
• Sandbox data set structure/format will be dependent on access tools.
• Sandbox data set composition and quality will be dependent on the source.
• Sandbox check-out (data validation) strategy will be the responsibility of the
end user.
• Sandbox data sets should require minimal I/S intervention.
• Sandbox data can come from external or user supplied sources.
• Data acquisition from operational systems is restricted.
• Sandbox data will not be automatically refreshed on a regular basis.
• Naming standards do not apply to sandbox structures.
The BI Sandbox, the real why
• Shed light on data integration work clients do
whether I/S wishes to acknowledge it or not
• Increase partnership between I/S and business
– I/S has an appropriate solution to offer for more real
problems
• Most innovation doesn’t happen in well-defined
structures
The BI Sandbox, the how
Provide a place to play
• Typically SAS storage
Bring your own toys
• Manual loads of data from various sources including
• Data marts
• ODSs
• Operational systems
• User-supplied data sets
Create & Learn
• Use analysis tools (Business Objects, SAS, Excel) to
explore the data and discover
Transfer what you learn elsewhere
• Covert discoveries into operational changes to build
value
The BI Sandbox, the limitations
• Joins between disparate sources on natural keys
alone
– Operational system keys
– Functional keys
• No cleansing, no column renaming, minimal
metadata, no data modeling
• No automated refresh process
The BI Sandbox, the examples
• Prototyping new enterprise measure
• Experimenting with integration of disparate data
sources
• Predictive model creation, testing & validation
(in parallel with production development)
Discussion

More Related Content

What's hot

Data Governance and Metadata Management
Data Governance and Metadata ManagementData Governance and Metadata Management
Data Governance and Metadata Management DATAVERSITY
 
Structure of iso 27001
Structure of iso 27001Structure of iso 27001
Structure of iso 27001CUNIX INDIA
 
Atm devices and interfaces
Atm devices and interfacesAtm devices and interfaces
Atm devices and interfacesShanza Sohail
 
Maximum Availability Architecture - Best Practices for Oracle Database 19c
Maximum Availability Architecture - Best Practices for Oracle Database 19cMaximum Availability Architecture - Best Practices for Oracle Database 19c
Maximum Availability Architecture - Best Practices for Oracle Database 19cGlen Hawkins
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureDATAVERSITY
 
Data warehouse 12 reconciled data layers
Data warehouse  12 reconciled data layersData warehouse  12 reconciled data layers
Data warehouse 12 reconciled data layersVaibhav Khanna
 
Property Graphs in APEX.pptx
Property Graphs in APEX.pptxProperty Graphs in APEX.pptx
Property Graphs in APEX.pptxssuser923120
 
Data Warehouse Architecture.pptx
Data Warehouse Architecture.pptxData Warehouse Architecture.pptx
Data Warehouse Architecture.pptx22PCS007ANBUF
 
Database management system presentation
Database management system presentationDatabase management system presentation
Database management system presentationsameerraaj
 
Privacy-ready Data Protection Program Implementation
Privacy-ready Data Protection Program ImplementationPrivacy-ready Data Protection Program Implementation
Privacy-ready Data Protection Program ImplementationEryk Budi Pratama
 
Urgensi RUU Perlindungan Data Pribadi
Urgensi RUU Perlindungan Data PribadiUrgensi RUU Perlindungan Data Pribadi
Urgensi RUU Perlindungan Data PribadiEryk Budi Pratama
 
Essential Reference and Master Data Management
Essential Reference and Master Data ManagementEssential Reference and Master Data Management
Essential Reference and Master Data ManagementDATAVERSITY
 
Relational database- Fundamentals
Relational database- FundamentalsRelational database- Fundamentals
Relational database- FundamentalsMohammed El Hedhly
 
‏‏Chapter 8: Reference and Master Data Management
‏‏Chapter 8: Reference and Master Data Management ‏‏Chapter 8: Reference and Master Data Management
‏‏Chapter 8: Reference and Master Data Management Ahmed Alorage
 

What's hot (20)

Data Governance and Metadata Management
Data Governance and Metadata ManagementData Governance and Metadata Management
Data Governance and Metadata Management
 
Structure of iso 27001
Structure of iso 27001Structure of iso 27001
Structure of iso 27001
 
Atm devices and interfaces
Atm devices and interfacesAtm devices and interfaces
Atm devices and interfaces
 
DBA
DBADBA
DBA
 
Maximum Availability Architecture - Best Practices for Oracle Database 19c
Maximum Availability Architecture - Best Practices for Oracle Database 19cMaximum Availability Architecture - Best Practices for Oracle Database 19c
Maximum Availability Architecture - Best Practices for Oracle Database 19c
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
Data warehouse 12 reconciled data layers
Data warehouse  12 reconciled data layersData warehouse  12 reconciled data layers
Data warehouse 12 reconciled data layers
 
Property Graphs in APEX.pptx
Property Graphs in APEX.pptxProperty Graphs in APEX.pptx
Property Graphs in APEX.pptx
 
planning & project management for DWH
planning & project management for DWHplanning & project management for DWH
planning & project management for DWH
 
Data Warehouse Architecture.pptx
Data Warehouse Architecture.pptxData Warehouse Architecture.pptx
Data Warehouse Architecture.pptx
 
SABSA Implementation(Part VI)_ver1-0
SABSA Implementation(Part VI)_ver1-0SABSA Implementation(Part VI)_ver1-0
SABSA Implementation(Part VI)_ver1-0
 
Database management system presentation
Database management system presentationDatabase management system presentation
Database management system presentation
 
Privacy-ready Data Protection Program Implementation
Privacy-ready Data Protection Program ImplementationPrivacy-ready Data Protection Program Implementation
Privacy-ready Data Protection Program Implementation
 
Intan rahayu tata cara sertifikasi kelaikan sistem elektronik
Intan rahayu tata cara sertifikasi kelaikan sistem elektronikIntan rahayu tata cara sertifikasi kelaikan sistem elektronik
Intan rahayu tata cara sertifikasi kelaikan sistem elektronik
 
CISSP Chapter 1 BCP
CISSP Chapter 1 BCPCISSP Chapter 1 BCP
CISSP Chapter 1 BCP
 
DAMA International DMBOK V2 - Comparison with V1
DAMA International DMBOK V2 - Comparison with V1DAMA International DMBOK V2 - Comparison with V1
DAMA International DMBOK V2 - Comparison with V1
 
Urgensi RUU Perlindungan Data Pribadi
Urgensi RUU Perlindungan Data PribadiUrgensi RUU Perlindungan Data Pribadi
Urgensi RUU Perlindungan Data Pribadi
 
Essential Reference and Master Data Management
Essential Reference and Master Data ManagementEssential Reference and Master Data Management
Essential Reference and Master Data Management
 
Relational database- Fundamentals
Relational database- FundamentalsRelational database- Fundamentals
Relational database- Fundamentals
 
‏‏Chapter 8: Reference and Master Data Management
‏‏Chapter 8: Reference and Master Data Management ‏‏Chapter 8: Reference and Master Data Management
‏‏Chapter 8: Reference and Master Data Management
 

Similar to Madison WI BI Sandbox Group Discusses Data Exploration

Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2RojaT4
 
Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricNathan Bijnens
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)James Serra
 
CS8091_BDA_Unit_I_Analytical_Architecture
CS8091_BDA_Unit_I_Analytical_ArchitectureCS8091_BDA_Unit_I_Analytical_Architecture
CS8091_BDA_Unit_I_Analytical_ArchitecturePalani Kumar
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing conceptspcherukumalla
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)James Serra
 
Data lake benefits
Data lake benefitsData lake benefits
Data lake benefitsRicky Barron
 
data resource management
 data resource management data resource management
data resource managementsoodsurbhi123
 
Introduction to data mining and data warehousing
Introduction to data mining and data warehousingIntroduction to data mining and data warehousing
Introduction to data mining and data warehousingEr. Nawaraj Bhandari
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?RTTS
 
BD_Architecture and Charateristics.pptx.pdf
BD_Architecture and Charateristics.pptx.pdfBD_Architecture and Charateristics.pptx.pdf
BD_Architecture and Charateristics.pptx.pdferamfatima43
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...DATAVERSITY
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureJames Serra
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonCapgemini
 
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptxUNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptxshruthisweety4
 
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)Moacyr Passador
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...DATAVERSITY
 
Dbms and it infrastructure
Dbms and  it infrastructureDbms and  it infrastructure
Dbms and it infrastructureprojectandppt
 
Data Warehousing 2016
Data Warehousing 2016Data Warehousing 2016
Data Warehousing 2016Kent Graziano
 

Similar to Madison WI BI Sandbox Group Discusses Data Exploration (20)

Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2
 
Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft Fabric
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
 
CS8091_BDA_Unit_I_Analytical_Architecture
CS8091_BDA_Unit_I_Analytical_ArchitectureCS8091_BDA_Unit_I_Analytical_Architecture
CS8091_BDA_Unit_I_Analytical_Architecture
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing concepts
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
 
Data lake benefits
Data lake benefitsData lake benefits
Data lake benefits
 
DW 101
DW 101DW 101
DW 101
 
data resource management
 data resource management data resource management
data resource management
 
Introduction to data mining and data warehousing
Introduction to data mining and data warehousingIntroduction to data mining and data warehousing
Introduction to data mining and data warehousing
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?
 
BD_Architecture and Charateristics.pptx.pdf
BD_Architecture and Charateristics.pptx.pdfBD_Architecture and Charateristics.pptx.pdf
BD_Architecture and Charateristics.pptx.pdf
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A Comparison
 
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptxUNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
 
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
 
Dbms and it infrastructure
Dbms and  it infrastructureDbms and  it infrastructure
Dbms and it infrastructure
 
Data Warehousing 2016
Data Warehousing 2016Data Warehousing 2016
Data Warehousing 2016
 

Recently uploaded

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 

Recently uploaded (20)

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 

Madison WI BI Sandbox Group Discusses Data Exploration

  • 1. TheThe BI SandboxBI Sandbox Madison, Wisconsin AreaMadison, Wisconsin Area Business Intelligence & Data WarehousingBusiness Intelligence & Data Warehousing Discussion GroupDiscussion Group
  • 2. Production ETL Analytic Data LayerData Acquisition Layer Operational Data Layer BI architecture at a glance … Legacy Source Systems Legacy Source Systems New Source Systems New Source Systems TriageTriage ConformedConformed StorageStorage AreaArea batch transaction OperationalOperational Data StoresData Stores OperationalOperational Data StoresData Stores XML Message XML Message DataData MartsMarts AnalysisAnalysis SandboxesSandboxes Other Sources: Operational systems  User supplied data Manual Loads
  • 3. BI architecture at a glance … Operational Data Layer Analytic Data Layer ConformedConformed StorageStorage AreaArea OperationalOperational Data StoresData Stores OperationalOperational Data StoresData Stores DataData MartsMarts Consolidated data feeds (legacy & new) to downstream systems Consolidated data feeds (legacy & new) to downstream systems Near real-time data feeds of new systems’ data Near real-time data feeds of new systems’ data Standardized reporting, ad hoc reporting and analysis, data mining, predictive models Standardized reporting, ad hoc reporting and analysis, data mining, predictive models Standardized reporting Standardized reporting AnalysisAnalysis SandboxesSandboxes
  • 4. What do you think of when you hear “sandbox”? Sandboxes are places to play where The sand and box are provided You bring your own toys What you create is temporary 
  • 5. Obviously some of us are more talented with sandboxes than others…
  • 6. Which is the best analogy for a BI environment? Assembly Line Assembly Line A Predictive Model Test Bed A Predictive Model Test Bed A Library A Library An Artist’s Studio An Artist’s Studio An Information Goldmine An Information Goldmine
  • 8. The BI Sandbox, defined Responsibilities • To facilitate short term ad-hoc exploratory analysis. • To remove roadblocks to client self-service (minimizing the need for I/S assistance) with short term ad-hoc exploratory analysis. • To avoid the creation of unmanaged spreadsheet based data on user desktops or shared network drives. • To better enable short term ad-hoc exploratory analysis to be converted to long term operational analysis as needed (through traceability) Collaborators Semantic Layer, Operational Data Layer (ODL), Analytic Data Layer (ADL) Rationale Typically reporting and analysis is ongoing, consistent, and can be enabled by production structures such as ODSs and data marts. Occasionally, business requirements indicate a need for temporary or ad-hoc exploratory data analysis that cannot be supported by existing data structures. These business requirements often results in unmanaged disparate spreadsheet data on individual user desktops or shared network drives. Sandboxes are meant to mitigate the risk that these ad hoc data sets are created through inconsistent techniques and the subsequent risk that analytical results discovered by using them are hard to trace and convert to a more permanent process; and doing so typically requires a complex project to convert the untraceable data set, integration, and analytical rules into repeatable rules.
  • 9. The BI Sandbox, defined Issues and Notes • Sandbox data sets will be short-lived. • The sandbox will support Ad hoc analysis. • Sandbox data sets will be intended for a specific purpose. • Reporting generated from the sandbox will not be considered “official”. • Sandbox data sets should be transitional. • Sandboxes, if they cannot be decommissioned, should be transitioned into production structures (e.g., ODSs or data marts). • Sandbox data set structure/format will be dependent on access tools. • Sandbox data set composition and quality will be dependent on the source. • Sandbox check-out (data validation) strategy will be the responsibility of the end user. • Sandbox data sets should require minimal I/S intervention. • Sandbox data can come from external or user supplied sources. • Data acquisition from operational systems is restricted. • Sandbox data will not be automatically refreshed on a regular basis. • Naming standards do not apply to sandbox structures.
  • 10. The BI Sandbox, the real why • Shed light on data integration work clients do whether I/S wishes to acknowledge it or not • Increase partnership between I/S and business – I/S has an appropriate solution to offer for more real problems • Most innovation doesn’t happen in well-defined structures
  • 11. The BI Sandbox, the how Provide a place to play • Typically SAS storage Bring your own toys • Manual loads of data from various sources including • Data marts • ODSs • Operational systems • User-supplied data sets Create & Learn • Use analysis tools (Business Objects, SAS, Excel) to explore the data and discover Transfer what you learn elsewhere • Covert discoveries into operational changes to build value
  • 12. The BI Sandbox, the limitations • Joins between disparate sources on natural keys alone – Operational system keys – Functional keys • No cleansing, no column renaming, minimal metadata, no data modeling • No automated refresh process
  • 13. The BI Sandbox, the examples • Prototyping new enterprise measure • Experimenting with integration of disparate data sources • Predictive model creation, testing & validation (in parallel with production development)