SlideShare a Scribd company logo
1 of 15
Lessons learned from American Tower  Building a flexible and affordable enterprise data warehouse with the expressor semantic data integration system
Who is ATC? Cellular and broadcast tower ownership and operation Over 30,000 towers Leader in tower industry worldwide Operations in US, Mexico, Brazil, India $1.6B revenue (worldwide),  1200 employees (US) 2
Business Challenges International operations 4 markets at present, plus future expansions Business model Real estate  Outdated reporting environment Single purpose data marts Lengthy, redundant data extracts No clear definition of contents Poor reporting 3
Enterprise Data Warehouse Program Started October 2008 Improve user experience for reporting Integrated data Improved reporting tool Faster data refreshes Three-part solution Data Warehouse: Kimball methodology, SQL Server 2005 BI reporting tool: Cognos 8 Governance process: Business-led management of DW 4
Selecting the right tools Reporting tool (Cognos 8) Lengthy vendor selection process, close business involvement Database for DW (SQL Server 2005) Technical evaluation (data volumes and future capacity reqs) Experience of existing personnel ETL tool (initially SSIS) Experience of existing personnel Budget (used existing SQL Server licenses) Technical evaluation (could live with shortcomings) 5
ETL Structure - High Level Three-step process Separate jobs for each process 6 Source System EDW Extract Transform Load Raw data pulls Prepare data Load Facts &  Dimensions (SCD)
ETL Structure - Detail Three-layer process for each step 7 Scheduler Control Flow, timing Extract Metadata Wrapper Data quality, logging, etc. ETL Execution Actual data transfer / processing
SSIS Issues Functionality Bulk updates Awkward scripting  (two languages, not well integrated) Performance Oracle extracts not performing optimally 8
Opportunity for expressor Became aware of expressor mid-2009 Proof of concept to establish benefits Performance:  8-24x faster for Oracle extracts (1-4 channels) Scripting: expressordatascript very powerful Functionality: bulk updates, general capabilities Acquisition made much easier by low cost of adoption 9
ETL Structure - expressor Three-step process 10 Scripting ETL functionality Extract Transform Load Raw data pulls Prepare data Load Facts &  Dimensions (SCD) Improved performance Semantic rationalization Bulk updates expressor benefits
ETL Structure Three-layer process for each step 11 Scheduler Control Flow, timing Extract Metadata Wrapper Data quality, logging, etc. Change execution method via metadata ETL Execution Actual data transfer / processing Replace with expressor
expressor Downstream Benefits Semantic dictionary Clarify confusing business terms Multiple formats for Tower Number Differing business terms for same concept (milestone / date) Direct input from BAs into data modeling / ETL Growth potential Add channels as needed Development / Maintenance Simpler ETL Fewer stored procedures or “inventive” solutions 12
expressor Challenges expressor datascript is different from MS / .Net world Very powerful scripting language Single library All semantic terms share a namespace Process / Flow control Involving BAs in semantic rationalization Requires process change outside development team 13
Lessons Learned SSIS is “free”, but you get what you pay for Functionality limitations; we didn’t know what we were missing Performance Transition requires effort Small learning curve for expressor datascript Semantic Rationalization process impacts Work with all affected groups ETL Architecture preparation pays off Plan for scalable hardware and flexible software 14
15

More Related Content

What's hot

Online analytical processing (olap) tools
Online analytical processing (olap) toolsOnline analytical processing (olap) tools
Online analytical processing (olap) toolskulkarnivaibhav
 
RGiampaoli.DynamicIntegrations
RGiampaoli.DynamicIntegrationsRGiampaoli.DynamicIntegrations
RGiampaoli.DynamicIntegrationsRicardo Giampaoli
 
Online analytical processing
Online analytical processingOnline analytical processing
Online analytical processingSamraiz Tejani
 
Data extraction, transformation, and loading
Data extraction, transformation, and loadingData extraction, transformation, and loading
Data extraction, transformation, and loadingSiddique Ibrahim
 
Bw training 4 extraction
Bw training   4 extractionBw training   4 extraction
Bw training 4 extractionJoseph Tham
 
Etl - Extract Transform Load
Etl - Extract Transform LoadEtl - Extract Transform Load
Etl - Extract Transform LoadABDUL KHALIQ
 
Bw training 5 ods and bc
Bw training   5 ods and bcBw training   5 ods and bc
Bw training 5 ods and bcJoseph Tham
 
Parallel machines flinkforward2017
Parallel machines flinkforward2017Parallel machines flinkforward2017
Parallel machines flinkforward2017Nisha Talagala
 
A Common Database Approach for OLTP and OLAP Using an In-Memory Column Database
A Common Database Approach for OLTP and OLAP Using an In-Memory Column DatabaseA Common Database Approach for OLTP and OLAP Using an In-Memory Column Database
A Common Database Approach for OLTP and OLAP Using an In-Memory Column DatabaseIshara Amarasekera
 
Data warehousing testing strategies cognos
Data warehousing testing strategies cognosData warehousing testing strategies cognos
Data warehousing testing strategies cognosSandeep Mehta
 
BUILDING A DATA WAREHOUSE
BUILDING A DATA WAREHOUSEBUILDING A DATA WAREHOUSE
BUILDING A DATA WAREHOUSENeha Kapoor
 
ETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testingETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testingVibrant Event
 
Seminar on olap online analytical
Seminar on olap  online analyticalSeminar on olap  online analytical
Seminar on olap online analyticalcyber_fox
 
Etl Overview (Extract, Transform, And Load)
Etl Overview (Extract, Transform, And Load)Etl Overview (Extract, Transform, And Load)
Etl Overview (Extract, Transform, And Load)LizLavaveshkul
 
Online analytical processing
Online analytical processingOnline analytical processing
Online analytical processingnurmeen1
 

What's hot (20)

Online analytical processing (olap) tools
Online analytical processing (olap) toolsOnline analytical processing (olap) tools
Online analytical processing (olap) tools
 
RGiampaoli.DynamicIntegrations
RGiampaoli.DynamicIntegrationsRGiampaoli.DynamicIntegrations
RGiampaoli.DynamicIntegrations
 
Online analytical processing
Online analytical processingOnline analytical processing
Online analytical processing
 
Data extraction, transformation, and loading
Data extraction, transformation, and loadingData extraction, transformation, and loading
Data extraction, transformation, and loading
 
ETL Testing Overview
ETL Testing OverviewETL Testing Overview
ETL Testing Overview
 
Bw training 4 extraction
Bw training   4 extractionBw training   4 extraction
Bw training 4 extraction
 
Etl - Extract Transform Load
Etl - Extract Transform LoadEtl - Extract Transform Load
Etl - Extract Transform Load
 
DWHRestructure
DWHRestructureDWHRestructure
DWHRestructure
 
Bw training 5 ods and bc
Bw training   5 ods and bcBw training   5 ods and bc
Bw training 5 ods and bc
 
OLAP v/s OLTP
OLAP v/s OLTPOLAP v/s OLTP
OLAP v/s OLTP
 
Parallel machines flinkforward2017
Parallel machines flinkforward2017Parallel machines flinkforward2017
Parallel machines flinkforward2017
 
A Common Database Approach for OLTP and OLAP Using an In-Memory Column Database
A Common Database Approach for OLTP and OLAP Using an In-Memory Column DatabaseA Common Database Approach for OLTP and OLAP Using an In-Memory Column Database
A Common Database Approach for OLTP and OLAP Using an In-Memory Column Database
 
Data warehousing testing strategies cognos
Data warehousing testing strategies cognosData warehousing testing strategies cognos
Data warehousing testing strategies cognos
 
BUILDING A DATA WAREHOUSE
BUILDING A DATA WAREHOUSEBUILDING A DATA WAREHOUSE
BUILDING A DATA WAREHOUSE
 
ETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testingETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testing
 
OLTP-Bench
OLTP-BenchOLTP-Bench
OLTP-Bench
 
Seminar on olap online analytical
Seminar on olap  online analyticalSeminar on olap  online analytical
Seminar on olap online analytical
 
Etl Overview (Extract, Transform, And Load)
Etl Overview (Extract, Transform, And Load)Etl Overview (Extract, Transform, And Load)
Etl Overview (Extract, Transform, And Load)
 
Rdbms
RdbmsRdbms
Rdbms
 
Online analytical processing
Online analytical processingOnline analytical processing
Online analytical processing
 

Similar to Lessons from American Tower on Building a Flexible and Affordable Data Warehouse

Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSSDeepali Raut
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon RedshiftAmazon Web Services
 
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...Databricks
 
Implementation of Oracle ExaData and OFM 11g with Banner in HCT
Implementation of Oracle ExaData and OFM 11g with Banner in HCTImplementation of Oracle ExaData and OFM 11g with Banner in HCT
Implementation of Oracle ExaData and OFM 11g with Banner in HCTKhalid Tariq
 
Data Aware Enterprise v2
Data Aware Enterprise v2Data Aware Enterprise v2
Data Aware Enterprise v2ukdpe
 
Saying goodbye to SQL Server 2000
Saying goodbye to SQL Server 2000Saying goodbye to SQL Server 2000
Saying goodbye to SQL Server 2000ukdpe
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)James Serra
 
Python + MPP Database = Large Scale AI/ML Projects in Production Faster
Python + MPP Database = Large Scale AI/ML Projects in Production FasterPython + MPP Database = Large Scale AI/ML Projects in Production Faster
Python + MPP Database = Large Scale AI/ML Projects in Production FasterPaige_Roberts
 
Oracle Database 11g Lower Your Costs
Oracle Database 11g Lower Your CostsOracle Database 11g Lower Your Costs
Oracle Database 11g Lower Your CostsMark Rabne
 
DB2 9 for z/OS - Business Value
DB2 9 for z/OS  - Business  ValueDB2 9 for z/OS  - Business  Value
DB2 9 for z/OS - Business ValueSurekha Parekh
 
Conspectus data warehousing appliances – fad or future
Conspectus   data warehousing appliances – fad or futureConspectus   data warehousing appliances – fad or future
Conspectus data warehousing appliances – fad or futureDavid Walker
 
Mukhtar_Resume_ETL_Developer
Mukhtar_Resume_ETL_DeveloperMukhtar_Resume_ETL_Developer
Mukhtar_Resume_ETL_DeveloperMukhtar Mohammed
 
3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.pptBsMath3rdsem
 
Mukhtar resume etl_developer
Mukhtar resume etl_developerMukhtar resume etl_developer
Mukhtar resume etl_developerMukhtar Mohammed
 
Microsoft SQL Server - StreamInsight Overview Presentation
Microsoft SQL Server - StreamInsight Overview PresentationMicrosoft SQL Server - StreamInsight Overview Presentation
Microsoft SQL Server - StreamInsight Overview PresentationMicrosoft Private Cloud
 
Dimensional Modelling Session 2
Dimensional Modelling Session 2Dimensional Modelling Session 2
Dimensional Modelling Session 2akitda
 

Similar to Lessons from American Tower on Building a Flexible and Affordable Data Warehouse (20)

Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSS
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
 
Implementation of Oracle ExaData and OFM 11g with Banner in HCT
Implementation of Oracle ExaData and OFM 11g with Banner in HCTImplementation of Oracle ExaData and OFM 11g with Banner in HCT
Implementation of Oracle ExaData and OFM 11g with Banner in HCT
 
Data Aware Enterprise v2
Data Aware Enterprise v2Data Aware Enterprise v2
Data Aware Enterprise v2
 
Exadata
ExadataExadata
Exadata
 
Saying goodbye to SQL Server 2000
Saying goodbye to SQL Server 2000Saying goodbye to SQL Server 2000
Saying goodbye to SQL Server 2000
 
Evolving Architecture
Evolving ArchitectureEvolving Architecture
Evolving Architecture
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
 
Python + MPP Database = Large Scale AI/ML Projects in Production Faster
Python + MPP Database = Large Scale AI/ML Projects in Production FasterPython + MPP Database = Large Scale AI/ML Projects in Production Faster
Python + MPP Database = Large Scale AI/ML Projects in Production Faster
 
Oracle Database 11g Lower Your Costs
Oracle Database 11g Lower Your CostsOracle Database 11g Lower Your Costs
Oracle Database 11g Lower Your Costs
 
DB2 9 for z/OS - Business Value
DB2 9 for z/OS  - Business  ValueDB2 9 for z/OS  - Business  Value
DB2 9 for z/OS - Business Value
 
Conspectus data warehousing appliances – fad or future
Conspectus   data warehousing appliances – fad or futureConspectus   data warehousing appliances – fad or future
Conspectus data warehousing appliances – fad or future
 
Mukhtar_Resume_ETL_Developer
Mukhtar_Resume_ETL_DeveloperMukhtar_Resume_ETL_Developer
Mukhtar_Resume_ETL_Developer
 
11g R2
11g R211g R2
11g R2
 
3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt
 
Mukhtar resume etl_developer
Mukhtar resume etl_developerMukhtar resume etl_developer
Mukhtar resume etl_developer
 
Microsoft SQL Server - StreamInsight Overview Presentation
Microsoft SQL Server - StreamInsight Overview PresentationMicrosoft SQL Server - StreamInsight Overview Presentation
Microsoft SQL Server - StreamInsight Overview Presentation
 
Tera stream ETL
Tera stream ETLTera stream ETL
Tera stream ETL
 
Dimensional Modelling Session 2
Dimensional Modelling Session 2Dimensional Modelling Session 2
Dimensional Modelling Session 2
 

Recently uploaded

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 

Recently uploaded (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 

Lessons from American Tower on Building a Flexible and Affordable Data Warehouse

  • 1. Lessons learned from American Tower Building a flexible and affordable enterprise data warehouse with the expressor semantic data integration system
  • 2. Who is ATC? Cellular and broadcast tower ownership and operation Over 30,000 towers Leader in tower industry worldwide Operations in US, Mexico, Brazil, India $1.6B revenue (worldwide), 1200 employees (US) 2
  • 3. Business Challenges International operations 4 markets at present, plus future expansions Business model Real estate Outdated reporting environment Single purpose data marts Lengthy, redundant data extracts No clear definition of contents Poor reporting 3
  • 4. Enterprise Data Warehouse Program Started October 2008 Improve user experience for reporting Integrated data Improved reporting tool Faster data refreshes Three-part solution Data Warehouse: Kimball methodology, SQL Server 2005 BI reporting tool: Cognos 8 Governance process: Business-led management of DW 4
  • 5. Selecting the right tools Reporting tool (Cognos 8) Lengthy vendor selection process, close business involvement Database for DW (SQL Server 2005) Technical evaluation (data volumes and future capacity reqs) Experience of existing personnel ETL tool (initially SSIS) Experience of existing personnel Budget (used existing SQL Server licenses) Technical evaluation (could live with shortcomings) 5
  • 6. ETL Structure - High Level Three-step process Separate jobs for each process 6 Source System EDW Extract Transform Load Raw data pulls Prepare data Load Facts & Dimensions (SCD)
  • 7. ETL Structure - Detail Three-layer process for each step 7 Scheduler Control Flow, timing Extract Metadata Wrapper Data quality, logging, etc. ETL Execution Actual data transfer / processing
  • 8. SSIS Issues Functionality Bulk updates Awkward scripting (two languages, not well integrated) Performance Oracle extracts not performing optimally 8
  • 9. Opportunity for expressor Became aware of expressor mid-2009 Proof of concept to establish benefits Performance: 8-24x faster for Oracle extracts (1-4 channels) Scripting: expressordatascript very powerful Functionality: bulk updates, general capabilities Acquisition made much easier by low cost of adoption 9
  • 10. ETL Structure - expressor Three-step process 10 Scripting ETL functionality Extract Transform Load Raw data pulls Prepare data Load Facts & Dimensions (SCD) Improved performance Semantic rationalization Bulk updates expressor benefits
  • 11. ETL Structure Three-layer process for each step 11 Scheduler Control Flow, timing Extract Metadata Wrapper Data quality, logging, etc. Change execution method via metadata ETL Execution Actual data transfer / processing Replace with expressor
  • 12. expressor Downstream Benefits Semantic dictionary Clarify confusing business terms Multiple formats for Tower Number Differing business terms for same concept (milestone / date) Direct input from BAs into data modeling / ETL Growth potential Add channels as needed Development / Maintenance Simpler ETL Fewer stored procedures or “inventive” solutions 12
  • 13. expressor Challenges expressor datascript is different from MS / .Net world Very powerful scripting language Single library All semantic terms share a namespace Process / Flow control Involving BAs in semantic rationalization Requires process change outside development team 13
  • 14. Lessons Learned SSIS is “free”, but you get what you pay for Functionality limitations; we didn’t know what we were missing Performance Transition requires effort Small learning curve for expressor datascript Semantic Rationalization process impacts Work with all affected groups ETL Architecture preparation pays off Plan for scalable hardware and flexible software 14
  • 15. 15