SlideShare a Scribd company logo
1 of 11
Download to read offline
Processing Large
Data sets for the National
Broadband Map with FME
Reed Whittington
Principal
National Broadband Map
www.broadbandmap.gov/techology

Massive amounts of data for all states all
providers wireline and wireless

Coverage statistics needed for each census
block for all counties and all states

Process Must be Automated so processing can
be repeated for data updates

Interoperability with Oracle and ESRI GDB

Short 1 month project cycle
Start Mid Dec 2010 working on site
Project Characteristics and
Challenges

FME was chosen for factors including:
Interoperability with Oracle and ESRI
Scalable Automated processing
Fast overlay and dissolve operations
End customer suggested FME

Professional Services provided by GIS INT
10+ Years FME Integration Experience
Responsiveness
Recommended by Safe
Interoperability Solution

Modular Design
One-Function Workflow with SQL query
published so that workflow can be called to
iterate across states or states and counties
Oracle Spatial index was used to read just the
polygons required
• Coverage that cross Census Blocks returned
from SQL SELECT
Implementation Highlights
AND COUNTYFIPS = 017
Where Clauses allow processing distinct parts
of data set.
Sample Workspace Interface
to be Iterated Over
The Iteration Driver – Initial

The actual iteration is done in a separate FME
workspace that reads the parts to process

We realized halfway through processing a need
to start processing over but not reprocess any
parts that had succeed.
– Manual tracking initially

Calling a workspace from parent workspace
WorkspaceRunner
• Used without FME Server
• A manual process would be used to distribute
processing over multiple CPUs with a command
line call passing in State to process.
Improvements / Tweaks on
the Iterator Pattern
The return status of each iteration (workspace
call) can be stored and used later to reprocess
fails.
FME Server can be utilized by replacing
– WorkspaceRunner
• Used to run another workspace on the same
Desktop box with
– FMEServerWorkspaceRunner
• Used to queue FME Server hosted workspace or
– FMEServerJobSubmitter
• Also used to queue FME Server job
Workflow Example - FME Workflow
Submitter Iterator with Status
Tracker
Whats Next?

Workflow Submitter Iterator Improvements
– Optimization – faster database
– Better Logging – Retrieve child log when
failed with FMEServerLogFileRetriever
– Metrics - estimate percent completed
– Job Control – Allow iterator to be killed or
halted and started up on another engine
– More Parameter driven

Backfit/Upgrade to replace existing
installations
Thank You!

Questions?

For more information:
 Reed Whittington
 GIS Interop Solutions
 reedwhit@gisint.com
 www.gisint.com
 www.linkedin.com/in/reedwhit

More Related Content

What's hot

ENMAX Supporting Processes and Improving GIS Data Workflows with FME
ENMAX Supporting Processes and Improving GIS Data Workflows with FMEENMAX Supporting Processes and Improving GIS Data Workflows with FME
ENMAX Supporting Processes and Improving GIS Data Workflows with FME
Safe Software
 
Adaptive Execution Support for Malleable Computation
Adaptive Execution Support for Malleable ComputationAdaptive Execution Support for Malleable Computation
Adaptive Execution Support for Malleable Computation
Qian Lin
 
MapReduce : Simplified Data Processing on Large Clusters
MapReduce : Simplified Data Processing on Large ClustersMapReduce : Simplified Data Processing on Large Clusters
MapReduce : Simplified Data Processing on Large Clusters
Abolfazl Asudeh
 
2015 dec 8 svc comprestimator
2015 dec 8 svc comprestimator2015 dec 8 svc comprestimator
2015 dec 8 svc comprestimator
hellocn
 

What's hot (18)

A Whirlwind Tour of Spatial Joins
A Whirlwind Tour of Spatial JoinsA Whirlwind Tour of Spatial Joins
A Whirlwind Tour of Spatial Joins
 
ENMAX Supporting Processes and Improving GIS Data Workflows with FME
ENMAX Supporting Processes and Improving GIS Data Workflows with FMEENMAX Supporting Processes and Improving GIS Data Workflows with FME
ENMAX Supporting Processes and Improving GIS Data Workflows with FME
 
Fault Tolerance and Processing Semantics in Apache Apex
Fault Tolerance and Processing Semantics in Apache ApexFault Tolerance and Processing Semantics in Apache Apex
Fault Tolerance and Processing Semantics in Apache Apex
 
Reading init param
Reading init paramReading init param
Reading init param
 
Innovile Automated Configuration Management
Innovile Automated Configuration ManagementInnovile Automated Configuration Management
Innovile Automated Configuration Management
 
Mule Batch Filter
Mule Batch FilterMule Batch Filter
Mule Batch Filter
 
A Bit of Hocus Pocus
A Bit of Hocus PocusA Bit of Hocus Pocus
A Bit of Hocus Pocus
 
Maximizing GIS Performance In Your Sleep
Maximizing GIS Performance In Your SleepMaximizing GIS Performance In Your Sleep
Maximizing GIS Performance In Your Sleep
 
Flux - An open sourced Workflow orchestrator from Flipkart
Flux - An open sourced Workflow orchestrator from FlipkartFlux - An open sourced Workflow orchestrator from Flipkart
Flux - An open sourced Workflow orchestrator from Flipkart
 
Adaptive Execution Support for Malleable Computation
Adaptive Execution Support for Malleable ComputationAdaptive Execution Support for Malleable Computation
Adaptive Execution Support for Malleable Computation
 
MapReduce : Simplified Data Processing on Large Clusters
MapReduce : Simplified Data Processing on Large ClustersMapReduce : Simplified Data Processing on Large Clusters
MapReduce : Simplified Data Processing on Large Clusters
 
Mustafa Iqbal Cv
Mustafa Iqbal CvMustafa Iqbal Cv
Mustafa Iqbal Cv
 
Improving Resource Utilization in Cloud using Application Placement Heuristics
Improving Resource Utilization in Cloud using Application Placement HeuristicsImproving Resource Utilization in Cloud using Application Placement Heuristics
Improving Resource Utilization in Cloud using Application Placement Heuristics
 
Application of the automation know-how within the EGS-CC project
Application of the automation know-how within the EGS-CC projectApplication of the automation know-how within the EGS-CC project
Application of the automation know-how within the EGS-CC project
 
Recursion
RecursionRecursion
Recursion
 
Load balancing
Load balancingLoad balancing
Load balancing
 
MapReduce: Simplified Data Processing on Large Clusters
MapReduce: Simplified Data Processing on Large ClustersMapReduce: Simplified Data Processing on Large Clusters
MapReduce: Simplified Data Processing on Large Clusters
 
2015 dec 8 svc comprestimator
2015 dec 8 svc comprestimator2015 dec 8 svc comprestimator
2015 dec 8 svc comprestimator
 

Similar to Processing Large Datasets for the National Broadband Map with FME

Threading Successes 03 Gamebryo
Threading Successes 03   GamebryoThreading Successes 03   Gamebryo
Threading Successes 03 Gamebryo
guest40fc7cd
 
Applications of parellel computing
Applications of parellel computingApplications of parellel computing
Applications of parellel computing
pbhopi
 
Survey of task scheduler
Survey of task schedulerSurvey of task scheduler
Survey of task scheduler
elisha25
 
From Raw to Cooked: In-Depth FME Desktop to FME Server Workflow
From Raw to Cooked: In-Depth FME Desktop to FME Server WorkflowFrom Raw to Cooked: In-Depth FME Desktop to FME Server Workflow
From Raw to Cooked: In-Depth FME Desktop to FME Server Workflow
Safe Software
 
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #2: Galera Cluster
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #2: Galera ClusterWebinar Slides: MySQL HA/DR/Geo-Scale - High Noon #2: Galera Cluster
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #2: Galera Cluster
Continuent
 

Similar to Processing Large Datasets for the National Broadband Map with FME (20)

A Safer's Guide to Best Practices for Optimizing Jobs on FME Server
A Safer's Guide to Best Practices for Optimizing Jobs on FME ServerA Safer's Guide to Best Practices for Optimizing Jobs on FME Server
A Safer's Guide to Best Practices for Optimizing Jobs on FME Server
 
A Safer's Guide to Best Practices for Optimizing Jobs on FME Server
A Safer's Guide to Best Practices for Optimizing Jobs on FME ServerA Safer's Guide to Best Practices for Optimizing Jobs on FME Server
A Safer's Guide to Best Practices for Optimizing Jobs on FME Server
 
Back to FME School - Day 2: Your Data and FME
Back to FME School - Day 2: Your Data and FMEBack to FME School - Day 2: Your Data and FME
Back to FME School - Day 2: Your Data and FME
 
FME Server 2021.0 at Scale: Advanced Job Control
FME Server 2021.0 at Scale: Advanced Job ControlFME Server 2021.0 at Scale: Advanced Job Control
FME Server 2021.0 at Scale: Advanced Job Control
 
FME Server_Ken Bragg - Safe Software FME World Tour 2013
FME Server_Ken Bragg - Safe Software FME World Tour 2013FME Server_Ken Bragg - Safe Software FME World Tour 2013
FME Server_Ken Bragg - Safe Software FME World Tour 2013
 
Oracle Database -- How to Improve Interoperability using FME
Oracle Database -- How to Improve Interoperability using FME Oracle Database -- How to Improve Interoperability using FME
Oracle Database -- How to Improve Interoperability using FME
 
Threading Successes 03 Gamebryo
Threading Successes 03   GamebryoThreading Successes 03   Gamebryo
Threading Successes 03 Gamebryo
 
Autosar Basics hand book_v1
Autosar Basics  hand book_v1Autosar Basics  hand book_v1
Autosar Basics hand book_v1
 
Capital One's Next Generation Decision in less than 2 ms
Capital One's Next Generation Decision in less than 2 msCapital One's Next Generation Decision in less than 2 ms
Capital One's Next Generation Decision in less than 2 ms
 
Applications of parellel computing
Applications of parellel computingApplications of parellel computing
Applications of parellel computing
 
Survey of task scheduler
Survey of task schedulerSurvey of task scheduler
Survey of task scheduler
 
Software Performance
Software Performance Software Performance
Software Performance
 
Enhanced Reframework Session_16-07-2022.pptx
Enhanced Reframework Session_16-07-2022.pptxEnhanced Reframework Session_16-07-2022.pptx
Enhanced Reframework Session_16-07-2022.pptx
 
Next-Gen Decision Making in Under 2ms
Next-Gen Decision Making in Under 2msNext-Gen Decision Making in Under 2ms
Next-Gen Decision Making in Under 2ms
 
Using FME Server and Engines to Convert Large Amounts of Data
Using FME Server and Engines to Convert Large Amounts of DataUsing FME Server and Engines to Convert Large Amounts of Data
Using FME Server and Engines to Convert Large Amounts of Data
 
From Raw to Cooked: In-Depth FME Desktop to FME Server Workflow
From Raw to Cooked: In-Depth FME Desktop to FME Server WorkflowFrom Raw to Cooked: In-Depth FME Desktop to FME Server Workflow
From Raw to Cooked: In-Depth FME Desktop to FME Server Workflow
 
Ultimate Real-Time — Monitor Anything, Update Anything
Ultimate Real-Time — Monitor Anything, Update AnythingUltimate Real-Time — Monitor Anything, Update Anything
Ultimate Real-Time — Monitor Anything, Update Anything
 
Taking Full Advantage of Galera Multi Master Cluster
Taking Full Advantage of Galera Multi Master ClusterTaking Full Advantage of Galera Multi Master Cluster
Taking Full Advantage of Galera Multi Master Cluster
 
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #2: Galera Cluster
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #2: Galera ClusterWebinar Slides: MySQL HA/DR/Geo-Scale - High Noon #2: Galera Cluster
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #2: Galera Cluster
 
PROCESS.pptx
PROCESS.pptxPROCESS.pptx
PROCESS.pptx
 

More from Safe Software

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataCloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Safe Software
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsIgniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Safe Software
 
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdfIntroducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Safe Software
 
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataCloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Safe Software
 
Taking Off with FME: Elevating Airport Operations to New Heights
Taking Off with FME: Elevating Airport Operations to New HeightsTaking Off with FME: Elevating Airport Operations to New Heights
Taking Off with FME: Elevating Airport Operations to New Heights
Safe Software
 
Initiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance StrategyInitiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance Strategy
Safe Software
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Safe Software
 
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...
Safe Software
 

More from Safe Software (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action:  Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action:  Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data Streams
 
The Critical Role of Spatial Data in Today's Data Ecosystem
The Critical Role of Spatial Data in Today's Data EcosystemThe Critical Role of Spatial Data in Today's Data Ecosystem
The Critical Role of Spatial Data in Today's Data Ecosystem
 
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataCloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsIgniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
Mastering MicroStation DGN: How to Integrate CAD and GIS
Mastering MicroStation DGN: How to Integrate CAD and GISMastering MicroStation DGN: How to Integrate CAD and GIS
Mastering MicroStation DGN: How to Integrate CAD and GIS
 
Geospatial Synergy: Amplifying Efficiency with FME & Esri
Geospatial Synergy: Amplifying Efficiency with FME & EsriGeospatial Synergy: Amplifying Efficiency with FME & Esri
Geospatial Synergy: Amplifying Efficiency with FME & Esri
 
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdfIntroducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
 
Breaking Barriers & Leveraging the Latest Developments in AI Technology
Breaking Barriers & Leveraging the Latest Developments in AI TechnologyBreaking Barriers & Leveraging the Latest Developments in AI Technology
Breaking Barriers & Leveraging the Latest Developments in AI Technology
 
Best Practices to Navigating Data and Application Integration for the Enterpr...
Best Practices to Navigating Data and Application Integration for the Enterpr...Best Practices to Navigating Data and Application Integration for the Enterpr...
Best Practices to Navigating Data and Application Integration for the Enterpr...
 
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataCloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
 
New Year's Fireside Chat with Safe Software’s Founders
New Year's Fireside Chat with Safe Software’s FoundersNew Year's Fireside Chat with Safe Software’s Founders
New Year's Fireside Chat with Safe Software’s Founders
 
Taking Off with FME: Elevating Airport Operations to New Heights
Taking Off with FME: Elevating Airport Operations to New HeightsTaking Off with FME: Elevating Airport Operations to New Heights
Taking Off with FME: Elevating Airport Operations to New Heights
 
Initiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance StrategyInitiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance Strategy
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
 
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Recently uploaded (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Processing Large Datasets for the National Broadband Map with FME

  • 1. Processing Large Data sets for the National Broadband Map with FME Reed Whittington Principal
  • 3.  Massive amounts of data for all states all providers wireline and wireless  Coverage statistics needed for each census block for all counties and all states  Process Must be Automated so processing can be repeated for data updates  Interoperability with Oracle and ESRI GDB  Short 1 month project cycle Start Mid Dec 2010 working on site Project Characteristics and Challenges
  • 4.  FME was chosen for factors including: Interoperability with Oracle and ESRI Scalable Automated processing Fast overlay and dissolve operations End customer suggested FME  Professional Services provided by GIS INT 10+ Years FME Integration Experience Responsiveness Recommended by Safe Interoperability Solution
  • 5.  Modular Design One-Function Workflow with SQL query published so that workflow can be called to iterate across states or states and counties Oracle Spatial index was used to read just the polygons required • Coverage that cross Census Blocks returned from SQL SELECT Implementation Highlights
  • 6. AND COUNTYFIPS = 017 Where Clauses allow processing distinct parts of data set. Sample Workspace Interface to be Iterated Over
  • 7. The Iteration Driver – Initial  The actual iteration is done in a separate FME workspace that reads the parts to process  We realized halfway through processing a need to start processing over but not reprocess any parts that had succeed. – Manual tracking initially  Calling a workspace from parent workspace WorkspaceRunner • Used without FME Server • A manual process would be used to distribute processing over multiple CPUs with a command line call passing in State to process.
  • 8. Improvements / Tweaks on the Iterator Pattern The return status of each iteration (workspace call) can be stored and used later to reprocess fails. FME Server can be utilized by replacing – WorkspaceRunner • Used to run another workspace on the same Desktop box with – FMEServerWorkspaceRunner • Used to queue FME Server hosted workspace or – FMEServerJobSubmitter • Also used to queue FME Server job
  • 9. Workflow Example - FME Workflow Submitter Iterator with Status Tracker
  • 10. Whats Next?  Workflow Submitter Iterator Improvements – Optimization – faster database – Better Logging – Retrieve child log when failed with FMEServerLogFileRetriever – Metrics - estimate percent completed – Job Control – Allow iterator to be killed or halted and started up on another engine – More Parameter driven  Backfit/Upgrade to replace existing installations
  • 11. Thank You!  Questions?  For more information:  Reed Whittington  GIS Interop Solutions  reedwhit@gisint.com  www.gisint.com  www.linkedin.com/in/reedwhit