SlideShare a Scribd company logo
Colorado State Address Dataset 
Automated Processing 
Nathan Lowry, GIS Outreach Coordinator 
State of Colorado 
September 23, 2014
Common Data Model 
● Allows local and state-wide querying, analysis, and integration … 
● Accommodates information exchanges 
▪ Hierarchical - City to County, County to Region, Region to State 
▪ Among neighboring jurisdictions (eg. County to County, etc.) 
● Allows profiles to provide data in standard forms for specific 
objectives 
▪ NENA CLDXF for NG-911 
▪ USPS Pub-28 for CASS 
▪ ArcGIS Geocoding (for quality comparisons, etc.) 
● It’s more efficient (less work) and assures more quality (less loss)
FGDC-STD-016-2011 
United States Thoroughfare, Landmark, and Postal Address Data Standard 
Of Greatest Significance: 
1.Everything* is ‘fully explicit’ (fully spelled‐out) 
No abbreviations allowed; No Ambiguity 
*The only exception is two‐letter state postal codes (eg. “CO” = Colorado) 
●2.You will express exactly how each address will be parsed 
Parsing is no longer subject to interpretation 
The break‐down is stored in the data for each record 
3.Each Address must be assigned a Unique Identifier (UID) 
Multiple representations of the same address can be “tied 
together” if and only if (iff) addresses are assigned UIDs. 
These are big changes that few have yet implemented 
•Our common data model is designed to accommodate both: 
‒your current state and 
‒this “to be” state
Presuppositions: 
● SQL Server Integration Services (SSIS) 
o Parallel processing - fast translations - True. 
o Most Compatible with SQL Server - Irrelevant* 
o Developed by DBAs for DBAs - No, developed by app 
developers for app developers 
▪ (ie. Normalization tools) - Hah, hah, hah, hah, 
hah! 
o No Additional Cost - (This one bore out) 
o I learned French instead of Spanish - (SSIS instead of 
Python) 
● No Parsing 
o I will translate, but it’ll be the locals’ responsibility to 
pre-parse... - No parsing, no geocoding* 
o In addition, no last lines, no geocoding* 
● 6-8 Weeks Processing - 6-8 Months of Processing
Automating Processes
Colorado State Address Dataset 
Automated and Manual Processes
Automating Processes
Observations 
● SQL Server Integration Services (SSIS) 
○ SSIS is quirky 
○ SSIS Expression Language is Swahili 
○ A modeling canvas may be more effective for design 
○ SSIS can integrate with many other server processes (FTP) 
● Parsing and “Last Lining” will give CO jurisdictions a 
leg up 
○ The level of effort can be significant 
○ CLDXF Street Naming and Address Numbering Conventions 
● Standards 
○ Jurisdictional pretypes, sequencers - minor tweaks 
○ Subaddress conventions need ... something
Opportunities 
● Standards 
○ Improvement via implementation 
○ Coalescence on Subaddresses 
● Common implementations of data models 
○ Reduce the cost of development 
○ Makes sharing of code useful and possible 
● Common code 
○ Shared parsing tools 
○ Shared applications
Questions? 
Thank You!

More Related Content

Viewers also liked

hollee r10
hollee r10hollee r10
hollee r10Year56
 
Diploma in quality management
Diploma in quality managementDiploma in quality management
Diploma in quality managementselinasimpson0301
 
Scientific notation pop quiz
Scientific notation pop quizScientific notation pop quiz
Scientific notation pop quizjackjenkins8
 
Reggia di caserta interni 2
Reggia di caserta interni 2Reggia di caserta interni 2
Reggia di caserta interni 2
My own sweet home
 
Merger agrrement BT4
Merger agrrement BT4Merger agrrement BT4
Merger agrrement BT4vanehernand
 
Les propietats de la matèria (angel guimera's conflicted copy 2011 11-29)
Les propietats de la matèria (angel guimera's conflicted copy 2011 11-29)Les propietats de la matèria (angel guimera's conflicted copy 2011 11-29)
Les propietats de la matèria (angel guimera's conflicted copy 2011 11-29)Toni Gomez
 
Los 10 Principios Heurísticos de Nielsen
Los  10 Principios Heurísticos de Nielsen Los  10 Principios Heurísticos de Nielsen
Los 10 Principios Heurísticos de Nielsen
Juan Carlos Villegas Cruz
 

Viewers also liked (9)

hollee r10
hollee r10hollee r10
hollee r10
 
Diploma in quality management
Diploma in quality managementDiploma in quality management
Diploma in quality management
 
Scientific notation pop quiz
Scientific notation pop quizScientific notation pop quiz
Scientific notation pop quiz
 
FutureCafé
FutureCaféFutureCafé
FutureCafé
 
как малката кухня да изглежда по просторна
как малката кухня да изглежда по просторнакак малката кухня да изглежда по просторна
как малката кухня да изглежда по просторна
 
Reggia di caserta interni 2
Reggia di caserta interni 2Reggia di caserta interni 2
Reggia di caserta interni 2
 
Merger agrrement BT4
Merger agrrement BT4Merger agrrement BT4
Merger agrrement BT4
 
Les propietats de la matèria (angel guimera's conflicted copy 2011 11-29)
Les propietats de la matèria (angel guimera's conflicted copy 2011 11-29)Les propietats de la matèria (angel guimera's conflicted copy 2011 11-29)
Les propietats de la matèria (angel guimera's conflicted copy 2011 11-29)
 
Los 10 Principios Heurísticos de Nielsen
Los  10 Principios Heurísticos de Nielsen Los  10 Principios Heurísticos de Nielsen
Los 10 Principios Heurísticos de Nielsen
 

Similar to Colorado State Address Dataset Automated Processing

2013 GISCO Track, Quality Assessment and Improvement for Addressed Locations ...
2013 GISCO Track, Quality Assessment and Improvement for Addressed Locations ...2013 GISCO Track, Quality Assessment and Improvement for Addressed Locations ...
2013 GISCO Track, Quality Assessment and Improvement for Addressed Locations ...
GIS in the Rockies
 
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Dan Lynn
 
An Introduction to MapReduce
An Introduction to MapReduce An Introduction to MapReduce
An Introduction to MapReduce
Sina Ebrahimi
 
Locality Sensitive Hashing By Spark
Locality Sensitive Hashing By SparkLocality Sensitive Hashing By Spark
Locality Sensitive Hashing By Spark
Spark Summit
 
Handling the growth of data
Handling the growth of dataHandling the growth of data
Handling the growth of data
Piyush Katariya
 
Introduction to PostgreSQL
Introduction to PostgreSQLIntroduction to PostgreSQL
Introduction to PostgreSQL
Jim Mlodgenski
 
Ledingkart Meetup #2: Scaling Search @Lendingkart
Ledingkart Meetup #2: Scaling Search @LendingkartLedingkart Meetup #2: Scaling Search @Lendingkart
Ledingkart Meetup #2: Scaling Search @Lendingkart
Mukesh Singh
 
Big Data processing with Apache Spark
Big Data processing with Apache SparkBig Data processing with Apache Spark
Big Data processing with Apache Spark
Lucian Neghina
 
Druid
DruidDruid
Dirty data? Clean it up! - Datapalooza Denver 2016
Dirty data? Clean it up! - Datapalooza Denver 2016Dirty data? Clean it up! - Datapalooza Denver 2016
Dirty data? Clean it up! - Datapalooza Denver 2016
Dan Lynn
 
Streamsets and spark at SF Hadoop User Group
Streamsets and spark at SF Hadoop User GroupStreamsets and spark at SF Hadoop User Group
Streamsets and spark at SF Hadoop User Group
Hari Shreedharan
 
A Day in the Life of a Druid Implementor and Druid's Roadmap
A Day in the Life of a Druid Implementor and Druid's RoadmapA Day in the Life of a Druid Implementor and Druid's Roadmap
A Day in the Life of a Druid Implementor and Druid's Roadmap
Itai Yaffe
 
Using python to analyze spatial data
Using python to analyze spatial dataUsing python to analyze spatial data
Using python to analyze spatial data
Kudos S.A.S
 
Neo4j graph database
Neo4j graph databaseNeo4j graph database
Neo4j graph database
Prashant Bhargava
 
NoSQL for Artificial Intelligence
NoSQL for Artificial IntelligenceNoSQL for Artificial Intelligence
NoSQL for Artificial Intelligence
Sebastián Ramírez Montaño
 
How We Use Functional Programming to Find the Bad Guys
How We Use Functional Programming to Find the Bad GuysHow We Use Functional Programming to Find the Bad Guys
How We Use Functional Programming to Find the Bad Guys
New York City College of Technology Computer Systems Technology Colloquium
 
Geospatial Options in Apache Spark
Geospatial Options in Apache SparkGeospatial Options in Apache Spark
Geospatial Options in Apache Spark
Databricks
 
AS-STATS
AS-STATSAS-STATS
AS-STATS
Thomas Mangin
 
Hybrid Databases - PHP UK Conference 22 February 2019
Hybrid Databases - PHP UK Conference 22 February 2019Hybrid Databases - PHP UK Conference 22 February 2019
Hybrid Databases - PHP UK Conference 22 February 2019
Dave Stokes
 
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
Reynold Xin
 

Similar to Colorado State Address Dataset Automated Processing (20)

2013 GISCO Track, Quality Assessment and Improvement for Addressed Locations ...
2013 GISCO Track, Quality Assessment and Improvement for Addressed Locations ...2013 GISCO Track, Quality Assessment and Improvement for Addressed Locations ...
2013 GISCO Track, Quality Assessment and Improvement for Addressed Locations ...
 
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
 
An Introduction to MapReduce
An Introduction to MapReduce An Introduction to MapReduce
An Introduction to MapReduce
 
Locality Sensitive Hashing By Spark
Locality Sensitive Hashing By SparkLocality Sensitive Hashing By Spark
Locality Sensitive Hashing By Spark
 
Handling the growth of data
Handling the growth of dataHandling the growth of data
Handling the growth of data
 
Introduction to PostgreSQL
Introduction to PostgreSQLIntroduction to PostgreSQL
Introduction to PostgreSQL
 
Ledingkart Meetup #2: Scaling Search @Lendingkart
Ledingkart Meetup #2: Scaling Search @LendingkartLedingkart Meetup #2: Scaling Search @Lendingkart
Ledingkart Meetup #2: Scaling Search @Lendingkart
 
Big Data processing with Apache Spark
Big Data processing with Apache SparkBig Data processing with Apache Spark
Big Data processing with Apache Spark
 
Druid
DruidDruid
Druid
 
Dirty data? Clean it up! - Datapalooza Denver 2016
Dirty data? Clean it up! - Datapalooza Denver 2016Dirty data? Clean it up! - Datapalooza Denver 2016
Dirty data? Clean it up! - Datapalooza Denver 2016
 
Streamsets and spark at SF Hadoop User Group
Streamsets and spark at SF Hadoop User GroupStreamsets and spark at SF Hadoop User Group
Streamsets and spark at SF Hadoop User Group
 
A Day in the Life of a Druid Implementor and Druid's Roadmap
A Day in the Life of a Druid Implementor and Druid's RoadmapA Day in the Life of a Druid Implementor and Druid's Roadmap
A Day in the Life of a Druid Implementor and Druid's Roadmap
 
Using python to analyze spatial data
Using python to analyze spatial dataUsing python to analyze spatial data
Using python to analyze spatial data
 
Neo4j graph database
Neo4j graph databaseNeo4j graph database
Neo4j graph database
 
NoSQL for Artificial Intelligence
NoSQL for Artificial IntelligenceNoSQL for Artificial Intelligence
NoSQL for Artificial Intelligence
 
How We Use Functional Programming to Find the Bad Guys
How We Use Functional Programming to Find the Bad GuysHow We Use Functional Programming to Find the Bad Guys
How We Use Functional Programming to Find the Bad Guys
 
Geospatial Options in Apache Spark
Geospatial Options in Apache SparkGeospatial Options in Apache Spark
Geospatial Options in Apache Spark
 
AS-STATS
AS-STATSAS-STATS
AS-STATS
 
Hybrid Databases - PHP UK Conference 22 February 2019
Hybrid Databases - PHP UK Conference 22 February 2019Hybrid Databases - PHP UK Conference 22 February 2019
Hybrid Databases - PHP UK Conference 22 February 2019
 
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
 

More from GeCo in the Rockies

Fusion of Geodesy and GIS at NOAA as NGS
Fusion of Geodesy and GIS at NOAA as NGSFusion of Geodesy and GIS at NOAA as NGS
Fusion of Geodesy and GIS at NOAA as NGSGeCo in the Rockies
 
Stone national spatial reference system heights
Stone national spatial reference system   heightsStone national spatial reference system   heights
Stone national spatial reference system heightsGeCo in the Rockies
 
Edwards frontier precision terrestrial imagingandmeasurement
Edwards frontier precision terrestrial imagingandmeasurementEdwards frontier precision terrestrial imagingandmeasurement
Edwards frontier precision terrestrial imagingandmeasurementGeCo in the Rockies
 
Siddle connecting surveying and mgis to mesa countys rtrn
Siddle connecting surveying and mgis to mesa countys rtrnSiddle connecting surveying and mgis to mesa countys rtrn
Siddle connecting surveying and mgis to mesa countys rtrnGeCo in the Rockies
 
Londe mobile devices appropriate uses
Londe mobile devices appropriate usesLonde mobile devices appropriate uses
Londe mobile devices appropriate usesGeCo in the Rockies
 
Lowry colorado state address dataset data quality
Lowry colorado state address dataset data qualityLowry colorado state address dataset data quality
Lowry colorado state address dataset data qualityGeCo in the Rockies
 
Vetter employee residence reports weld county
Vetter employee residence reports weld countyVetter employee residence reports weld county
Vetter employee residence reports weld countyGeCo in the Rockies
 
Caldwell community sustainability and land use policy
Caldwell community sustainability and land use policyCaldwell community sustainability and land use policy
Caldwell community sustainability and land use policyGeCo in the Rockies
 
Behunin and lasslo inexpensive mobile mapping solutions
Behunin and lasslo inexpensive mobile mapping solutionsBehunin and lasslo inexpensive mobile mapping solutions
Behunin and lasslo inexpensive mobile mapping solutionsGeCo in the Rockies
 

More from GeCo in the Rockies (20)

Fusion of Geodesy and GIS at NOAA as NGS
Fusion of Geodesy and GIS at NOAA as NGSFusion of Geodesy and GIS at NOAA as NGS
Fusion of Geodesy and GIS at NOAA as NGS
 
Stone national spatial reference system heights
Stone national spatial reference system   heightsStone national spatial reference system   heights
Stone national spatial reference system heights
 
Buck appgeo
Buck appgeoBuck appgeo
Buck appgeo
 
Edwards frontier precision terrestrial imagingandmeasurement
Edwards frontier precision terrestrial imagingandmeasurementEdwards frontier precision terrestrial imagingandmeasurement
Edwards frontier precision terrestrial imagingandmeasurement
 
Siddle connecting surveying and mgis to mesa countys rtrn
Siddle connecting surveying and mgis to mesa countys rtrnSiddle connecting surveying and mgis to mesa countys rtrn
Siddle connecting surveying and mgis to mesa countys rtrn
 
Stone four corners monument
Stone four corners monumentStone four corners monument
Stone four corners monument
 
Isaac esri living atlas
Isaac esri living atlasIsaac esri living atlas
Isaac esri living atlas
 
Londe mobile devices appropriate uses
Londe mobile devices appropriate usesLonde mobile devices appropriate uses
Londe mobile devices appropriate uses
 
Lowry colorado state address dataset data quality
Lowry colorado state address dataset data qualityLowry colorado state address dataset data quality
Lowry colorado state address dataset data quality
 
Lindemann arc gis forlocalgovt
Lindemann arc gis forlocalgovtLindemann arc gis forlocalgovt
Lindemann arc gis forlocalgovt
 
Duran here presentation
Duran here presentationDuran here presentation
Duran here presentation
 
Underwood esri serug
Underwood esri serugUnderwood esri serug
Underwood esri serug
 
Korris national map corps
Korris national map corpsKorris national map corps
Korris national map corps
 
Chamberlain hazus
Chamberlain hazusChamberlain hazus
Chamberlain hazus
 
Gup web mobilegis
Gup web mobilegisGup web mobilegis
Gup web mobilegis
 
Vetter employee residence reports weld county
Vetter employee residence reports weld countyVetter employee residence reports weld county
Vetter employee residence reports weld county
 
Caldwell community sustainability and land use policy
Caldwell community sustainability and land use policyCaldwell community sustainability and land use policy
Caldwell community sustainability and land use policy
 
Caldwell uas
Caldwell uasCaldwell uas
Caldwell uas
 
Gijselaers lights camerang911
Gijselaers lights camerang911Gijselaers lights camerang911
Gijselaers lights camerang911
 
Behunin and lasslo inexpensive mobile mapping solutions
Behunin and lasslo inexpensive mobile mapping solutionsBehunin and lasslo inexpensive mobile mapping solutions
Behunin and lasslo inexpensive mobile mapping solutions
 

Recently uploaded

GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
Rohit Gautam
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Vladimir Iglovikov, Ph.D.
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 

Recently uploaded (20)

GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 

Colorado State Address Dataset Automated Processing

  • 1. Colorado State Address Dataset Automated Processing Nathan Lowry, GIS Outreach Coordinator State of Colorado September 23, 2014
  • 2.
  • 3. Common Data Model ● Allows local and state-wide querying, analysis, and integration … ● Accommodates information exchanges ▪ Hierarchical - City to County, County to Region, Region to State ▪ Among neighboring jurisdictions (eg. County to County, etc.) ● Allows profiles to provide data in standard forms for specific objectives ▪ NENA CLDXF for NG-911 ▪ USPS Pub-28 for CASS ▪ ArcGIS Geocoding (for quality comparisons, etc.) ● It’s more efficient (less work) and assures more quality (less loss)
  • 4. FGDC-STD-016-2011 United States Thoroughfare, Landmark, and Postal Address Data Standard Of Greatest Significance: 1.Everything* is ‘fully explicit’ (fully spelled‐out) No abbreviations allowed; No Ambiguity *The only exception is two‐letter state postal codes (eg. “CO” = Colorado) ●2.You will express exactly how each address will be parsed Parsing is no longer subject to interpretation The break‐down is stored in the data for each record 3.Each Address must be assigned a Unique Identifier (UID) Multiple representations of the same address can be “tied together” if and only if (iff) addresses are assigned UIDs. These are big changes that few have yet implemented •Our common data model is designed to accommodate both: ‒your current state and ‒this “to be” state
  • 5. Presuppositions: ● SQL Server Integration Services (SSIS) o Parallel processing - fast translations - True. o Most Compatible with SQL Server - Irrelevant* o Developed by DBAs for DBAs - No, developed by app developers for app developers ▪ (ie. Normalization tools) - Hah, hah, hah, hah, hah! o No Additional Cost - (This one bore out) o I learned French instead of Spanish - (SSIS instead of Python) ● No Parsing o I will translate, but it’ll be the locals’ responsibility to pre-parse... - No parsing, no geocoding* o In addition, no last lines, no geocoding* ● 6-8 Weeks Processing - 6-8 Months of Processing
  • 7. Colorado State Address Dataset Automated and Manual Processes
  • 9. Observations ● SQL Server Integration Services (SSIS) ○ SSIS is quirky ○ SSIS Expression Language is Swahili ○ A modeling canvas may be more effective for design ○ SSIS can integrate with many other server processes (FTP) ● Parsing and “Last Lining” will give CO jurisdictions a leg up ○ The level of effort can be significant ○ CLDXF Street Naming and Address Numbering Conventions ● Standards ○ Jurisdictional pretypes, sequencers - minor tweaks ○ Subaddress conventions need ... something
  • 10. Opportunities ● Standards ○ Improvement via implementation ○ Coalescence on Subaddresses ● Common implementations of data models ○ Reduce the cost of development ○ Makes sharing of code useful and possible ● Common code ○ Shared parsing tools ○ Shared applications