SlideShare a Scribd company logo
Open Spatial Data
Progress towards a reusable gazetteer
                                               th
        Open Data Group – 16 April 2012
                   @ianibbo



This work is licensed under a Creative Commons Attribution 3.0 Unported License.
Overview
Original Problem
  How to transition a central gov't funded aggregation of
   childcare and positive activities with a budget of
   >2m / year to an open data* model running on £60/
   month hardware
     Retaining security (Of a certain level)
     Retaining functionality
(See http://www.madwdata.org.uk/blog/id/394)
2 Major Costs To Mitigate
Large cluster of proprietary OS hosts, ~12 front
 end web servers, hot backup sql server
  Migrated to 1*Pound Host server ~£60/month, server
   has 2 hard drives, hot backup, off site rsync
Data costs – BPH Address-Point data – Used for
 geocoding incoming records and lookups on
 search terms. OS Boundary Line
  ???
Some Noise
Open Spatial Data Consultation......
Open Spatial Data
Ordnance Survey Open Data
  http://www.ordnancesurvey.co.uk/oswebsite/products/os-lo
Code Point Open
  Postcodes to Northing/Easting
OS Locator
  Gazetteer of road names (And other features)
Obtained by registering on website, requesting,
 getting email, following link, …..
The reality of CodePoint Open
The core data is “Open”
Missing the one vital link between CodePoint
 Open and OS Locator – PostCode → Road
 Names / Identifiers.
If you're happy to display Postcodes without road
   names, it's ideal.
Last Mile Problem.
Finding an automated way to link the 2 is hard!
Licensed data is now open, but out of date
Address Point
Still Licensed
Expensive
Probably not that useful anyway for most projects
Problem with focus on “Open Data”
Everyone ends up implementing their own
 gazetteer
Large scale providers have rate limits and
 introduce external dependencies / Speed
 issues
People want local geo-coding (for lots of different
 reasons).
Having rolled your own gazetteer, you discover
 you need to handle updates (Full replacements)
It's not an end in itself
Vision
A stand-alone gazetteer web app designed for
  local network use with features for importing
  updates from OS, reconciling multiple data
  sources and performing geo-coding lookups.
Available Tools
Apache SOLR
  Long-Standing stalwart of the open data and search
    community
  Schemas slightly clunky
  Several spatial options, all with different strengths /
   weaknesses. Multiple points a problem in some.
ElasticSearch
  Schema Free, Apparently Solid Spatial, Multi Points
  Good integration with Mongo via Rivers
Problems / Issues
ES Spatial search hard to do directly via a COOL
 URL
  Spatial query syntax is expressive, but complex and
   needs JSON sub-documents
Need service wrappers
But thats easily done
Updates!
Missed Level of Abstraction
(Common to many open data sets?)

                   Local
                   Copy




                    C
                     o
  Sourc             m      Processin
    e               pa         g
                    re


      NOSQL Like            ES Ideal for
      Mongo is ideal for    this
      this
Progress
Starting to extract code from existing services
 into a generic spatial app
https://github.com/ianibo/AnOpenGazetteerFramewo
Work progressing under aegis of GIST Mobile
 group / Open Data group
Workable Gaz now, but command line interface
 for importing.
Questions / Comments?
Some supporting info
Original Project – FOI request to DfE
                         Total costs - First 3 years

          7000000
                                           Local Authority         Consultation sem-
          6000000                          Revenue                 inars
                                           Local Authority Capi-   Methods Consulting
          5000000                          tal
                                           Central Office of In-   Engine Group
          4000000                          formation
                                           Qi Consulting           Digital Public
          3000000                          Redhouse                Tribal Education
                                           DfE Staff Costs
          2000000

          1000000

                0
               2008-09   2009-10            2010-11
First 3 years - Non LA costs

2500000

                                              Central Office of In-
2000000                                       formation
                                              Qi Consulting
                                              Redhouse
1500000
                                              DfE Staff Costs
                                              Consultation sem-
                                              inars
1000000
                                              Methods Consulting
                                              Engine Group
 500000                                       Digital Public
                                              Tribal Education

      0
     2008-09    2009-10           2010-11

More Related Content

Similar to Open spatial processing

Irw seminar slides for am Sharepoint event on 25.4.12
Irw seminar slides for am Sharepoint event on 25.4.12 Irw seminar slides for am Sharepoint event on 25.4.12
Irw seminar slides for am Sharepoint event on 25.4.12
NatashaIRWS
 
Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...
Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...
Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...
BIWUG
 
Blockchain by Silver Touch Tech Lab
Blockchain by Silver Touch Tech LabBlockchain by Silver Touch Tech Lab
Blockchain by Silver Touch Tech Lab
SilverTouchTechLab
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Denodo
 
Case Study - Appirio as the Serverless Enterprise
Case Study - Appirio as the Serverless EnterpriseCase Study - Appirio as the Serverless Enterprise
Case Study - Appirio as the Serverless Enterprise
Appirio
 
Broadband World Forum 2012 Highlights
Broadband World Forum 2012 HighlightsBroadband World Forum 2012 Highlights
Broadband World Forum 2012 Highlights
Alan Quayle
 
Lesser Known Opportunities in Technology
Lesser Known Opportunities in TechnologyLesser Known Opportunities in Technology
Lesser Known Opportunities in Technology
Calen Legaspi
 
Lesser Known Opportunities in Technology
Lesser Known Opportunities in TechnologyLesser Known Opportunities in Technology
Lesser Known Opportunities in Technology
Orange and Bronze Software Labs
 
5 2-nessi-stuart campbell
5 2-nessi-stuart campbell5 2-nessi-stuart campbell
5 2-nessi-stuart campbell
Digital Business Innovation Community
 
Resume_ASA
Resume_ASAResume_ASA
Resume_ASA
Self-employed
 
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisatiesData Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
Multiscope
 
1524 how ibm's big data solution can help you gain insight into your data cen...
1524 how ibm's big data solution can help you gain insight into your data cen...1524 how ibm's big data solution can help you gain insight into your data cen...
1524 how ibm's big data solution can help you gain insight into your data cen...
IBM
 
How to develop a data scientist – What business has requested v02
How to develop a data scientist – What business has requested v02How to develop a data scientist – What business has requested v02
How to develop a data scientist – What business has requested v02
Data Science London
 
EDW_Recommendation_Smartmonk_26-12-14
EDW_Recommendation_Smartmonk_26-12-14EDW_Recommendation_Smartmonk_26-12-14
EDW_Recommendation_Smartmonk_26-12-14
Nagi Reddy B
 
Using Linked Data and the Internet of Things for Energy Management
Using Linked Data and the Internet of Things for Energy ManagementUsing Linked Data and the Internet of Things for Energy Management
Using Linked Data and the Internet of Things for Energy Management
Edward Curry
 
Top Business Intelligence Trends for 2016 by Panorama Software
Top Business Intelligence Trends for 2016 by Panorama SoftwareTop Business Intelligence Trends for 2016 by Panorama Software
Top Business Intelligence Trends for 2016 by Panorama Software
Panorama Software
 
Big Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBig Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of Analytics
BigDataExpo
 
Big Data and Analytics by Silver Touch Tech Lab
Big Data and Analytics by Silver Touch Tech LabBig Data and Analytics by Silver Touch Tech Lab
Big Data and Analytics by Silver Touch Tech Lab
SilverTouchTechLab
 
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...
Jade Global
 
Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12
Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12
Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12
Eric Athas
 

Similar to Open spatial processing (20)

Irw seminar slides for am Sharepoint event on 25.4.12
Irw seminar slides for am Sharepoint event on 25.4.12 Irw seminar slides for am Sharepoint event on 25.4.12
Irw seminar slides for am Sharepoint event on 25.4.12
 
Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...
Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...
Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...
 
Blockchain by Silver Touch Tech Lab
Blockchain by Silver Touch Tech LabBlockchain by Silver Touch Tech Lab
Blockchain by Silver Touch Tech Lab
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
 
Case Study - Appirio as the Serverless Enterprise
Case Study - Appirio as the Serverless EnterpriseCase Study - Appirio as the Serverless Enterprise
Case Study - Appirio as the Serverless Enterprise
 
Broadband World Forum 2012 Highlights
Broadband World Forum 2012 HighlightsBroadband World Forum 2012 Highlights
Broadband World Forum 2012 Highlights
 
Lesser Known Opportunities in Technology
Lesser Known Opportunities in TechnologyLesser Known Opportunities in Technology
Lesser Known Opportunities in Technology
 
Lesser Known Opportunities in Technology
Lesser Known Opportunities in TechnologyLesser Known Opportunities in Technology
Lesser Known Opportunities in Technology
 
5 2-nessi-stuart campbell
5 2-nessi-stuart campbell5 2-nessi-stuart campbell
5 2-nessi-stuart campbell
 
Resume_ASA
Resume_ASAResume_ASA
Resume_ASA
 
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisatiesData Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
 
1524 how ibm's big data solution can help you gain insight into your data cen...
1524 how ibm's big data solution can help you gain insight into your data cen...1524 how ibm's big data solution can help you gain insight into your data cen...
1524 how ibm's big data solution can help you gain insight into your data cen...
 
How to develop a data scientist – What business has requested v02
How to develop a data scientist – What business has requested v02How to develop a data scientist – What business has requested v02
How to develop a data scientist – What business has requested v02
 
EDW_Recommendation_Smartmonk_26-12-14
EDW_Recommendation_Smartmonk_26-12-14EDW_Recommendation_Smartmonk_26-12-14
EDW_Recommendation_Smartmonk_26-12-14
 
Using Linked Data and the Internet of Things for Energy Management
Using Linked Data and the Internet of Things for Energy ManagementUsing Linked Data and the Internet of Things for Energy Management
Using Linked Data and the Internet of Things for Energy Management
 
Top Business Intelligence Trends for 2016 by Panorama Software
Top Business Intelligence Trends for 2016 by Panorama SoftwareTop Business Intelligence Trends for 2016 by Panorama Software
Top Business Intelligence Trends for 2016 by Panorama Software
 
Big Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBig Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of Analytics
 
Big Data and Analytics by Silver Touch Tech Lab
Big Data and Analytics by Silver Touch Tech LabBig Data and Analytics by Silver Touch Tech Lab
Big Data and Analytics by Silver Touch Tech Lab
 
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...
 
Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12
Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12
Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12
 

More from ianibbo

Slides from Gangplank discussion
Slides from Gangplank discussionSlides from Gangplank discussion
Slides from Gangplank discussion
ianibbo
 
Ki presv2
Ki presv2Ki presv2
Ki presv2
ianibbo
 
Xcri aggr dev8d
Xcri aggr dev8dXcri aggr dev8d
Xcri aggr dev8d
ianibbo
 
Xcri aggr dev8d
Xcri aggr dev8dXcri aggr dev8d
Xcri aggr dev8d
ianibbo
 
Aggregator appoverview
Aggregator appoverviewAggregator appoverview
Aggregator appoverview
ianibbo
 
Gist od2-feb-2011
Gist od2-feb-2011Gist od2-feb-2011
Gist od2-feb-2011
ianibbo
 
Gist od2-feb-2011
Gist od2-feb-2011Gist od2-feb-2011
Gist od2-feb-2011
ianibbo
 
Setting up and open fidy dev environment
Setting up and open fidy dev environmentSetting up and open fidy dev environment
Setting up and open fidy dev environment
ianibbo
 
OpenFIDY Module & Architecture Overview
OpenFIDY Module & Architecture OverviewOpenFIDY Module & Architecture Overview
OpenFIDY Module & Architecture Overview
ianibbo
 
GIST Mobile Group - Welcome jan2011
GIST Mobile Group - Welcome jan2011GIST Mobile Group - Welcome jan2011
GIST Mobile Group - Welcome jan2011
ianibbo
 
GIST Open Conversation - Sheffield App
GIST Open Conversation - Sheffield AppGIST Open Conversation - Sheffield App
GIST Open Conversation - Sheffield App
ianibbo
 
Making Agile Pay
Making Agile PayMaking Agile Pay
Making Agile Pay
ianibbo
 
Geekup Sheffield Semantic Web Primer
Geekup Sheffield Semantic Web PrimerGeekup Sheffield Semantic Web Primer
Geekup Sheffield Semantic Web Primer
ianibbo
 

More from ianibbo (13)

Slides from Gangplank discussion
Slides from Gangplank discussionSlides from Gangplank discussion
Slides from Gangplank discussion
 
Ki presv2
Ki presv2Ki presv2
Ki presv2
 
Xcri aggr dev8d
Xcri aggr dev8dXcri aggr dev8d
Xcri aggr dev8d
 
Xcri aggr dev8d
Xcri aggr dev8dXcri aggr dev8d
Xcri aggr dev8d
 
Aggregator appoverview
Aggregator appoverviewAggregator appoverview
Aggregator appoverview
 
Gist od2-feb-2011
Gist od2-feb-2011Gist od2-feb-2011
Gist od2-feb-2011
 
Gist od2-feb-2011
Gist od2-feb-2011Gist od2-feb-2011
Gist od2-feb-2011
 
Setting up and open fidy dev environment
Setting up and open fidy dev environmentSetting up and open fidy dev environment
Setting up and open fidy dev environment
 
OpenFIDY Module & Architecture Overview
OpenFIDY Module & Architecture OverviewOpenFIDY Module & Architecture Overview
OpenFIDY Module & Architecture Overview
 
GIST Mobile Group - Welcome jan2011
GIST Mobile Group - Welcome jan2011GIST Mobile Group - Welcome jan2011
GIST Mobile Group - Welcome jan2011
 
GIST Open Conversation - Sheffield App
GIST Open Conversation - Sheffield AppGIST Open Conversation - Sheffield App
GIST Open Conversation - Sheffield App
 
Making Agile Pay
Making Agile PayMaking Agile Pay
Making Agile Pay
 
Geekup Sheffield Semantic Web Primer
Geekup Sheffield Semantic Web PrimerGeekup Sheffield Semantic Web Primer
Geekup Sheffield Semantic Web Primer
 

Recently uploaded

A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 

Recently uploaded (20)

A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 

Open spatial processing

  • 1. Open Spatial Data Progress towards a reusable gazetteer th Open Data Group – 16 April 2012 @ianibbo This work is licensed under a Creative Commons Attribution 3.0 Unported License.
  • 2.
  • 3. Overview Original Problem How to transition a central gov't funded aggregation of childcare and positive activities with a budget of >2m / year to an open data* model running on £60/ month hardware Retaining security (Of a certain level) Retaining functionality (See http://www.madwdata.org.uk/blog/id/394)
  • 4. 2 Major Costs To Mitigate Large cluster of proprietary OS hosts, ~12 front end web servers, hot backup sql server Migrated to 1*Pound Host server ~£60/month, server has 2 hard drives, hot backup, off site rsync Data costs – BPH Address-Point data – Used for geocoding incoming records and lookups on search terms. OS Boundary Line ???
  • 5. Some Noise Open Spatial Data Consultation......
  • 6. Open Spatial Data Ordnance Survey Open Data http://www.ordnancesurvey.co.uk/oswebsite/products/os-lo Code Point Open Postcodes to Northing/Easting OS Locator Gazetteer of road names (And other features) Obtained by registering on website, requesting, getting email, following link, …..
  • 7. The reality of CodePoint Open The core data is “Open” Missing the one vital link between CodePoint Open and OS Locator – PostCode → Road Names / Identifiers. If you're happy to display Postcodes without road names, it's ideal. Last Mile Problem. Finding an automated way to link the 2 is hard! Licensed data is now open, but out of date
  • 8. Address Point Still Licensed Expensive Probably not that useful anyway for most projects
  • 9. Problem with focus on “Open Data” Everyone ends up implementing their own gazetteer Large scale providers have rate limits and introduce external dependencies / Speed issues People want local geo-coding (for lots of different reasons). Having rolled your own gazetteer, you discover you need to handle updates (Full replacements) It's not an end in itself
  • 10. Vision A stand-alone gazetteer web app designed for local network use with features for importing updates from OS, reconciling multiple data sources and performing geo-coding lookups.
  • 11. Available Tools Apache SOLR Long-Standing stalwart of the open data and search community Schemas slightly clunky Several spatial options, all with different strengths / weaknesses. Multiple points a problem in some. ElasticSearch Schema Free, Apparently Solid Spatial, Multi Points Good integration with Mongo via Rivers
  • 12. Problems / Issues ES Spatial search hard to do directly via a COOL URL Spatial query syntax is expressive, but complex and needs JSON sub-documents Need service wrappers But thats easily done Updates!
  • 13. Missed Level of Abstraction (Common to many open data sets?) Local Copy C o Sourc m Processin e pa g re NOSQL Like ES Ideal for Mongo is ideal for this this
  • 14. Progress Starting to extract code from existing services into a generic spatial app https://github.com/ianibo/AnOpenGazetteerFramewo Work progressing under aegis of GIST Mobile group / Open Data group Workable Gaz now, but command line interface for importing.
  • 16. Some supporting info Original Project – FOI request to DfE Total costs - First 3 years 7000000 Local Authority Consultation sem- 6000000 Revenue inars Local Authority Capi- Methods Consulting 5000000 tal Central Office of In- Engine Group 4000000 formation Qi Consulting Digital Public 3000000 Redhouse Tribal Education DfE Staff Costs 2000000 1000000 0 2008-09 2009-10 2010-11
  • 17. First 3 years - Non LA costs 2500000 Central Office of In- 2000000 formation Qi Consulting Redhouse 1500000 DfE Staff Costs Consultation sem- inars 1000000 Methods Consulting Engine Group 500000 Digital Public Tribal Education 0 2008-09 2009-10 2010-11