SlideShare a Scribd company logo
1 of 27
From Data Chaos to Actionable Intelligence: How the Convergence of the GeoWeb and Semantic Web is Revolutionizing the Way We Process Information Where 2.0 2008 Sean Gorman FortiusOne Inc.
AKA: Trying to GeoHack Without Getting Arrested
Geography of the Web Paul Torrens Steve Coast Martin Dodge Anthony Townsend Matt Zook
Big Data Sets Algorithmic Analysis Map Logical Results to Physical Realities
NYSE – SIAC Routing Vulnerabilities
Then these guys show up
Then these guys rescue you
 
Geography on the Web
Small data sets with largely unstructured content
Crowdsourcing large structured data sets with quantitative capabilities
Didn’t we try this last year?
What happens when your database reaches 1,683,185,246 features
 
Rinse and repeat per query type
or
Build a lightweight object database
 
 
OpenLocation Lots of silos of GeoData how can we begin to interconnect them Infochimps.org
Can we federate our data
OpenLocation Linked repositories of data form a graph of content
Node “ Y ” is connected to Node “ X ” by the link “User Action” User adds structured dataset “ X ” to their 3 rd  Party URL string “ Y ” Structured Dataset “ X” Third Party URL “ Y”
Structured Dataset “ X” Third Party URL “ Y” Third Party URL “ Z” Data Set Degree X 2 Y 1 Z 1
Creates the Ability to Intelligently Serve Content User searches on a term and gets a result based on tags and full text query then weighted by degree and user ratings: The system can now recommend data URL “Y” and “Z” as data that may be useful context for dataset “X” even though they may have no tags in common The graph can be encapsulated in code and communicated to the outside world Structured Dataset “ X” Third Party URL “ Y” Third Party URL “ Z”
1. Bring a new class of content/data to the Web 2. Intelligently mix that content with the rest of the Web 3. Enable the content to answer meaningful questions for users
If you would like to check out the Finder! Beta see us for a card or go here: http://finder.geocommons.com / If you would like to talk about federating geodata email me at: [email_address]

More Related Content

What's hot

Participatory Sensing through Social Networks
Participatory Sensing through Social NetworksParticipatory Sensing through Social Networks
Participatory Sensing through Social NetworksIoannis Krontiris
 
The Emergence of a Third Wave of Open Data
The Emergence of a Third Wave of Open DataThe Emergence of a Third Wave of Open Data
The Emergence of a Third Wave of Open Datathegovlabnyu
 
Open data presentation 2013 v0 5
Open data presentation 2013 v0 5Open data presentation 2013 v0 5
Open data presentation 2013 v0 5Alan Kong
 
Linked Open Government Data: What’s Next?
Linked Open Government Data:  What’s Next?Linked Open Government Data:  What’s Next?
Linked Open Government Data: What’s Next?Li Ding
 
Mac373 med312 data journalism lecture
Mac373 med312 data journalism lectureMac373 med312 data journalism lecture
Mac373 med312 data journalism lectureRob Jewitt
 
Denver's Open Data Initiative
Denver's Open Data InitiativeDenver's Open Data Initiative
Denver's Open Data InitiativeAllan Glen
 
Building a Digital Manifesto
Building a Digital ManifestoBuilding a Digital Manifesto
Building a Digital ManifestoSarah Granger
 
Domenico Donvito - Istat - Open Data in Official Statistics - 10 July 2013
Domenico Donvito - Istat - Open Data in Official Statistics - 10 July 2013Domenico Donvito - Istat - Open Data in Official Statistics - 10 July 2013
Domenico Donvito - Istat - Open Data in Official Statistics - 10 July 2013AmbasciatadelCanada
 
Data power and Civic Hacking at mySociety
Data power and Civic Hacking at mySocietyData power and Civic Hacking at mySociety
Data power and Civic Hacking at mySocietysbaack
 
The GIS Guide to Public Domain Data
The GIS Guide to Public Domain DataThe GIS Guide to Public Domain Data
The GIS Guide to Public Domain DataEsri
 
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...Elena Simperl
 
Memory Connected
Memory ConnectedMemory Connected
Memory ConnectedLi Ding
 
VGI Overview - Crowdsourcing Participatory Mapping
VGI Overview - Crowdsourcing Participatory MappingVGI Overview - Crowdsourcing Participatory Mapping
VGI Overview - Crowdsourcing Participatory MappingDany Laksono
 
Treasury Board of Canada - Open Government / Open Data in Canada - July 2013
Treasury Board of Canada - Open Government / Open Data in Canada - July 2013Treasury Board of Canada - Open Government / Open Data in Canada - July 2013
Treasury Board of Canada - Open Government / Open Data in Canada - July 2013AmbasciatadelCanada
 
Public policy and the information revolution 4.20.12
Public policy and the information revolution 4.20.12Public policy and the information revolution 4.20.12
Public policy and the information revolution 4.20.12Greg Wass
 
How Linked Data is transforming eGovernment
How Linked Data is transforming eGovernmentHow Linked Data is transforming eGovernment
How Linked Data is transforming eGovernmentNikos Loutas
 

What's hot (20)

Participatory Sensing through Social Networks
Participatory Sensing through Social NetworksParticipatory Sensing through Social Networks
Participatory Sensing through Social Networks
 
The Emergence of a Third Wave of Open Data
The Emergence of a Third Wave of Open DataThe Emergence of a Third Wave of Open Data
The Emergence of a Third Wave of Open Data
 
Open data presentation 2013 v0 5
Open data presentation 2013 v0 5Open data presentation 2013 v0 5
Open data presentation 2013 v0 5
 
Linked Open Government Data: What’s Next?
Linked Open Government Data:  What’s Next?Linked Open Government Data:  What’s Next?
Linked Open Government Data: What’s Next?
 
Mac373 med312 data journalism lecture
Mac373 med312 data journalism lectureMac373 med312 data journalism lecture
Mac373 med312 data journalism lecture
 
Denver's Open Data Initiative
Denver's Open Data InitiativeDenver's Open Data Initiative
Denver's Open Data Initiative
 
mit data vr jan17
mit data vr jan17mit data vr jan17
mit data vr jan17
 
Data journalism Overview
Data journalism OverviewData journalism Overview
Data journalism Overview
 
Building a Digital Manifesto
Building a Digital ManifestoBuilding a Digital Manifesto
Building a Digital Manifesto
 
Domenico Donvito - Istat - Open Data in Official Statistics - 10 July 2013
Domenico Donvito - Istat - Open Data in Official Statistics - 10 July 2013Domenico Donvito - Istat - Open Data in Official Statistics - 10 July 2013
Domenico Donvito - Istat - Open Data in Official Statistics - 10 July 2013
 
Data power and Civic Hacking at mySociety
Data power and Civic Hacking at mySocietyData power and Civic Hacking at mySociety
Data power and Civic Hacking at mySociety
 
The GIS Guide to Public Domain Data
The GIS Guide to Public Domain DataThe GIS Guide to Public Domain Data
The GIS Guide to Public Domain Data
 
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
 
Data, Indicators and Maps on Homelessness
Data, Indicators and Maps on HomelessnessData, Indicators and Maps on Homelessness
Data, Indicators and Maps on Homelessness
 
Data journalism
Data journalismData journalism
Data journalism
 
Memory Connected
Memory ConnectedMemory Connected
Memory Connected
 
VGI Overview - Crowdsourcing Participatory Mapping
VGI Overview - Crowdsourcing Participatory MappingVGI Overview - Crowdsourcing Participatory Mapping
VGI Overview - Crowdsourcing Participatory Mapping
 
Treasury Board of Canada - Open Government / Open Data in Canada - July 2013
Treasury Board of Canada - Open Government / Open Data in Canada - July 2013Treasury Board of Canada - Open Government / Open Data in Canada - July 2013
Treasury Board of Canada - Open Government / Open Data in Canada - July 2013
 
Public policy and the information revolution 4.20.12
Public policy and the information revolution 4.20.12Public policy and the information revolution 4.20.12
Public policy and the information revolution 4.20.12
 
How Linked Data is transforming eGovernment
How Linked Data is transforming eGovernmentHow Linked Data is transforming eGovernment
How Linked Data is transforming eGovernment
 

Viewers also liked

Gsapp xcities gorman_upload
Gsapp xcities gorman_uploadGsapp xcities gorman_upload
Gsapp xcities gorman_uploadseagor
 
Geo Web F1 Enterprise
Geo Web F1 EnterpriseGeo Web F1 Enterprise
Geo Web F1 Enterpriseseagor
 
Oscar twitter geo_sentiment
Oscar twitter geo_sentimentOscar twitter geo_sentiment
Oscar twitter geo_sentimentseagor
 
Black Friday Twitter Brand Analysis
Black Friday Twitter Brand AnalysisBlack Friday Twitter Brand Analysis
Black Friday Twitter Brand Analysisseagor
 
Wherecamp Lightningtalk - GeoMining Social Media
Wherecamp Lightningtalk - GeoMining Social MediaWherecamp Lightningtalk - GeoMining Social Media
Wherecamp Lightningtalk - GeoMining Social Mediaseagor
 
Just in Time Analytics - Where Conference
Just in Time Analytics - Where ConferenceJust in Time Analytics - Where Conference
Just in Time Analytics - Where Conferenceseagor
 
The State of Big Data for Geo - ESRI Big Data Meetup
The State of Big Data for Geo - ESRI Big Data MeetupThe State of Big Data for Geo - ESRI Big Data Meetup
The State of Big Data for Geo - ESRI Big Data Meetupseagor
 

Viewers also liked (7)

Gsapp xcities gorman_upload
Gsapp xcities gorman_uploadGsapp xcities gorman_upload
Gsapp xcities gorman_upload
 
Geo Web F1 Enterprise
Geo Web F1 EnterpriseGeo Web F1 Enterprise
Geo Web F1 Enterprise
 
Oscar twitter geo_sentiment
Oscar twitter geo_sentimentOscar twitter geo_sentiment
Oscar twitter geo_sentiment
 
Black Friday Twitter Brand Analysis
Black Friday Twitter Brand AnalysisBlack Friday Twitter Brand Analysis
Black Friday Twitter Brand Analysis
 
Wherecamp Lightningtalk - GeoMining Social Media
Wherecamp Lightningtalk - GeoMining Social MediaWherecamp Lightningtalk - GeoMining Social Media
Wherecamp Lightningtalk - GeoMining Social Media
 
Just in Time Analytics - Where Conference
Just in Time Analytics - Where ConferenceJust in Time Analytics - Where Conference
Just in Time Analytics - Where Conference
 
The State of Big Data for Geo - ESRI Big Data Meetup
The State of Big Data for Geo - ESRI Big Data MeetupThe State of Big Data for Geo - ESRI Big Data Meetup
The State of Big Data for Geo - ESRI Big Data Meetup
 

Similar to Where 2.0 NoSQL Presentation 2008 - GeoIQ

The technical case for a semantic web
The technical case for a semantic webThe technical case for a semantic web
The technical case for a semantic webTony Dobaj
 
Bigdatacooltools
BigdatacooltoolsBigdatacooltools
Bigdatacooltoolssuresh sood
 
DCA Symposium 6 Feb 2023.pdf
DCA Symposium 6 Feb 2023.pdfDCA Symposium 6 Feb 2023.pdf
DCA Symposium 6 Feb 2023.pdfAlan Morrison
 
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)Rinke Hoekstra
 
The web of data: how are we doing so far
The web of data: how are we doing so farThe web of data: how are we doing so far
The web of data: how are we doing so farElena Simperl
 
Self adaptive based natural language interface for disambiguation of
Self adaptive based natural language interface for disambiguation ofSelf adaptive based natural language interface for disambiguation of
Self adaptive based natural language interface for disambiguation ofNurfadhlina Mohd Sharef
 
What do we want computers to do for us?
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us? Andrea Volpini
 
Spivack Blogtalk 2008
Spivack Blogtalk 2008Spivack Blogtalk 2008
Spivack Blogtalk 2008Blogtalk 2008
 
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...Edward Curry
 
Open data Websmatch
Open data WebsmatchOpen data Websmatch
Open data Websmatchdata publica
 
Omitola birmingham cityuniv
Omitola birmingham cityunivOmitola birmingham cityuniv
Omitola birmingham cityunivTope Omitola
 
鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107皓仁 柯
 
Big Data on the Web – What We Will Do
Big Data on the Web – What We Will Do Big Data on the Web – What We Will Do
Big Data on the Web – What We Will Do Haklae Kim
 
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...Amit Sheth
 
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?Elena Simperl
 
Semantic Web Investigation within Big Data Context
Semantic Web Investigation within Big Data ContextSemantic Web Investigation within Big Data Context
Semantic Web Investigation within Big Data ContextMurad Daryousse
 

Similar to Where 2.0 NoSQL Presentation 2008 - GeoIQ (20)

The technical case for a semantic web
The technical case for a semantic webThe technical case for a semantic web
The technical case for a semantic web
 
Broad Data
Broad DataBroad Data
Broad Data
 
Semantic Web vision and its relevance to Open Digital Data for MGI
Semantic Web vision and its relevance to Open Digital Data for MGISemantic Web vision and its relevance to Open Digital Data for MGI
Semantic Web vision and its relevance to Open Digital Data for MGI
 
Bigdatacooltools
BigdatacooltoolsBigdatacooltools
Bigdatacooltools
 
Ngdm09 han gao
Ngdm09 han gaoNgdm09 han gao
Ngdm09 han gao
 
DCA Symposium 6 Feb 2023.pdf
DCA Symposium 6 Feb 2023.pdfDCA Symposium 6 Feb 2023.pdf
DCA Symposium 6 Feb 2023.pdf
 
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
 
The web of data: how are we doing so far
The web of data: how are we doing so farThe web of data: how are we doing so far
The web of data: how are we doing so far
 
BrightTALK - Semantic AI
BrightTALK - Semantic AI BrightTALK - Semantic AI
BrightTALK - Semantic AI
 
Self adaptive based natural language interface for disambiguation of
Self adaptive based natural language interface for disambiguation ofSelf adaptive based natural language interface for disambiguation of
Self adaptive based natural language interface for disambiguation of
 
What do we want computers to do for us?
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us?
 
Spivack Blogtalk 2008
Spivack Blogtalk 2008Spivack Blogtalk 2008
Spivack Blogtalk 2008
 
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
 
Open data Websmatch
Open data WebsmatchOpen data Websmatch
Open data Websmatch
 
Omitola birmingham cityuniv
Omitola birmingham cityunivOmitola birmingham cityuniv
Omitola birmingham cityuniv
 
鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107
 
Big Data on the Web – What We Will Do
Big Data on the Web – What We Will Do Big Data on the Web – What We Will Do
Big Data on the Web – What We Will Do
 
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...
 
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?
 
Semantic Web Investigation within Big Data Context
Semantic Web Investigation within Big Data ContextSemantic Web Investigation within Big Data Context
Semantic Web Investigation within Big Data Context
 

Where 2.0 NoSQL Presentation 2008 - GeoIQ

  • 1. From Data Chaos to Actionable Intelligence: How the Convergence of the GeoWeb and Semantic Web is Revolutionizing the Way We Process Information Where 2.0 2008 Sean Gorman FortiusOne Inc.
  • 2. AKA: Trying to GeoHack Without Getting Arrested
  • 3. Geography of the Web Paul Torrens Steve Coast Martin Dodge Anthony Townsend Matt Zook
  • 4. Big Data Sets Algorithmic Analysis Map Logical Results to Physical Realities
  • 5. NYSE – SIAC Routing Vulnerabilities
  • 6. Then these guys show up
  • 7. Then these guys rescue you
  • 8.  
  • 10. Small data sets with largely unstructured content
  • 11. Crowdsourcing large structured data sets with quantitative capabilities
  • 12. Didn’t we try this last year?
  • 13. What happens when your database reaches 1,683,185,246 features
  • 14.  
  • 15. Rinse and repeat per query type
  • 16. or
  • 17. Build a lightweight object database
  • 18.  
  • 19.  
  • 20. OpenLocation Lots of silos of GeoData how can we begin to interconnect them Infochimps.org
  • 21. Can we federate our data
  • 22. OpenLocation Linked repositories of data form a graph of content
  • 23. Node “ Y ” is connected to Node “ X ” by the link “User Action” User adds structured dataset “ X ” to their 3 rd Party URL string “ Y ” Structured Dataset “ X” Third Party URL “ Y”
  • 24. Structured Dataset “ X” Third Party URL “ Y” Third Party URL “ Z” Data Set Degree X 2 Y 1 Z 1
  • 25. Creates the Ability to Intelligently Serve Content User searches on a term and gets a result based on tags and full text query then weighted by degree and user ratings: The system can now recommend data URL “Y” and “Z” as data that may be useful context for dataset “X” even though they may have no tags in common The graph can be encapsulated in code and communicated to the outside world Structured Dataset “ X” Third Party URL “ Y” Third Party URL “ Z”
  • 26. 1. Bring a new class of content/data to the Web 2. Intelligently mix that content with the rest of the Web 3. Enable the content to answer meaningful questions for users
  • 27. If you would like to check out the Finder! Beta see us for a card or go here: http://finder.geocommons.com / If you would like to talk about federating geodata email me at: [email_address]

Editor's Notes

  1. Our story started with trying to GeoHack while avoiding Jonny Law
  2. Specifically we started off researching the geography of the web – what is the structure of the plumbing of the Internet – the autonomous system graph, the router graph, IP addresses. It was a great collaboration of colleagues at UCL, Berkley, NYU and GMU.
  3. Our specialty was really big data sets like the router and AS graphs. We’d use statistical mechanics and graph theory to model and analyze the data then map those logical results to their physical realities.
  4. This resulted in discovering little tidbits like how to take down the NYSE.
  5. Resulting in the guys in dark suits showing up
  6. Fortunately the folks at IQT – the intelligence communities venture fund – rescued us.
  7. IQT has been part of several key technologies driving the GeoWeb. One of their early investments was metacarta the folks behind openlayers which many of us in the community benefit from. They were also the funding behind Keyhole – commonly known as Google Earth the iconic application of the GeoWeb. They continued to fund the leading edge with technologies like SketchUp. IQT and the government at large have been a big impetus and source of funding pushing GeoWeb innovation.
  8. While we started researching the “Geography of the Web” it was “”Geography on the Web” that had captured the worlds attention. The creation of mirror worlds on the web with amazing satellite imagery and 3D visualizations
  9. But…the majority of data populating these mirror worlds were small and largely descriptive text and photos.
  10. Leveraging our love for big data sets we developed GeoCommons – a crowdsourced repository of large structured datasets with quantitative capabilities
  11. Yes last year on this stage we launched GeoCommons – so what happened – where did it go in one year.
  12. We got lots of people contributing data – so much data we killed our database
  13. This chart shows why. The problem with big datasets is they fill up databases very rapidly. 95% of the data in GeoCommons had over 100 features and over half had more that 5000 features and several had more than 100,000 features.
  14. The problem was we were trying store and query heterogeneous data at scale. To do this we had to normalize the data, causing the tables to get huge. Then we had to optimize to not only manage the table size, but do that for all queries types and functions. It was a Sisyphean cycle – everything helped but nothing solved the problem – the rock rolled to the bottom of the hill each time. So, you can keep fighting the battle….
  15. Leveraging a lightweight object database construct we were able to rapidly store, mine, access, and translate data – we were able to fit well over a billion features into 15 gigs of storage. The best illustration is to see a demo of the platform in action.
  16. Introducing Finder!
  17. We are going to leverage the GeoCommons platform to next build Maker – a cartographically empowered map creation tool and Atlas – a collaborative application to allow users to tell multimedia stories around maps.
  18. We are not the only folks bringing geo-content to the web. There are many clever people doing innovative work to bring more rich data to the masses, but how can we begin to interconnect it all.
  19. Can we federate our data
  20. We’ve been having some early discussions on how we can create a network of data repositories to form a graph of content. Data stored securely and linked together – where the cloud and the data are owned by everyone.
  21. Allowing a user to grab a URL from one source via web services and map it with a structured dataset via object federation from another source. By consciously mapping these two datasets together the user has created semantic meaning between them – going beyond just the syntax of tags. For instance a user may map crime rates from Every Block with housing prices from Zillow to solve a problem on where to buy a house. They could have no tags or syntax relating them but now they have a far more meaningful semantic relationship.
  22. These relations form a graph relating content semantically together across the web. Whether you implement with RDF, ATOM or any other practical approach you have a highly efficient computational construct to relate content.
  23. This allows you to make relevant recommendations to the user to solve meaningful problems.
  24. Our goal is to deliver technology that accomplished three big objectives. I believe that an ecosystem of companies striving for similar objective will be what constitute another evolution of the Web.