ORGpedia
The Open Organizational
Data Project
Prof. Beth Simone Noveck
Joel Gurin, Executive Director
Website: info@3RoundStones.com
@3RoundStones
Main +1-877-290-2127
Direct +1-571-331-3758
ORGpedia site development team:
Luke Ruth, Application Developer
David Wood, CTO
Bernadette Hyland, CEO
Funded by the
Project: Do Tank
http://dotank.nyls.edu
Friday, June 21, 13
Agenda
• Update on ORGpedia site development
• Project status
• Datasets
• Functionality review
• Review Next steps
• NYU stakeholder review
• Engaging contributors
Friday, June 21, 13
Problem Statement
• Taxpayers spend billions to have government collect data
• Machine readable content is the new default for
government (OS&T M13-13)
• Yet finding, accessing and combining data is difficult ...
• ORGpedia: the Open Organizational Data Project is
opening new possibilities
• Based entirely on OpenWeb Standards, Open
Source & open government data
• Leverages the crowd for data augmentation
Friday, June 21, 13
Project Status Highlights
ORGpedia Phase 1 began
15-April 2013
Completed as of 20 June 2013
• Site live (QA)
• Data platform installed on
Rackspace Cloud
• Use case: Combine open
government data from
regulatory agencies for
nuclear plants
• Data sets: EPA Facilities,
EPA Toxic Releases, NRC
Violations, SEC, Open
Street Maps, DBpedia
• User Contributions
• Wikipedia
• Open Corporates
• Visualizations: maps &
charts
• Support for public &
authenticated users thru
Google, Facebook,Yahoo!
Friday, June 21, 13
ORGpedia Datasets
• U.S. EPA
• Facilities
• Toxic Releases
• US Nuclear Regulatory
Commission
• Violations
Friday, June 21, 13
ORGpedia is Innovative
• Combines US Government Regulatory data
without the expense, time and high failure rate of
traditional approaches
• Leverages crowdsourcing to fix dirty data and
builds on expert and/or local knowledge
• Open vs. proprietary approach
• Creates opportunity for new businesses &
startups
Friday, June 21, 13
Technical Innovation
• Linked Data --> Easy combination of datasets
• Leverages an innovative Open Source data
platform (Callimachus)
• Crowdsourced contributions
• Contributions overlay (do not overwrite)
authoritative data
• Web-scale
Friday, June 21, 13
Friday, June 21, 13
“Linked Data allows for
cooperation without coordination”
- DavidWood
Friday, June 21, 13
OpenStack Cloud Services Amazon Web Services
Traditional networks
Persistent URL (PURL)
Services
Linked Data
Services
Mobile Web Print
Friday, June 21, 13
ORGpedia is Interactive
• The public can view, comment, discuss
• Contributors can:
• overlay controlled fields
• Associate facilities with Linked Open
Data and other Web content (e.g. stock
quotes, images)
• Template-driven approach make extending
the site easy
Friday, June 21, 13
ORGpedia
User Contributions
Add/modify
• EPA Facility name
• Wikipedia abstract
• Corporate Owner(s)
• Wikipedia abstract
• Open Corporates ID
• Stock ticker
• SEC ID
• Related images
Friday, June 21, 13
Friday, June 21, 13
Friday, June 21, 13
Friday, June 21, 13
Friday, June 21, 13
Friday, June 21, 13
Friday, June 21, 13
Friday, June 21, 13
Friday, June 21, 13
Friday, June 21, 13
Friday, June 21, 13
Edit icons appear
Friday, June 21, 13
Friday, June 21, 13
Friday, June 21, 13
Friday, June 21, 13
Friday, June 21, 13
Friday, June 21, 13
Contribution
history
maintained
Friday, June 21, 13
ORGpedia Site Summary
• Addresses a compelling problem
• Live site
• Hosted on the cloud
• Driven by Open data
• Government
• Linked Open Data
• Crowdsourced
• Open Source Data platform
• Template driven approach
• Drives hundreds of millions of web pages
Callimachus
Friday, June 21, 13
Pitching ORGpedia ...
About the project and case studies
Friday, June 21, 13
This work is Copyright © 2013 3 Round Stones Inc.
It is licensed under the Creative Commons Attribution 3.0 Unported License
Full details at: http://creativecommons.org/licenses/by/3.0/
You are free:
to Share — to copy, distribute and transmit the work
to Remix — to adapt the work
Under the following conditions:
Attribution. You must attribute the work in the manner specified by the
author or licensor (but not in any way that suggests that they endorse
you or your use of the work).
Share Alike. If you alter, transform, or build upon this work, you may
distribute the resulting work only under the same or similar license to this
one.
Friday, June 21, 13

ORGpedia: The Open Organizational Data Project

  • 1.
    ORGpedia The Open Organizational DataProject Prof. Beth Simone Noveck Joel Gurin, Executive Director Website: info@3RoundStones.com @3RoundStones Main +1-877-290-2127 Direct +1-571-331-3758 ORGpedia site development team: Luke Ruth, Application Developer David Wood, CTO Bernadette Hyland, CEO Funded by the Project: Do Tank http://dotank.nyls.edu Friday, June 21, 13
  • 2.
    Agenda • Update onORGpedia site development • Project status • Datasets • Functionality review • Review Next steps • NYU stakeholder review • Engaging contributors Friday, June 21, 13
  • 3.
    Problem Statement • Taxpayersspend billions to have government collect data • Machine readable content is the new default for government (OS&T M13-13) • Yet finding, accessing and combining data is difficult ... • ORGpedia: the Open Organizational Data Project is opening new possibilities • Based entirely on OpenWeb Standards, Open Source & open government data • Leverages the crowd for data augmentation Friday, June 21, 13
  • 4.
    Project Status Highlights ORGpediaPhase 1 began 15-April 2013 Completed as of 20 June 2013 • Site live (QA) • Data platform installed on Rackspace Cloud • Use case: Combine open government data from regulatory agencies for nuclear plants • Data sets: EPA Facilities, EPA Toxic Releases, NRC Violations, SEC, Open Street Maps, DBpedia • User Contributions • Wikipedia • Open Corporates • Visualizations: maps & charts • Support for public & authenticated users thru Google, Facebook,Yahoo! Friday, June 21, 13
  • 5.
    ORGpedia Datasets • U.S.EPA • Facilities • Toxic Releases • US Nuclear Regulatory Commission • Violations Friday, June 21, 13
  • 6.
    ORGpedia is Innovative •Combines US Government Regulatory data without the expense, time and high failure rate of traditional approaches • Leverages crowdsourcing to fix dirty data and builds on expert and/or local knowledge • Open vs. proprietary approach • Creates opportunity for new businesses & startups Friday, June 21, 13
  • 7.
    Technical Innovation • LinkedData --> Easy combination of datasets • Leverages an innovative Open Source data platform (Callimachus) • Crowdsourced contributions • Contributions overlay (do not overwrite) authoritative data • Web-scale Friday, June 21, 13
  • 8.
  • 9.
    “Linked Data allowsfor cooperation without coordination” - DavidWood Friday, June 21, 13
  • 10.
    OpenStack Cloud ServicesAmazon Web Services Traditional networks Persistent URL (PURL) Services Linked Data Services Mobile Web Print Friday, June 21, 13
  • 11.
    ORGpedia is Interactive •The public can view, comment, discuss • Contributors can: • overlay controlled fields • Associate facilities with Linked Open Data and other Web content (e.g. stock quotes, images) • Template-driven approach make extending the site easy Friday, June 21, 13
  • 12.
    ORGpedia User Contributions Add/modify • EPAFacility name • Wikipedia abstract • Corporate Owner(s) • Wikipedia abstract • Open Corporates ID • Stock ticker • SEC ID • Related images Friday, June 21, 13
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30.
    ORGpedia Site Summary •Addresses a compelling problem • Live site • Hosted on the cloud • Driven by Open data • Government • Linked Open Data • Crowdsourced • Open Source Data platform • Template driven approach • Drives hundreds of millions of web pages Callimachus Friday, June 21, 13
  • 31.
    Pitching ORGpedia ... Aboutthe project and case studies Friday, June 21, 13
  • 32.
    This work isCopyright © 2013 3 Round Stones Inc. It is licensed under the Creative Commons Attribution 3.0 Unported License Full details at: http://creativecommons.org/licenses/by/3.0/ You are free: to Share — to copy, distribute and transmit the work to Remix — to adapt the work Under the following conditions: Attribution. You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work). Share Alike. If you alter, transform, or build upon this work, you may distribute the resulting work only under the same or similar license to this one. Friday, June 21, 13