IBM Open Data
Upcoming SlideShare
Loading in...5
×
 

Like this? Share it with your network

Share

IBM Open Data

on

  • 2,815 views

Open Data is a global trend. This presentation describes part of the trend, focuses on one of the most advanced initiatives - Dublinked.ie in Ireland and provides guidance on how to get started.

Open Data is a global trend. This presentation describes part of the trend, focuses on one of the most advanced initiatives - Dublinked.ie in Ireland and provides guidance on how to get started.

Statistics

Views

Total Views
2,815
Views on SlideShare
2,814
Embed Views
1

Actions

Likes
7
Downloads
104
Comments
0

1 Embed 1

http://www.linkedin.com 1

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

IBM Open Data Presentation Transcript

  • 1. Open DataAnders Quitzau, Innovation Executive andersq@dk.ibm.com 1
  • 2. IBM Open DataWhat is Open Data ?Open data is:•  Data that is generally provided at no cost and with no license constraint•  Machine readable and often people readable•  Accessed via files or APIs especially for linked data. The data may be static or dynamic.Why is Open Data being used ?•  Transparency : to build stakeholder understanding and confidence. Increasing data visibility both inside and outside the organization•  Cost efficiency: to reduce the cost and effort of developing, operating and delivering services (both IT and civic services)•  Economic : to enable innovation by small agile businesses to develop new insight, new services and new products (e.g. tools for ‘co-creation’, innovative systems of engagement)Who?•  Governments agencies and the research community•  Private industry looking for a competitive advantage © 2013 IBM Corporation 2
  • 3. Open Data is big and growingMcKinsey and Associates estimates the Potential revenue to EU governmentsannual economic value of big, open liquid could be around €27 b - Gartnerhealth data at about $350 billion annuallyOpen data today:162 governments43 countries> 1 million data sets “The API which has easily 10 times More traffic than the website, Open weather data collected by the National Oceanic and Atmospheric Association has an annual estimated economic value of $10 billion “$7bn worth of items on eBay through APIs” of items on eBay through “$7bn worth h"p://visual.ly/open-­‐data-­‐movement   3
  • 4. IBM Open DataAnd in Denmark – Open Data movement is emerging © 2013 IBM Corporation
  • 5. And IBM is participating….. Maintains a catalog of open data provided by city TJW Research project to build governments with a semantic model for smart visualization tools and cities collaboration Developed strategies for creating visualizations that Web site with Cognos can enable citizens make software that can be used Helsinki use of and benefit from to visualize open data open data ManyEyes   Research project with the Nantes Intelligent transportation City of Minneapolis. Uses CoOp   internal and open data to improve Traffic, Public Intelligent Transportation is using Open Data from Nantes to feed Traffic Prediction Safety, Planning KPI’s Open data portal to Uses open data to create facilitate data-driven high-resolution weather innovation in an urban forecasts environment 5
  • 6. IBM Open DataBuildings – UK public buildings – example © 2013 IBM Corporation
  • 7. IBM Open DataNew York City OpenData – Building Footprints © 2013 IBM Corporation
  • 8. Dublinked – Open data as a catalyst for innovation andeconomic development •  Open  Data  from   public  sector  for   public  –  and   restricted  -­‐   consump=on   •  Innova=on   network  of   academic,  public  &   commercial   research   •  Sandbox  for   experiment  and   test  plus  poten=al   roll-­‐out  
  • 9. IBM Open DataDublinked Site Update •  New  data  added  almost  daily   •  Increasing  spread  of  contribu=on  from   Authori=es   •  Live  data  is  most  downloaded   •  NUIM  now  managing  members  directly     Content  is  growing  nicely,  our  focus  has  shiLed  to   differen=ated  func=on  ….     How  can  we  make  Dublinked  the  most  advanced   Data  Ecosystem  in  the  World?   © 2013 IBM Corporation
  • 10. IBM Open DataInformation layers on; on; Traffic sensors, finding, buildings, air quality, Information layers Traffic sensors, way way finding, buildings, air Dublin city datanoise, cycling, road works,road works, complaints... quality, noise, cycling, complaints... Fire Brigade, Waste, Water, Transport, Utilities Parks, Planning, Environment, Heritage, Litter, Maintenance Arts, Culture, Sports & Recreation Housing, Social and Community Services © 2010 IBM Corporation © 2013 IBM Corporation
  • 11. IBM Open DataLive data: Integrate live streaming data (via IBM System S) Next steps: live dataNext steps: Integrate live streaming data (via IBM System S) Real Time  data  goes  “stale  “  very  quickly  and   becomes historic data`. DCC has Real Time data on Dublin Bus Fleet Allows IBM access to data and provide server space. 1000 vehicles provide location information every 20 – 30 Seconds © 2013 IBM Corporation © 2010 IBM Corporation
  • 12. IBM Open DataLinked Open Data All data is now using publishing using linked data standards – i.e. the data is structured in a way that that it can be interlinked and become more useful. This allows DUBLinked to share information in a way that can be read automatically by any computer via the internet. This enables data from different sources to be connected and queried. © 2013 IBM Corporation
  • 13. IBM Open DataBenefits seen - Dublinked•  Interrnal Efficiency Benefits•  Demonstrate that city administrators are doing a reasonable, good job•  Demonstrate that you are complying with regulatory requirements and reporting•  Compare yourself with other organisations, cities, countries etc•  Grow new businesses and jobs © 2013 IBM Corporation
  • 14. IBM Open DataOpen Data Start-up Mypp.ie © 2013 IBM Corporation
  • 15. IBM Open DataLinked Open Data - Continued Metadata  is  linked  to  external  websites,  like  DBPedia,  when  appropriate.  This  removes   ambiguity  when  talking  about  things  (e.g.  “Apple”  –  the  fruit  or  the  manufacturer  of  iPads  ?)   and  can  allow  cross-­‐site  query  capability:   © 2013 IBM Corporation
  • 16. IBM Open DataLinked Open Data - Continued Metadata is linked to external vocabularies, like IPSV, when appropriate. This allows cross linking to other open data sites that use the same vocabularies: © 2013 IBM Corporation
  • 17. IBM Open DataOnline mapping of files with geospatial fields Files  that  contain  geospa=al  informa=on  such  as  longitude  and  la=tude  can  be  mapped   automa=cally  :   © 2013 IBM Corporation
  • 18. IBM Open DataExample ReasonableCity Problem   How  can  we  provide  City  decision  makers  with  explana=ons  and  diagnoses   for  events  by  applying  machine  reasoning  techniques  to  a  fusion  of   massive,  rich,  complex  and  dynamic  data?  How  can  we  move  from   explana=on  to  predic=on?   Challenges   •  Iden?fying  relevant  data  and  informa=on   •  Capturing  and  represen?ng  anomalies   •  Correla?ng  knowledge  on  heterogeneous  data  sources   •  Advanced  fusion  of  data   Goals   •  Iden=fica=on  of  the  nature  and  cause  of  changes   •  Explaining  logical  connec?on  of  knowledge  across  space  and  =me   •  Move  from  explana=on  to  predic=on   Detection to Diagnosis? Anomaly Detected: Delayed buses, congested roads © 2013 IBM Corporation
  • 19. IBM Open Data Next steps: Gov2Gov and beyondAnd there certainlyinterest in interest in DublinkedThere is very broad is broad Dublinked Property management Commercial valuations and rates Business & Retail Housing Tourism Mapping Events Pool resources Share results Demographics Heritage Environment Crime Waste Collection Health Fault Reporting Water Transport & Access Planning © 2013 IBM Corporation © 2010 IBM Corporation
  • 20. IBM Open DataEmerging Business Models, examples•  Premium Product / Service. HospitalRegisters.com•  Freemium Product / Service. public transportation in urban areas.•  Open Source. OpenCorporates and OpenPolis•  Infrastructural Razor & Blades. Public Data Sets on Amazon Web Service•  Demand-Oriented Platform. DataMarket and Infochimps•  Supply-Oriented Platform. Socrata and Microsoft Open Government Data Initiative•  Free, as Branded Advertising. IBM City Forward, IBM Many Eyes or Google Public Data Explorer•  White-Label Development.. This business model has not consolidated yet, but some embryonic attempts seem to be particularly promising.Source: Business Model & Policy Innovation Unit at the Istituto Superiore Mario Boella © 2013 IBM Corporation
  • 21. IBM Open DataDublinked membership fee structure © 2013 IBM Corporation
  • 22. IBM Open DataNext steps Dublinked & Social Program Data: –  Release non-sensitive social data to Dublinked –  For example, Housing Stock data –  Research on anonymisation & privacy Research for Smarter Social Programs (outside Dublinked) –  Multilevel analysis of high-cost/high-need regions and citizens •  Identify regions where service providers are overloaded or inaccessible •  Recommend alternative service providers to balance demand •  Forecast regions that will become problematic –  Identifying patterns to provide recommendations for interventions •  Identify patterns for those that are chronically homeless •  Highlight those that are identified as returning to recommend intervention –  Forecasting future requirements and emerging ‘hot spots’ •  Are risk factors, such as substance abuse, propagated through the social graph? •  Are social costs  propagated  through  the  social  or  spa=al  graph?     © 2013 IBM Corporation
  • 23. IBM Open DataHow to open up data•  Collect the data together into a single file or set of files.•  Resolve any licensing conflicts; for instance, different parts of the same dataset may be owned by different parties.•  Choose an open data license, such as the Open Data Commons Attribution License. Alternatives, including share-alike licenses are available from the Open Data Commons site.•  Upload the data to a publicly-accessible part of your website. Registration should not be required to access the data.•  Include the following information along with the data: •  License details (from step 3) •  Technical details of the format that the data is stored in. Note that this does not need to be a standard format, as long as you can explain it to users, but it should not be in formats that require proprietary software to use (such as XLS), or in non-machine- readable formats (such as PDF). •  Details of when the data was last updated, and will be updated next. •  Provenance of the data, including details of original creator. •  Methodology information, such as how data was collected, calibrated, and transformed prior to upload. (credit: http://opendatahandbook.org) © 2013 IBM Corporation
  • 24. IBM Open DataOpen data is ‘just’ data, but there are a few new considerations Most current To date Open Open Data is by Open Data can Open Data is Open Data Open Data sets Data has been its very nature come from part of a represents a are not that mostly static ,but diverse in anywhere with spectrum of value migration large, but they real-time data content, format no guarantees visibility Most from the data are growing streams growing and semantic about quality or orgs have a mix itself to higher (e.g.. COSM) and can be authenticity of data from value analytics linked private to open and service Big  Data   Volume   Velocity   Variety   Veracity   Visibility   Value   Data  at   Data  in   Data  in                                   Data  in   Data  in           Data  of   Rest   Mo?on   Many  Forms   Doubt   the  Open   Many  Values   Terabytes to Streaming data, Uncertainty due to Structured, Open data is exabytes of milliseconds to data inconsistency Large range of data unstructured, generally open to existing data to seconds to & incompleteness, values from free text, multimedia anyone. Which process respond ambiguities, (data philanthropy raises issues of latency, deception, to high value privacy. Security model monetization and provenance approximations Big Data + Open Data + Co-Creation = New opportunities © 2013 IBM Corporation 24
  • 25. IBM Open Data The technology; Conceptual roadmap for the system Open Innovation Portal Publication & Annotation Privacy & Security Challenges include .. Scalable privacy and security of resources Catalog & Navigation Search & Query Automated assimilation and sharing of resources Knowledge Representation & Visualization & Analytics Reasoning Robust models to organize and represent resources and their context Open REST Web Services API Represent knowledge efficiently for continuous machine reasoning and diagnosis Enterprise Platform IBM IOC IBM Connections Compose resources for development, mash-up & Interaction with Industry Solutions Content Sharing & Collaboration visualization IBM Enterprise Cloud Scalable compute, storage & network infrastructure Key IBM Research IBM Products & Services Dublin Provider Enterprise Citizen City 1…N Partners & People © 2013 IBM Corporation © 2010 IBM Corporation
  • 26. IBM Open DataIBM have the technologies to deliver a scalable and robust Open DataplatformValue Proposition:•  IBM has the strength & depth of technology to help clients exploit Open Data as a Strategic Information Asset•  We can bring a robust Data lifecycle management & governance ( trust, privacy, . ) approach to Open Data, This is key !IBM have technologies needed e.g. :•  Information Server for data quality•  Vivisimo (for linked data access ) & Initiate for federated Master Data Management•  Guardium for privacy management and Optim for application archive•  Infosphere Streams for Data-in-Motion•  BigInsights Hadoop-based analysis•  New Cloud-hosted spreadsheet functionality © 2013 IBM Corporation 26
  • 27. IBM Open DataHaving opened up – now what?•  Tell the world! •  Understanding your audience •  Post your material on third-party sites •  Making your communications more social-media friendly•  Social media•  Engage the publich: •  Unconferences, Meetups and Barcamps•  Making things! Challenges, requests, hackdays, prizes and prototypes•  (credits: http://opendatahandbook.org) © 2013 IBM Corporation
  • 28. IBM Open Data Anders  Quitzau,  Innova=on  Execu=ve  –  andersq@dk.ibm.com   Peter  Lange,  CTO  IBM  Smarter  Ci=es  –  petlan@dk.ibm.com   © 2013 IBM Corporation