Flanders Open Data Day II - KeyNote - Erik Mannens

731 views

Published on

Published in: Technology, Business
  • Be the first to comment

  • Be the first to like this

Flanders Open Data Day II - KeyNote - Erik Mannens

  1. 1. ELIS  –  Mul*media  Lab  Whatifdr.  Erik  Mannens  @erikmannens  Open Data, Linked Data, and Big DataWe needtogether
  2. 2. ELIS  –  Mul*media  Lab  OpenData
  3. 3. ELIS  –  Mul*media  Lab  Way of … Thinking
  4. 4. ELIS  –  Mul*media  Lab  Silos of Data
  5. 5. ELIS  –  Mul*media  Lab  “Stop Hugging your Data”
  6. 6. ELIS  –  Mul*media  Lab  
  7. 7. ELIS  –  Mul*media  Lab  e.g. … Open Learning
  8. 8. ELIS  –  Mul*media  Lab  OpenDataLinked
  9. 9. ELIS  –  Mul*media  Lab  Way of … Publishing
  10. 10. ELIS  –  Mul*media  Lab  Semantic Web
  11. 11. ELIS  –  Mul*media  Lab  
  12. 12. ELIS  –  Mul*media  Lab  Connect your Silos
  13. 13. ELIS  –  Mul*media  Lab  5-stars (Technical Perspective)Open Linked Data (Tim Berners-Lee)Make your Stuff available on the WebMake it available as Structured DataIn a non-proprietary FormatUse URLs to identify Things, so one can point at your StuffLink your Data to other People’s Data to provide Context
  14. 14. ELIS  –  Mul*media  Lab  5-stars (Organisational Perspective)Open Data Engagement (Tim Davies)Be Demand-drivenProvide ContextSupport ConversationBuild Skills & CapacityCollaborate with the Community
  15. 15. ELIS  –  Mul*media  Lab  5-stars (Functional Perspective)Open Data Portal Functionalities (iMinds)Dataset RegistryMetadata ProviderCo-creation PlatformData Publishing PlatformCommon Data Hub
  16. 16. ELIS  –  Mul*media  Lab  Data as Commodity
  17. 17. ELIS  –  Mul*media  Lab  SidenoteR&Wbase
  18. 18. ELIS  –  Mul*media  Lab  
  19. 19. ELIS  –  Mul*media  Lab  15’ Open Data Publishing Frameworke.g.data.gent.beopendata.antwerpen.be
  20. 20. ELIS  –  Mul*media  Lab  Publishes 2 to 5 Star Datatdt/coretdt/inputtriple store
  21. 21. ELIS  –  Mul*media  Lab  REST-full API for Developerstriple storecoreRESTful data adapterCSVXLSJSONXMLSPARQLendpoint...e.g. datatank.gent.be/Grondgebied/Stratenor data.irail.be/NMBS/Stations
  22. 22. ELIS  –  Mul*media  Lab  R&Wbasegit for triples
  23. 23. ELIS  –  Mul*media  Lab  Read/WriteLINKEDDATA
  24. 24. ELIS  –  Mul*media  Lab  TRIPLE STORESare they up for the challenge?
  25. 25. ELIS  –  Mul*media  Lab  Distributed Triple Version ControlCommitsDeltasVirtual graphsVersionsstoredescribeidentify resolve
  26. 26. ELIS  –  Mul*media  Lab  LIVE triplesrequire fast version retrievalLIGHTWEIGHTalgorithmthrough a
  27. 27. ELIS  –  Mul*media  Lab  Store triplesQUADS<subject> <predicate> <object> <context>using
  28. 28. ELIS  –  Mul*media  Lab  R&WbaseGRAPH accessTRIPLESTORESPROVENANCEVERSIONwith directprovidescontrolforand
  29. 29. ELIS  –  Mul*media  Lab  DataBIG
  30. 30. ELIS  –  Mul*media  Lab  Way of … Analyzing
  31. 31. ELIS  –  Mul*media  Lab  How Difficult Can It Be?
  32. 32. ELIS  –  Mul*media  Lab  Collaborative Effort found Higgs Boson
  33. 33. ELIS  –  Mul*media  Lab  Banking IndustryHealthcareIndustryMarketingIndustrySmart CitiesDeep understanding of some key Big Data markets
  34. 34. ELIS  –  Mul*media  Lab  •  US Securities and Exchanges Commission has estimated that itwould need to collect 20 terabytes of data per month to monitor all UScapital market activity•  Unstructured data comprises some 80% of the total data held bythe average financial institution•  The total number of non-cash payments in the EU amounted to90.6 billion in 2011.•  The total number of automatic teller machines (ATMs) in the EU in2011 was 0.44 million•  The number of points of sale (POS) terminals in the EU was 8.8million in 2011Big (Data) Bang in Banking
  35. 35. ELIS  –  Mul*media  Lab  Whatifit wereOPEN & LINKED
  36. 36. ELIS  –  Mul*media  Lab  e.g. … OpenSpending
  37. 37. ELIS  –  Mul*media  Lab  e.g. … OpenSpending
  38. 38. ELIS  –  Mul*media  Lab  e.g. … OpenBank
  39. 39. ELIS  –  Mul*media  Lab  e.g. … OpenCorporates
  40. 40. ELIS  –  Mul*media  Lab  e.g. … OpenCorporates - Belgium
  41. 41. ELIS  –  Mul*media  Lab  •  Medical images are increasing by 20-40% annually•  Electronic medical records: in 2009, 99% of primary care physiciansin the Netherlands used EMRs, compared to 46% in the United Statesand 36% in Canada•  Medical research, in which 100,000 participants are genotyped(ca. 1.5 GB/person), could result in a staggering 150 terabytes ofdata.•  As of July 2012 PatientsLikeMe members have shared 4,029,661symptom reports about 7,338 symptoms and 548,650 treatmenthistories about 12,838 treatmentsBig (Data) Bang in Healthcare
  42. 42. ELIS  –  Mul*media  Lab  Whatifit wereOPEN & LINKED
  43. 43. ELIS  –  Mul*media  Lab  e.g. … PatientsLikeMe
  44. 44. ELIS  –  Mul*media  Lab  e.g. … 23AndMe
  45. 45. ELIS  –  Mul*media  Lab  e.g. … PlayStation III
  46. 46. ELIS  –  Mul*media  Lab  e.g. … OpenPhacts
  47. 47. ELIS  –  Mul*media  Lab  e.g. … DisQover (iMinds –Ontoforce)
  48. 48. ELIS  –  Mul*media  Lab  •  Data use is expected to grow by as much as 44 times, amounting tosome 35.2ZB (zettabytes -- a billion terabytes) globally•  Walmart handles more than 1 million customer transactionsevery hour, which is imported into databases estimated tocontain more than 2.5 petabytes of data.•  Twitter has 200 million tweets per day or approximately 46MB/sec ofdata created (August 2011)•  25% of search results for the World’s Top 20 largest brands are linksto user-generated content•  YouTube has 3 billion visitors per day, 48 hours of video is uploadedper minute (May 2011)•  There are over 200,000,000 blogs: 34% of their posts areopinions about products & brandsBig (Data) Bang in Marketing
  49. 49. ELIS  –  Mul*media  Lab  Whatifit wereOPEN & LINKED
  50. 50. ELIS  –  Mul*media  Lab  e.g. … Consumers in 1990
  51. 51. ELIS  –  Mul*media  Lab  e.g. … Consumers in 2000
  52. 52. ELIS  –  Mul*media  Lab  e.g. … Consumers since 2010
  53. 53. ELIS  –  Mul*media  Lab  The Tyranny of the Empowered ConsYOUmers
  54. 54. ELIS  –  Mul*media  Lab  
  55. 55. ELIS  –  Mul*media  Lab  
  56. 56. ELIS  –  Mul*media  Lab  e.g. … GoodRelations
  57. 57. ELIS  –  Mul*media  Lab  e.g. … Nike
  58. 58. ELIS  –  Mul*media  Lab  •  Data use is expected to grow by as much as 44 times,amounting to some 35.2ZB (zettabytes -- a billion terabytes) globally•  Sensors, social media feeds, photos, video and cellphone GPSsignals account for 2.5 quintillion bytes of data per day•  More than 50% global population lives in cities and this number isforecast to rise to 69% by 2050•  The number of city residents is expected to grow from 3.5 billionto 5 billion in the next 20 years•  ‘Internet of Things’ Age is approaching: 25 billion devicesconnected to the Internet by 2015 and 50 billion by 2020•  Access to public data is estimated to be worth €27 billion in the EU•  ICT-enabled energy efficiency could translate into over €600 billionworth of cost savings for the public and private sectorBig (Data) Bang in Smart Cities
  59. 59. ELIS  –  Mul*media  Lab  Whatifit wereOPEN & LINKED
  60. 60. ELIS  –  Mul*media  Lab  e.g. … OpenTransport
  61. 61. ELIS  –  Mul*media  Lab  e.g. … OpenTransport
  62. 62. ELIS  –  Mul*media  Lab  e.g. … OpenEnergyMonitor
  63. 63. ELIS  –  Mul*media  Lab  
  64. 64. ELIS  –  Mul*media  Lab  
  65. 65. ELIS  –  Mul*media  Lab  e.g. … Big Data … in Iceland?
  66. 66. ELIS  –  Mul*media  Lab  e.g. … a Trillion Sensors … in Iceland!
  67. 67. ELIS  –  Mul*media  Lab  
  68. 68. ELIS  –  Mul*media  Lab  
  69. 69. ELIS  –  Mul*media  Lab  
  70. 70. ELIS  –  Mul*media  Lab  
  71. 71. ELIS  –  Mul*media  Lab  
  72. 72. ELIS  –  Mul*media  Lab  QUESTIONS?dr. Erik Mannenserik.mannens@ugent.be@erikmannensThoughts?
  73. 73. ELIS  –  Mul*media  Lab  Credits•  EMC - Greenplum•  Peter Hinssen•  Scott Brinker•  Jim Lecinski•  David Armano•  Did not have time to check all licenses of the Flickrphotos – in my defense, I did not kill anyone nor did I inany way insult and/or infringe the CIA, NSA, NDA, orany other JAA (Just Another Acronym)

×