SlideShare a Scribd company logo
1 of 20
Download to read offline
Data All the Way Down
Jeni Tennison
@JeniT
http://www.jenitennison.com/blog/
Data All the Way Down
• challenges of complex open data
• layered approach to data publishing
• essential steps
• benefits
Complex Datasets
• too much for a single spreadsheet
• need to navigate
 • browse through data
 • look at slices of larger dataset
 • view summary statistics

• need to explain
 • definitions of terms, provisos & disclaimers
User Challenge
• complex data sets have range of users
 • different hardware / platforms
 • different tasks / goals
 • different ability / understanding

• no one interface satisfies everyone
• data owners cannot satisfy everyone
• create ecosystem around open data
visualisation / data gap   end user vs reuser
Visualisations
• approachable for real people
• necessary for stakeholder buy-in
• beauty is in what's left out
 • advertisement or taster of rich datasets
 • often not possible in official data

• leaves questions unanswered
 • what if we looked at the data in a different way?
Raw Data
• importable into own data store
 • often only interested in particular slice
 • data set may be massive / changing

• run whatever analysis you want
 • requires at least some programming skills
 • analysis might not be appropriate for the data

• documentation probably lacking
bridging the gap                         layered data access

Photo by Nikita Kravchuk http://www.flickr.com/photos/mi55er/3845619153/
Layered Architecture
• user interface
 • navigation and global understanding

• API
 • curated, targeted, programmable access

• query
 • free-form programmable access

• raw data
legislation.gov.uk   lists as Atom feeds
legislation.gov.uk   content as XML
legislation.gov.uk   layer other views
organograms   navigable visualisation
organograms   JSON data
organograms   RDF / XML / HTML
organograms   SPARQL query
organograms   raw data
Key Techniques
• resource-driven design (good URIs)
• every page built based on API calls
• explicit links to API access
 • for bonus points, link to your transformation code

• consistent terminology
 • clear mapping from UI to API

• caching & access control at each level
Benefits
• fork at any point
 • don't like the visualisation / API? create your own!

• everyone is human
 • reusers gain understanding from user interface

• visualisation benefits the stack
 • API oriented towards achieving a goal
 • visual validation of data improves quality
Questions?

More Related Content

What's hot (7)

Google App Engine - exploiting limitations
Google App Engine - exploiting limitationsGoogle App Engine - exploiting limitations
Google App Engine - exploiting limitations
 
Alfresco Day Vienna 2015 - Technical Track - Extending Share: Real world exam...
Alfresco Day Vienna 2015 - Technical Track - Extending Share: Real world exam...Alfresco Day Vienna 2015 - Technical Track - Extending Share: Real world exam...
Alfresco Day Vienna 2015 - Technical Track - Extending Share: Real world exam...
 
10x10 on <link />
10x10 on <link />10x10 on <link />
10x10 on <link />
 
Rest api
Rest apiRest api
Rest api
 
Web Scraping Technologies
Web Scraping TechnologiesWeb Scraping Technologies
Web Scraping Technologies
 
Alfresco Day Vienna 2015 - Technical Track - REST API of the Future
Alfresco Day Vienna 2015 - Technical Track - REST API of the FutureAlfresco Day Vienna 2015 - Technical Track - REST API of the Future
Alfresco Day Vienna 2015 - Technical Track - REST API of the Future
 
Troubleshooting Exchange Hybrid Deployments
Troubleshooting Exchange Hybrid DeploymentsTroubleshooting Exchange Hybrid Deployments
Troubleshooting Exchange Hybrid Deployments
 

Viewers also liked (6)

IBES Health and biomedical informatics
IBES  Health and biomedical informaticsIBES  Health and biomedical informatics
IBES Health and biomedical informatics
 
Granada0611 digital humanities
Granada0611 digital humanitiesGranada0611 digital humanities
Granada0611 digital humanities
 
Open Data: Dreams to Reality
Open Data: Dreams to RealityOpen Data: Dreams to Reality
Open Data: Dreams to Reality
 
Semantic Web and RDF: Can we reach escape velocity?
Semantic Web and RDF: Can we reach escape velocity?Semantic Web and RDF: Can we reach escape velocity?
Semantic Web and RDF: Can we reach escape velocity?
 
Blogcomments
BlogcommentsBlogcomments
Blogcomments
 
Porting terminologies to the Semantic Web
Porting terminologies to the Semantic WebPorting terminologies to the Semantic Web
Porting terminologies to the Semantic Web
 

Similar to Data All the Way Down

Design Reviews for Operations - Velocity Europe 2014
Design Reviews for Operations - Velocity Europe 2014Design Reviews for Operations - Velocity Europe 2014
Design Reviews for Operations - Velocity Europe 2014
Mandi Walls
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application Development
Peter Haase
 

Similar to Data All the Way Down (20)

Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Presto Summit 2018 - 02 - LinkedIn
Presto Summit 2018  - 02 - LinkedInPresto Summit 2018  - 02 - LinkedIn
Presto Summit 2018 - 02 - LinkedIn
 
Architecture Principles CodeStock
Architecture Principles CodeStock Architecture Principles CodeStock
Architecture Principles CodeStock
 
Data Ingestion Engine
Data Ingestion EngineData Ingestion Engine
Data Ingestion Engine
 
Fusion 3 Overview Webinar
Fusion 3 Overview Webinar Fusion 3 Overview Webinar
Fusion 3 Overview Webinar
 
How to Manage and Troubleshoot Search: A Practical Guide
How to Manage and Troubleshoot Search: A Practical GuideHow to Manage and Troubleshoot Search: A Practical Guide
How to Manage and Troubleshoot Search: A Practical Guide
 
Design Reviews for Operations - Velocity Europe 2014
Design Reviews for Operations - Velocity Europe 2014Design Reviews for Operations - Velocity Europe 2014
Design Reviews for Operations - Velocity Europe 2014
 
Open Data and APIs - DataWeave
Open Data and APIs - DataWeaveOpen Data and APIs - DataWeave
Open Data and APIs - DataWeave
 
Multi View Constructed Right
Multi View Constructed RightMulti View Constructed Right
Multi View Constructed Right
 
Engage 2019: Modernising Your Domino and XPages Applications
Engage 2019: Modernising Your Domino and XPages Applications Engage 2019: Modernising Your Domino and XPages Applications
Engage 2019: Modernising Your Domino and XPages Applications
 
Webinar: Personalized Retail Search & Recommendations with Fusion
Webinar: Personalized Retail Search & Recommendations with FusionWebinar: Personalized Retail Search & Recommendations with Fusion
Webinar: Personalized Retail Search & Recommendations with Fusion
 
UI Dev in Big data world using open source
UI Dev in Big data world using open sourceUI Dev in Big data world using open source
UI Dev in Big data world using open source
 
NoSQL Simplified: Schema vs. Schema-less
NoSQL Simplified: Schema vs. Schema-lessNoSQL Simplified: Schema vs. Schema-less
NoSQL Simplified: Schema vs. Schema-less
 
SPSOrlando - InfoPath 2010 Jumpstarter
SPSOrlando - InfoPath 2010 JumpstarterSPSOrlando - InfoPath 2010 Jumpstarter
SPSOrlando - InfoPath 2010 Jumpstarter
 
apidays LIVE Paris 2021 - Detecting and Protecting PII by Rob Dickinson, Resu...
apidays LIVE Paris 2021 - Detecting and Protecting PII by Rob Dickinson, Resu...apidays LIVE Paris 2021 - Detecting and Protecting PII by Rob Dickinson, Resu...
apidays LIVE Paris 2021 - Detecting and Protecting PII by Rob Dickinson, Resu...
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application Development
 
Harness the power of Data in a Big Data Lake
Harness the power of Data in a Big Data LakeHarness the power of Data in a Big Data Lake
Harness the power of Data in a Big Data Lake
 
Exploring Data Preparation and Visualization Tools for Urban Forestry
Exploring Data Preparation and Visualization Tools for Urban ForestryExploring Data Preparation and Visualization Tools for Urban Forestry
Exploring Data Preparation and Visualization Tools for Urban Forestry
 
Data warehouseold
Data warehouseoldData warehouseold
Data warehouseold
 
Cloud patterns at Carleton University
Cloud patterns at Carleton UniversityCloud patterns at Carleton University
Cloud patterns at Carleton University
 

More from Jeni Tennison

Why Everyone Needs an Open Data Strategy
Why Everyone Needs an Open Data StrategyWhy Everyone Needs an Open Data Strategy
Why Everyone Needs an Open Data Strategy
Jeni Tennison
 
Commercial value of open data
Commercial value of open dataCommercial value of open data
Commercial value of open data
Jeni Tennison
 

More from Jeni Tennison (11)

How much faith should we have in data? - ODI Friday Lunchtime Lecture
How much faith should we have in data? - ODI Friday Lunchtime LectureHow much faith should we have in data? - ODI Friday Lunchtime Lecture
How much faith should we have in data? - ODI Friday Lunchtime Lecture
 
The challenges of building a strong data infrastructure
The challenges of building a strong data infrastructureThe challenges of building a strong data infrastructure
The challenges of building a strong data infrastructure
 
BCS Address Day - Open Addresses
BCS Address Day - Open AddressesBCS Address Day - Open Addresses
BCS Address Day - Open Addresses
 
Knowledge for Everyone
Knowledge for EveryoneKnowledge for Everyone
Knowledge for Everyone
 
Why Everyone Needs an Open Data Strategy
Why Everyone Needs an Open Data StrategyWhy Everyone Needs an Open Data Strategy
Why Everyone Needs an Open Data Strategy
 
Open Data: A New tool for Government
Open Data: A New tool for GovernmentOpen Data: A New tool for Government
Open Data: A New tool for Government
 
Commercial value of open data
Commercial value of open dataCommercial value of open data
Commercial value of open data
 
Legislation.gov.uk
Legislation.gov.ukLegislation.gov.uk
Legislation.gov.uk
 
Collisions, Chimera and Consonance in Web Content
Collisions, Chimera and Consonance in Web ContentCollisions, Chimera and Consonance in Web Content
Collisions, Chimera and Consonance in Web Content
 
How the Web of Data Will be Won
How the Web of Data Will be WonHow the Web of Data Will be Won
How the Web of Data Will be Won
 
OpenTech 2008: Power of Information - Rewiring the London Gazette with RDFa
OpenTech 2008: Power of Information - Rewiring the London Gazette with RDFaOpenTech 2008: Power of Information - Rewiring the London Gazette with RDFa
OpenTech 2008: Power of Information - Rewiring the London Gazette with RDFa
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Data All the Way Down

  • 1. Data All the Way Down Jeni Tennison @JeniT http://www.jenitennison.com/blog/
  • 2. Data All the Way Down • challenges of complex open data • layered approach to data publishing • essential steps • benefits
  • 3. Complex Datasets • too much for a single spreadsheet • need to navigate • browse through data • look at slices of larger dataset • view summary statistics • need to explain • definitions of terms, provisos & disclaimers
  • 4. User Challenge • complex data sets have range of users • different hardware / platforms • different tasks / goals • different ability / understanding • no one interface satisfies everyone • data owners cannot satisfy everyone • create ecosystem around open data
  • 5. visualisation / data gap end user vs reuser
  • 6. Visualisations • approachable for real people • necessary for stakeholder buy-in • beauty is in what's left out • advertisement or taster of rich datasets • often not possible in official data • leaves questions unanswered • what if we looked at the data in a different way?
  • 7. Raw Data • importable into own data store • often only interested in particular slice • data set may be massive / changing • run whatever analysis you want • requires at least some programming skills • analysis might not be appropriate for the data • documentation probably lacking
  • 8. bridging the gap layered data access Photo by Nikita Kravchuk http://www.flickr.com/photos/mi55er/3845619153/
  • 9. Layered Architecture • user interface • navigation and global understanding • API • curated, targeted, programmable access • query • free-form programmable access • raw data
  • 10. legislation.gov.uk lists as Atom feeds
  • 11. legislation.gov.uk content as XML
  • 12. legislation.gov.uk layer other views
  • 13. organograms navigable visualisation
  • 14. organograms JSON data
  • 15. organograms RDF / XML / HTML
  • 16. organograms SPARQL query
  • 17. organograms raw data
  • 18. Key Techniques • resource-driven design (good URIs) • every page built based on API calls • explicit links to API access • for bonus points, link to your transformation code • consistent terminology • clear mapping from UI to API • caching & access control at each level
  • 19. Benefits • fork at any point • don't like the visualisation / API? create your own! • everyone is human • reusers gain understanding from user interface • visualisation benefits the stack • API oriented towards achieving a goal • visual validation of data improves quality