SlideShare a Scribd company logo
F I N D A N D U N D E R S TA N D D ATA




                  Best Practices for

       Publishing Data


Hjalmar Gislason, founder & CEO - hg@datamarket.com   October, 2012
Hjalmar
                Gislason
                Founder and CEO




Twitter: @datamarket
Slides: http://blog.datamarket.com/
Heavy
Data Consumers

    Providers of

 Data Delivery
  Technology
Computers                                                    Humans




    |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Computers                                                      Humans

• Structure                                                          • Understand and
                                                                       use




      |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Computers                                                      Humans

• Structure                                                          • Understand and
                                                                       use




      |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Publishing for Computers


1. Simple formats
2. Indexes, unique IDs and meta-data
3. FAQs and feedback channels
Simple Formats




"Don't anthropomorphize computers
           - they hate it."
                     - Unknown
Simple Formats
Simple Formats:
Tim Berners-Lee’s Five Stars




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Simple Formats:
You lost me at “Semantics”




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Standards will emerge and there will
be more and more of them



                     • RDF
                     • OData vs. GData
                     • DSPL
                     • SDMX




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Indexes, unique ids and meta-data




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Indexes, unique ids and meta-data




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Indexes, unique ids and meta-data




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Indexes, unique IDs and meta-data

  • Must: Unique ID, Title, Last updated
  • Should: Meta-data


  • Why?
   • No need for scraping
       • Less load on your end
   • Ensures full coverage
   • Ensures content removal and updates




        |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Indexes, unique IDs and meta-data

  • Hard to emphasize enough!


  • Unique IDs for everything: Datsets, columns, entities, ...


  • Why?
    • Continuity: A small change for a man = giant leap for a
      computer




        |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Indexes, unique IDs and meta-data

  • Any relevant contextual information
   • URL(s), descriptions, methodology, next updated, authors,
     keywords, units, license information, ...




        |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
FAQs and feedback channels

   #1 reason for not publishing data:




   “There are errors in the data and I don't
       want others to discover them”




       |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
FAQs and feedback channels

   #1 reason for not publishing data:




      “There are errors in the data and I do
         want others to discover them”




       |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
FAQs and feedback channels




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
FAQs and feedback channels




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Publishing for Computers


1. Simple formats
2. Indexes, unique IDs and meta-data
3. FAQs and feedback channels
Computers                                                         Humans

• Structure                                                             • Understand
                                                                          and use




      |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Publishing for Humans


1. Search / Discovery
2. Visualization
3. Download
Search / Discovery

  • Requirements differ from web/text search
   • A lot less textual content to base on
     • Synonyms, dictionaries, autocomplete
   • But (hopefully) good meta-data = facets and filtering


  • Give people ways to browse
   • Categories vs. tags vs. search
   • Serendipity: Random, related, interesting...
Search / Discovery
Visualize
109 columns
     x
  340 lines
     =
37.060 cells
Visualize

  • What you should offer depends on the data


  • Statistical data
    • Focus on the most common charts and get them right
    • Do NOT invent new visualizations or chart types


  • Use standards compatible technologies
    • No Flash!
    • Charting and visualization libraries
Visualize
Visualize
Download

  • Make it easy to use your data outside your tools
   • Play nicely with those providing functionality beyond what
     you can offer: Tableau, R, SAS, MathLab, Mathematica,
     SPSS, ...




  • Provide downloads in the formats most commonly used by your
    users:
   • Raw data: Excel, CSV, feeds (R, Excel live feeds, APIs)
   • Charts and visualizations: Bitmap, vector, PPT, embeds?
Computers                                                       Humans

• Structure                                                           • Understand and use
 • Simple formats                                                         • Search / Discovery
 • Indexes, unique IDs and                                                • Visualization
   meta-data                                                              • Download
 • FAQs and feedback
   channels




       |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
F I N D A N D U N D E R S TA N D D ATA



              Hjalmar Gislason, founder & CEO



Twitter: @datamarket · Facebook: DataMarket · E-mail: hg@datamarket.com

More Related Content

Similar to Strata NY: Best Practices for Publishing Data

Data Catalogues - Architecting for Collaboration & Self-Service
Data Catalogues - Architecting for Collaboration & Self-ServiceData Catalogues - Architecting for Collaboration & Self-Service
Data Catalogues - Architecting for Collaboration & Self-Service
DATAVERSITY
 
The Business Value of Big Data
The Business Value of Big DataThe Business Value of Big Data
The Business Value of Big Data
Clark Boyd
 
Data Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s HomeData Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s Home
DATAVERSITY
 
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
DATAVERSITY
 
intro to data science Clustering and visualization of data science subfields ...
intro to data science Clustering and visualization of data science subfields ...intro to data science Clustering and visualization of data science subfields ...
intro to data science Clustering and visualization of data science subfields ...
jybufgofasfbkpoovh
 
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
DATAVERSITY
 
01-Introduction.pdf
01-Introduction.pdf01-Introduction.pdf
01-Introduction.pdf
ngVnThng12
 
Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016
StampedeCon
 
LDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business IntelligenceLDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business Intelligence
DATAVERSITY
 
Data Modeling & Data Integration
Data Modeling & Data IntegrationData Modeling & Data Integration
Data Modeling & Data Integration
DATAVERSITY
 
Data Modeling & Metadata Management
Data Modeling & Metadata ManagementData Modeling & Metadata Management
Data Modeling & Metadata Management
DATAVERSITY
 
Changing nature of data and its implications on analytics
Changing nature of data and its implications on analyticsChanging nature of data and its implications on analytics
Changing nature of data and its implications on analytics
Ashnikbiz
 
Big data why big data is huge for CPG manufacturers
Big data why big data is huge for CPG manufacturersBig data why big data is huge for CPG manufacturers
Big data why big data is huge for CPG manufacturers
Janet Dorenkott
 
Big Data Presentation at SCQAA-SF on June 12 2013
Big Data Presentation at SCQAA-SF on June 12 2013Big Data Presentation at SCQAA-SF on June 12 2013
Big Data Presentation at SCQAA-SF on June 12 2013Sujit Ghosh
 
Data In Action: Business Value of Data
Data In Action: Business Value of DataData In Action: Business Value of Data
Data In Action: Business Value of Data
Matt Turner
 
Data Modeling Techniques
Data Modeling TechniquesData Modeling Techniques
Data Modeling Techniques
DATAVERSITY
 
Data science.chapter-1,2,3
Data science.chapter-1,2,3Data science.chapter-1,2,3
Data science.chapter-1,2,3
varshakumar21
 
Digital Economics
Digital EconomicsDigital Economics
Digital Economics
Lee Schlenker
 
Big_Data.pptx
Big_Data.pptxBig_Data.pptx
Big_Data.pptx
mohamedibrahim946387
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & Approaches
DATAVERSITY
 

Similar to Strata NY: Best Practices for Publishing Data (20)

Data Catalogues - Architecting for Collaboration & Self-Service
Data Catalogues - Architecting for Collaboration & Self-ServiceData Catalogues - Architecting for Collaboration & Self-Service
Data Catalogues - Architecting for Collaboration & Self-Service
 
The Business Value of Big Data
The Business Value of Big DataThe Business Value of Big Data
The Business Value of Big Data
 
Data Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s HomeData Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s Home
 
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
 
intro to data science Clustering and visualization of data science subfields ...
intro to data science Clustering and visualization of data science subfields ...intro to data science Clustering and visualization of data science subfields ...
intro to data science Clustering and visualization of data science subfields ...
 
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
 
01-Introduction.pdf
01-Introduction.pdf01-Introduction.pdf
01-Introduction.pdf
 
Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016
 
LDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business IntelligenceLDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business Intelligence
 
Data Modeling & Data Integration
Data Modeling & Data IntegrationData Modeling & Data Integration
Data Modeling & Data Integration
 
Data Modeling & Metadata Management
Data Modeling & Metadata ManagementData Modeling & Metadata Management
Data Modeling & Metadata Management
 
Changing nature of data and its implications on analytics
Changing nature of data and its implications on analyticsChanging nature of data and its implications on analytics
Changing nature of data and its implications on analytics
 
Big data why big data is huge for CPG manufacturers
Big data why big data is huge for CPG manufacturersBig data why big data is huge for CPG manufacturers
Big data why big data is huge for CPG manufacturers
 
Big Data Presentation at SCQAA-SF on June 12 2013
Big Data Presentation at SCQAA-SF on June 12 2013Big Data Presentation at SCQAA-SF on June 12 2013
Big Data Presentation at SCQAA-SF on June 12 2013
 
Data In Action: Business Value of Data
Data In Action: Business Value of DataData In Action: Business Value of Data
Data In Action: Business Value of Data
 
Data Modeling Techniques
Data Modeling TechniquesData Modeling Techniques
Data Modeling Techniques
 
Data science.chapter-1,2,3
Data science.chapter-1,2,3Data science.chapter-1,2,3
Data science.chapter-1,2,3
 
Digital Economics
Digital EconomicsDigital Economics
Digital Economics
 
Big_Data.pptx
Big_Data.pptxBig_Data.pptx
Big_Data.pptx
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & Approaches
 

More from Hjalmar Gislason

Icelandic environment for innovation and entrepreneurship
Icelandic environment for innovation and entrepreneurshipIcelandic environment for innovation and entrepreneurship
Icelandic environment for innovation and entrepreneurship
Hjalmar Gislason
 
Níu atriði sem enginn sagði mér um nýsköpun
Níu atriði sem enginn sagði mér um nýsköpunNíu atriði sem enginn sagði mér um nýsköpun
Níu atriði sem enginn sagði mér um nýsköpun
Hjalmar Gislason
 
What does a random place on Earth look like?
What does a random place on Earth look like?What does a random place on Earth look like?
What does a random place on Earth look like?
Hjalmar Gislason
 
Unified Intelligence
Unified IntelligenceUnified Intelligence
Unified Intelligence
Hjalmar Gislason
 
Eruptions, Open Data and the Earth's Nerve System
Eruptions, Open Data and the Earth's Nerve SystemEruptions, Open Data and the Earth's Nerve System
Eruptions, Open Data and the Earth's Nerve System
Hjalmar Gislason
 
Data Visualizations and Storytelling
Data Visualizations and StorytellingData Visualizations and Storytelling
Data Visualizations and Storytelling
Hjalmar Gislason
 
ICIJ Conference April 2012
ICIJ Conference April 2012ICIJ Conference April 2012
ICIJ Conference April 2012
Hjalmar Gislason
 
Data Visualization: Where (normal) people fall in love with data
Data Visualization: Where (normal) people fall in love with dataData Visualization: Where (normal) people fall in love with data
Data Visualization: Where (normal) people fall in love with data
Hjalmar Gislason
 
9 things nobody told me about the start-up business
9 things nobody told me about the start-up business9 things nobody told me about the start-up business
9 things nobody told me about the start-up business
Hjalmar Gislason
 
Effective Data Visualization - Strata (Feb 2012)
Effective Data Visualization - Strata (Feb 2012)Effective Data Visualization - Strata (Feb 2012)
Effective Data Visualization - Strata (Feb 2012)
Hjalmar Gislason
 
Data visualizition - where normal people fall in love with data
Data visualizition - where normal people fall in love with dataData visualizition - where normal people fall in love with data
Data visualizition - where normal people fall in love with data
Hjalmar Gislason
 
DataMarket á Haustráðstefnu Skýrr 2011
DataMarket á Haustráðstefnu Skýrr 2011DataMarket á Haustráðstefnu Skýrr 2011
DataMarket á Haustráðstefnu Skýrr 2011
Hjalmar Gislason
 
DataMarket at Nordic Techpolitics
DataMarket at Nordic TechpoliticsDataMarket at Nordic Techpolitics
DataMarket at Nordic Techpolitics
Hjalmar Gislason
 
DataMarket at Media 3.0 in Bergen
DataMarket at Media 3.0 in BergenDataMarket at Media 3.0 in Bergen
DataMarket at Media 3.0 in Bergen
Hjalmar Gislason
 
The Business of Open Data
The Business of Open DataThe Business of Open Data
The Business of Open Data
Hjalmar Gislason
 
DataMarket - Iceland (english)
DataMarket - Iceland (english)DataMarket - Iceland (english)
DataMarket - Iceland (english)
Hjalmar Gislason
 
Dokkan sept-2010
Dokkan sept-2010Dokkan sept-2010
Dokkan sept-2010
Hjalmar Gislason
 
DataMarket í Silfri Egils 26. september 2009
DataMarket í Silfri Egils 26. september 2009DataMarket í Silfri Egils 26. september 2009
DataMarket í Silfri Egils 26. september 2009
Hjalmar Gislason
 
DataMarket: Haustráðstefna Skýrr, sept 2010
DataMarket: Haustráðstefna Skýrr, sept 2010DataMarket: Haustráðstefna Skýrr, sept 2010
DataMarket: Haustráðstefna Skýrr, sept 2010
Hjalmar Gislason
 
Landsins gögn og nauðsynjar - HR 9. apríl 2010
Landsins gögn og nauðsynjar - HR 9. apríl 2010Landsins gögn og nauðsynjar - HR 9. apríl 2010
Landsins gögn og nauðsynjar - HR 9. apríl 2010
Hjalmar Gislason
 

More from Hjalmar Gislason (20)

Icelandic environment for innovation and entrepreneurship
Icelandic environment for innovation and entrepreneurshipIcelandic environment for innovation and entrepreneurship
Icelandic environment for innovation and entrepreneurship
 
Níu atriði sem enginn sagði mér um nýsköpun
Níu atriði sem enginn sagði mér um nýsköpunNíu atriði sem enginn sagði mér um nýsköpun
Níu atriði sem enginn sagði mér um nýsköpun
 
What does a random place on Earth look like?
What does a random place on Earth look like?What does a random place on Earth look like?
What does a random place on Earth look like?
 
Unified Intelligence
Unified IntelligenceUnified Intelligence
Unified Intelligence
 
Eruptions, Open Data and the Earth's Nerve System
Eruptions, Open Data and the Earth's Nerve SystemEruptions, Open Data and the Earth's Nerve System
Eruptions, Open Data and the Earth's Nerve System
 
Data Visualizations and Storytelling
Data Visualizations and StorytellingData Visualizations and Storytelling
Data Visualizations and Storytelling
 
ICIJ Conference April 2012
ICIJ Conference April 2012ICIJ Conference April 2012
ICIJ Conference April 2012
 
Data Visualization: Where (normal) people fall in love with data
Data Visualization: Where (normal) people fall in love with dataData Visualization: Where (normal) people fall in love with data
Data Visualization: Where (normal) people fall in love with data
 
9 things nobody told me about the start-up business
9 things nobody told me about the start-up business9 things nobody told me about the start-up business
9 things nobody told me about the start-up business
 
Effective Data Visualization - Strata (Feb 2012)
Effective Data Visualization - Strata (Feb 2012)Effective Data Visualization - Strata (Feb 2012)
Effective Data Visualization - Strata (Feb 2012)
 
Data visualizition - where normal people fall in love with data
Data visualizition - where normal people fall in love with dataData visualizition - where normal people fall in love with data
Data visualizition - where normal people fall in love with data
 
DataMarket á Haustráðstefnu Skýrr 2011
DataMarket á Haustráðstefnu Skýrr 2011DataMarket á Haustráðstefnu Skýrr 2011
DataMarket á Haustráðstefnu Skýrr 2011
 
DataMarket at Nordic Techpolitics
DataMarket at Nordic TechpoliticsDataMarket at Nordic Techpolitics
DataMarket at Nordic Techpolitics
 
DataMarket at Media 3.0 in Bergen
DataMarket at Media 3.0 in BergenDataMarket at Media 3.0 in Bergen
DataMarket at Media 3.0 in Bergen
 
The Business of Open Data
The Business of Open DataThe Business of Open Data
The Business of Open Data
 
DataMarket - Iceland (english)
DataMarket - Iceland (english)DataMarket - Iceland (english)
DataMarket - Iceland (english)
 
Dokkan sept-2010
Dokkan sept-2010Dokkan sept-2010
Dokkan sept-2010
 
DataMarket í Silfri Egils 26. september 2009
DataMarket í Silfri Egils 26. september 2009DataMarket í Silfri Egils 26. september 2009
DataMarket í Silfri Egils 26. september 2009
 
DataMarket: Haustráðstefna Skýrr, sept 2010
DataMarket: Haustráðstefna Skýrr, sept 2010DataMarket: Haustráðstefna Skýrr, sept 2010
DataMarket: Haustráðstefna Skýrr, sept 2010
 
Landsins gögn og nauðsynjar - HR 9. apríl 2010
Landsins gögn og nauðsynjar - HR 9. apríl 2010Landsins gögn og nauðsynjar - HR 9. apríl 2010
Landsins gögn og nauðsynjar - HR 9. apríl 2010
 

Recently uploaded

Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s DholeraTata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
Avirahi City Dholera
 
What are the main advantages of using HR recruiter services.pdf
What are the main advantages of using HR recruiter services.pdfWhat are the main advantages of using HR recruiter services.pdf
What are the main advantages of using HR recruiter services.pdf
HumanResourceDimensi1
 
Attending a job Interview for B1 and B2 Englsih learners
Attending a job Interview for B1 and B2 Englsih learnersAttending a job Interview for B1 and B2 Englsih learners
Attending a job Interview for B1 and B2 Englsih learners
Erika906060
 
FINAL PRESENTATION.pptx12143241324134134
FINAL PRESENTATION.pptx12143241324134134FINAL PRESENTATION.pptx12143241324134134
FINAL PRESENTATION.pptx12143241324134134
LR1709MUSIC
 
Exploring Patterns of Connection with Social Dreaming
Exploring Patterns of Connection with Social DreamingExploring Patterns of Connection with Social Dreaming
Exploring Patterns of Connection with Social Dreaming
Nicola Wreford-Howard
 
amptalk_RecruitingDeck_english_2024.06.05
amptalk_RecruitingDeck_english_2024.06.05amptalk_RecruitingDeck_english_2024.06.05
amptalk_RecruitingDeck_english_2024.06.05
marketing317746
 
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
BBPMedia1
 
ModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
ModelingMarketingStrategiesMKS.CollumbiaUniversitypdfModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
ModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
fisherameliaisabella
 
CADAVER AS OUR FIRST TEACHER anatomt in your.pptx
CADAVER AS OUR FIRST TEACHER anatomt in your.pptxCADAVER AS OUR FIRST TEACHER anatomt in your.pptx
CADAVER AS OUR FIRST TEACHER anatomt in your.pptx
fakeloginn69
 
VAT Registration Outlined In UAE: Benefits and Requirements
VAT Registration Outlined In UAE: Benefits and RequirementsVAT Registration Outlined In UAE: Benefits and Requirements
VAT Registration Outlined In UAE: Benefits and Requirements
uae taxgpt
 
Skye Residences | Extended Stay Residences Near Toronto Airport
Skye Residences | Extended Stay Residences Near Toronto AirportSkye Residences | Extended Stay Residences Near Toronto Airport
Skye Residences | Extended Stay Residences Near Toronto Airport
marketingjdass
 
Kseniya Leshchenko: Shared development support service model as the way to ma...
Kseniya Leshchenko: Shared development support service model as the way to ma...Kseniya Leshchenko: Shared development support service model as the way to ma...
Kseniya Leshchenko: Shared development support service model as the way to ma...
Lviv Startup Club
 
Putting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptxPutting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptx
Cynthia Clay
 
Premium MEAN Stack Development Solutions for Modern Businesses
Premium MEAN Stack Development Solutions for Modern BusinessesPremium MEAN Stack Development Solutions for Modern Businesses
Premium MEAN Stack Development Solutions for Modern Businesses
SynapseIndia
 
The effects of customers service quality and online reviews on customer loyal...
The effects of customers service quality and online reviews on customer loyal...The effects of customers service quality and online reviews on customer loyal...
The effects of customers service quality and online reviews on customer loyal...
balatucanapplelovely
 
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
BBPMedia1
 
Project File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdfProject File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdf
RajPriye
 
falcon-invoice-discounting-a-premier-platform-for-investors-in-india
falcon-invoice-discounting-a-premier-platform-for-investors-in-indiafalcon-invoice-discounting-a-premier-platform-for-investors-in-india
falcon-invoice-discounting-a-premier-platform-for-investors-in-india
Falcon Invoice Discounting
 
Affordable Stationery Printing Services in Jaipur | Navpack n Print
Affordable Stationery Printing Services in Jaipur | Navpack n PrintAffordable Stationery Printing Services in Jaipur | Navpack n Print
Affordable Stationery Printing Services in Jaipur | Navpack n Print
Navpack & Print
 
Cracking the Workplace Discipline Code Main.pptx
Cracking the Workplace Discipline Code Main.pptxCracking the Workplace Discipline Code Main.pptx
Cracking the Workplace Discipline Code Main.pptx
Workforce Group
 

Recently uploaded (20)

Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s DholeraTata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
 
What are the main advantages of using HR recruiter services.pdf
What are the main advantages of using HR recruiter services.pdfWhat are the main advantages of using HR recruiter services.pdf
What are the main advantages of using HR recruiter services.pdf
 
Attending a job Interview for B1 and B2 Englsih learners
Attending a job Interview for B1 and B2 Englsih learnersAttending a job Interview for B1 and B2 Englsih learners
Attending a job Interview for B1 and B2 Englsih learners
 
FINAL PRESENTATION.pptx12143241324134134
FINAL PRESENTATION.pptx12143241324134134FINAL PRESENTATION.pptx12143241324134134
FINAL PRESENTATION.pptx12143241324134134
 
Exploring Patterns of Connection with Social Dreaming
Exploring Patterns of Connection with Social DreamingExploring Patterns of Connection with Social Dreaming
Exploring Patterns of Connection with Social Dreaming
 
amptalk_RecruitingDeck_english_2024.06.05
amptalk_RecruitingDeck_english_2024.06.05amptalk_RecruitingDeck_english_2024.06.05
amptalk_RecruitingDeck_english_2024.06.05
 
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...
 
ModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
ModelingMarketingStrategiesMKS.CollumbiaUniversitypdfModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
ModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
 
CADAVER AS OUR FIRST TEACHER anatomt in your.pptx
CADAVER AS OUR FIRST TEACHER anatomt in your.pptxCADAVER AS OUR FIRST TEACHER anatomt in your.pptx
CADAVER AS OUR FIRST TEACHER anatomt in your.pptx
 
VAT Registration Outlined In UAE: Benefits and Requirements
VAT Registration Outlined In UAE: Benefits and RequirementsVAT Registration Outlined In UAE: Benefits and Requirements
VAT Registration Outlined In UAE: Benefits and Requirements
 
Skye Residences | Extended Stay Residences Near Toronto Airport
Skye Residences | Extended Stay Residences Near Toronto AirportSkye Residences | Extended Stay Residences Near Toronto Airport
Skye Residences | Extended Stay Residences Near Toronto Airport
 
Kseniya Leshchenko: Shared development support service model as the way to ma...
Kseniya Leshchenko: Shared development support service model as the way to ma...Kseniya Leshchenko: Shared development support service model as the way to ma...
Kseniya Leshchenko: Shared development support service model as the way to ma...
 
Putting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptxPutting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptx
 
Premium MEAN Stack Development Solutions for Modern Businesses
Premium MEAN Stack Development Solutions for Modern BusinessesPremium MEAN Stack Development Solutions for Modern Businesses
Premium MEAN Stack Development Solutions for Modern Businesses
 
The effects of customers service quality and online reviews on customer loyal...
The effects of customers service quality and online reviews on customer loyal...The effects of customers service quality and online reviews on customer loyal...
The effects of customers service quality and online reviews on customer loyal...
 
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
 
Project File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdfProject File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdf
 
falcon-invoice-discounting-a-premier-platform-for-investors-in-india
falcon-invoice-discounting-a-premier-platform-for-investors-in-indiafalcon-invoice-discounting-a-premier-platform-for-investors-in-india
falcon-invoice-discounting-a-premier-platform-for-investors-in-india
 
Affordable Stationery Printing Services in Jaipur | Navpack n Print
Affordable Stationery Printing Services in Jaipur | Navpack n PrintAffordable Stationery Printing Services in Jaipur | Navpack n Print
Affordable Stationery Printing Services in Jaipur | Navpack n Print
 
Cracking the Workplace Discipline Code Main.pptx
Cracking the Workplace Discipline Code Main.pptxCracking the Workplace Discipline Code Main.pptx
Cracking the Workplace Discipline Code Main.pptx
 

Strata NY: Best Practices for Publishing Data

  • 1. F I N D A N D U N D E R S TA N D D ATA Best Practices for Publishing Data Hjalmar Gislason, founder & CEO - hg@datamarket.com October, 2012
  • 2. Hjalmar Gislason Founder and CEO Twitter: @datamarket Slides: http://blog.datamarket.com/
  • 3.
  • 4.
  • 5. Heavy Data Consumers Providers of Data Delivery Technology
  • 6. Computers Humans | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 7. Computers Humans • Structure • Understand and use | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 8. Computers Humans • Structure • Understand and use | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 9. Publishing for Computers 1. Simple formats 2. Indexes, unique IDs and meta-data 3. FAQs and feedback channels
  • 10. Simple Formats "Don't anthropomorphize computers - they hate it." - Unknown
  • 12. Simple Formats: Tim Berners-Lee’s Five Stars | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 13. Simple Formats: You lost me at “Semantics” | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 14. Standards will emerge and there will be more and more of them • RDF • OData vs. GData • DSPL • SDMX | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 15. Indexes, unique ids and meta-data | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 16. Indexes, unique ids and meta-data | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 17. Indexes, unique ids and meta-data | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 18. Indexes, unique IDs and meta-data • Must: Unique ID, Title, Last updated • Should: Meta-data • Why? • No need for scraping • Less load on your end • Ensures full coverage • Ensures content removal and updates | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 19. Indexes, unique IDs and meta-data • Hard to emphasize enough! • Unique IDs for everything: Datsets, columns, entities, ... • Why? • Continuity: A small change for a man = giant leap for a computer | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 20. Indexes, unique IDs and meta-data • Any relevant contextual information • URL(s), descriptions, methodology, next updated, authors, keywords, units, license information, ... | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 21. FAQs and feedback channels #1 reason for not publishing data: “There are errors in the data and I don't want others to discover them” | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 22. FAQs and feedback channels #1 reason for not publishing data: “There are errors in the data and I do want others to discover them” | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 23. FAQs and feedback channels | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 24. FAQs and feedback channels | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 25. Publishing for Computers 1. Simple formats 2. Indexes, unique IDs and meta-data 3. FAQs and feedback channels
  • 26. Computers Humans • Structure • Understand and use | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 27. Publishing for Humans 1. Search / Discovery 2. Visualization 3. Download
  • 28. Search / Discovery • Requirements differ from web/text search • A lot less textual content to base on • Synonyms, dictionaries, autocomplete • But (hopefully) good meta-data = facets and filtering • Give people ways to browse • Categories vs. tags vs. search • Serendipity: Random, related, interesting...
  • 31.
  • 32. 109 columns x 340 lines = 37.060 cells
  • 33.
  • 34.
  • 35.
  • 36. Visualize • What you should offer depends on the data • Statistical data • Focus on the most common charts and get them right • Do NOT invent new visualizations or chart types • Use standards compatible technologies • No Flash! • Charting and visualization libraries
  • 39. Download • Make it easy to use your data outside your tools • Play nicely with those providing functionality beyond what you can offer: Tableau, R, SAS, MathLab, Mathematica, SPSS, ... • Provide downloads in the formats most commonly used by your users: • Raw data: Excel, CSV, feeds (R, Excel live feeds, APIs) • Charts and visualizations: Bitmap, vector, PPT, embeds?
  • 40. Computers Humans • Structure • Understand and use • Simple formats • Search / Discovery • Indexes, unique IDs and • Visualization meta-data • Download • FAQs and feedback channels | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 41. F I N D A N D U N D E R S TA N D D ATA Hjalmar Gislason, founder & CEO Twitter: @datamarket · Facebook: DataMarket · E-mail: hg@datamarket.com