Your SlideShare is downloading. ×
  • Like
Bancilhon
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

Now you can save presentations on your phone or tablet

Available for both IPhone and Android

Text the download link to your phone

Standard text messaging rates apply

Bancilhon

  • 566 views
Published

 

Published in Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
No Downloads

Views

Total Views
566
On SlideShare
0
From Embeds
0
Number of Embeds
0

Actions

Shares
Downloads
6
Comments
0
Likes
1

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. Formats for Open Data François Bancilhon twitter.com/fbancilhon www.data-publica.com Share-PSI Workshop Brussels May 10, 2011
  • 2. Data Publica● Develop the most complete and in-depth knowledge of French electronic data. Provide a complete directory of public data in France.● Set up a DataStore, where people can find data provided by us (data hunting) and by outside vendors (data reseller)
  • 3. CAVEAT● I strongly support the 10 principles of the Sunlight foundation● From bad to good, there is a spectrum, I support improvement rather than rejection of everything that is not perfect● This work derived from the recommendation of GFII (Groupement Français de lIndustrie de lInformation)
  • 4. Summary● Open formats at the physical level● Standard formats at the conceptual level● Agreement on anonymization● Providing source data with pdf data● Privileging XML● Definition of exchange formats
  • 5. Physical level● At the physical level (text, image, video, etc.), provide ● an open format (a standard for which anyone can build tools) ● a format compatible with the commonly used tools
  • 6. Conceptual level● For every vertical, define standards that take into account the specificity of the area● Standards to be elaborated by researchers, users and industry representatives, at the European level● Examples: Inspire, ITS, XBRL, OAI
  • 7. Anonymization● Provide an operational definition of anonymization● Standards for it and operational qualification● Make up ways to anonymize while keeping some meaning● Need for European standard and technology
  • 8. Providing source data with pdf● PDF is a good format for consumer display● PDF is a bad format for re-use● Most of the time PDF is produced from some other source format● Request that PDF is provided together with its source (not always that simple)
  • 9. Pushing for XML● Principle of improvement: the move to XML from organizations that were publishing in some other unfriendly format (eg PDF), is a good thing
  • 10. Define exchange formats● Most open data formats are based on the use that the public body is making internally of this data● Define instead an exchange format based on transmission rather that on internal usage
  • 11. Questions?francois.bancilhon@data-publica.com www.data-publica.com twitter.com/fbancilhon