SlideShare a Scribd company logo
1 of 18
Download to read offline
On Metadata for Open Data
   Yannis Charalabidis
       25.04.2012
Introduction


We will try in the next slides to show you what is
    the level of expectation from metadata
   handling from a 2nd generation open data
                       system
Imagine you are in front of the ENGAGE system,
     and you have your URI from a dataset,
            somewhere in the cloud,
      (copied as string in the clipboard)

                 And begin …
Prescreening: User only gives URI
         of the dataset

   Enter (paste) the URI of your dataset


   _
(then for 30 seconds you see this
        screen, changing)
      Progress of ENGAGE Resource Prescreening:
                ( 45% ) of jobs completed

                     Managed to :
                      Identify xls file
              Autofill, provisionally: Title
             Autofill, provisionally: Creator
              Create unique ENGAGE URI
                  Calculate keywords
            Autofill, provisionally: keywords
                             …
                             …
(When finishing import, the report)
                                Report
 ENGAGE managed to automatically, provisionally fill in ( 21 ) of 43 metadata
                       attributes for your dataset.

                       Your current validity is at ( 45% )

For your dataset to be inserted in the database, you need to continue filling
                         in ( 5 ) mandatory attributes.
          Your dataset will then be inserted with validity ( 55% )

If all ( 17 ) non-mandatory attributes are filled in, validity will be maximum, at
                        70% / limit of the insertion phase.


     Please select next action:     Continue           Park       Cancel
After import …

… and then, we enter the metadata insertion
        page with pre-filled data, etc.

When we finish, we get a similar final report.

AND NOW THE ENGAGE METADATA set, that
       makes all that a possibility:
But,before, some semantics:
Attribute characteristics – notation:

(M) :   attribute is Mandatory (cannot be empty)
(*) :   attribute takes values from a controlled list of terms (codelist), or tree (dag of terms), or table
(+) :   takes values from an extendible list or tree. User may extend the list during insertion
(a) :   an auto-filling list (as suggestion) or otherwise automatically calculated attribute
(m) :   attribute accepts multiple values
(v) :   attribute entry can be verified through a type-checking algorithm

(( x )) : x is possible, but as an option
no tag : attribute is a simple string entry



---------- for the future -------------
(c0), (c1), (c2), (c3) : the importance of attribute in completeness calculation (c3 is higher – mostly important)
(q0), (q1), (q2), (q3) : the importance of attribute in data quality calculation (q3 is higher – mostly important)
A. The core attributes
                                                                                            Size of
                                                                                                       Existing
Metadata Attribute                               Type of Attribute    Type of codelist     codelist
                                                                                           (nodes)    codelists

Title
                                                     (M) ((a))
Automatic: extracted from the dataset headline                                -               -           -
                                                      String
of the URI/dataset provided

Publisher                                            (M)(*)(+)                              100 X     Greece
                                                                      Tree of Strings
PUB admin tree (100 per country, extendible)      Pointer to Tree                          country     (ENG)

Creator
PUB admin tree (100 per country, extendible)         (M)(*)(+)                              100 X     Greece
                                                                     Tree of PS entities
Prompt: same as the publisher                     Pointer to Tree                          country     (ENG)

Code
Automatic: ENGAGE automatic classification
                                                    (M)(*)(a)
system (date,country,PSector,type,etc) or                                     -               -           -
                                                      String
ENGAGE URI

User                                                                                                      -
                                                      (*)(a)
The user who uploads that. Automatic filling                           Table of Users         -
                                                 Pointer to Table
from table of users / login
B. The outer core attributes
                                                                                             Size of
                                                                                                           Existing
Metadata Attribute                                  Type of Attribute   Type of codelist    codelist
                                                                                                          codelists
                                                                                            (nodes)

Subject
                                                       (M)(*)(+)                           All resource
Text describing the resource in one sentence                             List of strings                    NO
                                                     Pointer to List                         subjects
It can be stored in a list and reused

Type
List of types: dataset, linkable dataset,              (M)(*)(m)
                                                                         List of strings       10           ENG
visualization, textual information, executable        Pointer to list
binary, unknown

Format
                                                        (M)(*)(+)        List of strings       50           ENG
xls xml odata … jpd pdf … (appr. 50 format types)
                                                      Pointer to list
Language
ISO simplified (5 < 20 (EU) < ISO (3000).            (M)(*) ((a)) (m)
                                                                         List of strings       200        ISO List
Automatic: extract from language settings (when       Pointer to List
                                                                                                           (ENG)
XLS / ISO)

Country
                                                       (M)(*)(m)                                          ISO List
5 ENGAGE countries < rest of 27 EU < other                               List of strings       200
                                                     Pointer to List                                       (ENG)
countries ISO country list
C. The Public Sector Context
                                                                                             Size of    Existing
Metadata Attribute                                   Type of Attribute   Type of codelist   codelist   codelists
                                                                                            (nodes)


Public Sector Domain
Tree of sectors (20: finance, health, social
                                                         (*)(m)(+)
security, etc)                                                           Tree of strings      20       ENG, GR
                                                      Pointer to Tree
Automatic : can be calculated from Creator, if all
public sector entities have a domain

Relative Public Service
List of public services (i2010 20 basic services,
                                                        (*)(m)(+)
plus “other-reward service”, “othr permission                             List of strings     24       ENG, GR
                                                      Pointer to List
service”, “Other registry entry service”, “Other
personal documents service”)
Relative Information System
                                                        (*)(m)(+)
List of EU and national main information systems                          List of strings     200         GR
                                                      Pointer to List
(50+50*country)
Legal Framework
Main EU directives on open data (10), main
                                                                          Table of Legal
national laws and decrees on open data (10 X             (*)(m)(+)                            100         GR
                                                                            Elements
country)
D. The Scientific Context
                                                                                             Size of
                                                                                                        Existing
Metadata Attribute                                   Type of Attribute   Type of codelist   codelist
                                                                                                       codelists
                                                                                            (nodes)

Scientific Sector                                         (*)(m)
                                                                         Tree of strings      100      Science
ENGAGE Tree of Scientific Domains                     Pointer to Tree

Scientific Usage of Resource
ENGAGE tree of scientific types/usages: events           (*)(m)(+)
                                                                         Tree of strings      20       Science
data (nature or man-made), financial data, health     Pointer to Tree
data, etc (20)

Intended Audience
List of possible audiences: citizens, enterprises,
                                                        (*)(m)(+)
researchers, public sector managers, public                               Tree of strings     20       ENGAGE
                                                      Pointer to List
sector officers, policy makers, members of
National Parliament, MEP’s, NGO’s etc

Keywords
Initial list made / proposed by ENGAGE System
                                                       (*)(m)(+)(a)
with countries, Psector Domain, Science Domain,                           List of strings     200          -
                                                      Pointer to List
Usage. Also get from linked areas / domains /
types etc
E. URL’s – URI’s - Links
                                                                                             Size of
                                                                                                            Existing
Metadata Attribute                                  Type of Attribute   Type of codelist    codelist
                                                                                            (nodes)        codelists


Type of Source Link
                                                         (*)(+)
URL / URI / DOI / WS / RSS/ ENGAGE / other                               List of Strings       10            ENG
                                                     Pointer to List


Source Link (URL)                                                                          Codelist is
String or ENGAGE URL (*)(a). Automatic: put the       (*) (+) ((a))                        the full list
                                                                         List of Strings                     Yes
URL of ENGAGE site                                   Pointer to List                       of URI’s in
                                                                                            ENGAGE
Type of Resource link
                                                         (*)(+)
URL / URI / DOI / WS / RSS/ ENGAGE other                                 List of Strings       10            ENG
                                                     Pointer to List
Resource Link                                                                              Codelist is
String or ENGAGE (a). Automatic lists the link it     (*) (+) ((a))                        the full list
                                                                         List of Strings                     Yes
already has.                                         Pointer to List                       of URI’s in
                                                                                            ENGAGE
Relevant Resources                                                                         Codelist is
List of existing URI’s in the system . Automatic:                                          the full list
                                                      (*)(m)(+)(a)       List of Strings                     Yes
calculates from matching domain+type+                                                      of URI’s in
                                                                                            ENGAGE
F. Linked Data
                                                                                        Size of
                                                                                                   Existing
Metadata Attribute                              Type of Attribute   Type of codelist   codelist
                                                                                       (nodes)    codelists


Linking status
Linkable, linked, non-linked, non-linkable,           (*)
                                                                     List of Strings      5         YES
unknown                                          Pointer to List


Linked Data Set
                                                 (*)(m)(+)(a)(d)
URI of a linked dataset.                                              List of URI’s    No limit       -
                                                 Pointer to List
Details of link:
   Linking Type (PK match)                       Pointer to List     List of Strings      1           -
   Matching column of this resource                  String                 -             -           -
   Matching column of linked resource                String                 -             -           -
   Columns of this resource, to be included        (m) String               -             -           -
   Columns of linked resource, to be included      (m) String               -             -           -

Visualisations                                   (*)(m)(+)(a)(d)
                                                                      List of URI’s    No limit       -
Links to visualisations of current resource      Pointer to List
G. Dates and Status
                                                                                        Size of
                                                                                                   Existing
Metadata Attribute                              Type of Attribute   Type of codelist   codelist
                                                                                       (nodes)    codelists

                                                      (v)
Consideration Started on                                                    -             -           -
                                                     DATE
                                                      (v)
Initial Approval / Planning Started on                                      -             -           -
                                                     DATE
                                                      (v)
Planned to be valid on                                                      -             -           -
                                                     DATE
                                                      (v)
Validity Started on                                                         -             -           -
                                                     DATE
                                                      (v)
Validity to finish on                                                       -             -           -
                                                     DATE
                                                      (v)
Rejected on                                                                 -             -           -
                                                     DATE
                                                      (v)
Substituted on                                                              -             -           -
                                                     DATE
Status
Considered, planned, valid, valid and linked,
                                                    (*) (a)
rejected, outdated, substituted.                                     List of Strings      8         ENG
                                                 Pointer to List
Automatic: calculation through DATES
H. Rating
                                                                                           Size of
                                                                                                      Existing
Metadata Attribute                                 Type of Attribute   Type of codelist   codelist
                                                                                          (nodes)    codelists

Metadata Completeness
Automatic: calculated by filled / empty non        Number (1-100)             -              -           -
mandatory items
Metadata Quality
Automatic: calculated by specific filled / empty
                                                   Number (1-100)             -              -           -
non mandatory items

Citizen Rating
                                                   Number (1-100)             -              -           -
As reported / calculated by relative users
Researcher Rating
                                                   Number (1-100)             -              -           -
As reported / calculated by relative users
Business Rating                                    Number (1-100)
As reported / calculated by relative users
Number of Downloads
                                                       Number                 -              -           -
As reported by the ENGAGE System
Density of Downloads
                                                      Number %                -              -           -
As number per total period of validity to date
Not to forget: Metadata codelists
where there, since the Hearing … !


    An Infrastructure for Open, Linked
   Governmental Data Provision towards
    Research Communities and Citizens


          Proposal Evaluation Hearing
              Brussels 23/2/2011
Q6: Which types of metadata will you select?

•    Exploit work already done by the consortium (DELFT, NTUA, AEGEAN, STFC) in public
     sector metadata schemas
•    Multi-facet design: take under consideration the fact that the data may be used in
     different contexts, such as research, policy making or by citizens
•    Take under consideration the fact that data sources may provide wildly differing
     metadata – go towards metadata standardisation for Open Data / a major
     contribution of ENGAGE
•    Two-phase metadata design within ENGAGE workplan (Task C1.2: Data and knowledge
     representation annotation and linking methods). Initial proposal based on Dublin Core,
     UK eGovernment Metadata Schema and eGMS+, is as following:

    Metadata ENGAGE Set
    Identifier                    Title                             Creator
    Publisher                     Country                           Source
    Type (*)                      Format (*)                        Language (*)
    Sector (*)                    Subject (*)                       Keywords (*)
    Relative Public Service (*)   Relative Information System       URL / URI / DOI
    Validity Date (from – to)     Audience (*)                      Legal Framework
    Status (*)                    Relevant Resources                Linkded Data Sets (*)
                                                       (*) Indicates Controlled Lists / Taxonomies

More Related Content

What's hot

Intro to Python (High School) Unit #2
Intro to Python (High School) Unit #2Intro to Python (High School) Unit #2
Intro to Python (High School) Unit #2Jay Coskey
 
Materi 6 user definedfunction
Materi 6 user definedfunctionMateri 6 user definedfunction
Materi 6 user definedfunctionAl Frilantika
 
Java class 5
Java class 5Java class 5
Java class 5Edureka!
 
Introduction à Scala - Michel Schinz - January 2010
Introduction à Scala - Michel Schinz - January 2010Introduction à Scala - Michel Schinz - January 2010
Introduction à Scala - Michel Schinz - January 2010JUG Lausanne
 
A Proposition for Business Process Modeling
A Proposition for Business Process ModelingA Proposition for Business Process Modeling
A Proposition for Business Process ModelingAng Chen
 
One Monad to Rule Them All
One Monad to Rule Them AllOne Monad to Rule Them All
One Monad to Rule Them AllJohn De Goes
 
Demystifying functional programming with Scala
Demystifying functional programming with ScalaDemystifying functional programming with Scala
Demystifying functional programming with ScalaDenis
 
Rcpp: Seemless R and C++
Rcpp: Seemless R and C++Rcpp: Seemless R and C++
Rcpp: Seemless R and C++Romain Francois
 
Scala categorytheory
Scala categorytheoryScala categorytheory
Scala categorytheoryKnoldus Inc.
 
Model-Driven Software Development - Pretty-Printing, Editor Services, Term Re...
Model-Driven Software Development - Pretty-Printing, Editor Services, Term Re...Model-Driven Software Development - Pretty-Printing, Editor Services, Term Re...
Model-Driven Software Development - Pretty-Printing, Editor Services, Term Re...Eelco Visser
 
Rcpp: Seemless R and C++
Rcpp: Seemless R and C++Rcpp: Seemless R and C++
Rcpp: Seemless R and C++Romain Francois
 
Functional programming in Scala
Functional programming in ScalaFunctional programming in Scala
Functional programming in ScalaDamian Jureczko
 
Domänenspezifische Sprachen mit Xtext
Domänenspezifische Sprachen mit XtextDomänenspezifische Sprachen mit Xtext
Domänenspezifische Sprachen mit XtextDr. Jan Köhnlein
 

What's hot (20)

Intro to Python (High School) Unit #2
Intro to Python (High School) Unit #2Intro to Python (High School) Unit #2
Intro to Python (High School) Unit #2
 
Materi 6 user definedfunction
Materi 6 user definedfunctionMateri 6 user definedfunction
Materi 6 user definedfunction
 
Java class 5
Java class 5Java class 5
Java class 5
 
Ruby Programming Assignment Help
Ruby Programming Assignment HelpRuby Programming Assignment Help
Ruby Programming Assignment Help
 
Spsl vi unit final
Spsl vi unit finalSpsl vi unit final
Spsl vi unit final
 
Introduction à Scala - Michel Schinz - January 2010
Introduction à Scala - Michel Schinz - January 2010Introduction à Scala - Michel Schinz - January 2010
Introduction à Scala - Michel Schinz - January 2010
 
A Proposition for Business Process Modeling
A Proposition for Business Process ModelingA Proposition for Business Process Modeling
A Proposition for Business Process Modeling
 
One Monad to Rule Them All
One Monad to Rule Them AllOne Monad to Rule Them All
One Monad to Rule Them All
 
Demystifying functional programming with Scala
Demystifying functional programming with ScalaDemystifying functional programming with Scala
Demystifying functional programming with Scala
 
JavaYDL17
JavaYDL17JavaYDL17
JavaYDL17
 
Rcpp: Seemless R and C++
Rcpp: Seemless R and C++Rcpp: Seemless R and C++
Rcpp: Seemless R and C++
 
Scala categorytheory
Scala categorytheoryScala categorytheory
Scala categorytheory
 
Model-Driven Software Development - Pretty-Printing, Editor Services, Term Re...
Model-Driven Software Development - Pretty-Printing, Editor Services, Term Re...Model-Driven Software Development - Pretty-Printing, Editor Services, Term Re...
Model-Driven Software Development - Pretty-Printing, Editor Services, Term Re...
 
Real World Scalaz
Real World ScalazReal World Scalaz
Real World Scalaz
 
Introducing scala
Introducing scalaIntroducing scala
Introducing scala
 
Rcpp: Seemless R and C++
Rcpp: Seemless R and C++Rcpp: Seemless R and C++
Rcpp: Seemless R and C++
 
Music workflow4
Music workflow4Music workflow4
Music workflow4
 
Functional programming in Scala
Functional programming in ScalaFunctional programming in Scala
Functional programming in Scala
 
Wien15 java8
Wien15 java8Wien15 java8
Wien15 java8
 
Domänenspezifische Sprachen mit Xtext
Domänenspezifische Sprachen mit XtextDomänenspezifische Sprachen mit Xtext
Domänenspezifische Sprachen mit Xtext
 

Similar to On metadata for Open Data

T3chFest 2016 - The polyglot programmer
T3chFest 2016 - The polyglot programmerT3chFest 2016 - The polyglot programmer
T3chFest 2016 - The polyglot programmerDavid Muñoz Díaz
 
HFile: A Block-Indexed File Format to Store Sorted Key-Value Pairs
HFile: A Block-Indexed File Format to Store Sorted Key-Value PairsHFile: A Block-Indexed File Format to Store Sorted Key-Value Pairs
HFile: A Block-Indexed File Format to Store Sorted Key-Value PairsSchubert Zhang
 
Os Vanrossum
Os VanrossumOs Vanrossum
Os Vanrossumoscon2007
 
Standardizing on a single N-dimensional array API for Python
Standardizing on a single N-dimensional array API for PythonStandardizing on a single N-dimensional array API for Python
Standardizing on a single N-dimensional array API for PythonRalf Gommers
 
James Jesus Bermas on Crash Course on Python
James Jesus Bermas on Crash Course on PythonJames Jesus Bermas on Crash Course on Python
James Jesus Bermas on Crash Course on PythonCP-Union
 
Mike Taulty OData (NxtGen User Group UK)
Mike Taulty OData (NxtGen User Group UK)Mike Taulty OData (NxtGen User Group UK)
Mike Taulty OData (NxtGen User Group UK)ukdpe
 
Generics Past, Present and Future (Latest)
Generics Past, Present and Future (Latest)Generics Past, Present and Future (Latest)
Generics Past, Present and Future (Latest)RichardWarburton
 
Changes and Bugs: Mining and Predicting Development Activities
Changes and Bugs: Mining and Predicting Development ActivitiesChanges and Bugs: Mining and Predicting Development Activities
Changes and Bugs: Mining and Predicting Development ActivitiesThomas Zimmermann
 
An Overview Of Python With Functional Programming
An Overview Of Python With Functional ProgrammingAn Overview Of Python With Functional Programming
An Overview Of Python With Functional ProgrammingAdam Getchell
 
Spark Sql and DataFrame
Spark Sql and DataFrameSpark Sql and DataFrame
Spark Sql and DataFramePrashant Gupta
 
Scripting in InduSoft Web Studio
Scripting in InduSoft Web StudioScripting in InduSoft Web Studio
Scripting in InduSoft Web StudioAVEVA
 
U-SQL Killer Scenarios: Custom Processing, Big Cognition, Image and JSON Proc...
U-SQL Killer Scenarios: Custom Processing, Big Cognition, Image and JSON Proc...U-SQL Killer Scenarios: Custom Processing, Big Cognition, Image and JSON Proc...
U-SQL Killer Scenarios: Custom Processing, Big Cognition, Image and JSON Proc...Michael Rys
 
Data Analysis with R (combined slides)
Data Analysis with R (combined slides)Data Analysis with R (combined slides)
Data Analysis with R (combined slides)Guy Lebanon
 
Xbase - Implementing Domain-Specific Languages for Java
Xbase - Implementing Domain-Specific Languages for JavaXbase - Implementing Domain-Specific Languages for Java
Xbase - Implementing Domain-Specific Languages for Javameysholdt
 

Similar to On metadata for Open Data (20)

Hfile格式详细介绍
Hfile格式详细介绍Hfile格式详细介绍
Hfile格式详细介绍
 
T3chFest 2016 - The polyglot programmer
T3chFest 2016 - The polyglot programmerT3chFest 2016 - The polyglot programmer
T3chFest 2016 - The polyglot programmer
 
HFile: A Block-Indexed File Format to Store Sorted Key-Value Pairs
HFile: A Block-Indexed File Format to Store Sorted Key-Value PairsHFile: A Block-Indexed File Format to Store Sorted Key-Value Pairs
HFile: A Block-Indexed File Format to Store Sorted Key-Value Pairs
 
Os Vanrossum
Os VanrossumOs Vanrossum
Os Vanrossum
 
syllabusCS.pdf
syllabusCS.pdfsyllabusCS.pdf
syllabusCS.pdf
 
Standardizing on a single N-dimensional array API for Python
Standardizing on a single N-dimensional array API for PythonStandardizing on a single N-dimensional array API for Python
Standardizing on a single N-dimensional array API for Python
 
James Jesus Bermas on Crash Course on Python
James Jesus Bermas on Crash Course on PythonJames Jesus Bermas on Crash Course on Python
James Jesus Bermas on Crash Course on Python
 
Pune Clojure Course Outline
Pune Clojure Course OutlinePune Clojure Course Outline
Pune Clojure Course Outline
 
Mike Taulty OData (NxtGen User Group UK)
Mike Taulty OData (NxtGen User Group UK)Mike Taulty OData (NxtGen User Group UK)
Mike Taulty OData (NxtGen User Group UK)
 
Arduino reference
Arduino referenceArduino reference
Arduino reference
 
Generics Past, Present and Future (Latest)
Generics Past, Present and Future (Latest)Generics Past, Present and Future (Latest)
Generics Past, Present and Future (Latest)
 
Changes and Bugs: Mining and Predicting Development Activities
Changes and Bugs: Mining and Predicting Development ActivitiesChanges and Bugs: Mining and Predicting Development Activities
Changes and Bugs: Mining and Predicting Development Activities
 
An Overview Of Python With Functional Programming
An Overview Of Python With Functional ProgrammingAn Overview Of Python With Functional Programming
An Overview Of Python With Functional Programming
 
Spark Sql and DataFrame
Spark Sql and DataFrameSpark Sql and DataFrame
Spark Sql and DataFrame
 
Scripting in InduSoft Web Studio
Scripting in InduSoft Web StudioScripting in InduSoft Web Studio
Scripting in InduSoft Web Studio
 
Arduino reference
Arduino   referenceArduino   reference
Arduino reference
 
U-SQL Killer Scenarios: Custom Processing, Big Cognition, Image and JSON Proc...
U-SQL Killer Scenarios: Custom Processing, Big Cognition, Image and JSON Proc...U-SQL Killer Scenarios: Custom Processing, Big Cognition, Image and JSON Proc...
U-SQL Killer Scenarios: Custom Processing, Big Cognition, Image and JSON Proc...
 
Data Analysis with R (combined slides)
Data Analysis with R (combined slides)Data Analysis with R (combined slides)
Data Analysis with R (combined slides)
 
Managing console input
Managing console inputManaging console input
Managing console input
 
Xbase - Implementing Domain-Specific Languages for Java
Xbase - Implementing Domain-Specific Languages for JavaXbase - Implementing Domain-Specific Languages for Java
Xbase - Implementing Domain-Specific Languages for Java
 

More from Yannis Charalabidis

Truths and Myths of Innovation and Entrepreneurship
Truths and Myths of Innovation and EntrepreneurshipTruths and Myths of Innovation and Entrepreneurship
Truths and Myths of Innovation and EntrepreneurshipYannis Charalabidis
 
IMTs testimonials: The case of IMAPS in the GR Public Sector
IMTs testimonials: The case of IMAPS in the GR Public SectorIMTs testimonials: The case of IMAPS in the GR Public Sector
IMTs testimonials: The case of IMAPS in the GR Public SectorYannis Charalabidis
 
Μελέτη των Διαδικτυακών Τόπων των Δήμων της Ελλάδας
Μελέτη των Διαδικτυακών Τόπων των Δήμων της Ελλάδας Μελέτη των Διαδικτυακών Τόπων των Δήμων της Ελλάδας
Μελέτη των Διαδικτυακών Τόπων των Δήμων της Ελλάδας Yannis Charalabidis
 
ΜΕΛΕΤΗ - Ψηφιακή Διακυβέρνηση στην Τοπική Αυτοδιοίκηση
ΜΕΛΕΤΗ - Ψηφιακή Διακυβέρνηση στην Τοπική ΑυτοδιοίκησηΜΕΛΕΤΗ - Ψηφιακή Διακυβέρνηση στην Τοπική Αυτοδιοίκηση
ΜΕΛΕΤΗ - Ψηφιακή Διακυβέρνηση στην Τοπική ΑυτοδιοίκησηYannis Charalabidis
 
Αρχαία Ελληνική Φιλοσοφία
Αρχαία Ελληνική ΦιλοσοφίαΑρχαία Ελληνική Φιλοσοφία
Αρχαία Ελληνική ΦιλοσοφίαYannis Charalabidis
 
Digital Governance & Artificial Intelligence
Digital Governance & Artificial IntelligenceDigital Governance & Artificial Intelligence
Digital Governance & Artificial IntelligenceYannis Charalabidis
 
The Generations of Digital governance : From Paper to Robots
The Generations of Digital governance : From Paper to RobotsThe Generations of Digital governance : From Paper to Robots
The Generations of Digital governance : From Paper to RobotsYannis Charalabidis
 
MANYLAWS : EU-Wide Legal Text Mining Using Big Data Infrastructures
 MANYLAWS : EU-Wide Legal Text Mining Using Big Data Infrastructures MANYLAWS : EU-Wide Legal Text Mining Using Big Data Infrastructures
MANYLAWS : EU-Wide Legal Text Mining Using Big Data InfrastructuresYannis Charalabidis
 
Digital government Challanges for Greece (slides in Greek)
Digital government Challanges for Greece (slides in Greek)Digital government Challanges for Greece (slides in Greek)
Digital government Challanges for Greece (slides in Greek)Yannis Charalabidis
 
Digital Governance Science Base
Digital Governance Science Base Digital Governance Science Base
Digital Governance Science Base Yannis Charalabidis
 
Ψηφιακός Μετασχηματισμός και Διακυβέρνηση: Διεθνείς Πολιτικές και Νέες Τεχνολ...
Ψηφιακός Μετασχηματισμός και Διακυβέρνηση: Διεθνείς Πολιτικές και Νέες Τεχνολ...Ψηφιακός Μετασχηματισμός και Διακυβέρνηση: Διεθνείς Πολιτικές και Νέες Τεχνολ...
Ψηφιακός Μετασχηματισμός και Διακυβέρνηση: Διεθνείς Πολιτικές και Νέες Τεχνολ...Yannis Charalabidis
 
Ψηφιακή Διακυβέρνηση και Διαλειτουργικότητα
Ψηφιακή Διακυβέρνηση και ΔιαλειτουργικότηταΨηφιακή Διακυβέρνηση και Διαλειτουργικότητα
Ψηφιακή Διακυβέρνηση και ΔιαλειτουργικότηταYannis Charalabidis
 
ManyLaws CEF Project, on legal informatics
ManyLaws CEF Project, on legal informatics ManyLaws CEF Project, on legal informatics
ManyLaws CEF Project, on legal informatics Yannis Charalabidis
 
Aegean Startups 2018 - Ομάδες και Διαδικασίες Β Φάσης
Aegean Startups 2018 - Ομάδες και Διαδικασίες Β ΦάσηςAegean Startups 2018 - Ομάδες και Διαδικασίες Β Φάσης
Aegean Startups 2018 - Ομάδες και Διαδικασίες Β ΦάσηςYannis Charalabidis
 
ΝΕΕΣ ΤΕΧΝΟΛΟΓΙΚΕΣ ΚΑΤΕΥΘΥΝΣΕΙΣ ΓΙΑ ΤΟ ΕΠΙΧΕΙΡΗΣΙΑΚΟ ΛΟΓΙΣΜΙΚΟ
ΝΕΕΣ ΤΕΧΝΟΛΟΓΙΚΕΣ ΚΑΤΕΥΘΥΝΣΕΙΣ ΓΙΑ ΤΟ ΕΠΙΧΕΙΡΗΣΙΑΚΟ ΛΟΓΙΣΜΙΚΟΝΕΕΣ ΤΕΧΝΟΛΟΓΙΚΕΣ ΚΑΤΕΥΘΥΝΣΕΙΣ ΓΙΑ ΤΟ ΕΠΙΧΕΙΡΗΣΙΑΚΟ ΛΟΓΙΣΜΙΚΟ
ΝΕΕΣ ΤΕΧΝΟΛΟΓΙΚΕΣ ΚΑΤΕΥΘΥΝΣΕΙΣ ΓΙΑ ΤΟ ΕΠΙΧΕΙΡΗΣΙΑΚΟ ΛΟΓΙΣΜΙΚΟYannis Charalabidis
 
Παρουσίαση του ΚΕΗΔ
Παρουσίαση του ΚΕΗΔΠαρουσίαση του ΚΕΗΔ
Παρουσίαση του ΚΕΗΔYannis Charalabidis
 
¨Ενα Μανιφέστο για την Ηλεκτρονική Διακυβέρνηση
¨Ενα Μανιφέστο για την Ηλεκτρονική Διακυβέρνηση¨Ενα Μανιφέστο για την Ηλεκτρονική Διακυβέρνηση
¨Ενα Μανιφέστο για την Ηλεκτρονική ΔιακυβέρνησηYannis Charalabidis
 
Kαινοτομία και επιχειρηματικότητα Chapter 5
Kαινοτομία και επιχειρηματικότητα Chapter 5Kαινοτομία και επιχειρηματικότητα Chapter 5
Kαινοτομία και επιχειρηματικότητα Chapter 5Yannis Charalabidis
 
Passive expert - sourcing, for policy making in the EU
Passive expert - sourcing,  for policy making in the EUPassive expert - sourcing,  for policy making in the EU
Passive expert - sourcing, for policy making in the EUYannis Charalabidis
 

More from Yannis Charalabidis (20)

Truths and Myths of Innovation and Entrepreneurship
Truths and Myths of Innovation and EntrepreneurshipTruths and Myths of Innovation and Entrepreneurship
Truths and Myths of Innovation and Entrepreneurship
 
IMTs testimonials: The case of IMAPS in the GR Public Sector
IMTs testimonials: The case of IMAPS in the GR Public SectorIMTs testimonials: The case of IMAPS in the GR Public Sector
IMTs testimonials: The case of IMAPS in the GR Public Sector
 
Μελέτη των Διαδικτυακών Τόπων των Δήμων της Ελλάδας
Μελέτη των Διαδικτυακών Τόπων των Δήμων της Ελλάδας Μελέτη των Διαδικτυακών Τόπων των Δήμων της Ελλάδας
Μελέτη των Διαδικτυακών Τόπων των Δήμων της Ελλάδας
 
ΜΕΛΕΤΗ - Ψηφιακή Διακυβέρνηση στην Τοπική Αυτοδιοίκηση
ΜΕΛΕΤΗ - Ψηφιακή Διακυβέρνηση στην Τοπική ΑυτοδιοίκησηΜΕΛΕΤΗ - Ψηφιακή Διακυβέρνηση στην Τοπική Αυτοδιοίκηση
ΜΕΛΕΤΗ - Ψηφιακή Διακυβέρνηση στην Τοπική Αυτοδιοίκηση
 
Αρχαία Ελληνική Φιλοσοφία
Αρχαία Ελληνική ΦιλοσοφίαΑρχαία Ελληνική Φιλοσοφία
Αρχαία Ελληνική Φιλοσοφία
 
Digital Governance & Artificial Intelligence
Digital Governance & Artificial IntelligenceDigital Governance & Artificial Intelligence
Digital Governance & Artificial Intelligence
 
The Generations of Digital governance : From Paper to Robots
The Generations of Digital governance : From Paper to RobotsThe Generations of Digital governance : From Paper to Robots
The Generations of Digital governance : From Paper to Robots
 
EIT-HEI Prometheus Project
EIT-HEI Prometheus ProjectEIT-HEI Prometheus Project
EIT-HEI Prometheus Project
 
MANYLAWS : EU-Wide Legal Text Mining Using Big Data Infrastructures
 MANYLAWS : EU-Wide Legal Text Mining Using Big Data Infrastructures MANYLAWS : EU-Wide Legal Text Mining Using Big Data Infrastructures
MANYLAWS : EU-Wide Legal Text Mining Using Big Data Infrastructures
 
Digital government Challanges for Greece (slides in Greek)
Digital government Challanges for Greece (slides in Greek)Digital government Challanges for Greece (slides in Greek)
Digital government Challanges for Greece (slides in Greek)
 
Digital Governance Science Base
Digital Governance Science Base Digital Governance Science Base
Digital Governance Science Base
 
Ψηφιακός Μετασχηματισμός και Διακυβέρνηση: Διεθνείς Πολιτικές και Νέες Τεχνολ...
Ψηφιακός Μετασχηματισμός και Διακυβέρνηση: Διεθνείς Πολιτικές και Νέες Τεχνολ...Ψηφιακός Μετασχηματισμός και Διακυβέρνηση: Διεθνείς Πολιτικές και Νέες Τεχνολ...
Ψηφιακός Μετασχηματισμός και Διακυβέρνηση: Διεθνείς Πολιτικές και Νέες Τεχνολ...
 
Ψηφιακή Διακυβέρνηση και Διαλειτουργικότητα
Ψηφιακή Διακυβέρνηση και ΔιαλειτουργικότηταΨηφιακή Διακυβέρνηση και Διαλειτουργικότητα
Ψηφιακή Διακυβέρνηση και Διαλειτουργικότητα
 
ManyLaws CEF Project, on legal informatics
ManyLaws CEF Project, on legal informatics ManyLaws CEF Project, on legal informatics
ManyLaws CEF Project, on legal informatics
 
Aegean Startups 2018 - Ομάδες και Διαδικασίες Β Φάσης
Aegean Startups 2018 - Ομάδες και Διαδικασίες Β ΦάσηςAegean Startups 2018 - Ομάδες και Διαδικασίες Β Φάσης
Aegean Startups 2018 - Ομάδες και Διαδικασίες Β Φάσης
 
ΝΕΕΣ ΤΕΧΝΟΛΟΓΙΚΕΣ ΚΑΤΕΥΘΥΝΣΕΙΣ ΓΙΑ ΤΟ ΕΠΙΧΕΙΡΗΣΙΑΚΟ ΛΟΓΙΣΜΙΚΟ
ΝΕΕΣ ΤΕΧΝΟΛΟΓΙΚΕΣ ΚΑΤΕΥΘΥΝΣΕΙΣ ΓΙΑ ΤΟ ΕΠΙΧΕΙΡΗΣΙΑΚΟ ΛΟΓΙΣΜΙΚΟΝΕΕΣ ΤΕΧΝΟΛΟΓΙΚΕΣ ΚΑΤΕΥΘΥΝΣΕΙΣ ΓΙΑ ΤΟ ΕΠΙΧΕΙΡΗΣΙΑΚΟ ΛΟΓΙΣΜΙΚΟ
ΝΕΕΣ ΤΕΧΝΟΛΟΓΙΚΕΣ ΚΑΤΕΥΘΥΝΣΕΙΣ ΓΙΑ ΤΟ ΕΠΙΧΕΙΡΗΣΙΑΚΟ ΛΟΓΙΣΜΙΚΟ
 
Παρουσίαση του ΚΕΗΔ
Παρουσίαση του ΚΕΗΔΠαρουσίαση του ΚΕΗΔ
Παρουσίαση του ΚΕΗΔ
 
¨Ενα Μανιφέστο για την Ηλεκτρονική Διακυβέρνηση
¨Ενα Μανιφέστο για την Ηλεκτρονική Διακυβέρνηση¨Ενα Μανιφέστο για την Ηλεκτρονική Διακυβέρνηση
¨Ενα Μανιφέστο για την Ηλεκτρονική Διακυβέρνηση
 
Kαινοτομία και επιχειρηματικότητα Chapter 5
Kαινοτομία και επιχειρηματικότητα Chapter 5Kαινοτομία και επιχειρηματικότητα Chapter 5
Kαινοτομία και επιχειρηματικότητα Chapter 5
 
Passive expert - sourcing, for policy making in the EU
Passive expert - sourcing,  for policy making in the EUPassive expert - sourcing,  for policy making in the EU
Passive expert - sourcing, for policy making in the EU
 

Recently uploaded

Extra-120324-Visite-Entreprise-icare.pdf
Extra-120324-Visite-Entreprise-icare.pdfExtra-120324-Visite-Entreprise-icare.pdf
Extra-120324-Visite-Entreprise-icare.pdfInfopole1
 
Scenario Library et REX Discover industry- and role- based scenarios
Scenario Library et REX Discover industry- and role- based scenariosScenario Library et REX Discover industry- and role- based scenarios
Scenario Library et REX Discover industry- and role- based scenariosErol GIRAUDY
 
TrustArc Webinar - How to Live in a Post Third-Party Cookie World
TrustArc Webinar - How to Live in a Post Third-Party Cookie WorldTrustArc Webinar - How to Live in a Post Third-Party Cookie World
TrustArc Webinar - How to Live in a Post Third-Party Cookie WorldTrustArc
 
UiPath Studio Web workshop series - Day 1
UiPath Studio Web workshop series  - Day 1UiPath Studio Web workshop series  - Day 1
UiPath Studio Web workshop series - Day 1DianaGray10
 
Flow Control | Block Size | ST Min | First Frame
Flow Control | Block Size | ST Min | First FrameFlow Control | Block Size | ST Min | First Frame
Flow Control | Block Size | ST Min | First FrameKapil Thakar
 
EMEA What is ThousandEyes? Webinar
EMEA What is ThousandEyes? WebinarEMEA What is ThousandEyes? Webinar
EMEA What is ThousandEyes? WebinarThousandEyes
 
Design and Modeling for MySQL SCALE 21X Pasadena, CA Mar 2024
Design and Modeling for MySQL SCALE 21X Pasadena, CA Mar 2024Design and Modeling for MySQL SCALE 21X Pasadena, CA Mar 2024
Design and Modeling for MySQL SCALE 21X Pasadena, CA Mar 2024Alkin Tezuysal
 
CyberSecurity - Computers In Libraries 2024
CyberSecurity - Computers In Libraries 2024CyberSecurity - Computers In Libraries 2024
CyberSecurity - Computers In Libraries 2024Brian Pichman
 
Novo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4jNovo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4jNeo4j
 
Automation Ops Series: Session 2 - Governance for UiPath projects
Automation Ops Series: Session 2 - Governance for UiPath projectsAutomation Ops Series: Session 2 - Governance for UiPath projects
Automation Ops Series: Session 2 - Governance for UiPath projectsDianaGray10
 
My key hands-on projects in Quantum, and QAI
My key hands-on projects in Quantum, and QAIMy key hands-on projects in Quantum, and QAI
My key hands-on projects in Quantum, and QAIVijayananda Mohire
 
Q4 2023 Quarterly Investor Presentation - FINAL - v1.pdf
Q4 2023 Quarterly Investor Presentation - FINAL - v1.pdfQ4 2023 Quarterly Investor Presentation - FINAL - v1.pdf
Q4 2023 Quarterly Investor Presentation - FINAL - v1.pdfTejal81
 
Stobox 4: Revolutionizing Investment in Real-World Assets Through Tokenization
Stobox 4: Revolutionizing Investment in Real-World Assets Through TokenizationStobox 4: Revolutionizing Investment in Real-World Assets Through Tokenization
Stobox 4: Revolutionizing Investment in Real-World Assets Through TokenizationStobox
 
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENT
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENTSIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENT
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENTxtailishbaloch
 
Planetek Italia Srl - Corporate Profile Brochure
Planetek Italia Srl - Corporate Profile BrochurePlanetek Italia Srl - Corporate Profile Brochure
Planetek Italia Srl - Corporate Profile BrochurePlanetek Italia Srl
 
From the origin to the future of Open Source model and business
From the origin to the future of  Open Source model and businessFrom the origin to the future of  Open Source model and business
From the origin to the future of Open Source model and businessFrancesco Corti
 
Top 10 Squarespace Development Companies
Top 10 Squarespace Development CompaniesTop 10 Squarespace Development Companies
Top 10 Squarespace Development CompaniesTopCSSGallery
 
Technical SEO for Improved Accessibility WTS FEST
Technical SEO for Improved Accessibility  WTS FESTTechnical SEO for Improved Accessibility  WTS FEST
Technical SEO for Improved Accessibility WTS FESTBillieHyde
 
March Patch Tuesday
March Patch TuesdayMarch Patch Tuesday
March Patch TuesdayIvanti
 
The New Cloud World Order Is FinOps (Slideshow)
The New Cloud World Order Is FinOps (Slideshow)The New Cloud World Order Is FinOps (Slideshow)
The New Cloud World Order Is FinOps (Slideshow)codyslingerland1
 

Recently uploaded (20)

Extra-120324-Visite-Entreprise-icare.pdf
Extra-120324-Visite-Entreprise-icare.pdfExtra-120324-Visite-Entreprise-icare.pdf
Extra-120324-Visite-Entreprise-icare.pdf
 
Scenario Library et REX Discover industry- and role- based scenarios
Scenario Library et REX Discover industry- and role- based scenariosScenario Library et REX Discover industry- and role- based scenarios
Scenario Library et REX Discover industry- and role- based scenarios
 
TrustArc Webinar - How to Live in a Post Third-Party Cookie World
TrustArc Webinar - How to Live in a Post Third-Party Cookie WorldTrustArc Webinar - How to Live in a Post Third-Party Cookie World
TrustArc Webinar - How to Live in a Post Third-Party Cookie World
 
UiPath Studio Web workshop series - Day 1
UiPath Studio Web workshop series  - Day 1UiPath Studio Web workshop series  - Day 1
UiPath Studio Web workshop series - Day 1
 
Flow Control | Block Size | ST Min | First Frame
Flow Control | Block Size | ST Min | First FrameFlow Control | Block Size | ST Min | First Frame
Flow Control | Block Size | ST Min | First Frame
 
EMEA What is ThousandEyes? Webinar
EMEA What is ThousandEyes? WebinarEMEA What is ThousandEyes? Webinar
EMEA What is ThousandEyes? Webinar
 
Design and Modeling for MySQL SCALE 21X Pasadena, CA Mar 2024
Design and Modeling for MySQL SCALE 21X Pasadena, CA Mar 2024Design and Modeling for MySQL SCALE 21X Pasadena, CA Mar 2024
Design and Modeling for MySQL SCALE 21X Pasadena, CA Mar 2024
 
CyberSecurity - Computers In Libraries 2024
CyberSecurity - Computers In Libraries 2024CyberSecurity - Computers In Libraries 2024
CyberSecurity - Computers In Libraries 2024
 
Novo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4jNovo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4j
 
Automation Ops Series: Session 2 - Governance for UiPath projects
Automation Ops Series: Session 2 - Governance for UiPath projectsAutomation Ops Series: Session 2 - Governance for UiPath projects
Automation Ops Series: Session 2 - Governance for UiPath projects
 
My key hands-on projects in Quantum, and QAI
My key hands-on projects in Quantum, and QAIMy key hands-on projects in Quantum, and QAI
My key hands-on projects in Quantum, and QAI
 
Q4 2023 Quarterly Investor Presentation - FINAL - v1.pdf
Q4 2023 Quarterly Investor Presentation - FINAL - v1.pdfQ4 2023 Quarterly Investor Presentation - FINAL - v1.pdf
Q4 2023 Quarterly Investor Presentation - FINAL - v1.pdf
 
Stobox 4: Revolutionizing Investment in Real-World Assets Through Tokenization
Stobox 4: Revolutionizing Investment in Real-World Assets Through TokenizationStobox 4: Revolutionizing Investment in Real-World Assets Through Tokenization
Stobox 4: Revolutionizing Investment in Real-World Assets Through Tokenization
 
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENT
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENTSIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENT
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENT
 
Planetek Italia Srl - Corporate Profile Brochure
Planetek Italia Srl - Corporate Profile BrochurePlanetek Italia Srl - Corporate Profile Brochure
Planetek Italia Srl - Corporate Profile Brochure
 
From the origin to the future of Open Source model and business
From the origin to the future of  Open Source model and businessFrom the origin to the future of  Open Source model and business
From the origin to the future of Open Source model and business
 
Top 10 Squarespace Development Companies
Top 10 Squarespace Development CompaniesTop 10 Squarespace Development Companies
Top 10 Squarespace Development Companies
 
Technical SEO for Improved Accessibility WTS FEST
Technical SEO for Improved Accessibility  WTS FESTTechnical SEO for Improved Accessibility  WTS FEST
Technical SEO for Improved Accessibility WTS FEST
 
March Patch Tuesday
March Patch TuesdayMarch Patch Tuesday
March Patch Tuesday
 
The New Cloud World Order Is FinOps (Slideshow)
The New Cloud World Order Is FinOps (Slideshow)The New Cloud World Order Is FinOps (Slideshow)
The New Cloud World Order Is FinOps (Slideshow)
 

On metadata for Open Data

  • 1. On Metadata for Open Data Yannis Charalabidis 25.04.2012
  • 2. Introduction We will try in the next slides to show you what is the level of expectation from metadata handling from a 2nd generation open data system
  • 3. Imagine you are in front of the ENGAGE system, and you have your URI from a dataset, somewhere in the cloud, (copied as string in the clipboard) And begin …
  • 4. Prescreening: User only gives URI of the dataset Enter (paste) the URI of your dataset _
  • 5. (then for 30 seconds you see this screen, changing) Progress of ENGAGE Resource Prescreening: ( 45% ) of jobs completed Managed to : Identify xls file Autofill, provisionally: Title Autofill, provisionally: Creator Create unique ENGAGE URI Calculate keywords Autofill, provisionally: keywords … …
  • 6. (When finishing import, the report) Report ENGAGE managed to automatically, provisionally fill in ( 21 ) of 43 metadata attributes for your dataset. Your current validity is at ( 45% ) For your dataset to be inserted in the database, you need to continue filling in ( 5 ) mandatory attributes. Your dataset will then be inserted with validity ( 55% ) If all ( 17 ) non-mandatory attributes are filled in, validity will be maximum, at 70% / limit of the insertion phase. Please select next action: Continue Park Cancel
  • 7. After import … … and then, we enter the metadata insertion page with pre-filled data, etc. When we finish, we get a similar final report. AND NOW THE ENGAGE METADATA set, that makes all that a possibility:
  • 8. But,before, some semantics: Attribute characteristics – notation: (M) : attribute is Mandatory (cannot be empty) (*) : attribute takes values from a controlled list of terms (codelist), or tree (dag of terms), or table (+) : takes values from an extendible list or tree. User may extend the list during insertion (a) : an auto-filling list (as suggestion) or otherwise automatically calculated attribute (m) : attribute accepts multiple values (v) : attribute entry can be verified through a type-checking algorithm (( x )) : x is possible, but as an option no tag : attribute is a simple string entry ---------- for the future ------------- (c0), (c1), (c2), (c3) : the importance of attribute in completeness calculation (c3 is higher – mostly important) (q0), (q1), (q2), (q3) : the importance of attribute in data quality calculation (q3 is higher – mostly important)
  • 9. A. The core attributes Size of Existing Metadata Attribute Type of Attribute Type of codelist codelist (nodes) codelists Title (M) ((a)) Automatic: extracted from the dataset headline - - - String of the URI/dataset provided Publisher (M)(*)(+) 100 X Greece Tree of Strings PUB admin tree (100 per country, extendible) Pointer to Tree country (ENG) Creator PUB admin tree (100 per country, extendible) (M)(*)(+) 100 X Greece Tree of PS entities Prompt: same as the publisher Pointer to Tree country (ENG) Code Automatic: ENGAGE automatic classification (M)(*)(a) system (date,country,PSector,type,etc) or - - - String ENGAGE URI User - (*)(a) The user who uploads that. Automatic filling Table of Users - Pointer to Table from table of users / login
  • 10. B. The outer core attributes Size of Existing Metadata Attribute Type of Attribute Type of codelist codelist codelists (nodes) Subject (M)(*)(+) All resource Text describing the resource in one sentence List of strings NO Pointer to List subjects It can be stored in a list and reused Type List of types: dataset, linkable dataset, (M)(*)(m) List of strings 10 ENG visualization, textual information, executable Pointer to list binary, unknown Format (M)(*)(+) List of strings 50 ENG xls xml odata … jpd pdf … (appr. 50 format types) Pointer to list Language ISO simplified (5 < 20 (EU) < ISO (3000). (M)(*) ((a)) (m) List of strings 200 ISO List Automatic: extract from language settings (when Pointer to List (ENG) XLS / ISO) Country (M)(*)(m) ISO List 5 ENGAGE countries < rest of 27 EU < other List of strings 200 Pointer to List (ENG) countries ISO country list
  • 11. C. The Public Sector Context Size of Existing Metadata Attribute Type of Attribute Type of codelist codelist codelists (nodes) Public Sector Domain Tree of sectors (20: finance, health, social (*)(m)(+) security, etc) Tree of strings 20 ENG, GR Pointer to Tree Automatic : can be calculated from Creator, if all public sector entities have a domain Relative Public Service List of public services (i2010 20 basic services, (*)(m)(+) plus “other-reward service”, “othr permission List of strings 24 ENG, GR Pointer to List service”, “Other registry entry service”, “Other personal documents service”) Relative Information System (*)(m)(+) List of EU and national main information systems List of strings 200 GR Pointer to List (50+50*country) Legal Framework Main EU directives on open data (10), main Table of Legal national laws and decrees on open data (10 X (*)(m)(+) 100 GR Elements country)
  • 12. D. The Scientific Context Size of Existing Metadata Attribute Type of Attribute Type of codelist codelist codelists (nodes) Scientific Sector (*)(m) Tree of strings 100 Science ENGAGE Tree of Scientific Domains Pointer to Tree Scientific Usage of Resource ENGAGE tree of scientific types/usages: events (*)(m)(+) Tree of strings 20 Science data (nature or man-made), financial data, health Pointer to Tree data, etc (20) Intended Audience List of possible audiences: citizens, enterprises, (*)(m)(+) researchers, public sector managers, public Tree of strings 20 ENGAGE Pointer to List sector officers, policy makers, members of National Parliament, MEP’s, NGO’s etc Keywords Initial list made / proposed by ENGAGE System (*)(m)(+)(a) with countries, Psector Domain, Science Domain, List of strings 200 - Pointer to List Usage. Also get from linked areas / domains / types etc
  • 13. E. URL’s – URI’s - Links Size of Existing Metadata Attribute Type of Attribute Type of codelist codelist (nodes) codelists Type of Source Link (*)(+) URL / URI / DOI / WS / RSS/ ENGAGE / other List of Strings 10 ENG Pointer to List Source Link (URL) Codelist is String or ENGAGE URL (*)(a). Automatic: put the (*) (+) ((a)) the full list List of Strings Yes URL of ENGAGE site Pointer to List of URI’s in ENGAGE Type of Resource link (*)(+) URL / URI / DOI / WS / RSS/ ENGAGE other List of Strings 10 ENG Pointer to List Resource Link Codelist is String or ENGAGE (a). Automatic lists the link it (*) (+) ((a)) the full list List of Strings Yes already has. Pointer to List of URI’s in ENGAGE Relevant Resources Codelist is List of existing URI’s in the system . Automatic: the full list (*)(m)(+)(a) List of Strings Yes calculates from matching domain+type+ of URI’s in ENGAGE
  • 14. F. Linked Data Size of Existing Metadata Attribute Type of Attribute Type of codelist codelist (nodes) codelists Linking status Linkable, linked, non-linked, non-linkable, (*) List of Strings 5 YES unknown Pointer to List Linked Data Set (*)(m)(+)(a)(d) URI of a linked dataset. List of URI’s No limit - Pointer to List Details of link: Linking Type (PK match) Pointer to List List of Strings 1 - Matching column of this resource String - - - Matching column of linked resource String - - - Columns of this resource, to be included (m) String - - - Columns of linked resource, to be included (m) String - - - Visualisations (*)(m)(+)(a)(d) List of URI’s No limit - Links to visualisations of current resource Pointer to List
  • 15. G. Dates and Status Size of Existing Metadata Attribute Type of Attribute Type of codelist codelist (nodes) codelists (v) Consideration Started on - - - DATE (v) Initial Approval / Planning Started on - - - DATE (v) Planned to be valid on - - - DATE (v) Validity Started on - - - DATE (v) Validity to finish on - - - DATE (v) Rejected on - - - DATE (v) Substituted on - - - DATE Status Considered, planned, valid, valid and linked, (*) (a) rejected, outdated, substituted. List of Strings 8 ENG Pointer to List Automatic: calculation through DATES
  • 16. H. Rating Size of Existing Metadata Attribute Type of Attribute Type of codelist codelist (nodes) codelists Metadata Completeness Automatic: calculated by filled / empty non Number (1-100) - - - mandatory items Metadata Quality Automatic: calculated by specific filled / empty Number (1-100) - - - non mandatory items Citizen Rating Number (1-100) - - - As reported / calculated by relative users Researcher Rating Number (1-100) - - - As reported / calculated by relative users Business Rating Number (1-100) As reported / calculated by relative users Number of Downloads Number - - - As reported by the ENGAGE System Density of Downloads Number % - - - As number per total period of validity to date
  • 17. Not to forget: Metadata codelists where there, since the Hearing … ! An Infrastructure for Open, Linked Governmental Data Provision towards Research Communities and Citizens Proposal Evaluation Hearing Brussels 23/2/2011
  • 18. Q6: Which types of metadata will you select? • Exploit work already done by the consortium (DELFT, NTUA, AEGEAN, STFC) in public sector metadata schemas • Multi-facet design: take under consideration the fact that the data may be used in different contexts, such as research, policy making or by citizens • Take under consideration the fact that data sources may provide wildly differing metadata – go towards metadata standardisation for Open Data / a major contribution of ENGAGE • Two-phase metadata design within ENGAGE workplan (Task C1.2: Data and knowledge representation annotation and linking methods). Initial proposal based on Dublin Core, UK eGovernment Metadata Schema and eGMS+, is as following: Metadata ENGAGE Set Identifier Title Creator Publisher Country Source Type (*) Format (*) Language (*) Sector (*) Subject (*) Keywords (*) Relative Public Service (*) Relative Information System URL / URI / DOI Validity Date (from – to) Audience (*) Legal Framework Status (*) Relevant Resources Linkded Data Sets (*) (*) Indicates Controlled Lists / Taxonomies