Going Local with a World-Class DataInfrastructure: Enabling SDMX forResearch Support   Rob Grim   Head Research Support/Re...
Research Data Support? With SDMX?•   Why should we support    researchers anyway?•   Why should a university use a    comp...
Research Data Support (Tilburg University)1.   Archive research data     and supplementary     materials2.   Register data...
Research (Data) Support 1.   “Research Support”, often              Landscape tools      used as a synonym for IT      sup...
Metadata Management         Metadata Registry                           Capture Dissemination                          Too...
Why SDMX?1.   SDMX allows us to capture and manage     „data intelligence‟ in a formalized and     structured way         ...
Workflow                                                    Existing tools                                        Time ser...
Where we are now?•   Production workflow for SDMX•   Populating the metadata registry     •   Enter (hierachichal)        ...
Metadata registry: Fusion Registry            Code ValuesCodelists
Concept SchemeTitelpresentatie in Footer   13-6-2012           10
Category Scheme
CARDS-project World Taxation Indicators1.   Georgia State University, International     Center for Public Policy, World Ta...
Lessons learnt so far• Support of senior management is needed to get beyond the  project/pilot stage• SDMX standards are c...
Acknowledgements      • CARDS was funded SURF.     Final Thought        The CARDS project was                             ...
Upcoming SlideShare
Loading in …5
×

Going local with a world-class data infrastructure: Enabling SDMX for research support

1,047 views

Published on

Published in: Technology, Education
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,047
On SlideShare
0
From Embeds
0
Number of Embeds
8
Actions
Shares
0
Downloads
4
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide
  • Leverage existing data if we curate for machinesAd 5 local perspective: infrastructure development. From SDMX infrastructure perspective: leverage existing infrastructure.
  • Metadata recordsSupplementary materials (code please!)Curate for machinesMetadata: Conceptueel (wat)Methodologisch (hoe)Quality~~Bruikbaarheid (wow!)
  • Going local with a world-class data infrastructure: Enabling SDMX for research support

    1. 1. Going Local with a World-Class DataInfrastructure: Enabling SDMX forResearch Support Rob Grim Head Research Support/Research Data Specialist Executive Manager Open Data Foundation (ODaF) Library & IT Services, Tilburg University (Netherlands) IASSIST 2012, June 8 Washington
    2. 2. Research Data Support? With SDMX?• Why should we support researchers anyway?• Why should a university use a complex set of standards such as SDMX to support research?• CARDS World Taxation Indicators Curation project• Collaborative research SDMX• Workflow support• Infrastructure development Capture• Metadata management• What does it take get SDMX up and running? 13-6-2012 2
    3. 3. Research Data Support (Tilburg University)1. Archive research data and supplementary materials2. Register data sources used and provenance Dataset available ! information3. Assist with dataset description to improve accessibility of datasets4. Integrated library and data catalogue5. Subject portals e.g. „European Values Study‟ DDI and RDF in metadata record (hidden)6. Financial Data Support 13-6-2012 3
    4. 4. Research (Data) Support 1. “Research Support”, often Landscape tools used as a synonym for IT support 2. Current research data Dataverse Network (DVN) services focus on data archiving, DMPs, curation Archiving + Access Management 3. Simple approaches to data sharing 4. Portfolio of research data SDMX tools needed to support academic practices Metadata Repository Questasy 5. Potential of metadata management undervalued Survey documentation SDMX Data Repository Aim for “Need to have” instead of “Nice to Have” 13-6-2012 4
    5. 5. Metadata Management Metadata Registry Capture Dissemination Tools Capture Capture Time-series data Source: Eurostat Source: OECD.Stat 13-6-2012 5 Tools needed!
    6. 6. Why SDMX?1. SDMX allows us to capture and manage „data intelligence‟ in a formalized and structured way Curation2. SDMX information model useful to describe time-series data from different disciplines SDMX3. SDMX offers means to prevent unnecessary replication of data Capture4. SDMX offers means to deal with confidential data and IPR5. The standard is well used, training materials, tutorials available6. SDMX IT tools are available for different FAO platforms: Java .NET7. FAO OpenSDMX initiative (D4Science)8. Researchers want „something‟ like OECD.Stat OECD.Stat 13-6-2012 6
    7. 7. Workflow Existing tools Time series metadata: Verbs: concepts, dimensions, …Extract from PDF, CSV attributes …Convert toSDMX-ML …Code 4 Registry Table: Overview of registry structures Agency ECB FAOStat GIST IAEG ISO SDMX Agency Scheme 0 0 0 0 0 1 Categorisations 0 0 0 0 0 0 Category schemes 1 0 0 0 0 1 Codelists 2 8 2 10 0 9URL: WTI Concept schemes 1 1 0 2 1 1 DSD 3 1 0 1 0 0 Dataflows 1 1 Data provider scheme 1 1 Provision agreement 2 13-6-2012 7
    8. 8. Where we are now?• Production workflow for SDMX• Populating the metadata registry • Enter (hierachichal) codelists • Concept IDs • Concept Schemes • DSDs • Dataflows • SDMX ML Generic format• WTI Fusion Registry• SDMX data repository • Keep data in the original formats (csv, txt, Stata) • Convert data from a database to SDMX Source: SDMX Information Model • Specific purpose database for SDMX compliant system • Other: Collaborate with FAO, Open SDMX? 13-6-2012 8
    9. 9. Metadata registry: Fusion Registry Code ValuesCodelists
    10. 10. Concept SchemeTitelpresentatie in Footer 13-6-2012 10
    11. 11. Category Scheme
    12. 12. CARDS-project World Taxation Indicators1. Georgia State University, International Center for Public Policy, World Tax Indicators Portal2. Tilburg University, prof. Jenny Ligthart3. Lack of data on personal income tax (PIT), corporate income tax (CIT), Value Added Tax (VAT) and other tax indicators4. Incomplete series, missing countries, tax data difficult to access (addendums), difficult to compare5. Work WTI group: statutory tax rates. Tilburg: effective tax rates, corporate income tax.6. The „raw „data stem from the IMF/GFS and the OECD/Revenue statistics. 13-6-2012 12
    13. 13. Lessons learnt so far• Support of senior management is needed to get beyond the project/pilot stage• SDMX standards are complex: steep learning curve• Capacity building is a must (Tip: Eurostat SDMX tutorials)• SDMX data repository: collaborate with other organizations• Focus on DSDs, full target and partial identifiers, hierarchical code lists• Fusion Registry upgrade• Additional (academic) partners welcome to leverage the macro economic time series registry and repository 13-6-2012 13
    14. 14. Acknowledgements • CARDS was funded SURF. Final Thought The CARDS project was Don‟t forget! undertaken in 2011 in the framework of the SURFshare Before you ask: programme – Access to Research Data “What you can do for your • WTI group and prof. Jenny country “, ask yourself: Ligthart “What metadata management can do for you”References1. Burgi-Schmelz, A. (2009). Data to the rescue. Why improved statistical information will be key for prevention of future crises. Finance and Development, 46(1), 31-43.2. Peter, K. S., Buttrick, S., & Duncan, D. Data appendix to “global reform of personal income taxation, 1981- 2005: Evidence from 189 countries”3. Peter, K. S., Steve Buttrick, & Duncan, D. (2010). Global reform of personal income taxation, 1981-2005: Evidence from 189 countries. National Tax Journal, 6(3). 13-6-2012 14

    ×