SlideShare a Scribd company logo
Discovering & Dealing with Data

                                       Presented by


                           Kimberly Silk, MLS, Data Librarian,
                    Martin Prosperity Institute, University of Toronto




17 September 2012
Agenda
    • The MPI information environment
    • Common data sources & authority
    • Data management, discovery and access
    • What is Open Data? Big Data?
    • Fun with data visualization
    • Q&A



2
About the MPI
• The Martin Prosperity Institute is a economic
  think-tank; we are part of the Rotman School
  within the University of Toronto
• My client group consists of grad students, post-
  docs, visiting faculty and researchers who use
  social-science data to support their research
• To support their research process, I procure,
  curate, preserve and make discoverable data sets.
• The MPI has our own data repository that has
  grown to 4 TB in size.
                                                  3
Data Sources
    • Common & Very authoritative sources
      – StatsCan via the Data Liberation Initiative
      – Bureau of Labor Statistics, Bureau of Economic
        Analysis, American Fact Finder (Census)
      – OECD eLibrary
      – World Bank
      – Int’l sources such as UK Data Archive, Swedish
        National Data Service, etc.
      – Pew Research Center
      – Gallup
4
More data sources
    • Less authoritative??
      – Chinese Data Center
      – Rolling Stone
      – MySpace
      – CrunchBase




5
Data Challenge: Discovery

• Lots of research data
  being collected and
  added, but no method
  to manage it, catalogue
  it, or make it findable
• Demands from various
  clients: faculty,
  students, researchers,
  staff, administration
• The shared network
  drive was no longer
  effective




                            6
Show & Share…
    • We want the world to see our data catalogue
    • But, we don’t want the world to be able to
      copy or change what’s in the catalogue, or the
      catalogue itself
    • We need to manage access to our data; who
      are you? Where are you from? Why do you
      want the data? What are you going to do with
      it? Will you share your results?

7
Data Discovery Platforms
    • I reviewed several platforms that would work in
      an academic environment:
      – Nesstar – developed in Norway by Norwegian Social
        Science Data Services, used by StatsCan, UK Data
        Archive, NORC at UChicago
      – Islandora – Open source system based on Fedora
        developed at UPEI
      – ODESI – proprietary system developed and used by
        Scholars Portal
      – Dataverse – Open source system developed by the
        Institute for Quantitative Social Science at Harvard,
        used by NBER, and many academic think tanks.

8
Dataverse
    • Dataverse was a good choice since we could
      install an iteration at UToronto, in the UToronto
      cloud, and I could manage it myself
    • It was free, and my colleagues at Scholar’s Portal
      was interested in installing it – I was the perfect
      guinea pig
    • Slowly, I am cataloguing my data collection; I
      have set up a lending agreement, and it’s working
      very well.
    • Demo:
      http://dataverse.scholarsportal.info/dvn/dv/mpi

9
Open Data
 • Open data is an idea, that certain data should be
   freely available to everyone to use, reuse, and
   redistribute without restriction.
 • Governments around the world have begun to
   “open up” some of their data: US, UK, New
   Zealand, Norway, Russia, Australia, Morocco,
   Netherlands, Chile, Spain, Uruguay, France, Brazil,
   Estonia, Portugal, etc.
 • State- and municipal-levels of government have
   also created open data sites.

10
Open Data Opportunities…
 • Governments open up their data to foster
   better citizenship and improve transparency
 • Open Data can spur grass-roots innovation:
   citizens access open data to use in software
   programs to solve problems, such as finding a
   local daycare, knowing when the next bus will
   come, reporting crime on-the-fly, or watching
   congress proceedings in real time.

11
… and Challenges
 • Open Data takes commitment. Successful
   implementations have a dedicated team of
   people who decide what data to release
   according to usefulness and demand
 • The data must be anonymized, cleansed and
   in a non-proprietary format
 • Organizations must be prepared to listen to
   the citizens, be responsive, and trouble-shoot.
 • Open data is a public service.

12
Big Data
 • Big Data is a collection of data sets that is too
   large for the average database management tool
   (Access and Excel, for instance).
 • Examples come from meteorology, genomics and
   physics. At MPI we wrestle with large GIS data
   sets (maps and satellite data), and deal with data
   at the terabyte (1 trillion bytes) level.
 • Larger data sets deal with petabytes (1
   quadrillion bytes) and exabytes (1 quintillion
   bytes).

13
Data Visualizations
 • The visual representation of data ---- literally,
   a picture can say a thousand [numbers]
 • Edward Tufte is a key pioneer:
   http://www.edwardtufte.com/tufte/
 • Fantastic examples at Flowing Data:
   http://flowingdata.com/
 • RSA Animate: http://www.thersa.org/


14
Q&A

                                (and, Thank You!)



                           Kimberly Silk, MLS, Data Librarian,
                    Martin Prosperity Institute, University of Toronto
                          kimberly.silk@martinprosperity.org




17 September 2012

More Related Content

What's hot

The HathiTrust Research Center: An Overview of Advanced Computational Services
The HathiTrust Research Center: An Overview of Advanced Computational ServicesThe HathiTrust Research Center: An Overview of Advanced Computational Services
The HathiTrust Research Center: An Overview of Advanced Computational Services
Robert H. McDonald
 
LOD/LAM Presentation
LOD/LAM PresentationLOD/LAM Presentation
LOD/LAM Presentation
Hafabe
 
The culture of researchData
The culture of researchDataThe culture of researchData
The culture of researchData
petermurrayrust
 
Linked data: what it means, why it matters. Karen Coyle
Linked data: what it means, why it matters. Karen CoyleLinked data: what it means, why it matters. Karen Coyle
Linked data: what it means, why it matters. Karen Coyle
Biblioteca Nacional de España
 
Efforts to Promote Open Science in European Research Libraries
Efforts to Promote Open Science in European Research LibrariesEfforts to Promote Open Science in European Research Libraries
Efforts to Promote Open Science in European Research Libraries
LIBER Europe
 
LIBER Webinar: Research Data Services Survey
LIBER Webinar: Research Data Services Survey LIBER Webinar: Research Data Services Survey
LIBER Webinar: Research Data Services Survey
LIBER Europe
 
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
Alison Hitchens
 
March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...
March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...
March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...
National Information Standards Organization (NISO)
 
A theory of digital library metadata : enrich then filter
A theory of digital library metadata : enrich then filter A theory of digital library metadata : enrich then filter
A theory of digital library metadata : enrich then filter
Getaneh Alemu
 
Research into Practice case study 2: Library linked data implementations an...
	Research into Practice case study 2:  Library linked data implementations an...	Research into Practice case study 2:  Library linked data implementations an...
Research into Practice case study 2: Library linked data implementations an...
Hazel Hall
 
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Stefan Dietze
 
DYAS: The Greek Research Infrastructure Network for the Humanities
DYAS: The Greek Research Infrastructure Network for the HumanitiesDYAS: The Greek Research Infrastructure Network for the Humanities
DYAS: The Greek Research Infrastructure Network for the Humanities
ariadnenetwork
 
Open data and open access landscape in Tanzania/Zaituni Kaijage
Open data and open access landscape in Tanzania/Zaituni KaijageOpen data and open access landscape in Tanzania/Zaituni Kaijage
Open data and open access landscape in Tanzania/Zaituni Kaijage
African Open Science Platform
 
Linked open data and libraries
Linked open data and librariesLinked open data and libraries
Linked open data and libraries
Alison Hitchens
 
What is #LODLAM?! (revised January 2015)
What is #LODLAM?! (revised January 2015)What is #LODLAM?! (revised January 2015)
What is #LODLAM?! (revised January 2015)
Alison Hitchens
 
November 18, 2015 NISO Webinar: Text Mining: Digging Deep for Knowledge
November 18, 2015 NISO Webinar: Text Mining: Digging Deep for KnowledgeNovember 18, 2015 NISO Webinar: Text Mining: Digging Deep for Knowledge
November 18, 2015 NISO Webinar: Text Mining: Digging Deep for Knowledge
National Information Standards Organization (NISO)
 
Increase usage of online resources Edina presentation
Increase usage of online resources Edina presentationIncrease usage of online resources Edina presentation
Increase usage of online resources Edina presentation
JISC RSC Eastern
 
Life of a data archive: Workflow, staff, skills, partnerships. ADP example
Life of a data archive: Workflow, staff, skills, partnerships. ADP exampleLife of a data archive: Workflow, staff, skills, partnerships. ADP example
Life of a data archive: Workflow, staff, skills, partnerships. ADP example
Arhiv družboslovnih podatkov
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina Leonelli
LEARN Project
 
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste..."Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
Biblioteca Nacional de España
 

What's hot (20)

The HathiTrust Research Center: An Overview of Advanced Computational Services
The HathiTrust Research Center: An Overview of Advanced Computational ServicesThe HathiTrust Research Center: An Overview of Advanced Computational Services
The HathiTrust Research Center: An Overview of Advanced Computational Services
 
LOD/LAM Presentation
LOD/LAM PresentationLOD/LAM Presentation
LOD/LAM Presentation
 
The culture of researchData
The culture of researchDataThe culture of researchData
The culture of researchData
 
Linked data: what it means, why it matters. Karen Coyle
Linked data: what it means, why it matters. Karen CoyleLinked data: what it means, why it matters. Karen Coyle
Linked data: what it means, why it matters. Karen Coyle
 
Efforts to Promote Open Science in European Research Libraries
Efforts to Promote Open Science in European Research LibrariesEfforts to Promote Open Science in European Research Libraries
Efforts to Promote Open Science in European Research Libraries
 
LIBER Webinar: Research Data Services Survey
LIBER Webinar: Research Data Services Survey LIBER Webinar: Research Data Services Survey
LIBER Webinar: Research Data Services Survey
 
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
 
March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...
March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...
March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...
 
A theory of digital library metadata : enrich then filter
A theory of digital library metadata : enrich then filter A theory of digital library metadata : enrich then filter
A theory of digital library metadata : enrich then filter
 
Research into Practice case study 2: Library linked data implementations an...
	Research into Practice case study 2:  Library linked data implementations an...	Research into Practice case study 2:  Library linked data implementations an...
Research into Practice case study 2: Library linked data implementations an...
 
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
 
DYAS: The Greek Research Infrastructure Network for the Humanities
DYAS: The Greek Research Infrastructure Network for the HumanitiesDYAS: The Greek Research Infrastructure Network for the Humanities
DYAS: The Greek Research Infrastructure Network for the Humanities
 
Open data and open access landscape in Tanzania/Zaituni Kaijage
Open data and open access landscape in Tanzania/Zaituni KaijageOpen data and open access landscape in Tanzania/Zaituni Kaijage
Open data and open access landscape in Tanzania/Zaituni Kaijage
 
Linked open data and libraries
Linked open data and librariesLinked open data and libraries
Linked open data and libraries
 
What is #LODLAM?! (revised January 2015)
What is #LODLAM?! (revised January 2015)What is #LODLAM?! (revised January 2015)
What is #LODLAM?! (revised January 2015)
 
November 18, 2015 NISO Webinar: Text Mining: Digging Deep for Knowledge
November 18, 2015 NISO Webinar: Text Mining: Digging Deep for KnowledgeNovember 18, 2015 NISO Webinar: Text Mining: Digging Deep for Knowledge
November 18, 2015 NISO Webinar: Text Mining: Digging Deep for Knowledge
 
Increase usage of online resources Edina presentation
Increase usage of online resources Edina presentationIncrease usage of online resources Edina presentation
Increase usage of online resources Edina presentation
 
Life of a data archive: Workflow, staff, skills, partnerships. ADP example
Life of a data archive: Workflow, staff, skills, partnerships. ADP exampleLife of a data archive: Workflow, staff, skills, partnerships. ADP example
Life of a data archive: Workflow, staff, skills, partnerships. ADP example
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina Leonelli
 
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste..."Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
 

Viewers also liked

Day2 1 nijsten_dealing with data
Day2 1 nijsten_dealing with dataDay2 1 nijsten_dealing with data
Day2 1 nijsten_dealing with data
groundwatercop
 
Dealing with Unstructured Data: Scaling to Infinity
Dealing with Unstructured Data: Scaling to InfinityDealing with Unstructured Data: Scaling to Infinity
Dealing with Unstructured Data: Scaling to Infinity
Great Wide Open
 
Dealing With Data
Dealing With DataDealing With Data
Dealing With Data
psjelinek
 
Back-to-School Survey 2016
Back-to-School Survey 2016Back-to-School Survey 2016
Back-to-School Survey 2016
Deloitte United States
 
The Near Future of CSS
The Near Future of CSSThe Near Future of CSS
The Near Future of CSS
Rachel Andrew
 
Essential things that should always be in your car
Essential things that should always be in your carEssential things that should always be in your car
Essential things that should always be in your car
Eason Chan
 
Classroom Management Tips for Kids and Adolescents
Classroom Management Tips for Kids and AdolescentsClassroom Management Tips for Kids and Adolescents
Classroom Management Tips for Kids and Adolescents
Shelly Sanchez Terrell
 

Viewers also liked (7)

Day2 1 nijsten_dealing with data
Day2 1 nijsten_dealing with dataDay2 1 nijsten_dealing with data
Day2 1 nijsten_dealing with data
 
Dealing with Unstructured Data: Scaling to Infinity
Dealing with Unstructured Data: Scaling to InfinityDealing with Unstructured Data: Scaling to Infinity
Dealing with Unstructured Data: Scaling to Infinity
 
Dealing With Data
Dealing With DataDealing With Data
Dealing With Data
 
Back-to-School Survey 2016
Back-to-School Survey 2016Back-to-School Survey 2016
Back-to-School Survey 2016
 
The Near Future of CSS
The Near Future of CSSThe Near Future of CSS
The Near Future of CSS
 
Essential things that should always be in your car
Essential things that should always be in your carEssential things that should always be in your car
Essential things that should always be in your car
 
Classroom Management Tips for Kids and Adolescents
Classroom Management Tips for Kids and AdolescentsClassroom Management Tips for Kids and Adolescents
Classroom Management Tips for Kids and Adolescents
 

Similar to APLIC 2012: Discovering & Dealing with Data

Data 101: A Gentle Introduction
Data 101: A Gentle IntroductionData 101: A Gentle Introduction
Data 101: A Gentle Introduction
Hamilton Public Library
 
Data 101: A Gentle Introduction
Data 101: A Gentle IntroductionData 101: A Gentle Introduction
Data 101: A Gentle Introduction
Hamilton Public Library
 
Big and Small Web Data
Big and Small Web DataBig and Small Web Data
Big and Small Web Data
Marieke Guy
 
Open Sesame: Open Data, Data Liberation and Opportunities for Librarians
Open Sesame: Open Data, Data Liberation and Opportunities for LibrariansOpen Sesame: Open Data, Data Liberation and Opportunities for Librarians
Open Sesame: Open Data, Data Liberation and Opportunities for Librarians
Communication and Media Studies, Carleton University
 
Introduction Data Science.pptx
Introduction Data Science.pptxIntroduction Data Science.pptx
Introduction Data Science.pptx
AkhirulAminulloh2
 
datamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxdatamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptx
HASHEMHASH
 
Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypse
ENUG
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
National Information Standards Organization (NISO)
 
Supporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data ManagementSupporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data Management
Marieke Guy
 
ICPSR Data Services
ICPSR Data ServicesICPSR Data Services
ICPSR Data Services
ICPSR
 
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open Data
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open DataODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open Data
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open Data
Martin Kaltenböck
 
Getting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessGetting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open Access
Abby Clobridge
 
ICPSR Workshop Template - 2012/13
ICPSR Workshop Template - 2012/13ICPSR Workshop Template - 2012/13
ICPSR Workshop Template - 2012/13
ICPSR
 
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
hsuleslie
 
Open Data Publication - Requirements, Good practices, and Benefits
Open Data Publication - Requirements, Good practices, and BenefitsOpen Data Publication - Requirements, Good practices, and Benefits
Open Data Publication - Requirements, Good practices, and Benefits
ariadnenetwork
 
RDA - The Research Data Alliance in a Nutshell
RDA - The Research Data Alliance in a NutshellRDA - The Research Data Alliance in a Nutshell
RDA - The Research Data Alliance in a Nutshell
Research Data Alliance
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
African Open Science Platform
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
ICPSR
 
Requirements for Open Sharing of Archaeological Research Data
Requirements for Open Sharing of Archaeological Research DataRequirements for Open Sharing of Archaeological Research Data
Requirements for Open Sharing of Archaeological Research Data
ariadnenetwork
 

Similar to APLIC 2012: Discovering & Dealing with Data (20)

Data 101: A Gentle Introduction
Data 101: A Gentle IntroductionData 101: A Gentle Introduction
Data 101: A Gentle Introduction
 
Data 101: A Gentle Introduction
Data 101: A Gentle IntroductionData 101: A Gentle Introduction
Data 101: A Gentle Introduction
 
Big and Small Web Data
Big and Small Web DataBig and Small Web Data
Big and Small Web Data
 
Open Sesame: Open Data, Data Liberation and Opportunities for Librarians
Open Sesame: Open Data, Data Liberation and Opportunities for LibrariansOpen Sesame: Open Data, Data Liberation and Opportunities for Librarians
Open Sesame: Open Data, Data Liberation and Opportunities for Librarians
 
Introduction Data Science.pptx
Introduction Data Science.pptxIntroduction Data Science.pptx
Introduction Data Science.pptx
 
datamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxdatamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptx
 
Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypse
 
Gettind data used
Gettind data usedGettind data used
Gettind data used
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
Supporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data ManagementSupporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data Management
 
ICPSR Data Services
ICPSR Data ServicesICPSR Data Services
ICPSR Data Services
 
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open Data
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open DataODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open Data
ODI Node Vienna: Best Practise Beispiele für: Open Innovation mittels Open Data
 
Getting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessGetting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open Access
 
ICPSR Workshop Template - 2012/13
ICPSR Workshop Template - 2012/13ICPSR Workshop Template - 2012/13
ICPSR Workshop Template - 2012/13
 
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
 
Open Data Publication - Requirements, Good practices, and Benefits
Open Data Publication - Requirements, Good practices, and BenefitsOpen Data Publication - Requirements, Good practices, and Benefits
Open Data Publication - Requirements, Good practices, and Benefits
 
RDA - The Research Data Alliance in a Nutshell
RDA - The Research Data Alliance in a NutshellRDA - The Research Data Alliance in a Nutshell
RDA - The Research Data Alliance in a Nutshell
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
 
Requirements for Open Sharing of Archaeological Research Data
Requirements for Open Sharing of Archaeological Research DataRequirements for Open Sharing of Archaeological Research Data
Requirements for Open Sharing of Archaeological Research Data
 

More from Hamilton Public Library

OLA Super Conference 2019: Data Skills for 21st Century Library Practice
OLA Super Conference 2019: Data Skills for 21st Century Library PracticeOLA Super Conference 2019: Data Skills for 21st Century Library Practice
OLA Super Conference 2019: Data Skills for 21st Century Library Practice
Hamilton Public Library
 
OLA Super Conference 2019: Research Round-up
OLA Super Conference 2019: Research Round-upOLA Super Conference 2019: Research Round-up
OLA Super Conference 2019: Research Round-up
Hamilton Public Library
 
OLA Super Conference 2019: Changing Stakeholder Perceptions About Library Value
OLA Super Conference 2019: Changing Stakeholder Perceptions About Library ValueOLA Super Conference 2019: Changing Stakeholder Perceptions About Library Value
OLA Super Conference 2019: Changing Stakeholder Perceptions About Library Value
Hamilton Public Library
 
Constructing a Strategic Plan: Essential Processes and Components
Constructing a Strategic Plan: Essential Processes and ComponentsConstructing a Strategic Plan: Essential Processes and Components
Constructing a Strategic Plan: Essential Processes and Components
Hamilton Public Library
 
Library Space Use Study: What we Learned
Library Space Use Study: What we Learned Library Space Use Study: What we Learned
Library Space Use Study: What we Learned
Hamilton Public Library
 
Surfacing Integration in the Digital Scholarship Ecosystem
Surfacing Integration in the Digital Scholarship EcosystemSurfacing Integration in the Digital Scholarship Ecosystem
Surfacing Integration in the Digital Scholarship Ecosystem
Hamilton Public Library
 
Library Value Projects
Library Value ProjectsLibrary Value Projects
Library Value Projects
Hamilton Public Library
 
Trends in Demonstrating Library Value
Trends in Demonstrating Library ValueTrends in Demonstrating Library Value
Trends in Demonstrating Library Value
Hamilton Public Library
 
All Together Now: Collaboration and Coordination in Canada's Digital Scholars...
All Together Now: Collaboration and Coordination in Canada's Digital Scholars...All Together Now: Collaboration and Coordination in Canada's Digital Scholars...
All Together Now: Collaboration and Coordination in Canada's Digital Scholars...
Hamilton Public Library
 
L-Index: Designing a New Method for Measuring Library Impact in Canada
L-Index: Designing a New Method for Measuring Library Impact in CanadaL-Index: Designing a New Method for Measuring Library Impact in Canada
L-Index: Designing a New Method for Measuring Library Impact in Canada
Hamilton Public Library
 
Ink On Our Hands: Plotting the Map of Canada's Integrated Digital Scholarship...
Ink On Our Hands: Plotting the Map of Canada's Integrated Digital Scholarship...Ink On Our Hands: Plotting the Map of Canada's Integrated Digital Scholarship...
Ink On Our Hands: Plotting the Map of Canada's Integrated Digital Scholarship...
Hamilton Public Library
 
Library Evaluation in 3 Parts - Presented by Dr. Bill Irwin, Computers in Lib...
Library Evaluation in 3 Parts - Presented by Dr. Bill Irwin, Computers in Lib...Library Evaluation in 3 Parts - Presented by Dr. Bill Irwin, Computers in Lib...
Library Evaluation in 3 Parts - Presented by Dr. Bill Irwin, Computers in Lib...
Hamilton Public Library
 
Strategic Metrics Workshop: Computers in Libraries Conference, April 2015
Strategic Metrics Workshop: Computers in Libraries Conference, April 2015Strategic Metrics Workshop: Computers in Libraries Conference, April 2015
Strategic Metrics Workshop: Computers in Libraries Conference, April 2015
Hamilton Public Library
 
Evidence-Based Innovation
Evidence-Based InnovationEvidence-Based Innovation
Evidence-Based Innovation
Hamilton Public Library
 
Library Impact Studies: Lessons Learned
Library Impact Studies: Lessons LearnedLibrary Impact Studies: Lessons Learned
Library Impact Studies: Lessons Learned
Hamilton Public Library
 
Data, Metrics, and our Profession
Data, Metrics, and our ProfessionData, Metrics, and our Profession
Data, Metrics, and our Profession
Hamilton Public Library
 
CLA 2014: The Economic Impact of Libraries
CLA 2014: The Economic Impact of LibrariesCLA 2014: The Economic Impact of Libraries
CLA 2014: The Economic Impact of Libraries
Hamilton Public Library
 
So Much More: The Economic Impact of Toronto Public Library on the City of To...
So Much More: The Economic Impact of Toronto Public Library on the City of To...So Much More: The Economic Impact of Toronto Public Library on the City of To...
So Much More: The Economic Impact of Toronto Public Library on the City of To...
Hamilton Public Library
 
TRY 2011 - Mentoring the 21st Century Information Professional
TRY 2011 - Mentoring the 21st Century Information ProfessionalTRY 2011 - Mentoring the 21st Century Information Professional
TRY 2011 - Mentoring the 21st Century Information ProfessionalHamilton Public Library
 
Internet Librarian 2010 - Using Design Thinking to Enable Innovation
Internet Librarian 2010 - Using Design Thinking to Enable InnovationInternet Librarian 2010 - Using Design Thinking to Enable Innovation
Internet Librarian 2010 - Using Design Thinking to Enable InnovationHamilton Public Library
 

More from Hamilton Public Library (20)

OLA Super Conference 2019: Data Skills for 21st Century Library Practice
OLA Super Conference 2019: Data Skills for 21st Century Library PracticeOLA Super Conference 2019: Data Skills for 21st Century Library Practice
OLA Super Conference 2019: Data Skills for 21st Century Library Practice
 
OLA Super Conference 2019: Research Round-up
OLA Super Conference 2019: Research Round-upOLA Super Conference 2019: Research Round-up
OLA Super Conference 2019: Research Round-up
 
OLA Super Conference 2019: Changing Stakeholder Perceptions About Library Value
OLA Super Conference 2019: Changing Stakeholder Perceptions About Library ValueOLA Super Conference 2019: Changing Stakeholder Perceptions About Library Value
OLA Super Conference 2019: Changing Stakeholder Perceptions About Library Value
 
Constructing a Strategic Plan: Essential Processes and Components
Constructing a Strategic Plan: Essential Processes and ComponentsConstructing a Strategic Plan: Essential Processes and Components
Constructing a Strategic Plan: Essential Processes and Components
 
Library Space Use Study: What we Learned
Library Space Use Study: What we Learned Library Space Use Study: What we Learned
Library Space Use Study: What we Learned
 
Surfacing Integration in the Digital Scholarship Ecosystem
Surfacing Integration in the Digital Scholarship EcosystemSurfacing Integration in the Digital Scholarship Ecosystem
Surfacing Integration in the Digital Scholarship Ecosystem
 
Library Value Projects
Library Value ProjectsLibrary Value Projects
Library Value Projects
 
Trends in Demonstrating Library Value
Trends in Demonstrating Library ValueTrends in Demonstrating Library Value
Trends in Demonstrating Library Value
 
All Together Now: Collaboration and Coordination in Canada's Digital Scholars...
All Together Now: Collaboration and Coordination in Canada's Digital Scholars...All Together Now: Collaboration and Coordination in Canada's Digital Scholars...
All Together Now: Collaboration and Coordination in Canada's Digital Scholars...
 
L-Index: Designing a New Method for Measuring Library Impact in Canada
L-Index: Designing a New Method for Measuring Library Impact in CanadaL-Index: Designing a New Method for Measuring Library Impact in Canada
L-Index: Designing a New Method for Measuring Library Impact in Canada
 
Ink On Our Hands: Plotting the Map of Canada's Integrated Digital Scholarship...
Ink On Our Hands: Plotting the Map of Canada's Integrated Digital Scholarship...Ink On Our Hands: Plotting the Map of Canada's Integrated Digital Scholarship...
Ink On Our Hands: Plotting the Map of Canada's Integrated Digital Scholarship...
 
Library Evaluation in 3 Parts - Presented by Dr. Bill Irwin, Computers in Lib...
Library Evaluation in 3 Parts - Presented by Dr. Bill Irwin, Computers in Lib...Library Evaluation in 3 Parts - Presented by Dr. Bill Irwin, Computers in Lib...
Library Evaluation in 3 Parts - Presented by Dr. Bill Irwin, Computers in Lib...
 
Strategic Metrics Workshop: Computers in Libraries Conference, April 2015
Strategic Metrics Workshop: Computers in Libraries Conference, April 2015Strategic Metrics Workshop: Computers in Libraries Conference, April 2015
Strategic Metrics Workshop: Computers in Libraries Conference, April 2015
 
Evidence-Based Innovation
Evidence-Based InnovationEvidence-Based Innovation
Evidence-Based Innovation
 
Library Impact Studies: Lessons Learned
Library Impact Studies: Lessons LearnedLibrary Impact Studies: Lessons Learned
Library Impact Studies: Lessons Learned
 
Data, Metrics, and our Profession
Data, Metrics, and our ProfessionData, Metrics, and our Profession
Data, Metrics, and our Profession
 
CLA 2014: The Economic Impact of Libraries
CLA 2014: The Economic Impact of LibrariesCLA 2014: The Economic Impact of Libraries
CLA 2014: The Economic Impact of Libraries
 
So Much More: The Economic Impact of Toronto Public Library on the City of To...
So Much More: The Economic Impact of Toronto Public Library on the City of To...So Much More: The Economic Impact of Toronto Public Library on the City of To...
So Much More: The Economic Impact of Toronto Public Library on the City of To...
 
TRY 2011 - Mentoring the 21st Century Information Professional
TRY 2011 - Mentoring the 21st Century Information ProfessionalTRY 2011 - Mentoring the 21st Century Information Professional
TRY 2011 - Mentoring the 21st Century Information Professional
 
Internet Librarian 2010 - Using Design Thinking to Enable Innovation
Internet Librarian 2010 - Using Design Thinking to Enable InnovationInternet Librarian 2010 - Using Design Thinking to Enable Innovation
Internet Librarian 2010 - Using Design Thinking to Enable Innovation
 

Recently uploaded

GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Enhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZEnhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZ
Globus
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
Alex Pruden
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
Vlad Stirbu
 
UiPath Community Day Dubai: AI at Work..
UiPath Community Day Dubai: AI at Work..UiPath Community Day Dubai: AI at Work..
UiPath Community Day Dubai: AI at Work..
UiPathCommunity
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 

Recently uploaded (20)

GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Enhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZEnhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZ
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
UiPath Community Day Dubai: AI at Work..
UiPath Community Day Dubai: AI at Work..UiPath Community Day Dubai: AI at Work..
UiPath Community Day Dubai: AI at Work..
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 

APLIC 2012: Discovering & Dealing with Data

  • 1. Discovering & Dealing with Data Presented by Kimberly Silk, MLS, Data Librarian, Martin Prosperity Institute, University of Toronto 17 September 2012
  • 2. Agenda • The MPI information environment • Common data sources & authority • Data management, discovery and access • What is Open Data? Big Data? • Fun with data visualization • Q&A 2
  • 3. About the MPI • The Martin Prosperity Institute is a economic think-tank; we are part of the Rotman School within the University of Toronto • My client group consists of grad students, post- docs, visiting faculty and researchers who use social-science data to support their research • To support their research process, I procure, curate, preserve and make discoverable data sets. • The MPI has our own data repository that has grown to 4 TB in size. 3
  • 4. Data Sources • Common & Very authoritative sources – StatsCan via the Data Liberation Initiative – Bureau of Labor Statistics, Bureau of Economic Analysis, American Fact Finder (Census) – OECD eLibrary – World Bank – Int’l sources such as UK Data Archive, Swedish National Data Service, etc. – Pew Research Center – Gallup 4
  • 5. More data sources • Less authoritative?? – Chinese Data Center – Rolling Stone – MySpace – CrunchBase 5
  • 6. Data Challenge: Discovery • Lots of research data being collected and added, but no method to manage it, catalogue it, or make it findable • Demands from various clients: faculty, students, researchers, staff, administration • The shared network drive was no longer effective 6
  • 7. Show & Share… • We want the world to see our data catalogue • But, we don’t want the world to be able to copy or change what’s in the catalogue, or the catalogue itself • We need to manage access to our data; who are you? Where are you from? Why do you want the data? What are you going to do with it? Will you share your results? 7
  • 8. Data Discovery Platforms • I reviewed several platforms that would work in an academic environment: – Nesstar – developed in Norway by Norwegian Social Science Data Services, used by StatsCan, UK Data Archive, NORC at UChicago – Islandora – Open source system based on Fedora developed at UPEI – ODESI – proprietary system developed and used by Scholars Portal – Dataverse – Open source system developed by the Institute for Quantitative Social Science at Harvard, used by NBER, and many academic think tanks. 8
  • 9. Dataverse • Dataverse was a good choice since we could install an iteration at UToronto, in the UToronto cloud, and I could manage it myself • It was free, and my colleagues at Scholar’s Portal was interested in installing it – I was the perfect guinea pig • Slowly, I am cataloguing my data collection; I have set up a lending agreement, and it’s working very well. • Demo: http://dataverse.scholarsportal.info/dvn/dv/mpi 9
  • 10. Open Data • Open data is an idea, that certain data should be freely available to everyone to use, reuse, and redistribute without restriction. • Governments around the world have begun to “open up” some of their data: US, UK, New Zealand, Norway, Russia, Australia, Morocco, Netherlands, Chile, Spain, Uruguay, France, Brazil, Estonia, Portugal, etc. • State- and municipal-levels of government have also created open data sites. 10
  • 11. Open Data Opportunities… • Governments open up their data to foster better citizenship and improve transparency • Open Data can spur grass-roots innovation: citizens access open data to use in software programs to solve problems, such as finding a local daycare, knowing when the next bus will come, reporting crime on-the-fly, or watching congress proceedings in real time. 11
  • 12. … and Challenges • Open Data takes commitment. Successful implementations have a dedicated team of people who decide what data to release according to usefulness and demand • The data must be anonymized, cleansed and in a non-proprietary format • Organizations must be prepared to listen to the citizens, be responsive, and trouble-shoot. • Open data is a public service. 12
  • 13. Big Data • Big Data is a collection of data sets that is too large for the average database management tool (Access and Excel, for instance). • Examples come from meteorology, genomics and physics. At MPI we wrestle with large GIS data sets (maps and satellite data), and deal with data at the terabyte (1 trillion bytes) level. • Larger data sets deal with petabytes (1 quadrillion bytes) and exabytes (1 quintillion bytes). 13
  • 14. Data Visualizations • The visual representation of data ---- literally, a picture can say a thousand [numbers] • Edward Tufte is a key pioneer: http://www.edwardtufte.com/tufte/ • Fantastic examples at Flowing Data: http://flowingdata.com/ • RSA Animate: http://www.thersa.org/ 14
  • 15. Q&A (and, Thank You!) Kimberly Silk, MLS, Data Librarian, Martin Prosperity Institute, University of Toronto kimberly.silk@martinprosperity.org 17 September 2012