Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

PASTEUR4OA Data visualisation

1,858 views

Published on

Slides that show the PASTEUR4OA data visualisation work

Published in: Education
  • Be the first to comment

PASTEUR4OA Data visualisation

  1. 1. An Introduction to Data Visualisation For Open Access advocacy 25/08/2015 An Introduction to Data Visualisation 1 Marieke Guy Marieke.guy@okfn.org
  2. 2. PASTEUR4OA » PASTEUR4OA: Open Access Policy Alignment Strategies for European Union Research » EU-funded project 2014 – 2016 » Supports the development and/or reinforcement of open access strategies and policies at the national level and facilitate their coordination among all Member States » 15 partners across Europe » Key nodes - Europe-wide network of centres of expertise on open access and scholarly communication » EC’s policy for Open Access in Horizon 2020 is the foundation of the PASTEUR4OA advocacy work 25/08/2015 An Introduction to Data Visualisation 2 http://pasteur4oa.eu/
  3. 3. ROARMAP » ROARMAP: Registry of Open Access Repository Mandates and Policies » Searchable international registry charting the growth of open access mandates adopted by universities, research institutions and research funders » PASTEUR4OA work involved developing a new classification scheme for the registry » Project has also added almost 300 new policy entries to the ROARMAP database. On 24th August 2015 contains 725 policies. » Interesting data, wanted to show in interesting ways! 25/08/2015 An Introduction to Data Visualisation 3 http://roarmap.eprints.org/
  4. 4. WRITING A SPECIFICATION
  5. 5. Data Viz Brief » Exploit the ROARMAP database and create visuals to be produced for advocates to use in presentations, literature etc. » Develop ways to show the statistics in an attractive form, e.g.: ›  Maps (Europe, the world, individual continents) showing where policies are ›  Charts showing data such as funder type, what is mandatory, etc. » Create interactive online visualisations using data dumps » Use live data if possible » For policy makers, institutions, researchers, individuals 25/08/2015 An Introduction to Data Visualisation 5 http://pasteur4oa-dataviz.okfn.org/
  6. 6. Spec: Things to think about » What is the budget for the work? » How much time and resources are available for the work? » How will you get the data out of the system it is in? API, URL or other? » Where will you store the visualisations? » Where will you store the new data created? Will you release it openly? » How often will the data be updated? » Who can help you with your work? » What is genuinely do-able? 25/08/2015 An Introduction to Data Visualisation 6
  7. 7. 25/08/2015 An Introduction to Data Visualisation 7 From Tableau
  8. 8. DECIDE EARLY ON THE QUESTIONS YOU WANT TO ANSWER
  9. 9. TALK TO EXPERTS AND SEE WHAT THEY THINK
  10. 10. CHOOSING DATASETS
  11. 11. 25/08/2015 An Introduction to Data Visualisation 11 Open Access Policies by country – shows the top 10 countries From Infogram
  12. 12. Data Sets Used » ROARMAP data - http://roarmap.eprints.org/ » Open Access (OA) policy effectiveness study – collected by PASTEUR4OA Project » Bibliographic metadata indexed by the Thompson-Reuters database (WoK) - http://wokinfo.com/ » Publication date was estimated based on the Altmetrics database and WoK publication date - http://www.altmetric.com/ 25/08/2015 An Introduction to Data Visualisation 12
  13. 13. Data Sets: Things to think about » Is it a reputable source? » Is it openly available? » How up to date is the data? » Is the data easy to get out and work on? » Has the data been manipulated? » Are there omissions of data? » Will you need to combine data sets? » Do you need to add in other data e.g. country codes/names 25/08/2015 An Introduction to Data Visualisation 13
  14. 14. An Introduction to Data Visualisation 1425/08/2015 14 From Tableau
  15. 15. MANIPULATING DATA
  16. 16. Useful Excel Functions » Sum – adding up » Count – the number of cells in a range that have numbers in them » Vlookup - lets you search for specific information in your spreadsheet » Concatenate – lets you combine text from different cells into one cell » Trim – removes extra spaces » Substitute – like replace but more versatile » Sheet/page layout – hide rows, filter data, lock columns etc. 25/08/2015 An Introduction to Data Visualisation 16
  17. 17. Data: Things to Think About » You do need to be able to use excel…well » Have you checked your data recently? » Have you saved versions of your data and labelled them well? » Are you keeping notes about what you are doing? » Are you being consistent with your data? » Make sure you document processes » Countries can be tricky – make sure you are clear and consistent 25/08/2015 An Introduction to Data Visualisation 17
  18. 18. REGULARLY CARRY OUT SANITY CHECKS ON THE DATA
  19. 19. 25/08/2015 An Introduction to Data Visualisation 19 From Tableau
  20. 20. CONTROL HOW YOUR SPREADSHEET IS LAID OUT - LESS IS MORE
  21. 21. LEARN EARLY ON HOW TO REPLACE YOUR DATA SET IN THE TOOL
  22. 22. SAVE REGULAR VERSIONS OF THE DATA
  23. 23. 25/08/2015 An Introduction to Data Visualisation 23 Open Access Policies Worldwide by Continent   From Infogram
  24. 24. CHOOSING TOOLS
  25. 25. Tools Used »  Tableau Public: http://public.tableau.com/s/ »  Open Refine: http://openrefine.org/ »  Python Pandas: http://pandas.pydata.org/ »  Many Eyes: http://www-01.ibm.com/software/analytics/many-eyes/ »  CartoDB: https://cartodb.com/ »  Raw: http://raw.densitydesign.org/ »  Datawrapper: https://datawrapper.de/ »  Infogram: https://infogr.am »  Easel.ly: http://www.easel.ly/ »  Piktochart: http://piktochart.com/ 25/08/2015 An Introduction to Data Visualisation 25 http://selection.datavisualization.ch/
  26. 26. PLAY WITH LOTS OF TOOLS AND THEN COMMIT TO ONE OR TWO
  27. 27. HAVING SEVERAL MACHINES TO USE CAN BE USEFUL
  28. 28. 25/08/2015 An Introduction to Data Visualisation 28 From Tableau
  29. 29. Tools: Things to Think About » Does it cost to use? » What is the licence? » Do you have the correct OS? » Can you get the visualisation out of the tool? » Can you link to live data? » Can you embed it? » Can you make a graphic of it? » Can you download it? » Does the tool expect you to be able to programme? 25/08/2015 An Introduction to Data Visualisation 29
  30. 30. DELIVERING DATA VISUALISATIONS
  31. 31. Delivery: Things to Think About »  Does it cost to use? »  What are the infographics going to be used for? »  What format should they be in? »  Is there a colour theme? What colours look good? »  Can you create a custom palette »  Can you distinguish between different parts of the chart? »  Is it clear what question the visualisation is answering? »  Is there enough information on the data visualisation? »  Is there a heading, comment box, labels, annotation, legend etc.? »  Is the result honest? Make sure you document where all the visualisations are 25/08/2015 An Introduction to Data Visualisation 31
  32. 32. WORKING ON LIVE DATA IS A WHOLE DIFFERENT BALLGAME
  33. 33. 25/08/2015 An Introduction to Data Visualisation 33 Live data From Google maps
  34. 34. Useful Links » Data viz site: http://pasteur4oa-dataviz.okfn.org/ » Github site: https://github.com/okfn/PASTEUR4OA-dataviz/ » Pad: https://pad.okfn.org/p/pasteur4oa-data-viz » My Tableau profile: https://public.tableau.com/profile/marieke.guy#!/ » My infogram profile: https://infogr.am/mariekeguy » Live ROARMAP data: http://bit.ly/1Ri7rjl » Data stored at: https://zenodo.org/deposit/36339/ » Seeing data : http://seeingdata.org/about-seeing-data/ 25/08/2015 An Introduction to Data Visualisation 34
  35. 35. 25/08/2015 An Introduction to Data Visualisation 35 Open Access Policies Worldwide by Continent From Infogram
  36. 36. http://pasteur4oa-dataviz.okfn.org/
  37. 37. Thanks! 25/08/2015 An Introduction to Data Visualisation 37 Contact… Marieke Guy, Open Knowledge Marieke.guy@okfn.org www.pasteur4oa.eu

×