Analyzing
                     Data about our Data


                           Heather	
  Piwowar	
  @researchremix	
  
                               cofounder,	
  ImpactStory.	
  	
  Postdoc	
  at	
  NESCent.



some photos NC, SA
                                                                  #RDS2013	
  
We want to understand
the impact of our datasets
Something that we
data-people often forget:
IMPACT ISN’T JUST CITATIONS
Many types of engagement:
   • views
   • saves
   • discussions
   • formal references
   • recommendations
Many engagement groups:
   • researchers
   • teachers
   • students
   • policy makers
   • practitioners
journal article citation

formal reference
from a scholar
journal article in-text link

formal reference
from a scholar
blog in-text link

informal discussion
from a scholar
tweet in-text link

informal discussion
from a scholar
... or a teacher, student,
policy maker, ...
Mendeley,
delicious bookmark

save
by someone
for some reason :)
GitHub star

recommendation
by someone
http://www.flickr.com/photos/pixscapes/4331070047
Analyze
Data about our Data
Impact flavour




                 CC-BY-NC by maniacyak on flickr
                 http://www.flickr.com/photos/maniacyak/3432589472
How can we do this?

What is possible now?

What data exists, and how
can we analyze it?
Open metrics, with context, for diverse products.!




                                 Board:
                                 Cameron Neylon
                                 John Wilbanks
Also:

Altmetric.com
Plum Analytics
Data Citation Index
1. More data about our data
a) More metrics exposed
http://dx.doi.org/10.5061/dryad.18
http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/3131/utilization
http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/3131/utilization
http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/3131/utilization
b) More support for all
   types of engagement
Today: full pallet for articles
Tools to support all stages
of web-native engagement
still lacking for datasets.

  #callToAction
c) more derived metrics,
   more context
2) More open
http://www.flickr.com/photos/quinnanya/2055471833
OPEN text mining of articles
OPEN data from repos
OPEN metrics
     from aggregators
1. More data about our data
2. More open
3. More awareness
http://datapub.cdlib.org/nsf-now-allows-data-in-biosketch-
                     accomplishments/

 Piwowar H. (2013).Value all research products, Nature,
     493 (7431) 159-159. DOI: 10.1038/493159a
Drive demand
Drive change
1. More data about our data
2. More open
3. More awareness
http://www.flickr.com/photos/blmurch/722900022/
thank you!
Jason Priem: cofounder of ImpactStory
Also: Todd Vision, Mike Whitlock, the open science community, and
   those who release their articles, datasets and photos openly.



                   blog.ImpactStory.org
                  team@ImpactStory.org
                       @ImpactStory


                       ImpactStory.org

Analyzing data about our data