NISO Webinar: Research Data Curation Part 2:
E-Science Librarianship
September 18, 2013
Speakers:
Lisa Johnston
Research S...
Academic Libraries
Get Ready:
Big data is here and
it needs a (caring)
home
Lisa Johnston
University of Minnesota - Twin C...
Research is digital, and this presents some
challenges...
Research data can also be big and hard to
manage...
Image: http://history.dal.ca/Images/pr_hanlon/Parma%20unsorted%20archi...
Image: http://www.retronaut.com/wp-content/uploads/2012/12/160.jpg
Federal agencies must improve public access to
digital ...
Preservation, organization, and public access?
Sounds like a job for the libraries!
Image Http://cdn.slashgear.com/wp-cont...
Challenge: How to create sustainable and
scaleable data curation programs on campus.
Image: http://www.businesscomputingwo...
1. Helping researchers develop better data
management skills.
Image: http://www.spellboundblog.com/wp-content/uploads/2008...
Data Management Training Initiatives by the University of
Minnesota Libraries (2010 - present)
360
47,862
58
Visits to
htt...
Image: http://www.businesscomputingworld.co.uk/wp-content/uploads/2012/01/Computer-Connections.jpg
”Training Researchers o...
Data Information Literacy: IMLS-funded project 2011-14
In-Depth Interviews
●  90-120 minutes
●  4 graduate students
●  1 f...
Online Course for Graduate Students reaches across
campus
Jeffryes, J. and Johnston, L. (2013). "An E-Learning Approach to...
Five “Flipped Classroom” Workshops Coming Fall 2013
Personal Archiving Skills Transferable to Data Info Lit
2. Listening to the Evolving Campus Need
Image: http://www.businesscomputingworld.co.uk/wp-content/uploads/2012/01/Compute...
Scientific Data:
•  Aerospace Engineering
•  Astronomical images
•  Institute for Health Informatics
•  Chemical Engineeri...
Build partnerships with others who are tackling these
issues
Bring together data service providers in an informal way
Discussion Topics:
●  Data Storage Options on
Campus
●  Metadata ...
3. Developing common workflows for archiving
data in the library.
Image: http://www.fujitsu.com/img/INTSTG/products/bpm/bu...
Libraries offer data archiving tools and repositories
●  Institutional Repository for
self-deposit of datasets
●  Digital ...
Example Data Archived in the UDC
Image: http://jivesna.com/images/dspace.png
Data Curation Pilot 2013 will develop a workflow for data
curation as a servic...
Pilot will examine six example data sets from a variety of
disciplines.
“I recognize that I'm not the only person in this predicament of storing larger
sets of data and that figuring out how to ...
We can help by training researchers to create
better data, bringing together campus partners,
and implementing curation te...
Thanks
http://z.umn.edu/datapilot13
The Library's Role in Enabling
Data Interaction for
Researchers
NISO	
  Webinar:	
  Research	
  Data	
  Cura6on	
  
Part	
...
Data	
  Management	
  Services	
  
•  Johns	
  Hopkins	
  University	
  Data	
  Management	
  
Services	
  (JHUDMS)	
  –	
...
Two	
  Stages	
  of	
  JHUDMS	
  
•  Pre-­‐proposal	
  consulta6on	
  and	
  assistance	
  with	
  
data	
  management	
  ...
Data	
  Management	
  Layers	
  
Layers	
   Characteris6cs	
   Implica6on	
  for	
  PI	
   Implica6on	
  rela6ve	
  
to	
 ...
“Big	
  Data”	
  
•  What	
  is	
  Big	
  Data?	
  
•  There	
  are	
  defini6ons	
  based	
  on	
  the	
  “V’s”	
  of	
  B...
Libraries	
  and	
  Big	
  Data	
  
•  My	
  asser6on	
  is	
  that	
  our	
  community	
  has	
  been	
  
overwhelmed	
  ...
•  NSF	
  Award	
  OCI-­‐0830976	
  
•  Sheridan	
  Libraries	
  and	
  JHU	
  financial	
  support	
  
•  Data	
  Conserva...
NISO Webinar • September 18, 2013
!
!
!
!
!
Questions?!
All questions will be posted with presenter answers on the
NISO we...
Thank you for joining us today. 

Please take a moment to fill out the brief online survey.

We look forward to hearing fro...
Upcoming SlideShare
Loading in...5
×

Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data part 2

3,027

Published on

About the Webinar

Big data is being collected at a rate that is surpassing traditional analytical methods due to the constantly expanding ways in which data can be created and mined. Faculty in all disciplines are increasingly creating and/or incorporating big data into their research and institutions are creating repositories and other tools to manage it all. There are many challenge to effectively manage and curate this data—challenges that are both similar and different to managing document archives. Libraries can and are assuming a key role in making this information more useful, visible, and accessible, such as creating taxonomies, designing metadata schemes, and systematizing retrieval methods.

Our panelists will talk about their experience with big data curation, best practices for research data management, and the tools used by libraries as they take on this evolving role.

Published in: Education, Technology
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
3,027
On Slideshare
0
From Embeds
0
Number of Embeds
24
Actions
Shares
0
Downloads
44
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data part 2

  1. 1. NISO Webinar: Research Data Curation Part 2: E-Science Librarianship September 18, 2013 Speakers: Lisa Johnston Research Services Librarian, Co-Director of the University Digital Conservancy, University of Minnesota Libraries Sayeed Choudhury Associate Dean for Research Data Management, Sheridan Libraries of Johns Hopkins University Carly Strasser Data Curation Project Manager, UC Curation Center (UC3), California Digital Library http://www.niso.org/news/events/2013/webinars/data_curation
  2. 2. Academic Libraries Get Ready: Big data is here and it needs a (caring) home Lisa Johnston University of Minnesota - Twin Cities September 18, 2013 Image: http://www.sciencemag.org/content/331/6018.cover-expansion
  3. 3. Research is digital, and this presents some challenges...
  4. 4. Research data can also be big and hard to manage... Image: http://history.dal.ca/Images/pr_hanlon/Parma%20unsorted%20archives.JPG
  5. 5. Image: http://www.retronaut.com/wp-content/uploads/2012/12/160.jpg Federal agencies must improve public access to digital research data (OSTP Memo, Feb 22, 2013).
  6. 6. Preservation, organization, and public access? Sounds like a job for the libraries! Image Http://cdn.slashgear.com/wp-content/uploads/2012/10/google-datacenter-tech-13.jpg
  7. 7. Challenge: How to create sustainable and scaleable data curation programs on campus. Image: http://www.businesscomputingworld.co.uk/wp-content/uploads/2012/01/Computer-Connections.jpg
  8. 8. 1. Helping researchers develop better data management skills. Image: http://www.spellboundblog.com/wp-content/uploads/2008/09/floppy_photo.jpg
  9. 9. Data Management Training Initiatives by the University of Minnesota Libraries (2010 - present) 360 47,862 58 Visits to http://lib.umn.edu/datamanagement Faculty and Staff attendees to drop-in Workshop “Creating a Data Management Plan” (RCR CE credit) Graduate students enrolled in open online “Data Management Course” (non-credit Fall 2012 and Spring 2013)
  10. 10. Image: http://www.businesscomputingworld.co.uk/wp-content/uploads/2012/01/Computer-Connections.jpg ”Training Researchers on Data Management: A Scalable, Cross- Disciplinary Approach," Available in the Journal of eScience Librarianship (Vol. 1: Iss. 2) Interactive Workshops Facilitate Discussion
  11. 11. Data Information Literacy: IMLS-funded project 2011-14 In-Depth Interviews ●  90-120 minutes ●  4 graduate students ●  1 faculty member ●  Interview tools: z.umn.edu/dil Johnston, L. and Jeffryes, J. (2013). "Data Management Skills Needed by Structural Engineering Students: A Case Study at the University of Minnesota." J. Prof. Issues Eng. Educ. Pract., DOI:10.1061/(ASCE)EI.1943-5541.0000154 (Feb. 13, 2013).
  12. 12. Online Course for Graduate Students reaches across campus Jeffryes, J. and Johnston, L. (2013). "An E-Learning Approach to Data Information Literacy Education." 2013 ASEE Annual Conference (Atlanta). http://www.asee.org/public/conferences/20/papers/6956/view
  13. 13. Five “Flipped Classroom” Workshops Coming Fall 2013
  14. 14. Personal Archiving Skills Transferable to Data Info Lit
  15. 15. 2. Listening to the Evolving Campus Need Image: http://www.businesscomputingworld.co.uk/wp-content/uploads/2012/01/Computer-Connections.jpg
  16. 16. Scientific Data: •  Aerospace Engineering •  Astronomical images •  Institute for Health Informatics •  Chemical Engineering Research Lab •  Botany Images of the Bell Museum Herbarium collection GIS Data: •  Minnesota Geological Survey •  USpatial and TerraPop Social Sciences/Survey data: •  Office of Institutional Research •  Climate Change Working Group Arts & Humanities: •  DAH Symposium •  Ojibwe Conversation Data Data service needs will vary (and evolve) across disciplines
  17. 17. Build partnerships with others who are tackling these issues
  18. 18. Bring together data service providers in an informal way Discussion Topics: ●  Data Storage Options on Campus ●  Metadata Standards ●  Spatial Data ●  Best Practices for De- identifying Research Data ●  Data Repositories (Local, National) ●  Data Services at the Supercomputing institute ●  Practical Examples for Managing data (Sciences)
  19. 19. 3. Developing common workflows for archiving data in the library. Image: http://www.fujitsu.com/img/INTSTG/products/bpm/business-process-management-582x240.jpg
  20. 20. Libraries offer data archiving tools and repositories ●  Institutional Repository for self-deposit of datasets ●  Digital library collections open to user-submissions for image, audio, and video. ●  GIS data initiative on campus, library partnership.
  21. 21. Example Data Archived in the UDC
  22. 22. Image: http://jivesna.com/images/dspace.png Data Curation Pilot 2013 will develop a workflow for data curation as a service.
  23. 23. Pilot will examine six example data sets from a variety of disciplines.
  24. 24. “I recognize that I'm not the only person in this predicament of storing larger sets of data and that figuring out how to do this well and sustainably will help many, many folks around the University.” “Data curation goes beyond backup and storage. Meanwhile, how to archive, preserve, and provide access to (sometimes large) datasets is still new to many researchers” Data Curation Pilot Faculty Responses to “Why Participate”
  25. 25. We can help by training researchers to create better data, bringing together campus partners, and implementing curation techniques that scale. Image Http://cdn.slashgear.com/wp-content/uploads/2012/10/google-datacenter-tech-13.jpg
  26. 26. Thanks http://z.umn.edu/datapilot13
  27. 27. The Library's Role in Enabling Data Interaction for Researchers NISO  Webinar:  Research  Data  Cura6on   Part  2:  Libraries  and  Big  Data   Sayeed  Choudhury  
  28. 28. Data  Management  Services   •  Johns  Hopkins  University  Data  Management   Services  (JHUDMS)  –  hJp://dmp.data.jhu.edu   •  Culmina6on  of  over  a  decade  of  R&D  star6ng   with  Sloan  Digital  Sky  Survey  (SDSS)   •  Implementa6on  of  Data  Conservancy   technology  development,  educa6onal,   workforce  development  and  sustainability   programs  
  29. 29. Two  Stages  of  JHUDMS   •  Pre-­‐proposal  consulta6on  and  assistance  with   data  management  plan  prepara6on  for  NSF   proposals  –  though  rapidly  expanding  beyond   NSF  and  into  other  use  cases   •  Post-­‐proposal  data  management  through  JHU   Data  Archive   •  First  stage  paid  for  directly  by  JHU;  second   stage  paid  for  through  line  items  within  NSF   proposal  budgets  
  30. 30. Data  Management  Layers   Layers   Characteris6cs   Implica6on  for  PI   Implica6on  rela6ve   to  NSF   Cura6on   Adding  value  throughout   life-­‐cycle   •  Feature  Extrac6on   •  New  query   capabili6es   •  Cross-­‐disciplinary   •  Compe66ve   advantage   •  New   opportuni6es   Preserva6on   Ensuring  that  data  can   be  fully  used  and   interpreted   •  Ability  to  use  own   data  in  the  future   (e.g.  5  yrs)   •  Data  sharing     •  Sa6sfies  NSF   needs  across   directorates     Archiving   Data  protec6on  including   fixity,  iden6fiers   •  Provides  iden6fiers   for  sharing,   references,  etc.   •  Could  sa6sfy  most   NSF  requirements   Storage   Bits  on  disk,  tape,  cloud,   etc.   Backup  and  restore   •  Responsible  for:   •  Restore   •  Sharing   •  Staffing   •  Could  be  enough   for  now  but  not   near-­‐term  future  
  31. 31. “Big  Data”   •  What  is  Big  Data?   •  There  are  defini6ons  based  on  the  “V’s”  of  Big   Data  (e.g.,  volume,  velocity,  variety)   •  What  is  clear  is  that  it’s  fundamentally  different   from  “spreadsheet  science”   •  For  me,  if  a  (designated)  community’s  ability  to   deal  with  data  is  overwhelmed,  it’s  “Big  Data”   •  SDSS  lessons  learned:   hJps://wiki.library.jhu.edu/x/eY1XAQ  
  32. 32. Libraries  and  Big  Data   •  My  asser6on  is  that  our  community  has  been   overwhelmed  by  data   •  While  it’s  essen6al  to  leverage  exis6ng  capability,   we  need  to  be  aware  that  this  is  the  beginning  of   the  journey   •  The  goal  is  not  about  suppor6ng  libraries  –  it  is   about  suppor6ng  scholarship   •  Data  repositories  need  to  coalesce  into   infrastructure   •  Interac6on  with  data  should  become  seamless  
  33. 33. •  NSF  Award  OCI-­‐0830976   •  Sheridan  Libraries  and  JHU  financial  support   •  Data  Conservancy  colleagues  for  slides   •  hJp://dataconservancy.org     •  hJp://dmp.data.jhu.edu  -­‐-­‐  JHU  DMS   •  hJp://www.dlib.org/dlib/september12/mayernik/ 09mayernik.html  -­‐-­‐  blueprint  document   •  hJps://www.youtube.com/watch?v=F6iYXNvCRO4  -­‐-­‐   data  management  layer  stack  model   Acknowledgements  
  34. 34. NISO Webinar • September 18, 2013 ! ! ! ! ! Questions?! All questions will be posted with presenter answers on the NISO website following the webinar:! ! http://www.niso.org/news/events/2013/webinars/data_curation NISO Webinar: Research Data Curation Part 2: Libraries and Big Data
  35. 35. Thank you for joining us today. 
 Please take a moment to fill out the brief online survey. We look forward to hearing from you! THANK YOU
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×