Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data part 2
 

Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data part 2

on

  • 2,755 views

About the Webinar ...

About the Webinar

Big data is being collected at a rate that is surpassing traditional analytical methods due to the constantly expanding ways in which data can be created and mined. Faculty in all disciplines are increasingly creating and/or incorporating big data into their research and institutions are creating repositories and other tools to manage it all. There are many challenge to effectively manage and curate this data—challenges that are both similar and different to managing document archives. Libraries can and are assuming a key role in making this information more useful, visible, and accessible, such as creating taxonomies, designing metadata schemes, and systematizing retrieval methods.

Our panelists will talk about their experience with big data curation, best practices for research data management, and the tools used by libraries as they take on this evolving role.

Statistics

Views

Total Views
2,755
Views on SlideShare
787
Embed Views
1,968

Actions

Likes
2
Downloads
35
Comments
0

21 Embeds 1,968

http://www.niso.org 1059
http://www.scilogs.com 628
http://www.infodocket.com 130
http://cloud.feedly.com 84
http://www.newsblur.com 16
http://plus.url.google.com 10
http://www.feedspot.com 7
http://127.0.0.1 6
http://newsgator.cgsh.com 6
http://feedly.com 5
http://newsblur.com 4
http://translate.googleusercontent.com 3
http://silverreader.com 2
https://www.google.com 1
https://www.google.co.in 1
http://feedreader.com 1
http://reader.aol.com 1
http://www.hanrss.com 1
http://www.inoreader.com 1
http://digg.com 1
http://www.google.co.jp 1
More...

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data part 2 Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data part 2 Presentation Transcript

  • NISO Webinar: Research Data Curation Part 2: E-Science Librarianship September 18, 2013 Speakers: Lisa Johnston Research Services Librarian, Co-Director of the University Digital Conservancy, University of Minnesota Libraries Sayeed Choudhury Associate Dean for Research Data Management, Sheridan Libraries of Johns Hopkins University Carly Strasser Data Curation Project Manager, UC Curation Center (UC3), California Digital Library http://www.niso.org/news/events/2013/webinars/data_curation
  • Academic Libraries Get Ready: Big data is here and it needs a (caring) home Lisa Johnston University of Minnesota - Twin Cities September 18, 2013 Image: http://www.sciencemag.org/content/331/6018.cover-expansion
  • Research is digital, and this presents some challenges...
  • Research data can also be big and hard to manage... Image: http://history.dal.ca/Images/pr_hanlon/Parma%20unsorted%20archives.JPG
  • Image: http://www.retronaut.com/wp-content/uploads/2012/12/160.jpg Federal agencies must improve public access to digital research data (OSTP Memo, Feb 22, 2013).
  • Preservation, organization, and public access? Sounds like a job for the libraries! Image Http://cdn.slashgear.com/wp-content/uploads/2012/10/google-datacenter-tech-13.jpg
  • Challenge: How to create sustainable and scaleable data curation programs on campus. Image: http://www.businesscomputingworld.co.uk/wp-content/uploads/2012/01/Computer-Connections.jpg
  • 1. Helping researchers develop better data management skills. Image: http://www.spellboundblog.com/wp-content/uploads/2008/09/floppy_photo.jpg
  • Data Management Training Initiatives by the University of Minnesota Libraries (2010 - present) 360 47,862 58 Visits to http://lib.umn.edu/datamanagement Faculty and Staff attendees to drop-in Workshop “Creating a Data Management Plan” (RCR CE credit) Graduate students enrolled in open online “Data Management Course” (non-credit Fall 2012 and Spring 2013)
  • Image: http://www.businesscomputingworld.co.uk/wp-content/uploads/2012/01/Computer-Connections.jpg ”Training Researchers on Data Management: A Scalable, Cross- Disciplinary Approach," Available in the Journal of eScience Librarianship (Vol. 1: Iss. 2) Interactive Workshops Facilitate Discussion
  • Data Information Literacy: IMLS-funded project 2011-14 In-Depth Interviews ●  90-120 minutes ●  4 graduate students ●  1 faculty member ●  Interview tools: z.umn.edu/dil Johnston, L. and Jeffryes, J. (2013). "Data Management Skills Needed by Structural Engineering Students: A Case Study at the University of Minnesota." J. Prof. Issues Eng. Educ. Pract., DOI:10.1061/(ASCE)EI.1943-5541.0000154 (Feb. 13, 2013).
  • Online Course for Graduate Students reaches across campus Jeffryes, J. and Johnston, L. (2013). "An E-Learning Approach to Data Information Literacy Education." 2013 ASEE Annual Conference (Atlanta). http://www.asee.org/public/conferences/20/papers/6956/view
  • Five “Flipped Classroom” Workshops Coming Fall 2013
  • Personal Archiving Skills Transferable to Data Info Lit
  • 2. Listening to the Evolving Campus Need Image: http://www.businesscomputingworld.co.uk/wp-content/uploads/2012/01/Computer-Connections.jpg
  • Scientific Data: •  Aerospace Engineering •  Astronomical images •  Institute for Health Informatics •  Chemical Engineering Research Lab •  Botany Images of the Bell Museum Herbarium collection GIS Data: •  Minnesota Geological Survey •  USpatial and TerraPop Social Sciences/Survey data: •  Office of Institutional Research •  Climate Change Working Group Arts & Humanities: •  DAH Symposium •  Ojibwe Conversation Data Data service needs will vary (and evolve) across disciplines
  • Build partnerships with others who are tackling these issues
  • Bring together data service providers in an informal way Discussion Topics: ●  Data Storage Options on Campus ●  Metadata Standards ●  Spatial Data ●  Best Practices for De- identifying Research Data ●  Data Repositories (Local, National) ●  Data Services at the Supercomputing institute ●  Practical Examples for Managing data (Sciences)
  • 3. Developing common workflows for archiving data in the library. Image: http://www.fujitsu.com/img/INTSTG/products/bpm/business-process-management-582x240.jpg
  • Libraries offer data archiving tools and repositories ●  Institutional Repository for self-deposit of datasets ●  Digital library collections open to user-submissions for image, audio, and video. ●  GIS data initiative on campus, library partnership.
  • Example Data Archived in the UDC
  • Image: http://jivesna.com/images/dspace.png Data Curation Pilot 2013 will develop a workflow for data curation as a service.
  • Pilot will examine six example data sets from a variety of disciplines.
  • “I recognize that I'm not the only person in this predicament of storing larger sets of data and that figuring out how to do this well and sustainably will help many, many folks around the University.” “Data curation goes beyond backup and storage. Meanwhile, how to archive, preserve, and provide access to (sometimes large) datasets is still new to many researchers” Data Curation Pilot Faculty Responses to “Why Participate”
  • We can help by training researchers to create better data, bringing together campus partners, and implementing curation techniques that scale. Image Http://cdn.slashgear.com/wp-content/uploads/2012/10/google-datacenter-tech-13.jpg
  • Thanks http://z.umn.edu/datapilot13
  • The Library's Role in Enabling Data Interaction for Researchers NISO  Webinar:  Research  Data  Cura6on   Part  2:  Libraries  and  Big  Data   Sayeed  Choudhury  
  • Data  Management  Services   •  Johns  Hopkins  University  Data  Management   Services  (JHUDMS)  –  hJp://dmp.data.jhu.edu   •  Culmina6on  of  over  a  decade  of  R&D  star6ng   with  Sloan  Digital  Sky  Survey  (SDSS)   •  Implementa6on  of  Data  Conservancy   technology  development,  educa6onal,   workforce  development  and  sustainability   programs  
  • Two  Stages  of  JHUDMS   •  Pre-­‐proposal  consulta6on  and  assistance  with   data  management  plan  prepara6on  for  NSF   proposals  –  though  rapidly  expanding  beyond   NSF  and  into  other  use  cases   •  Post-­‐proposal  data  management  through  JHU   Data  Archive   •  First  stage  paid  for  directly  by  JHU;  second   stage  paid  for  through  line  items  within  NSF   proposal  budgets  
  • Data  Management  Layers   Layers   Characteris6cs   Implica6on  for  PI   Implica6on  rela6ve   to  NSF   Cura6on   Adding  value  throughout   life-­‐cycle   •  Feature  Extrac6on   •  New  query   capabili6es   •  Cross-­‐disciplinary   •  Compe66ve   advantage   •  New   opportuni6es   Preserva6on   Ensuring  that  data  can   be  fully  used  and   interpreted   •  Ability  to  use  own   data  in  the  future   (e.g.  5  yrs)   •  Data  sharing     •  Sa6sfies  NSF   needs  across   directorates     Archiving   Data  protec6on  including   fixity,  iden6fiers   •  Provides  iden6fiers   for  sharing,   references,  etc.   •  Could  sa6sfy  most   NSF  requirements   Storage   Bits  on  disk,  tape,  cloud,   etc.   Backup  and  restore   •  Responsible  for:   •  Restore   •  Sharing   •  Staffing   •  Could  be  enough   for  now  but  not   near-­‐term  future  
  • “Big  Data”   •  What  is  Big  Data?   •  There  are  defini6ons  based  on  the  “V’s”  of  Big   Data  (e.g.,  volume,  velocity,  variety)   •  What  is  clear  is  that  it’s  fundamentally  different   from  “spreadsheet  science”   •  For  me,  if  a  (designated)  community’s  ability  to   deal  with  data  is  overwhelmed,  it’s  “Big  Data”   •  SDSS  lessons  learned:   hJps://wiki.library.jhu.edu/x/eY1XAQ  
  • Libraries  and  Big  Data   •  My  asser6on  is  that  our  community  has  been   overwhelmed  by  data   •  While  it’s  essen6al  to  leverage  exis6ng  capability,   we  need  to  be  aware  that  this  is  the  beginning  of   the  journey   •  The  goal  is  not  about  suppor6ng  libraries  –  it  is   about  suppor6ng  scholarship   •  Data  repositories  need  to  coalesce  into   infrastructure   •  Interac6on  with  data  should  become  seamless  
  • •  NSF  Award  OCI-­‐0830976   •  Sheridan  Libraries  and  JHU  financial  support   •  Data  Conservancy  colleagues  for  slides   •  hJp://dataconservancy.org     •  hJp://dmp.data.jhu.edu  -­‐-­‐  JHU  DMS   •  hJp://www.dlib.org/dlib/september12/mayernik/ 09mayernik.html  -­‐-­‐  blueprint  document   •  hJps://www.youtube.com/watch?v=F6iYXNvCRO4  -­‐-­‐   data  management  layer  stack  model   Acknowledgements  
  • NISO Webinar • September 18, 2013 ! ! ! ! ! Questions?! All questions will be posted with presenter answers on the NISO website following the webinar:! ! http://www.niso.org/news/events/2013/webinars/data_curation NISO Webinar: Research Data Curation Part 2: Libraries and Big Data
  • Thank you for joining us today. 
 Please take a moment to fill out the brief online survey. We look forward to hearing from you! THANK YOU