DataUp: Data Curation for Excel

646 views
587 views

Published on

Presentation for University of North Texas 2012 Open Access Symposium. Provided overview of DataUp (formerly DCXL( tool

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
646
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

DataUp: Data Curation for Excel

  1. 1. Facilitating  data  stewardship   practices  for  scientists    Carly  Strasser  |  carly.strasser@ucop.edu  |  www.carlystrasser.net   Open  Access  symposium  |  University  of  North  Texas  |  May  2012  
  2. 2. UGLY  TRUTH   Many   Earth  |  Environmental  |  Ecological   scientists…      5shortessays.blogspot.com     are  not  taught  data  management   don’t  know  what  metadata  are   can’t  name  data  centers  or  repositories   don’t  share  data  publicly  or  store  it  in  an  archive   aren’t  convinced  they  should  share  data    
  3. 3. Where  data  end  up   From  Flickr  by  diylibrarian   www blog.order2disorder.com   From  Flickr  by  csessums   Data  Metadata   From  Flickr  by  csessums   Recreated  from  Klump  et  al.  2006  
  4. 4. Where  data  end  up   From  Flickr  by  diylibrarian   www Data   wwwMetadata   From  Flickr  by  torkildr   Recreated  from  Klump  et  al.  2006  
  5. 5. Intercept  the   researchers  where  they  already  work:  
  6. 6. Frequency  of   Excel  use   Rare  or   occasional   use   Moderate   use   Percent  of  respondents  who  use   Excel  for  these  tasks  100   Every  day   90   or  almost   80   every  day   70   60   50   40   30   20   10   0   Organizing   Visualizing   Sta:s:cs   Sharing  data   data   data  
  7. 7. Facilitate   Archiving   Data   Data  Reuse  management   Sharing  &  organization   Reproducibility   Publishing  
  8. 8. •  Open  source  add-­‐in  &  web  application  •  Facilitate  data  management,  sharing,  archiving  for  scientists  •  Focus  on  atmospheric,  ecological,  hydrological,  and   oceanographic  data  •  Collect  requirements  for  add-­‐in  from  scientists,  data   centers,  libraries  
  9. 9. Add-­‐in  &  Web  Application?  Add-­‐in    •  Little  pieces  of  software    •  Download  to  extend  the  capabilities  of  Excel  •  Appear  as  “ribbon”  in  Excel  •  Only  work  with  Windows  Excel  2007+  •  Available  offline  but  updates  difficult   www.ablebits.com  
  10. 10. Add-­‐in  &  Web  Application?  Add-­‐in    •  Little  pieces  of  software    •  Download  to  extend  the  capabilities  of  Excel  •  Appear  as  “ribbon”  in  Excel  •  Only  work  with  Windows  Excel  2007+  •  Available  offline  but  updates  difficult  Web-­‐based  application    •  Websites  that  do  something  with  info/files  provided  by  user  •  Examples:  Facebook,  YouTube  •  No  program  download  required  but  updates  easy  •  New  user  interface  to  learn  
  11. 11. What  will  DCXL  do?   What  do  scientists   need?  
  12. 12. ~ 150  scientists  •  No  data  preservation   –  Unaware  of  archives   –  Resistant  to  sharing  •  Poor  data  documentation  •  90%  use  other  programs  along  with  Excel  
  13. 13. Requirements  1.  Must  work  for  Excel  users  without  the  add-­‐in  2.  No  additional  software  necessary  3.  Can  be  used  offline  4.  Perform  CSV  compatibility  checks,  reporting,  and  automated  fixes  5.  Add  Metadata  to  data  file   a.  Can  use  existing  metadata  as  a  template   b.  Add-­‐in  can  automatically  generate  some  of  the  metadata   where  the  info  is  available  from  the  file  6.  Generate  a  citation  for  the  data  file  7.  Deposit  data  and  metadata  in  a  repository    
  14. 14. Requirements  Features  1.  Compatibility  Check  2.  Generate  metadata  3.  Generate  citation  4.  Post  data  to  repository  
  15. 15. DCXL  Add-­‐in  Ribbon  
  16. 16. Open  Access?  
  17. 17. Vision  for  Future  •  Community  adoption  •  Extension  to  other  programs   –  Google  Docs,  OpenOffice  •  Incorporation  of  other  metadata  schemas  •  Repository  adoption  •  Partnerships:  FigShare,  F1000,  USGS,  etc.  
  18. 18. Website:  dcxl.cdlib.org  
  19. 19. dcxl.cdlib.org  @dcxlCDL  www.facebook.com/DCXLatCDL   www.carlystrasser.net   carlystrasser@gmail.com   @carlystrasser  

×