Archiving	  Small	                      Science	  Data	  Sets	                                 	  Carly	  Strasser	  |	  c...
UGLY TRUTH                                                    Many	                                                      E...
Where	  do	  data	  end	  up?	                                                          From	  Flickr	  by	  diylibrarian	...
Where	  do	  data	  end	  up?	                                                                       From	  Flickr	  by	  ...
Facilitate	                          Archiving	          Data	                              Data	  Reuse	  management	    ...
Why	  are	  you	                                                                     promoting	                           ...
Why	  are	  you	                               promoting	                                 Excel?	  Everyone	  uses	  it	  ...
DCXL	  Project	  Goals	  Audience:	  Earth,	  atmospheric,	  environmental,	  ecological	  scientists	  	  	  Contributors...
~ 150	  scientists	  •  No	  data	  preservation	     –  Unaware	  of	  archives	     –  Resistant	  to	  sharing	  •  Poo...
Requirements	  1.  Must	  work	  for	  Excel	  users	  without	  the	  add-­‐in	  2.  No	  additional	  software	  (other	...
Requirements	  1.    Must	  work	  for	  Excel	  users	  without	  the	  add-­‐in	  2.    No	  additional	  software	  (ot...
Requirements	  1.  Must	  work	  for	  Excel	  users	  without	  the	  add-­‐in	  2.  No	  additional	  software	  (other	...
Requirements	  1.  Must	  work	  for	  Excel	  users	  without	  the	  add-­‐in	  2.  No	  additional	  software	  (other	...
dcxl.cdlib.org	  DCXLatCDL	            @dcxlCDL	  
dcxl.cdlib.org	  @dcxlCDL	  www.facebook.com/DCXLatCDL	                                       www.carlystrasser.net	      ...
DCXL Lightning Talk: Archiving Small Datasets
Upcoming SlideShare
Loading in …5
×

DCXL Lightning Talk: Archiving Small Datasets

968
-1

Published on

Personal Digital Archiving 2012 Conference, Internet Archive in San Francisco CA

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
968
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
0
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

DCXL Lightning Talk: Archiving Small Datasets

  1. 1. Archiving  Small   Science  Data  Sets    Carly  Strasser  |  carly.strasser@ucop.edu  |  www.carlystrasser.net  John  Kunze  Patricia  Cruse   Personal  Digital  Archiving    |    February  2012  
  2. 2. UGLY TRUTH Many   Earth  |  Environmental  |  Ecological   scientists…      5shortessays.blogspot.com     are  not  taught  data  management   don’t  know  what  metadata  are   can’t  name  data  centers  or  repositories   don’t  share  data  publicly  or  store  it  in  an  archive   aren’t  convinced  they  should  share  data    
  3. 3. Where  do  data  end  up?   From  Flickr  by  diylibrarian   www blog.order2disorder.com   From  Flickr  by  csessums   Data  Metadata   From  Flickr  by  csessums   Recreated  from  Klump  et  al.  2006  
  4. 4. Where  do  data  end  up?   From  Flickr  by  diylibrarian   www Data   wwwMetadata   From  Flickr  by  torkildr   Recreated  from  Klump  et  al.  2006  
  5. 5. Facilitate   Archiving   Data   Data  Reuse  management   Sharing  &  organization   Reproducibility   Publishing  
  6. 6. Why  are  you   promoting   Excel?   Develop  an  open  source  &  free     Excel  add-­‐in    Add-­‐in:    Little  pieces  of  software          Download  to  extend  the  capabilities  of  Excel        Appear  as  “ribbon”   www.ablebits.com  
  7. 7. Why  are  you   promoting   Excel?  Everyone  uses  it  Stopgap  measure      
  8. 8. DCXL  Project  Goals  Audience:  Earth,  atmospheric,  environmental,  ecological  scientists      Contributors:  UC  community,  DataONE,  broader  community  via  conferences      Method:  Collect  requirements  via  surveys,  interviews,  polls      
  9. 9. ~ 150  scientists  •  No  data  preservation   –  Unaware  of  archives   –  Resistant  to  sharing  •  Poor  data  documentation  •  90%  use  other  programs  along  with  Excel  
  10. 10. Requirements  1.  Must  work  for  Excel  users  without  the  add-­‐in  2.  No  additional  software  (other  than  add-­‐in  and  Excel)  necessary  3.  Can  be  used  offline  
  11. 11. Requirements  1.  Must  work  for  Excel  users  without  the  add-­‐in  2.  No  additional  software  (other  than  add-­‐in  and  Excel)  necessary  3.  Can  be  used  offline  4.  Perform  CSV  compatibility  checks,  reporting,  and  automated  fixes  
  12. 12. Requirements  1.  Must  work  for  Excel  users  without  the  add-­‐in  2.  No  additional  software  (other  than  add-­‐in  and  Excel)  necessary  3.  Can  be  used  offline  4.  Perform  CSV  compatibility  checks,  reporting,  and  automated  fixes  5.  Add  Metadata  to  data  file   a.  Can  use  existing  metadata  as  a  template   b.  Add-­‐in  can  automatically  generate  some  of  the  metadata   where  the  info  is  available  from  the  file  6.  Generate  a  citation  for  the  data  file  
  13. 13. Requirements  1.  Must  work  for  Excel  users  without  the  add-­‐in  2.  No  additional  software  (other  than  add-­‐in  and  Excel)  necessary  3.  Can  be  used  offline  4.  Perform  CSV  compatibility  checks,  reporting,  and  automated  fixes  5.  Add  Metadata  to  data  file   a.  Can  use  existing  metadata  as  a  template   b.  Add-­‐in  can  automatically  generate  some  of  the  metadata   where  the  info  is  available  from  the  file  6.  Generate  a  citation  for  the  data  file  7.  Deposit  data  and  metadata  in  a  repository  
  14. 14. dcxl.cdlib.org  DCXLatCDL   @dcxlCDL  
  15. 15. dcxl.cdlib.org  @dcxlCDL  www.facebook.com/DCXLatCDL   www.carlystrasser.net   carlystrasser@gmail.com   @carlystrasser  

×