SlideShare a Scribd company logo
1 of 18
The Web of Sites: Creating Effective
Web Archiving Appraisal and
Collection Development Policies
Jennifer Wright
Archives and Information Management Team Leader
SAA 2013
Session 408
The Mission of Smithsonian Archives
 Appraise, acquire, and preserve
the records of the Smithsonian
Institution
 Offer a range of research and
reference services
 Establish policy and provide
expert guidance on record
keeping practices
 Create and promote products
and services that broaden
understanding of the
Smithsonian
Websites as Records
 Smithsonian’s official definition of a record:
“any official recorded information, regardless of
medium or characteristics, created, received,
and maintained by a Smithsonian museum,
office, or employee”
Smithsonian Directive 950
Management of the Smithsonian Web
 Sets policies and procedures to ensure the integrity
of content, reliability of infrastructure, and usability
of websites while protecting privacy of visitors and
Smithsonian’s reputation
 Requires Archives to provide dispositions for unit
websites, web applications, and online exhibits
 Requires Archives to maintain historical snapshots
of Smithsonian websites and related content
Smithsonian Directive 814
Social Media Policy
 Sets policy for opening and maintaining official
Smithsonian social media accounts
 Requires that units notify Archives when opening
and before closing a social media account
 Requires Archives to maintain registry of social
media accounts and to archive information
contained in the accounts according to current
standards and retention policies
Why Save?
 Websites and social media profiles are Smithsonian’s
public face
 Similar to a publication
 May incorporate many types of materials
 May replace other formats
Sounds straightforward.
How complicated could
appraisal possibly be?
Smithsonian’s Web Presence
 257 websites + 10 mobile websites
 89 blogs
 26 apps for various platforms
 578 social media accounts including:
 153 Facebook accounts
 105 Twitter accounts
 66 Flickr accounts
 66 YouTube accounts
http://www.si.edu/Connect
Why Not Save Everything?
 Some content already transferred to Archives in
another format
 Some content is the responsibility of other units
 Some content is collections, not records
 Some content serves only as pointers to other
Smithsonian and non-Smithsonian content
Other Issues Affecting Appraisal
 Certain types of files and coding don’t crawl well
 Flash, JavaScript, some video
 Organization and coding of site may make it impossible to
capture everything wanted and exclude everything unwanted
 Social media terms of service often do not allow
crawling
 Users may consider social media interactions to be
private
One policy doesn’t fit all
Our Policies: Public Websites
 Permanent records but may exclude:
 Detailed collections information
 Large sections duplicated in another format
 Crawl annually, before and after redesign, and on
day of major event
Our Policies: Intranets
 Individually appraised based upon content
 Generally block crawlers – permanent records must
be transferred via ftp, server to server transfer, or
external drive
 Will be restricted as appropriate
Our Policies: Social Media Accounts
 Will capture most accounts one time to show they
existed and how they were used
 Will crawl, use export tool, take screenshots, or a
combo to best capture account
 Will not be made immediately available online to
mitigate violations of terms of service
Our Policies: Social Media Accounts
 Must include or link to Smithsonian’s Terms of Use
– no capture otherwise
http://www.si.edu/Termsofuse
Our Policies: Social Media Accounts
 After first capture, account will be appraised
annually - significant original content will be
captured again
Our Policies: Blogs
 Permanent records
 Crawl annually unless there is no link to
Smithsonian’s terms of use
Questions?
Jennifer Wright
Archives and Information
Management Team Leader
wrightjm@si.edu
http://www.siarchives.si.edu/
SAA 2013 Session 408
Original Smithsonian Home
Page, launched May 8, 1995

More Related Content

Similar to The Web of Sites: Creating Effective Web Archiving Appraisal and Collection Development Policies

Social media-and-recordkeeping (1)
Social media-and-recordkeeping (1)Social media-and-recordkeeping (1)
Social media-and-recordkeeping (1)
Rehema14
 
Social media guidelines - Wake County
Social media guidelines - Wake CountySocial media guidelines - Wake County
Social media guidelines - Wake County
NCLA2011
 
Sharepoint 2010 governance
Sharepoint 2010 governanceSharepoint 2010 governance
Sharepoint 2010 governance
Ahmed Naji
 
How to Manage Managing Your Enterprise Content
How to Manage Managing Your Enterprise ContentHow to Manage Managing Your Enterprise Content
How to Manage Managing Your Enterprise Content
Patrick Tucker
 

Similar to The Web of Sites: Creating Effective Web Archiving Appraisal and Collection Development Policies (20)

Enterprise SharePoint Program - Architecture Models - (Innovate Vancouver) - ...
Enterprise SharePoint Program - Architecture Models - (Innovate Vancouver) - ...Enterprise SharePoint Program - Architecture Models - (Innovate Vancouver) - ...
Enterprise SharePoint Program - Architecture Models - (Innovate Vancouver) - ...
 
20110310 ARMA Northern CO Strategies and Policies for Social Media
20110310 ARMA Northern CO Strategies and Policies for Social Media20110310 ARMA Northern CO Strategies and Policies for Social Media
20110310 ARMA Northern CO Strategies and Policies for Social Media
 
Governance in SharePoint Premium:What's in the box?
Governance in SharePoint Premium:What's in the box?Governance in SharePoint Premium:What's in the box?
Governance in SharePoint Premium:What's in the box?
 
Preventing Security Leaks in SharePoint with Joel Oleson & Christian Buckley
Preventing Security Leaks in SharePoint with Joel Oleson & Christian BuckleyPreventing Security Leaks in SharePoint with Joel Oleson & Christian Buckley
Preventing Security Leaks in SharePoint with Joel Oleson & Christian Buckley
 
AFP: Web and Social Media
AFP: Web and Social MediaAFP: Web and Social Media
AFP: Web and Social Media
 
Jonathan Ralton - Governing SharePoint For User Adoption
Jonathan Ralton - Governing SharePoint For User AdoptionJonathan Ralton - Governing SharePoint For User Adoption
Jonathan Ralton - Governing SharePoint For User Adoption
 
INFOGOV14 - Governing SharePoint for User Adoption
INFOGOV14 - Governing SharePoint for User AdoptionINFOGOV14 - Governing SharePoint for User Adoption
INFOGOV14 - Governing SharePoint for User Adoption
 
Social media-and-recordkeeping (1)
Social media-and-recordkeeping (1)Social media-and-recordkeeping (1)
Social media-and-recordkeeping (1)
 
Best practices for security and governance in share point 2013 published
Best practices for security and governance in share point 2013   publishedBest practices for security and governance in share point 2013   published
Best practices for security and governance in share point 2013 published
 
iCrossing: Designing For Visibility - ANA Digital Marketing And Social Media...
iCrossing: Designing For Visibility  - ANA Digital Marketing And Social Media...iCrossing: Designing For Visibility  - ANA Digital Marketing And Social Media...
iCrossing: Designing For Visibility - ANA Digital Marketing And Social Media...
 
Designing Websites for Visibility, Rob Garner at the Association of National ...
Designing Websites for Visibility, Rob Garner at the Association of National ...Designing Websites for Visibility, Rob Garner at the Association of National ...
Designing Websites for Visibility, Rob Garner at the Association of National ...
 
March 2023 CIAOPS Need to Know Webinar
March 2023 CIAOPS Need to Know WebinarMarch 2023 CIAOPS Need to Know Webinar
March 2023 CIAOPS Need to Know Webinar
 
Social media guidelines - Wake County
Social media guidelines - Wake CountySocial media guidelines - Wake County
Social media guidelines - Wake County
 
Ecoomerce Topic on Marketing Strategies and ecommerce
Ecoomerce Topic on Marketing Strategies and ecommerceEcoomerce Topic on Marketing Strategies and ecommerce
Ecoomerce Topic on Marketing Strategies and ecommerce
 
Policy Commons
Policy CommonsPolicy Commons
Policy Commons
 
How to start: Setting up an open access repository in 22 steps
How to start: Setting up an open access repository in 22 stepsHow to start: Setting up an open access repository in 22 steps
How to start: Setting up an open access repository in 22 steps
 
Sharepoint 2010 governance
Sharepoint 2010 governanceSharepoint 2010 governance
Sharepoint 2010 governance
 
Social Networking Platform For Faith Communities
Social Networking Platform For Faith CommunitiesSocial Networking Platform For Faith Communities
Social Networking Platform For Faith Communities
 
LSHTM Research Data Management Policy: An Overview
LSHTM Research Data Management Policy: An OverviewLSHTM Research Data Management Policy: An Overview
LSHTM Research Data Management Policy: An Overview
 
How to Manage Managing Your Enterprise Content
How to Manage Managing Your Enterprise ContentHow to Manage Managing Your Enterprise Content
How to Manage Managing Your Enterprise Content
 

More from Smithsonian Institution Archives

More from Smithsonian Institution Archives (11)

Know Thyself: How Suffering Through An Existential Crisis Will Help you Plan ...
Know Thyself: How Suffering Through An Existential Crisis Will Help you Plan ...Know Thyself: How Suffering Through An Existential Crisis Will Help you Plan ...
Know Thyself: How Suffering Through An Existential Crisis Will Help you Plan ...
 
Evolution of the Memo
Evolution of the MemoEvolution of the Memo
Evolution of the Memo
 
Don’t Panic! : An Archivist’s Guide to Emergency Response – Lessons from the ...
Don’t Panic! : An Archivist’s Guide to Emergency Response – Lessons from the ...Don’t Panic! : An Archivist’s Guide to Emergency Response – Lessons from the ...
Don’t Panic! : An Archivist’s Guide to Emergency Response – Lessons from the ...
 
Preserving Digital Materials at the Smithsonian Institution Archives
Preserving Digital Materials at the Smithsonian Institution ArchivesPreserving Digital Materials at the Smithsonian Institution Archives
Preserving Digital Materials at the Smithsonian Institution Archives
 
The Smithsonian Institution's Crowdsourcing Tradition, Since 1849
The Smithsonian Institution's Crowdsourcing Tradition, Since 1849The Smithsonian Institution's Crowdsourcing Tradition, Since 1849
The Smithsonian Institution's Crowdsourcing Tradition, Since 1849
 
The Russell E. Train Africana Collection: An Archival Safari through Photogra...
The Russell E. Train Africana Collection: An Archival Safari through Photogra...The Russell E. Train Africana Collection: An Archival Safari through Photogra...
The Russell E. Train Africana Collection: An Archival Safari through Photogra...
 
The Most Famous Man You’ve Never Heard Of: Dr. J. Horace McFarland
The Most Famous Man You’ve Never Heard Of: Dr. J. Horace McFarlandThe Most Famous Man You’ve Never Heard Of: Dr. J. Horace McFarland
The Most Famous Man You’ve Never Heard Of: Dr. J. Horace McFarland
 
The Chief S.O. Alonge Photographic Collection: Royal Court of Benin photograp...
The Chief S.O. Alonge Photographic Collection: Royal Court of Benin photograp...The Chief S.O. Alonge Photographic Collection: Royal Court of Benin photograp...
The Chief S.O. Alonge Photographic Collection: Royal Court of Benin photograp...
 
Out of the Box: The Archives of American Art’s Lawrence A. Fleischman Gallery
Out of the Box: The Archives of American Art’s Lawrence A. Fleischman GalleryOut of the Box: The Archives of American Art’s Lawrence A. Fleischman Gallery
Out of the Box: The Archives of American Art’s Lawrence A. Fleischman Gallery
 
Magnetic Videotape Recordings: Preservation, Assessment, and Migration
Magnetic Videotape Recordings: Preservation, Assessment, and MigrationMagnetic Videotape Recordings: Preservation, Assessment, and Migration
Magnetic Videotape Recordings: Preservation, Assessment, and Migration
 
The Evolution and Management of Email
The Evolution and Management of EmailThe Evolution and Management of Email
The Evolution and Management of Email
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 

The Web of Sites: Creating Effective Web Archiving Appraisal and Collection Development Policies

  • 1. The Web of Sites: Creating Effective Web Archiving Appraisal and Collection Development Policies Jennifer Wright Archives and Information Management Team Leader SAA 2013 Session 408
  • 2. The Mission of Smithsonian Archives  Appraise, acquire, and preserve the records of the Smithsonian Institution  Offer a range of research and reference services  Establish policy and provide expert guidance on record keeping practices  Create and promote products and services that broaden understanding of the Smithsonian
  • 3. Websites as Records  Smithsonian’s official definition of a record: “any official recorded information, regardless of medium or characteristics, created, received, and maintained by a Smithsonian museum, office, or employee”
  • 4. Smithsonian Directive 950 Management of the Smithsonian Web  Sets policies and procedures to ensure the integrity of content, reliability of infrastructure, and usability of websites while protecting privacy of visitors and Smithsonian’s reputation  Requires Archives to provide dispositions for unit websites, web applications, and online exhibits  Requires Archives to maintain historical snapshots of Smithsonian websites and related content
  • 5. Smithsonian Directive 814 Social Media Policy  Sets policy for opening and maintaining official Smithsonian social media accounts  Requires that units notify Archives when opening and before closing a social media account  Requires Archives to maintain registry of social media accounts and to archive information contained in the accounts according to current standards and retention policies
  • 6. Why Save?  Websites and social media profiles are Smithsonian’s public face  Similar to a publication  May incorporate many types of materials  May replace other formats
  • 7. Sounds straightforward. How complicated could appraisal possibly be?
  • 8. Smithsonian’s Web Presence  257 websites + 10 mobile websites  89 blogs  26 apps for various platforms  578 social media accounts including:  153 Facebook accounts  105 Twitter accounts  66 Flickr accounts  66 YouTube accounts http://www.si.edu/Connect
  • 9. Why Not Save Everything?  Some content already transferred to Archives in another format  Some content is the responsibility of other units  Some content is collections, not records  Some content serves only as pointers to other Smithsonian and non-Smithsonian content
  • 10. Other Issues Affecting Appraisal  Certain types of files and coding don’t crawl well  Flash, JavaScript, some video  Organization and coding of site may make it impossible to capture everything wanted and exclude everything unwanted  Social media terms of service often do not allow crawling  Users may consider social media interactions to be private
  • 12. Our Policies: Public Websites  Permanent records but may exclude:  Detailed collections information  Large sections duplicated in another format  Crawl annually, before and after redesign, and on day of major event
  • 13. Our Policies: Intranets  Individually appraised based upon content  Generally block crawlers – permanent records must be transferred via ftp, server to server transfer, or external drive  Will be restricted as appropriate
  • 14. Our Policies: Social Media Accounts  Will capture most accounts one time to show they existed and how they were used  Will crawl, use export tool, take screenshots, or a combo to best capture account  Will not be made immediately available online to mitigate violations of terms of service
  • 15. Our Policies: Social Media Accounts  Must include or link to Smithsonian’s Terms of Use – no capture otherwise http://www.si.edu/Termsofuse
  • 16. Our Policies: Social Media Accounts  After first capture, account will be appraised annually - significant original content will be captured again
  • 17. Our Policies: Blogs  Permanent records  Crawl annually unless there is no link to Smithsonian’s terms of use
  • 18. Questions? Jennifer Wright Archives and Information Management Team Leader wrightjm@si.edu http://www.siarchives.si.edu/ SAA 2013 Session 408 Original Smithsonian Home Page, launched May 8, 1995

Editor's Notes

  1. By this definition, any official web presence maintained by Smithsonian units is considered a record and subject to appraisal by the Archives.
  2. The Smithsonian also has two directives governing its web presence that give the Archives specific responsibilities.
  3. An organization’s web presence may be larger than you realize.
  4. Not to mention iTunes,Pinterest, UStream, FourSquare, Instagram, Tumblr, Google+, Wikis, Vine, Vimeo, and many others.That’s a lot of data to be captured, preserved, and stored over the long haul. We need to make sure we’re not capturing more than is necessary.
  5. There are also technical and legal issues affecting appraisal.
  6. We’ve found that one policy doesn’t fit every situation and we’ve developed general polices for different types of web presences.
  7. Annually is our goal, but we’re still working up to that frequency.
  8. On the left is my favorite example of original content. On April 30, 2012, the National Zoo live-tweeted from the artificial insemination of our giant panda.On the right is an excerpt from the Smithsonian Magazine’s Twitter feed. It simply tweets teasers and links to its blog posts and other web content. The account has immediate marketing value, but not long-term significance.