Evaluating the SiteStory Transactional Web Archive with the ApacheBench Tool
Justin F. Brunelle
Michael L. Nelson
Lyudmila Balakireva
Robert Sanderson
Herbert Van de Sompel
TPDL 2013, September 24, 2013
Assessing the Quality of Web Archives
Michael L. Nelson
Scott G. Ainsworth, Justin F. Brunelle,
Mat Kelly, Hany SalahEldeen,
Michele C. Weigle
Digital Preservation Meeting, July 22-23, 2014
Washington DC
Profiling Web Archive Coverage for Top-Level Domain and Content LanguageMichael Nelson
Profiling Web Archive Coverage for Top-Level Domain and Content Language
Ahmed AlSum, Michele C. Weigle, Michael L. Nelson,
Herbert Van de Sompel
TPDL 2013
September 22-26, 2013
Valletta, Malta
On the Change in Archivability of Websites Over TimeMichael Nelson
On the Change in Archivability of Websites Over Time
Mat Kelly, Justin F. Brunelle, Michele C. Weigle, and Michael L. Nelson
Old Dominion University
TPDL 2013, Valletta, Malta, September 23, 2013
Who and What Links to the Internet ArchiveMichael Nelson
Who and What Links to the Internet Archive
Yasmin AlNoamany, Ahmed AlSum, Michele C. Weigle, Michael L. Nelson
TPDL 2013, September 25, 2013
Best Student Paper Award Winner
The evolution of the Web should move forward in an upward spiral that cylces between guiding values, engineering and science. Guiding values should comprise social values as well as system principles that further stabilization and growth of the Web. Principles I will talk about will include social inclusion, connectedness and fairness. Example efforts improve Web access for disabled, critically access Web structures and Web growth, and try to transfer knowledge about previously found patterns of Web growth to analogous cases.
The Memento Protocol and Research Issues With Web ArchivingMichael Nelson
Michael L. Nelson
Old Dominion University
Web Science & Digital Libraries Research Group
www.cs.odu.edu/~mln/
University of Virginia Colloquium
2016-09-12
Assessing the Quality of Web Archives
Michael L. Nelson
Scott G. Ainsworth, Justin F. Brunelle,
Mat Kelly, Hany SalahEldeen,
Michele C. Weigle
Digital Preservation Meeting, July 22-23, 2014
Washington DC
Profiling Web Archive Coverage for Top-Level Domain and Content LanguageMichael Nelson
Profiling Web Archive Coverage for Top-Level Domain and Content Language
Ahmed AlSum, Michele C. Weigle, Michael L. Nelson,
Herbert Van de Sompel
TPDL 2013
September 22-26, 2013
Valletta, Malta
On the Change in Archivability of Websites Over TimeMichael Nelson
On the Change in Archivability of Websites Over Time
Mat Kelly, Justin F. Brunelle, Michele C. Weigle, and Michael L. Nelson
Old Dominion University
TPDL 2013, Valletta, Malta, September 23, 2013
Who and What Links to the Internet ArchiveMichael Nelson
Who and What Links to the Internet Archive
Yasmin AlNoamany, Ahmed AlSum, Michele C. Weigle, Michael L. Nelson
TPDL 2013, September 25, 2013
Best Student Paper Award Winner
The evolution of the Web should move forward in an upward spiral that cylces between guiding values, engineering and science. Guiding values should comprise social values as well as system principles that further stabilization and growth of the Web. Principles I will talk about will include social inclusion, connectedness and fairness. Example efforts improve Web access for disabled, critically access Web structures and Web growth, and try to transfer knowledge about previously found patterns of Web growth to analogous cases.
The Memento Protocol and Research Issues With Web ArchivingMichael Nelson
Michael L. Nelson
Old Dominion University
Web Science & Digital Libraries Research Group
www.cs.odu.edu/~mln/
University of Virginia Colloquium
2016-09-12
Web Archiving Activities of ODU’s Web Science and Digital Library Research G...Michael Nelson
Michael L. Nelson
@phonedude_mln
Michele C. Weigle
@weiglemc
National Symposium on Web Archiving Interoperability
2017-02-21
Many projects joint with LANL
Funding from NSF, IMLS, NEH, and AMF
InterPlanetary Wayback: The Next Step Towards Decentralized Web ArchivingSawood Alam
InterPlanetary Wayback (IPWB) facilitates permanence and collaboration in web archives by disseminating the contents of WARC files into the InterPlanetary File System (IPFS) network. IPFS is a peer-to-peer content-addressable file system that inherently allows deduplication and facilitates opt-in replication. IPWB splits the header and payload of WARC response records before disseminating into IPFS to leverage the deduplication, builds a CDXJ index with references to the IPFS hashes returns, and combines the headers and payload from IPFS at the time of replay. We also explore the possibility of an index-free, fully decentralized collaborative web archiving system as the next step.
Profiling Web Archival Voids for Memento RoutingSawood Alam
Slides of the paper presentation, "Profiling Web Archival Voids for Memento Routing", for JCDL 2021.
Authors: Sawood Alam, Michele C. Weigle, Michael L. Nelson
Preprint: https://arxiv.org/abs/2108.03311
Recording: https://youtu.be/ImJWkndNoS8
To the Rescue of the Orphans of Scholarly CommunicationMartin Klein
To the Rescue of the Orphans of Scholarly Communication
presentation at CNI Spring 2017 meeting
Herbert Van de Sompel
http://orcid.org/0000-0002-0715-6126
Michael L. Nelson
http://orcid.org/0000-0003-3749-8116
Martin Klein
http://orcid.org/0000-0003-0130-2097
Presentation for PIDapalooza 2016. PIDs need to be used to achieve their intended persistence. Our research (reported at WWW2016, see http://arxiv.org/1602.09102) found that a disturbing percentage of references to papers that have DOIs actually use the landing page HTTP URI instead of the DOI HTTP URI. The problem is likely related to tools used for collecting references such as bookmarks and reference managers. These select the landing page URI instead of the DOI URI because the former is what's available in the address bar. It can safely be assumed that the same problem exists for other types of PIDs. The net result is that the true potential of PIDs is not realized. In order to ameliorate this problem we propose a Signposting pattern for PIDs (http://signposting.org/identifier/). It consists of adding a Link header to HTTP HEAD/GET responses for all resources identified by a DOI, including the landing page and content resources such as "the PDF" and "the dataset". The Link header contains a link, which points with the "identifier" relation type to the DOI HTTP URI. When such a link is available, tools can automatically discover and use the DOI URI instead of the other URIs (landing page, PDF, dataset) associated with the DOI-identified object.
Altitude San Francisco 2018: Programming the EdgeFastly
Programming the edge
Second floor
Andrew Betts
Principal Developer Advocate, Fastly
Hide abstract
Through our support for running your own code on our edge servers, Fastly's network offers you a platform of unparalleled speed, reliability and efficiency to which you can delegate a surprising amount of logic that has traditionally been in the application layer. In this workshop, you'll implement a series of advanced edge solutions, and learn how to apply these patterns to your own applications to reduce your origin load, dramatically improve performance, and make your applications more secure.
Santa Fe Complex
March 13, 2009
Martin Klein, Frank McCown,
Joan Smith, Michael L. Nelson
Department of Computer Science
Old Dominion University
Norfolk VA
Intelligent web crawling
Denis Shestakov, Aalto University
Slides for tutorial given at WI-IAT'13 in Atlanta, USA on November 20th, 2013
Outline:
- overview of web crawling;
- intelligent web crawling;
- open challenges
This slide deck provides an overview of proposals to use HTTP Links as a means to address some long standing problems related to scholarly resources on the web.
An Introduction to Linked Data for Librarians (2018-06-28)Cliff Landis
Presented to the Special Libraries Association Georgia Chapter for their Spring Luncheon. This presentation gives advice for librarians on how to get started exploring and implementing linked data.
Scraping with Python for Fun and Profit - PyCon India 2010Abhishek Mishra
Tim Berners-Lee - On the Next Web talks about open, linked data. Sweet may the future be, but what if you need the data entangled in the vast web right now?
Mostly inspired from author's work on SpojBackup, this talk familiarizes beginners with the ease and power of web scraping in Python. It would introduce basics of related modules - Mechanize, urllib2, BeautifulSoup, Scrapy, and demonstrate simple examples to get them started with.
Short presentation on how to start performance testing. Going from phase of requirement gathering to the writing your first test scripts and using performance test tool. Glance over JMeter - open source performance testing tool.