This document summarizes interviews conducted with web archiving practitioners about their experiences curating COVID-19 collections. The interviews covered which content was included in the collections, how long collecting was continued, challenges faced, and how the content was made accessible for research. Regarding content, practitioners focused on their national domains and languages while trying to capture a global phenomenon. Collecting continued until the WHO declared an end to the pandemic. Challenges included technical and resource limitations as well as defining the national scope on social media. Collections were made accessible online or on-site with some restrictions. The document provides insights into early pandemic web archiving practices across multiple countries and organizations.
Introduction to British Library digital resources for social scientistsjohnkayebl
John Kaye and Peter Webster from the British Library gave a presentation on December 7th 2012 about the library's digital resources for social scientists. They discussed the library's collections in various formats, online services and resources relevant to social sciences like the ESRC online resource and oral history collections. They also covered the UK Web Archive and web archiving strategies.
The British Library aims to make its vast collection accessible through digitization and partnerships. It contains materials in all formats across all subjects from the UK and beyond. The Library's vision is to provide a "one stop shop" for digital content through initiatives like digitizing manuscripts and newspapers, partnering on projects like EthOS and Digi Islam, and creating a digital library program to preserve digital materials forever. The Library hopes to work with other institutions to improve discovery and access to knowledge through innovative new systems and services.
IFLA ARL Webinar Series: Academic Library Services during Covid 19IFLAAcademicandResea
Slides used by speakers at the IFLA ARL Webinar, Academic Library Services during COVID-19, held on 22 July 2020. The Webinar features 10 speakers from around the world, who share their institutional and national experiences during this COVID 19 period.
Reaching People - the new National Library of Scotland Strategy 2020-2025CILIPScotland
The document outlines the National Library of Scotland's new 2020-2025 strategy to better connect with audiences through five strategic priorities: safeguarding collections, improving access, engaging audiences, supporting learning and research, and developing the organization. The strategy aims to make the library more inclusive, responsive, and relevant for current and future generations through initiatives like expanding digital access, community outreach, and partnerships.
Diving into Digital: Small Steps and Big Returns in Digital Archivingdri_ireland
Keynote address by Dr. Natalie Harrower, Director of the Digital Repository of Ireland, to the Forum des Archivistes Nationaux (FAN) meeting in Abu Dhabi, May 2019.
Through a new Audiovisual Think Tank, visionary experts in the AV cultural heritage sector are working together to map out our shared strategic priorities and put into place a research and action agenda to shape the coming decade. The AV Think Tank looks to represent major AV archives and digital cultural heritage professionals from across the globe and closely connects these key players to work collectively at the forefront of the sector in consultation with the wider community. Initiated and actively supported by Sound and Vision, the AV Think Tank aims to lay the groundwork for an AV archiving sector that enables more long-term use of, learning with, and education through AV materials.
Connected Open Heritage - John Andersson; Executive Director, Wikimedia SverigeRCAHMW
Gwella strwythur a chwiliadwyedd gwybodaeth am dreftadaeth ddiwylliannol ddisymud ledled y byd yw nod y prosiect Treftadaeth Agored Gysylltiedig. Buom yn gweithio i gynnwys gwybodaeth wedi’i chyd-destunoli ar Wikipedia; delweddau newydd a hanesyddol ar Wikimedia Commons; a data strwythuredig ar Wikidata.
Yn ystod y cyflwyniad byddwn ni’n siarad am rai o’r heriau a’r gwersi a ddysgwyd wrth brosesu casgliad unigryw o setiau ddata o 50 o wledydd, gweithio i ryddhau setiau data newydd, darganfod ffyrdd o gadw’r casgliad yn gyfoes, a chreu methodolegau i’w gwneud hi’n bosibl i roi’r data ar Wikidata ar ffurf safonedig a strwythuredig.
Improving the structure and searchability of immovable cultural heritage information from around the world is what the Connected Open Heritage project has been trying to achieve. We have worked to include contextualized information on Wikipedia; new and historical images on Wikimedia Commons; and structured data on Wikidata.
During the presentation we will talk about some of the challenges and lessons gathered while working with processing a world-unique collection of datasets from 50 countries, working to release new datasets, finding ways to keep it up-to-date, and creating the methodologies for the data to be added to Wikidata in a standardized and structured form.
The document summarizes meetings of the High Level Expert Group on Digital Libraries. It discusses the following:
1) At its 5th meeting, the group adopted final reports on digital preservation, orphan works, and public-private partnerships. A memorandum of understanding on orphan works was also signed.
2) At its 4th meeting, the group discussed progress on launching a European digital library and endorsed a report on public-private partnerships and access to scientific data.
3) At its 3rd meeting, the group endorsed an advisory report on copyright issues and discussed access to scientific research and public-private collaboration.
Introduction to British Library digital resources for social scientistsjohnkayebl
John Kaye and Peter Webster from the British Library gave a presentation on December 7th 2012 about the library's digital resources for social scientists. They discussed the library's collections in various formats, online services and resources relevant to social sciences like the ESRC online resource and oral history collections. They also covered the UK Web Archive and web archiving strategies.
The British Library aims to make its vast collection accessible through digitization and partnerships. It contains materials in all formats across all subjects from the UK and beyond. The Library's vision is to provide a "one stop shop" for digital content through initiatives like digitizing manuscripts and newspapers, partnering on projects like EthOS and Digi Islam, and creating a digital library program to preserve digital materials forever. The Library hopes to work with other institutions to improve discovery and access to knowledge through innovative new systems and services.
IFLA ARL Webinar Series: Academic Library Services during Covid 19IFLAAcademicandResea
Slides used by speakers at the IFLA ARL Webinar, Academic Library Services during COVID-19, held on 22 July 2020. The Webinar features 10 speakers from around the world, who share their institutional and national experiences during this COVID 19 period.
Reaching People - the new National Library of Scotland Strategy 2020-2025CILIPScotland
The document outlines the National Library of Scotland's new 2020-2025 strategy to better connect with audiences through five strategic priorities: safeguarding collections, improving access, engaging audiences, supporting learning and research, and developing the organization. The strategy aims to make the library more inclusive, responsive, and relevant for current and future generations through initiatives like expanding digital access, community outreach, and partnerships.
Diving into Digital: Small Steps and Big Returns in Digital Archivingdri_ireland
Keynote address by Dr. Natalie Harrower, Director of the Digital Repository of Ireland, to the Forum des Archivistes Nationaux (FAN) meeting in Abu Dhabi, May 2019.
Through a new Audiovisual Think Tank, visionary experts in the AV cultural heritage sector are working together to map out our shared strategic priorities and put into place a research and action agenda to shape the coming decade. The AV Think Tank looks to represent major AV archives and digital cultural heritage professionals from across the globe and closely connects these key players to work collectively at the forefront of the sector in consultation with the wider community. Initiated and actively supported by Sound and Vision, the AV Think Tank aims to lay the groundwork for an AV archiving sector that enables more long-term use of, learning with, and education through AV materials.
Connected Open Heritage - John Andersson; Executive Director, Wikimedia SverigeRCAHMW
Gwella strwythur a chwiliadwyedd gwybodaeth am dreftadaeth ddiwylliannol ddisymud ledled y byd yw nod y prosiect Treftadaeth Agored Gysylltiedig. Buom yn gweithio i gynnwys gwybodaeth wedi’i chyd-destunoli ar Wikipedia; delweddau newydd a hanesyddol ar Wikimedia Commons; a data strwythuredig ar Wikidata.
Yn ystod y cyflwyniad byddwn ni’n siarad am rai o’r heriau a’r gwersi a ddysgwyd wrth brosesu casgliad unigryw o setiau ddata o 50 o wledydd, gweithio i ryddhau setiau data newydd, darganfod ffyrdd o gadw’r casgliad yn gyfoes, a chreu methodolegau i’w gwneud hi’n bosibl i roi’r data ar Wikidata ar ffurf safonedig a strwythuredig.
Improving the structure and searchability of immovable cultural heritage information from around the world is what the Connected Open Heritage project has been trying to achieve. We have worked to include contextualized information on Wikipedia; new and historical images on Wikimedia Commons; and structured data on Wikidata.
During the presentation we will talk about some of the challenges and lessons gathered while working with processing a world-unique collection of datasets from 50 countries, working to release new datasets, finding ways to keep it up-to-date, and creating the methodologies for the data to be added to Wikidata in a standardized and structured form.
The document summarizes meetings of the High Level Expert Group on Digital Libraries. It discusses the following:
1) At its 5th meeting, the group adopted final reports on digital preservation, orphan works, and public-private partnerships. A memorandum of understanding on orphan works was also signed.
2) At its 4th meeting, the group discussed progress on launching a European digital library and endorsed a report on public-private partnerships and access to scientific data.
3) At its 3rd meeting, the group endorsed an advisory report on copyright issues and discussed access to scientific research and public-private collaboration.
The minutes from the ALISS AGM on July 1st are summarized. Sally Patalong was re-elected as chair. The committee and finances were reported on. Events from the past year were recapped, including conferences, visits, and workshops. Membership increased slightly from 122 to 130 members. Upcoming activities include a workshop on crowd sourcing and library visits.
Digital Cultural Heritage: Experiences from British LibraryNora McGregor
Slides from seminar on Digital Cultural Heritage given to UCL Institute of Sustainable Heritage's two programmes: the MSc Sustainable Heritage and the MRes Science and Engineering in Arts, Heritage and Archaeology.
Slides from seminar on Digital Cultural Heritage given to UCL Institute of Sustainable Heritage's two programmes: the MSc Sustainable Heritage and the MRes Science and Engineering in Arts, Heritage and Archaeology.
A presentation about the JISC Mass Digitization project "Rhyfel Byd 1914-1918 a’r profiad Cymreig / Welsh experience of World War One 1914-1918". Talk at the Strategic Content Alliance World War One roundtable meeting, 27th March 2012.
By Dr. Petra Hauke,
IFLA Environment, Sustainability and Libraries Section (ENSULIB)
Netzwerk Grüne Bibliothek (German Green Library Network)
Presented at the ENSULIB Satellite Meeting in Cork, Ireland, July 2022
This document summarizes a training provided by the British Library on digital scholarship. It discusses how the British Library is digitizing its vast collection to make it more accessible online. It also describes the Digital Scholarship Training Programme launched in 2012 to train library staff on digital tools and methods. Key courses covered topics like digitization, data visualization, crowdsourcing, and programming. The training aims to empower staff to innovatively use digital resources and collaborate on projects.
Presentation given at Digital Humanities Research Colloquium, 18 October 2017.
Abstract: On 11 October 2017 UCC's Office of the Vice President for Teaching & Learning hosted a #nextgenspaces learning event. One of the speakers, Prof. Stephen Heppell, noted the following: ‘next generation is here and it means business’ and ‘students are going to a world of surprises.’ With these points in mind how do libraries prepare for the library of the future? In this presentation I discuss traditional expertise and how this is adapted for the uncharted territories of the future.
The British Library is the national library of the United Kingdom. It contains over 170 million items across all formats, including books, journals, newspapers, maps, stamps, sound recordings, patents, and websites archived from the UK domain. The library receives a copy of all publications produced in the UK and Ireland. It has extensive digital collections amounting to over 1 petabyte of data. The library is actively involved in digitization projects and uses digital methods like optical character recognition, crowdsourcing, and computational analysis to study and enrich its collections.
Social Contract Archaeology: a business case for the futureDigVentures Ltd
This document discusses how crowdfunding and crowdsourcing can help address challenges facing archaeology due to budget cuts. It presents DigVentures and the Flag Fen Lives project as examples. Flag Fen Lives was the first crowdfunded archaeological excavation in Europe, raising over £27k. It provided training, satisfied archaeological objectives, and increased visitor numbers at the Flag Fen site. The project engaged contributors through varied participation levels from social media to on-site involvement. It demonstrated how a value-led, socially contracted approach can benefit archaeology's triple bottom line of economic, social and environmental value.
The OpenGLAM community: promoting free & open access to digital cultural heritage | Lieke Ploeger, Open Knowledge Foundation at http://books2ebooks.eu/eod2014
Stewardship of the Digital Scholarly Record and Digital Published HeritageNASIG
This presentation discusses how The Keepers Registry and the network of Keepers is attempting to tackle the issue of digital preservation for electronic serials specifically. First identifying the scope of the problem being addressed, it moves on to the successes, in preservation and in measuring that preservation, before moving on to the challenges still to be surmounted. It touches upon some of the specific cases on which this preservation is focussing, including legal deposit and regional library consortia, as well as engagement with OA journals. It finishes with the broader plan of action to help allow the Keepers to accomplish their digital preservation goals, laid out in the statement they issued last August, calling upon all stakeholders in the world of scholarly communication, notably both publishers and research libraries, and setting actions they can take to help in this mission.
Speaker: Ted Westervelt, Library of Congress
20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...Neil Beagrie
“Digital Preservation: Setting the Course for a Decade of Change” , a conference keynote from 2007, available now on Slideshare is the ninth of 12 presentations I’ve selected to mark 20 years in Digital Preservation. The remainder will be published at monthly intervals over 2015.
This presentation was the opening keynote to a conference in 2007 held by the Belgian Association of Documentation (BDA) to celebrate its 60th anniversary. It dates from my time at the British Library.
The conference theme was "Europe facing the challenge of the long term conservation of digitalised archives". My keynote synthesised many of the topics I was focussing on at the time (and have featured in some of my earlier slide shares in this series) including encouraging University libraries to engage more actively with research data management in the sciences, to begin developing digital special collections of individuals, and to support international efforts to ensure continuing access and preservation of e-Journals as part of the scholarly record. In addition, given the European focus I briefly covered some of the major European initiatives in digital preservation at that time.
I have selected this presentation as one of the 12 in this series, not only as it is synthesising these key themes but also because it includes some thoughts on whether digital preservation needed to be evolution or revolution (or a bit of both) for libraries and archives.
The National Library of Sweden has collected printed material since 1661 and also collects TV, radio, movies, music and games. It began legally collecting websites in 2015 but current laws do not fully cover the changing digital landscape. Interactive content and material on international platforms are not included. The library conducted a study on revising legal deposit acts to better address digital and interactive media. It collected social media, websites and other election material from Swedish political parties and leaders for the 2018 election to learn from implementing collection of digital content. Experiences from this project will inform the library's final report on revising legal deposit laws.
Collaborative Communities - Cross Sectoral Library Initiatives in ScotlandCILIP Ireland
This document discusses collaborative library initiatives in Scotland across various sectors. It highlights that there are 492 public libraries, 19 higher education institution libraries, and over 2,400 school libraries in Scotland. Partnerships have been formed between these different library sectors through organizations like CILIP in Scotland to advocate for libraries and conduct staff training. Recent initiatives include a national school library strategy, innovation funding awards, and partnerships between universities and public libraries to share collections and facilities.
The document discusses the partnership between The Children's Museum of Indianapolis and Wikimedia to improve Wikipedia articles related to the museum. It provides examples of other museums that have participated in Wikipedian in Residence programs, including the British Museum, Smithsonian Institution, and National Archives. It outlines the benefits of collaborating with Wikipedia, such as reaching wider audiences, engaging volunteers, and improving online information. Strategies discussed include hosting edit-a-thons, backstage tours, student programs, multilingual projects using QR codes, and connecting with educators. Next steps mentioned are supporting the GLAM-Wiki community and diversifying engagement.
John Scally: The National Library of Scotland: A future vision for allCILIPScotland
The document summarizes a presentation given at the CILIPS Scotland Conference in Dundee on June 1, 2015 about the National Library of Scotland's future vision. It discusses the library's origins, collections of over 24 million items, legal deposit functions, digital collections, usage statistics, funding challenges, and strategic priorities for 2015-2020 which include preserving collections, increasing digital access, encouraging research, education, public engagement, and developing the library as a destination.
The Digital Research & Curator Team at the British Library was formed in 2010 to support digital scholarship. Their mission is to develop innovative models for digital scholarship using digital content and technologies. Some of their main activities include staff training, promoting digital scholarship at the library, curating digital research data, and engaging with users. They offer various training courses, organize discussions on digital topics, and support digital collections and services at the library.
Gauditz & Kunze, Web archives as research data FINAL.pptxWARCnet
(1) The document discusses the challenges of archiving research data from a sociological study of COVID-19 protests in Germany. As protests have moved online, archiving now requires web archiving but the small archives lacks resources and expertise for this.
(2) The study collected qualitative data like screenshots and videos from social media to understand participants' views and connections. This creates issues around consent, preservation formats, and technical support.
(3) Future research should involve archives earlier to address legal and data management requirements to ensure data can be preserved and accessed long-term. An interdisciplinary dialogue is needed to solve open questions around archiving digital research data.
Gauditz & Kunze, Web archives as research data FINAL.pptxWARCnet
The document discusses the challenges and opportunities at the intersection of social science research and archiving. It describes a research project on societal norms in COVID protests that collected data from social media sites. This raised issues for archiving the data long-term due to the variety of file formats, changing consent over time, and technical challenges. The researchers and archives are learning lessons about legal/ethical concerns, research data management practices, and the need for interdisciplinary collaboration between social scientists and archivists.
More Related Content
Similar to 20221018_Panel_Covid_WARCnet_closing_conference.pdf
The minutes from the ALISS AGM on July 1st are summarized. Sally Patalong was re-elected as chair. The committee and finances were reported on. Events from the past year were recapped, including conferences, visits, and workshops. Membership increased slightly from 122 to 130 members. Upcoming activities include a workshop on crowd sourcing and library visits.
Digital Cultural Heritage: Experiences from British LibraryNora McGregor
Slides from seminar on Digital Cultural Heritage given to UCL Institute of Sustainable Heritage's two programmes: the MSc Sustainable Heritage and the MRes Science and Engineering in Arts, Heritage and Archaeology.
Slides from seminar on Digital Cultural Heritage given to UCL Institute of Sustainable Heritage's two programmes: the MSc Sustainable Heritage and the MRes Science and Engineering in Arts, Heritage and Archaeology.
A presentation about the JISC Mass Digitization project "Rhyfel Byd 1914-1918 a’r profiad Cymreig / Welsh experience of World War One 1914-1918". Talk at the Strategic Content Alliance World War One roundtable meeting, 27th March 2012.
By Dr. Petra Hauke,
IFLA Environment, Sustainability and Libraries Section (ENSULIB)
Netzwerk Grüne Bibliothek (German Green Library Network)
Presented at the ENSULIB Satellite Meeting in Cork, Ireland, July 2022
This document summarizes a training provided by the British Library on digital scholarship. It discusses how the British Library is digitizing its vast collection to make it more accessible online. It also describes the Digital Scholarship Training Programme launched in 2012 to train library staff on digital tools and methods. Key courses covered topics like digitization, data visualization, crowdsourcing, and programming. The training aims to empower staff to innovatively use digital resources and collaborate on projects.
Presentation given at Digital Humanities Research Colloquium, 18 October 2017.
Abstract: On 11 October 2017 UCC's Office of the Vice President for Teaching & Learning hosted a #nextgenspaces learning event. One of the speakers, Prof. Stephen Heppell, noted the following: ‘next generation is here and it means business’ and ‘students are going to a world of surprises.’ With these points in mind how do libraries prepare for the library of the future? In this presentation I discuss traditional expertise and how this is adapted for the uncharted territories of the future.
The British Library is the national library of the United Kingdom. It contains over 170 million items across all formats, including books, journals, newspapers, maps, stamps, sound recordings, patents, and websites archived from the UK domain. The library receives a copy of all publications produced in the UK and Ireland. It has extensive digital collections amounting to over 1 petabyte of data. The library is actively involved in digitization projects and uses digital methods like optical character recognition, crowdsourcing, and computational analysis to study and enrich its collections.
Social Contract Archaeology: a business case for the futureDigVentures Ltd
This document discusses how crowdfunding and crowdsourcing can help address challenges facing archaeology due to budget cuts. It presents DigVentures and the Flag Fen Lives project as examples. Flag Fen Lives was the first crowdfunded archaeological excavation in Europe, raising over £27k. It provided training, satisfied archaeological objectives, and increased visitor numbers at the Flag Fen site. The project engaged contributors through varied participation levels from social media to on-site involvement. It demonstrated how a value-led, socially contracted approach can benefit archaeology's triple bottom line of economic, social and environmental value.
The OpenGLAM community: promoting free & open access to digital cultural heritage | Lieke Ploeger, Open Knowledge Foundation at http://books2ebooks.eu/eod2014
Stewardship of the Digital Scholarly Record and Digital Published HeritageNASIG
This presentation discusses how The Keepers Registry and the network of Keepers is attempting to tackle the issue of digital preservation for electronic serials specifically. First identifying the scope of the problem being addressed, it moves on to the successes, in preservation and in measuring that preservation, before moving on to the challenges still to be surmounted. It touches upon some of the specific cases on which this preservation is focussing, including legal deposit and regional library consortia, as well as engagement with OA journals. It finishes with the broader plan of action to help allow the Keepers to accomplish their digital preservation goals, laid out in the statement they issued last August, calling upon all stakeholders in the world of scholarly communication, notably both publishers and research libraries, and setting actions they can take to help in this mission.
Speaker: Ted Westervelt, Library of Congress
20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...Neil Beagrie
“Digital Preservation: Setting the Course for a Decade of Change” , a conference keynote from 2007, available now on Slideshare is the ninth of 12 presentations I’ve selected to mark 20 years in Digital Preservation. The remainder will be published at monthly intervals over 2015.
This presentation was the opening keynote to a conference in 2007 held by the Belgian Association of Documentation (BDA) to celebrate its 60th anniversary. It dates from my time at the British Library.
The conference theme was "Europe facing the challenge of the long term conservation of digitalised archives". My keynote synthesised many of the topics I was focussing on at the time (and have featured in some of my earlier slide shares in this series) including encouraging University libraries to engage more actively with research data management in the sciences, to begin developing digital special collections of individuals, and to support international efforts to ensure continuing access and preservation of e-Journals as part of the scholarly record. In addition, given the European focus I briefly covered some of the major European initiatives in digital preservation at that time.
I have selected this presentation as one of the 12 in this series, not only as it is synthesising these key themes but also because it includes some thoughts on whether digital preservation needed to be evolution or revolution (or a bit of both) for libraries and archives.
The National Library of Sweden has collected printed material since 1661 and also collects TV, radio, movies, music and games. It began legally collecting websites in 2015 but current laws do not fully cover the changing digital landscape. Interactive content and material on international platforms are not included. The library conducted a study on revising legal deposit acts to better address digital and interactive media. It collected social media, websites and other election material from Swedish political parties and leaders for the 2018 election to learn from implementing collection of digital content. Experiences from this project will inform the library's final report on revising legal deposit laws.
Collaborative Communities - Cross Sectoral Library Initiatives in ScotlandCILIP Ireland
This document discusses collaborative library initiatives in Scotland across various sectors. It highlights that there are 492 public libraries, 19 higher education institution libraries, and over 2,400 school libraries in Scotland. Partnerships have been formed between these different library sectors through organizations like CILIP in Scotland to advocate for libraries and conduct staff training. Recent initiatives include a national school library strategy, innovation funding awards, and partnerships between universities and public libraries to share collections and facilities.
The document discusses the partnership between The Children's Museum of Indianapolis and Wikimedia to improve Wikipedia articles related to the museum. It provides examples of other museums that have participated in Wikipedian in Residence programs, including the British Museum, Smithsonian Institution, and National Archives. It outlines the benefits of collaborating with Wikipedia, such as reaching wider audiences, engaging volunteers, and improving online information. Strategies discussed include hosting edit-a-thons, backstage tours, student programs, multilingual projects using QR codes, and connecting with educators. Next steps mentioned are supporting the GLAM-Wiki community and diversifying engagement.
John Scally: The National Library of Scotland: A future vision for allCILIPScotland
The document summarizes a presentation given at the CILIPS Scotland Conference in Dundee on June 1, 2015 about the National Library of Scotland's future vision. It discusses the library's origins, collections of over 24 million items, legal deposit functions, digital collections, usage statistics, funding challenges, and strategic priorities for 2015-2020 which include preserving collections, increasing digital access, encouraging research, education, public engagement, and developing the library as a destination.
The Digital Research & Curator Team at the British Library was formed in 2010 to support digital scholarship. Their mission is to develop innovative models for digital scholarship using digital content and technologies. Some of their main activities include staff training, promoting digital scholarship at the library, curating digital research data, and engaging with users. They offer various training courses, organize discussions on digital topics, and support digital collections and services at the library.
Similar to 20221018_Panel_Covid_WARCnet_closing_conference.pdf (20)
Gauditz & Kunze, Web archives as research data FINAL.pptxWARCnet
(1) The document discusses the challenges of archiving research data from a sociological study of COVID-19 protests in Germany. As protests have moved online, archiving now requires web archiving but the small archives lacks resources and expertise for this.
(2) The study collected qualitative data like screenshots and videos from social media to understand participants' views and connections. This creates issues around consent, preservation formats, and technical support.
(3) Future research should involve archives earlier to address legal and data management requirements to ensure data can be preserved and accessed long-term. An interdisciplinary dialogue is needed to solve open questions around archiving digital research data.
Gauditz & Kunze, Web archives as research data FINAL.pptxWARCnet
The document discusses the challenges and opportunities at the intersection of social science research and archiving. It describes a research project on societal norms in COVID protests that collected data from social media sites. This raised issues for archiving the data long-term due to the variety of file formats, changing consent over time, and technical challenges. The researchers and archives are learning lessons about legal/ethical concerns, research data management practices, and the need for interdisciplinary collaboration between social scientists and archivists.
2022 Visit Royal Danish Library Ditte Laursen.pdfWARCnet
The Royal Danish Library collects, preserves, and provides access to Denmark's cultural heritage in physical and digital forms. It supports research at several Danish universities and supplements services offered by public libraries. As a research institution, it increases knowledge of Danish cultural heritage. The library collects legal deposit of broadcasts, e-books, newspapers, music, videos, and websites on .dk domains or those targeting Danish audiences. Its digital collections include over 9 petabytes of data collected daily, including over 1.3 million registered .dk websites archived since the 1990s. The library provides search of its web archive and datasets for research.
20221015 introduction to panel Ditte Laursen.pdfWARCnet
This document discusses challenges and successes in creating and working with web archives as data sources. It includes three presentations on projects analyzing web archives from the Danish web. The Danish web archive collects Danish websites under the country's legal deposit legislation, totaling over 878 terabytes captured from 2000 to 2021. Working with web archives presents challenges for both librarians managing the archives and researchers analyzing the data. Effective collaboration between libraries and universities is needed to address these challenges and make the most of web archives as a research infrastructure.
The document summarizes a collaboration between several UK universities to create an archive of health-related websites from the UK domain. The project aims to collect 10,000 targets, explore ethical collection practices, and increase use of web archives in research. It discusses challenges around defining health information, describing content, addressing privacy and consent issues, and facilitating responsible research access. The collaboration is led by the National Library of Scotland and involves archivists, analysts and engineers from Edinburgh, Oxford and Cambridge universities.
WARCnet conference - Mapping social media archiving initiatives.pptxWARCnet
The document summarizes research on mapping social media archiving initiatives. It finds that most initiatives originate from national libraries and archives, with a geographical imbalance reflecting economic and political divides. While most initiatives started between 2010-2020, gaps remain in archived platforms and completeness due to challenges in collecting certain platforms like Facebook and underrepresentation of non-English language and non-Western initiatives. The researcher argues for greater awareness of these gaps and considering how outsider initiatives could help broaden collections.
The document summarizes research analyzing how streaming services have evolved online and been archived by the Danish web archive. It presents methods used to analyze terms related to streaming from 2003-2021, finding over 100 million documents for "stream*" in Danish and English. It also maps gaps in collecting 50 prominent Danish streaming services by searching their websites and app store pages in the archive. The research validates the library's goal to update tools to better collect content from streaming aggregators and individual publishers.
Maemura_WARCnet_Developing Datasheets for Archived Web Datasets.pdfWARCnet
The document discusses the results of a study on the effects of a new drug on memory and cognitive function in older adults. The double-blind study involved giving either the new drug or a placebo to 100 volunteers aged 65-80 over a 6 month period. Testing showed those receiving the drug experienced statistically significant improvements in short-term memory retention and processing speed compared to the placebo group.
The document shows images of three major libraries in Australia: the National Library of Australia, the Mitchell Library which is part of the State Library of New South Wales, and the State Library Victoria.
Millward - We cannot put this off any longer - upload.pptxWARCnet
This document discusses the use of web archives in historical research. It notes that while historians have been slow to adopt web archives as sources, literature on using web archives directly is growing. However, more work needs to be done using web archives as one source among many to study history. The document presents some examples of how web archives could be used to study the history of British social security and medical certification. It also discusses some of the methodological and ethical challenges of using private information from web archives in historical research.
Reporting from a Short-Term Network Stay at the BnF and INAWARCnet
The document summarizes the author's short-term network stay at the Bibliothèque national de France (BnF) and Institut National de l’Audiovisuel (INA) from 19-22 April 2022. At the BnF, the author learned about their digital legal deposit process and tools for collecting web content and social media, including Twitter, Instagram, YouTube, and TikTok. At the INA, the author saw their collection of over 16,000 websites and social media accounts, and their custom browsing tool WebMédia. Key themes discussed were the challenges of archiving ever-changing social media and limitations of access to platform data. The visit provided valuable insights into social media
The Working Group 6 meeting discussed the future of the WARCnet network after 2022/23 and two scenarios for sustaining it:
1. Sustaining the network without funding by establishing a management committee, international advisory board, annual member fees, and holding biannual meetings but providing no reimbursement for travel.
2. Keeping the network active with funding by exploring grant opportunities such as RI-STAR, COST action, and national networking schemes and asking university research offices about network grants. Next steps included mapping research projects and one-on-one consultations on research support.
The WARCnet Code Book of web archive data formatsWARCnet
The document discusses two projects of Working Group 5, which aims to discuss and formulate data formats for web archive data. Project 1 focuses on developing a shared data vocabulary for requesting archived web data. It provides examples of data requests, including for the contents of web archives, seed URLs and crawl policies, links within archived pages, and metadata on datasets. Project 2 involves creating a glossary of terms used in web archive research. The next steps outlined are to collect existing relevant vocabularies, hold events to identify terms and definitions, and develop the glossary further.
This document discusses using WebDriver to scrape websites semi-automatically by controlling a browser. WebDriver allows controlling a browser programmatically to load pages, interact with elements, and extract data. While slower than other scraping methods, it can handle dynamic sites better. The example application uses WebDriver to scrape defaced website mirrors, taking screenshots to quickly scan captures. It opens the browser, loads pages, and increments IDs to continue scraping until reaching a target or being banned for the day.
The document outlines discussions from three groups at a workshop. Group 1 discussed ways to keep a research network active through activities like workshops, training, small grants, and continuing collaboration. Group 2 identified potential funding sources for future work, including various European grants and fellowships. Group 3 brainstormed research topics the network could explore, such as methodologies for digital infrastructure, social media, museums, and issues around diversity, news, and conspiracies.
The document discusses a data wrangling experiment to create datasets from the Rijksmuseum collection and web archive data for research purposes. A group of researchers from different universities aim to develop standardized code books and controlled vocabularies to structure the data and enable interlinking across collections. They discuss techniques like SPARQL and identifiers in Wikidata to retrieve and organize machine-readable data for future studies of body postures in artworks and web archives.
What’s in a URL? Analysing COVID-19 web archive collectionsWARCnet
The document discusses the scope and quality of datasets from various national web archives that collected COVID-19 related web content. It finds that most collections had a national focus, except for the International Internet Preservation Consortium (IIPC) collection. However, the data is incomplete, with only 6 of 44 European countries represented. It also finds limitations in using top-level domains to determine country of origin. More data is needed to gain further insights into cultural biases and the international representation in these archived collections.
This document outlines the work of WG2 on transnational events. It lists the participants and their affiliations with various libraries. It then lists the sections to be covered, including the reasons for the collection, its scope, framing, accessibility, partnerships and uses. It provides an analysis of various presentations, workshops, datathons and interviews conducted by the group members on topics like the COVID-19 crisis and how information spreads virally. It also references ongoing and future projects between members' institutions on analyzing pandemic-related web archives.
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UPRAHUL
This Dissertation explores the particular circumstances of Mirzapur, a region located in the
core of India. Mirzapur, with its varied terrains and abundant biodiversity, offers an optimal
environment for investigating the changes in vegetation cover dynamics. Our study utilizes
advanced technologies such as GIS (Geographic Information Systems) and Remote sensing to
analyze the transformations that have taken place over the course of a decade.
The complex relationship between human activities and the environment has been the focus
of extensive research and worry. As the global community grapples with swift urbanization,
population expansion, and economic progress, the effects on natural ecosystems are becoming
more evident. A crucial element of this impact is the alteration of vegetation cover, which plays a
significant role in maintaining the ecological equilibrium of our planet.Land serves as the foundation for all human activities and provides the necessary materials for
these activities. As the most crucial natural resource, its utilization by humans results in different
'Land uses,' which are determined by both human activities and the physical characteristics of the
land.
The utilization of land is impacted by human needs and environmental factors. In countries
like India, rapid population growth and the emphasis on extensive resource exploitation can lead
to significant land degradation, adversely affecting the region's land cover.
Therefore, human intervention has significantly influenced land use patterns over many
centuries, evolving its structure over time and space. In the present era, these changes have
accelerated due to factors such as agriculture and urbanization. Information regarding land use and
cover is essential for various planning and management tasks related to the Earth's surface,
providing crucial environmental data for scientific, resource management, policy purposes, and
diverse human activities.
Accurate understanding of land use and cover is imperative for the development planning
of any area. Consequently, a wide range of professionals, including earth system scientists, land
and water managers, and urban planners, are interested in obtaining data on land use and cover
changes, conversion trends, and other related patterns. The spatial dimensions of land use and
cover support policymakers and scientists in making well-informed decisions, as alterations in
these patterns indicate shifts in economic and social conditions. Monitoring such changes with the
help of Advanced technologies like Remote Sensing and Geographic Information Systems is
crucial for coordinated efforts across different administrative levels. Advanced technologies like
Remote Sensing and Geographic Information Systems
9
Changes in vegetation cover refer to variations in the distribution, composition, and overall
structure of plant communities across different temporal and spatial scales. These changes can
occur natural.
A workshop hosted by the South African Journal of Science aimed at postgraduate students and early career researchers with little or no experience in writing and publishing journal articles.
How to Make a Field Mandatory in Odoo 17Celine George
In Odoo, making a field required can be done through both Python code and XML views. When you set the required attribute to True in Python code, it makes the field required across all views where it's used. Conversely, when you set the required attribute in XML views, it makes the field required only in the context of that particular view.
Reimagining Your Library Space: How to Increase the Vibes in Your Library No ...Diana Rendina
Librarians are leading the way in creating future-ready citizens – now we need to update our spaces to match. In this session, attendees will get inspiration for transforming their library spaces. You’ll learn how to survey students and patrons, create a focus group, and use design thinking to brainstorm ideas for your space. We’ll discuss budget friendly ways to change your space as well as how to find funding. No matter where you’re at, you’ll find ideas for reimagining your space in this session.
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptxEduSkills OECD
Iván Bornacelly, Policy Analyst at the OECD Centre for Skills, OECD, presents at the webinar 'Tackling job market gaps with a skills-first approach' on 12 June 2024
This document provides an overview of wound healing, its functions, stages, mechanisms, factors affecting it, and complications.
A wound is a break in the integrity of the skin or tissues, which may be associated with disruption of the structure and function.
Healing is the body’s response to injury in an attempt to restore normal structure and functions.
Healing can occur in two ways: Regeneration and Repair
There are 4 phases of wound healing: hemostasis, inflammation, proliferation, and remodeling. This document also describes the mechanism of wound healing. Factors that affect healing include infection, uncontrolled diabetes, poor nutrition, age, anemia, the presence of foreign bodies, etc.
Complications of wound healing like infection, hyperpigmentation of scar, contractures, and keloid formation.
1. The online archives of COVID-19
An oral history of born-digital collecting practices during the pandemic
WARCnet Closing Conference
Aarhus, 18 October 2022
Friedel Geeraert, Jane Winters, Nicola Bingham, Niels Brügger, Frédéric Clavert, Sophie Gebeil, Federico Nanni,
Caroline Nyvang, Valérie Schafer, Helle Strandgaard Jensen, Karin de Wild
2. Behind the scenes of born-digital COVID-19 collections
WARCnet Closing Conference
Aarhus, 18 October 2022
Friedel Geeraert
3. • In-depth interviews with web archiving practitioners about curating
COVID-19 collections
• Denmark, France, Hungary, Iceland, Luxembourg, Switzerland, the UK,
The Netherlands, the US, Australia, Singapore and the IIPC
collaborative collection
4. ● Which content is included?
○ How do you delimit a global phenomenon on the national level?
● How long is the collecting continued?
● What were the biggest challenges?
● How is the content made accessible for research?
5. Which
content?
How do you archive a global event on a national level?
• Focus on the national domain (.nl, .fr, .dk, …)
• Focus on the national language(s)
• Information about the country
• Information published by national citizens/organisations
• Information published in the country
7. Websites Twitter Facebook Instagram YouTube Tiktok Podcasts Reddit Twitch
Swiss national library
National library of Iceland
National Library of Luxembourg
National Library of France
National Library of Hungary
Royal Danish Library
IIPC collaborative collection
UK Web Archive
Institut national de l’Audiovisuel
(FR)
Royal Library of The Netherlands
Library of Congress
National Library Board (Singapore)
National Library of Australia
Archived Not archived
8.
9. WHEN?
January -
National
Library of
France (topical
news series)
26/01 -Royal
Danish Library
(cartoon)
13/02 - IIPC collaborative
collection
29/02 – Royal Library of The
Netherlands
February
● National Library of
Hungary
● National Library Board
(Singapore)
● National Library of
Australia
Early March - UK Web Archive
06/03 - Swiss National Library
12/03 - Royal Danish Library
13/03 - Institut national de
l’Audiovisuel
16/03 - National Library of France
16/03 - National Library of Luxembourg
24/03 - Library of Congress
Late March - Icelandic national library
JANUARY FEBRUARY MARCH 2020
10. Challenges
• Technical challenges related to capturing certain content
• Financial and human resources limitations
• Defining the national sphere on social media
• Permissions
• Required a sustained effort over several years
• Delimiting the COVID-19 collection since the pandemic touched
upon every aspect of life
• Identifying gaps in collections: overlooked themes / groups
• COVID-denial websites or controversial content
11. Inclusivity
• Limited number of curators leading to subjectivity
• Overlooked themes / groups
• Web archives remain largely unknown
12. Curating within web
archiving team
Internal network of
curators throughout the
institution
External network of
curators
Recommendations from
the public
Swiss national library
National library of Iceland
National Library of Luxembourg
National Library of France
National Library of Hungary
Royal Danish Library
IIPC collaborative collection
UK Web Archive
Institut national de l’Audiovisuel
(FR)
Royal Library of The Netherlands
Library of Congress
National Library Board
(Singapore)
National Library of Australia
Yes No
13. ‘We … launched a call for participation at the end of March and there too
we got a very good response from the media and a few communities that we
would not have otherwise thought of suggested their websites … For
example, the Muslim community and shoura.lu. I hadn't thought of looking
for religious communities. The Muslim community posted online information
and recommendations for its members about services in mosques, religious
holidays, etc. Based on this suggestion, we then looked more closely at other
religious communities.’
Ben Els (Bibliothèque nationale du Luxembourg) interviewed by Valérie Schafer
14. ‘We also collaborated with the Digital Heritage Network in The
Netherlands, together we initiated a call for heritage institutions
to help build a national collection about the coronavirus and its
effects in The Netherlands.’
Peter de Bode (Royal Library of The Netherlands) interviewed by Karin de Wild and Ismini Kyritsis
15. ‘Web archive Switzerland is a collaboration between the Swiss
National Library and 30 Swiss institutions, mostly libraries and
archives.’
Barbara Signori (Swiss National Library) interviewed by Friedel Geeraert
16. ‘The NLA continues to maintain partner arrangements for the
selective component [...] of the larger web archiving program. The
partner organisations include: the state libraries of Victoria, New
South Wales, Queensland, South Australia and Western Australia;
the Library and Archives of the Northern Territory; the Australian
War Memorial and, the National Gallery of Australia.’
Paul Koerbin (National Library of Australia) interviewed by Olga Holownia and Friedel
Geeraert
17. ‘Because there were some subject matter experts that were not on
this team, we did a lot of consultations with experts across the
Library. [...] We did a general analysis after a year or so of collecting
and were able to identify other gaps. Then we’d say: “This month,
or this week, this is the gap area we’re going to focus on, or let’s
focus on these groups”.’
Jennifer Harbster (Library of Congress) interviewed by Olga Holownia and Friedel Geeraert
18. How do you
handle fake
news and
controversial
topics?
• Exclude from collection but keep a record
• Include
• Include but contextualise
19. ‘I was particularly concerned about representativeness,
choosing sites that reflected all viewpoints. That was all
the more important because there were highly
controversial topics like chloroquine …’
David Benoist (National Library of France) interviewed by Sophie Gebeil and Valérie Schafer
20. ‘We’re not collecting them because of their authority; we are
collecting them because they were an example of misinformation.
The solution was to go to one of these big aggregators that listed
them all - NewsGuard - so that is in the archive and so there is
content that represents the misinformation.’
Jennifer Garbster (Library of Congress) interviewed by Olga Holownia and Friedel Geeraert
21. ‘It was quite an extreme right-wing website that had an anti-vaccination
policy [and said] that the coronavirus was a made up pandemic. It looked like
it was quite factual and verified information. …
So the decision was taken to not add this website to the collection, because
there is a risk that somebody might look at that article … [and that] it could
potentially cause danger to health. … We kept a record that the website had
been nominated. We recorded what the content of the website was and why
we decided to not include it in the collection.’
Nicola Bingham (IIPC collection) interviewed by Friedel Geeraert
22. ACCESS
• Collections freely available online
• IIPC collaborative collection
• Icelandic web archive (vefsafn.is)
• National Library of Australia
• Collections partly available online (permission granted), partly on
site
• Library of Congress
• UK Web Archive
• National Library Board of Singapore
• Collections available on site (and in other partner institutions)
• National Library of France + regional libraries
• Institut National de l’Audiovisuel (France) + regional libraries
• Royal Library of The Netherlands
• National Library of Luxembourg
• Swiss National Library
• Royal Danish Library: remote access & data dumps
• Published seed lists / metadata
26. Wearing three hats: the archivist researcher’s perspective on collecting
COVID-19
WARCnet Closing Conference
Aarhus, 18 October 2022
Nicola Bingham
27.
28.
29. What was collected?
● Mainly websites or sections of websites relevant to COVID-19
● Limited social media due to technical difficulties and ethical
concerns
● c. 1000 Twitter accounts of public figures and official agencies
● English language also Welsh and Scottish Gaelic
● Capture frequency is on a site-by-site basis (“one-off”, daily,
weekly, monthly)
● Focus is only on UK due to Legal Deposit Regulations
○ Websites hosted on UK TLDs and published in the UK
○ Extensive news coverage gets global perspective
○ Include diaspora communities e.g. Chinese in the UK
● Collaboration with other archives e.g. IIPC
● When to end collecting?
○ When WHO declares and end to pandemic
○ Continue related collections separately, e.g. Cost-of-Living
Crisis
30. UK Web Archive Covid-19 Collection in use
Datathon undertaken by Working Group 2, January 2021
- Outputs included “Chicken and Egg paper”
- Learning outcomes for the archivist
- The importance of contextual information, such as selection
decisions, where are the gaps and technical limitations, what
couldn’t we archive but wanted to, what can we do with the
data? Legal situation of exporting data.
‘Covid Stories’ Learning resource at British Library
- Creative response to collection items e.g. archived websites,
oral history interviews with NHS workers, broadcast news and
radio
https://cc.au.dk/fileadmin/dac/Projekter/WARCnet/Aasma
n_et_al_Chicken_and_Egg.pdf
31.
32. • 34 IIPC Members contributed nominations
• c.2000 nominations from the public
• Collection scope
• Published information prioritised rather than
social media
• Specify seeds at appropriate section of website
• Deprioritise rich media websites (data budget
concerns)
33. - 13, 855 seeds nominated by IIPC members
- 2,018 seeds nominated by public
- 2,411 = number of crawls run
- 16, 000 = total number of seeds archived
- 72 million = number of documents archived
- 5.8 TB = total data archived
- 180 Top Level Domains
- 70 languages
- 145 countries represented in Collection (see
map)
Link to collection
https://archive-it.org/collections/13529
IIPC CDG Novel Coronavirus
(COVID-19) stats October 2022
34. • IIPC Research Working Group worked with
Bibliothecca Alexandrina to develop services that
offer additional functionality beyond conventional
web archive playback to accommodate research
use cases.
• Solrwayback: Danish Web Archive
• Linkgate: data service, data extraction tool and
visualization front end for scalable temporal graph
visualisation for web archive research. Project
leads: Bibliothecca Alexandrina and National
Library of New Zealand
• AWAC2 (Analysing Web Archives of the COVID
Crisis through the IIPC Novel Coronavirus Dataset)
WARCnet network: Susan Aasman, Niels Brügger,
Frédéric Clavert, Karin De Wild, Sophie Gebeil,
Valérie Schafer, Joshgun Sirajzade
IIPC CDG Novel Coronavirus (COVID-19) researcher use of collections
35. Some highlights
98% of respondents describe themselves as professionals
Library and Archive are the most popular organisation type (82%)
68% are at National organisational level
After the UK (29%), Belgium is the most represented (22%)
Most respondents are Archivists (43%)
27% of respondents have 100-250 members of staff
68% had a COVID-19 special collection of web materials and/or social
media
70% say the initiative came from staff
52% collaborated with partners
72% added descriptive metadata
Scope was determined by subject/theme for 65%
64% have not stopped collecting content
71% collected WARC data
WG2 WARCnet survey
COVID-19 WEB COLLECTIONS
● June-September 2022
● Survey of European GLAM
organisations on Covid-19
collecting
● Analysis and results
forthcoming (2023)
36. Let’s talk about web archiving … three institutions, many possibilities
WARCnet Closing Conference
Aarhus, 18 October 2022
Valérie Schafer
38. •To document the rather “invisible” work, the shadows, hidden infrastructures and agencies
•To document the collections
•To better understand curation of collections, as well as perimeters, geographical coverage,
temporalities, size, teams, inclusiveness, participation…
•To take advantage of the WARCnet project which is gathering researchers and web
archivists, and involve everybody within WG2 (scholars from several countries)
•Already an expertise through the Terrorist Attacks collections + study of governance of Web
Archives + “Do Web archives have politics?” with F. Musiani …
•Special collections are special …
•National specificities
40. …and differences
- scope and perimeters
- experience (and notably in live collections)
- relation to social networks
- formats and methods of crawling (API, IA, …)
- time and organisation of the interview
44. Analysing web archives
- Temporalities to be retrieved
- New content
- Representativeness
- Preparation of a research
- Over-representations and silences
52. •Documentation of hidden infrastructures
•Documentation for future historians
•OH as both a tool and an object of study
(and notably for scalable reading)
53. Reading web archivists’ interviews at a distance
WARCnet Closing Conference
Aarhus, 18 October 2022
Jane Winters (on behalf of Frédéric Clavert)
54.
55.
56.
57.
58.
59.
60.
61. Key
Libraries:
01 Denmark
02 France
03 Hungary
04 IIPC (International)
05 Switzerland
06 UK
07 Iceland
08 Luxembourg
Sections:
01 The reasons for the special
collection
02 The scope of the collection
03 The framing of the collection
04 Accessibility & searchability
05 Partnerships and uses
Analysis by Frédéric Clavert,
C2
DH
66. Web archive histories and scalable reading
WARCnet Closing Conference
Aarhus, 18 October 2022
Helle Strandgaard Jensen
67. The advantages of Scalable Reading
Digital Humantites and the separation of ‘the digital’ and ‘the
humanities’
Staying in conversation with existing research (methods?)
Connecting close and distant reading (using distant reading as context
for individual data points and systematic selection of cases)