This document summarizes the digitization of Hansard, the official record of UK parliamentary debates. Key points include: Hansard was digitized by scanning nearly 3 million pages from 1803-2005; digitization enables improved access, preservation and usability; ongoing costs include hosting, storage and digital preservation; a digitization policy framework was developed to ensure consistency; and a web interface was created allowing faceted searching of the digitized Hansard texts.
IIIF at europeana, IIIF conference, Vatican, 2017Nuno Freire
The presentation will start with the current status of the work at Europeana in discovery of IIIF cultural heritage resources, with the particular focus of metadata aggregation. It will cover the ongoing research activities and the operational procedures for ingestion of IIIF resources.
The presentation will follow with the plans of further activities, also in relation to the IIIF Discovery Technical Specification Group, and a discussion of cooperation possibilities in this context.
Developing a national digital library stapel - meijers 20160302Enno Meijers
In 2015, the Koninklijke Bibliotheek (KB) became legally responsible for the digital infrastructure of the Dutch public libraries.
The KB wants to offer a platform where people and information come together. Their most important task for the years to come is the development of a national digital library - together with their partners in the network.
In this session, representatives from the KB will present their approach towards the Dutch digital library infrastructure. They will address some issues and welcome input from colleague librarians that are facing the same challenges.
IIIF at europeana, IIIF conference, Vatican, 2017Nuno Freire
The presentation will start with the current status of the work at Europeana in discovery of IIIF cultural heritage resources, with the particular focus of metadata aggregation. It will cover the ongoing research activities and the operational procedures for ingestion of IIIF resources.
The presentation will follow with the plans of further activities, also in relation to the IIIF Discovery Technical Specification Group, and a discussion of cooperation possibilities in this context.
Developing a national digital library stapel - meijers 20160302Enno Meijers
In 2015, the Koninklijke Bibliotheek (KB) became legally responsible for the digital infrastructure of the Dutch public libraries.
The KB wants to offer a platform where people and information come together. Their most important task for the years to come is the development of a national digital library - together with their partners in the network.
In this session, representatives from the KB will present their approach towards the Dutch digital library infrastructure. They will address some issues and welcome input from colleague librarians that are facing the same challenges.
Investigating the PROMISE of a Belgian web archive Sally Chambers
Presentation held (remotely) at: The "Web Archiving: Best Practices for Digital Cultural Heritage" international conference is organized by The National Library of Israel and the Open Media and Information Lab (OMILab) at the Open University of Israel. (http://webarchiving2018.nli.org.il)
The Belgian web is not currently systematically archived. As a result, there is a considerable risk that a significant portion of Belgian contemporary history will be lost forever. To prevent this, the Belgian Science Policy Office (BELSPO) funded the PROMISE (Preserving Online Multiple Information: towards a Belgian Strategy) project The aim of PROMISE is to: (i) identify current best practices in web-archiving (ii) pilot web-archiving in Belgium, including access (and use) for scientific research, and (iii) make recommendations for a sustainable web-archiving service for Belgium. This paper will present the current status of the PROMISE project, including the latest results.
The Digital Heritage Network in the Netherlands is working on a Linked Data based approach for improving the visibility of Digital Heritage information.
Clare Lanigan - Presentation to IES Studentsdri_ireland
Presentation given by Clare Lanigan, DRI Education and Outreach Manager, to students of the School of Information and Library Science, University of North Carolina, at the Institute for the International Education of Students (IES) Abroad centre in Rathmines, Dublin, on 1 June 2017.
Slides of the presentations gives as part of the Europeana Research panel "Cultural Heritage Data for Research: A Europeana Research Panel" at DH Benelux 2017 in Utrecht.
20180705 challanges for researchers in digital humanities liber 2018 lille(rw)LIBIS
Presentation of Roxanne Wyns (LIBIS - KU Leuven Bibliotheken) at LIBER 2018 Challenges for Researchers in the Digital Humanities: custom development vs. sustainable research infrastructures.
Presentation given by Vassilis Tzouvaras
National Technical University of Athens, Greece
LoCloud Conference
Sharing local cultural heritage online with LoCloud services
Amersfoort, Netherlands
5 February 2016
Ariadne Training Workshop
Ljubljana, Slovenia
21 January 2016
Presentation by:
Holly Wright, Archaeology Data Service (ADS)
and
Kater Fernie, 2 Culture Associates
The Wellcome Trust is examining the possibility of a cloud platform for the storage and delivery of digitised artefacts. This platform is intended for the Trust's own use as well as others. A version of this presentation with embedded notes and video can be viewed on Google docs: http://bit.ly/1GRKqN4 or PowerPoint online: http://bit.ly/1CwGsrE
EDF2014: Franck Cotton & Kamel Gadouche, France: TeraLab - A Secure Big Data...European Data Forum
Selected Talk of Franck Cotton, Technology Advisor, Institut National de la Statistique et des Etudes Economiques, France & Kamel Gadouche, Director, Centre d'Accès Sécurisé aux Données / Groupe des Ecoles Nationales d'Economie et Statistique, France at the European Data Forum 2014, 19 March 2014 in Athens, Greece: TeraLab - A Secure Big Data Platform, Description And Use Cases
Investigating the PROMISE of a Belgian web archive Sally Chambers
Presentation held (remotely) at: The "Web Archiving: Best Practices for Digital Cultural Heritage" international conference is organized by The National Library of Israel and the Open Media and Information Lab (OMILab) at the Open University of Israel. (http://webarchiving2018.nli.org.il)
The Belgian web is not currently systematically archived. As a result, there is a considerable risk that a significant portion of Belgian contemporary history will be lost forever. To prevent this, the Belgian Science Policy Office (BELSPO) funded the PROMISE (Preserving Online Multiple Information: towards a Belgian Strategy) project The aim of PROMISE is to: (i) identify current best practices in web-archiving (ii) pilot web-archiving in Belgium, including access (and use) for scientific research, and (iii) make recommendations for a sustainable web-archiving service for Belgium. This paper will present the current status of the PROMISE project, including the latest results.
The Digital Heritage Network in the Netherlands is working on a Linked Data based approach for improving the visibility of Digital Heritage information.
Clare Lanigan - Presentation to IES Studentsdri_ireland
Presentation given by Clare Lanigan, DRI Education and Outreach Manager, to students of the School of Information and Library Science, University of North Carolina, at the Institute for the International Education of Students (IES) Abroad centre in Rathmines, Dublin, on 1 June 2017.
Slides of the presentations gives as part of the Europeana Research panel "Cultural Heritage Data for Research: A Europeana Research Panel" at DH Benelux 2017 in Utrecht.
20180705 challanges for researchers in digital humanities liber 2018 lille(rw)LIBIS
Presentation of Roxanne Wyns (LIBIS - KU Leuven Bibliotheken) at LIBER 2018 Challenges for Researchers in the Digital Humanities: custom development vs. sustainable research infrastructures.
Presentation given by Vassilis Tzouvaras
National Technical University of Athens, Greece
LoCloud Conference
Sharing local cultural heritage online with LoCloud services
Amersfoort, Netherlands
5 February 2016
Ariadne Training Workshop
Ljubljana, Slovenia
21 January 2016
Presentation by:
Holly Wright, Archaeology Data Service (ADS)
and
Kater Fernie, 2 Culture Associates
The Wellcome Trust is examining the possibility of a cloud platform for the storage and delivery of digitised artefacts. This platform is intended for the Trust's own use as well as others. A version of this presentation with embedded notes and video can be viewed on Google docs: http://bit.ly/1GRKqN4 or PowerPoint online: http://bit.ly/1CwGsrE
EDF2014: Franck Cotton & Kamel Gadouche, France: TeraLab - A Secure Big Data...European Data Forum
Selected Talk of Franck Cotton, Technology Advisor, Institut National de la Statistique et des Etudes Economiques, France & Kamel Gadouche, Director, Centre d'Accès Sécurisé aux Données / Groupe des Ecoles Nationales d'Economie et Statistique, France at the European Data Forum 2014, 19 March 2014 in Athens, Greece: TeraLab - A Secure Big Data Platform, Description And Use Cases
"Filling the Digital Preservation Gap" with ArchivematicaJenny Mitcham
A webinar given by Jenny Mitcham and Simon Wilson to Digital Preservation Coalition members on 25th November 2015. It describes work underway in the "Filling the Digital Preservation Gap" project using Archivematica to preserve research data
Jisc Shared Service requirements presentation - 18th November 2015Jenny Mitcham
A presentation by Chris Awre and Jenny Mitcham about our requirements gathering exercise for the "Filling the Digital Preservation Gap" project to inform the requirements of Jisc's proposed shared service for RDM. The presentation was delivered on the 18th November 2015 at Jisc's shared services workshop at Aston University
This presentation was provided by Edward M. Corrado on Wednesday, June 14, during the NISO virtual event, Images: Digitization & Preservation of Special Collections in Libraries, Museums and Archives.
INNOVATION AND RESEARCH (Digital Library Information Access)Libcorpio
Innovation and research, Digital Library Information Access, LIS Education, Library and Information Science, LIS Studies, Information Management, Education and Learning, Library science, Information science, Digital Libraries, Research on Digital Libraries, DL, Innovation in libraries and publishing, Areas of Research for DL, Information Discovery, Collection Management and Preservation, Interoperability, Economic, Social and Legal Issues, Core Topics In Digital Libraries, DL Research Around The World
Digital Asset Management and Archival PreservationLAC Group
Phil Spiegel shares the basic principles, workflows, best practices and tools available, as well strategies for various types of digital/media projects.
2010 EGITF Amsterdam - Gap between GRID and HumanitiesDirk Roorda
How useful/relevant is GRID and High Performance Computing in its current form for the Humanities, especially within the European Infrastructure projects CLARIN, DARIAH and CESSDA? We need virtual use cases!
"Filling the digital preservation gap" with ArchivematicaJenny Mitcham
A presentation given by Jenny Mitcham at the iPRES conference on 6th November 2015 in Chapel Hill, North Carolina. It describes work underway in the "Filling the Digital Preservation Gap" project using Archivematica to preserve research data
This presentation was provided by
Priscilla Caplan of The Florida Center for Library Automation and Jeremy York of The University of Michigan Library, during the NISO Webinar "What It Takes To Make It Last: E-Resources Preservation" held on February 10, 2011.
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013SALCTG
An overview of Research Data Management: the research process from developing ideas to preservation of data; funder perspectives, the impact on the wider service, Data Asset Frameworks, preservation and access, and cost implications.
Presentada en la Jornada Internacional sobre Archivos Web y Depósito Legal Electrónico, en la Biblioteca Nacional de España (BNE), el día 9 de julio de 2013.
A presentation about the British Library News Media services given by Dr Luke McKernan
Lead Curator, News and Moving Image
The British Library. 20th April 2015 for an ALISS visit.
How SCIE supports the information needs of health and social care professionalsALISS
Sue Jardine, Information Specialist, How SCIE supports the information needs of health and social care professionals
Supporting Practitioners in Health and Social Care.
ALISS conference 11th February 2015
Speedy professional conversations around learning and teaching in higher educ...ALISS
Speedy professional conversations around learning and teaching in higher education via the brand new tweetchat #LTHEchat
Sue Beckingham, Sheffield Hallam University
Chrissi Nerantzi, Manchester Metropolitan University
Peter Reed, University of Liverpool
Dr David Walker, University of Sussex
3. Hansard
• the official report of debates in Parliament
• actually an unofficial private enterprise at first
• “nationalised” in 1909
• early reports written in the third person
• eventually developed into a (nearly) verbatim
account
• volumes from 1803 – 2005 were digitised
• nearly 3 million pages
4. “though not strictly verbatim, [it] is substantially
the verbatim report, with repetitions and
redundancies omitted and with obvious mistakes
corrected, but [...] on the other hand leaves out
nothing that adds to the meaning of the speech
or illustrates the argument.”
5. why digitise?
• enable preservation
• conservation is expensive
• increase access
• increase usability
• improve business processes
• re-use physical storage space
• costs have fallen significantly
• quality improving steadily
6. preservation vs. conservation
conservation
direct intervention to prevent/make good damage to
materials
preservation
a broader term than conservation. It includes all
managerial and financial considerations including
storage and accommodation provision, staffing levels,
policies, techniques, and methods involved in preserving
library and archive materials and the information
contained therein
7. preservation
• originals printed on poor quality paper
• starting to deteriorate
• reduce wear and tear from daily use
• keep in a controlled environment
• conservation is expensive
8. improve access
• internal
– extensive day to day business use across a very large
site
• public
– national heritage and birthright
– disposal by libraries
– international interest
10. costs
• costs have fallen significantly
• alternative funding models
• reduce physical storage needs
– dispose of surplus copies
– locate in less valuable space
• but beware the hidden costs…
11. ongoing costs
• developing a front-end and database
• hosting
• storing images
• digital preservation
• format migration
15. doing the work
• In house or contractor?
• method
– image only
– re-keying (single, double, triple...)
– OCR (optical character recognition)
– image plus text
– metadata capture
– manual intervention increases quality and costs!
17. OCR
• how accurate does it need to be?
• mass vs batch capture
• double or triple compare
• diminishing returns
18. QA (quality assurance)
• automate where possible
• contractor
– 100% proof reading
• client
– heavy sampling of images
– 1% sampling of text
• third party?
19. the need for a policy framework
• Hansard was the first major digitisation project in
the UK parliament
• an earlier project to digitise Local and Private
Acts captured images only
• we needed a digitisation policy for parliament to
ensure consistency and learning from
experience
20. policy aims
• ensure that individual projects:
– take into account the wider information context both
inside and outside Parliament
– deliver their target benefits
– offer value for money
• ensure the resources created can be:
– exploited fully
– used for as long as is required
21. policy scope
• publications
• photographs
• archival documents
• business records
22. policy principles
• digitisation needs to be seen as an integral part
of the information work carried out by parliament
• use of appropriate technical standards
• scan once for many purposes
• business cases should take account of all
relevant costs
23. selection criteria
• measurable user demand (for public use)
• business need (for internal use)
• the potential for learning and educational use
• cost and the availability of other resources
• technical considerations
• the uniqueness of the items
• conservation requirements
• intellectual property rights and copyright issues
• the availability of digitised versions of the same material
elsewhere
• the potential for revenue raising
• the feasibility of long-term preservation, where required
24. other aspects of the policy
• the delivery method will be planned at the outset
• the preservation master will be an
uncompressed TIFF file
• metadata will be created, to support resource
discovery, use, storage and digital preservation
• we will adopt international standards where
possible
• we will work with partners where possible
25. developing a digitisation strategy
• a project board has been created
• an integral part of an online parliamentary
history programme for parliament
• will use the criteria set out in the digitisation
policy to prioritise future digitisation work
26. practical guidelines
• guidelines have been developed for all parts of
parliament which need to create digitised assets:
– a checklist for doing the work
– glossary
– details of file formats, OCR options
– describes popular myths on costs
27. hosting
• text and images
• text only
• navigation
• search
• web 2.0
• funding models
• give it away!?
http://www.parliament.uk/publications/archives.cfm
28. developing a web interface
drivers
• keep costs down
• work closely with users
• meaningful search across a large amount of data
solution
• experimental approach
• open source
29. methodology and progress
• small team of developers from Parliamentary
ICT working closely with users (inside and
outside Parliament)
• uses “micro formats” approach
• XML is parsed into HTML before loading into the
database
• JPEGs not currently being used
• half of the data has been loaded (mainly 20th
century)
• public discussion group and issues log
34. faceted classification
• faceted approach to browsing and searching
• assignment of multiple classifications to an object
• classifications can be to be ordered in a variety of ways
• facets include
– date
– volume number
– monarch
– chamber
– content type (debates or questions)
– constituencies
– Members of Parliament
– offices held.
35. other features
• references using the standard format can be located
using the search box
HC Deb Vol 385 13 May 2002 c498
• predictable URLs
http://hansard.millbanksystems.com/commons/1941/may/07/w
• pages created for:
– individual Members of Parliament
– constituencies
– acts
– bills
– divisions
Today we’re going to say a little bit about our project to digitise Hansard, which is of course the official record of debates in Parliament. Although this isn’t records management in the traditional sense, it is about taking one of the most important records in the country if not the world, using it to create a new digital asset and fully exploiting its value as an information source rather than gathering dust on a shelf