Presentaiton at Panel "Interoperable Platforms and CLIR Initiatives: A Global Perspective" at the 2019 IIIF Conference
Göttingen, Thursday 26 June 2019
https://iiif.io/event/2019/goettingen/program/30/
Funding Central and the Association of Charitable Foundations are resources for finding funding. Census data on demographics for Telford and Wrekin, England can be found at localstats.co.uk and ukcensusdata.com. Survey monkey, Picmonkey and Piktochart are tools for creating surveys and visuals.
The document discusses locations in France including Strasbourg, Erstein, and Gijon, Spain. It mentions a transnational meeting from October 21-23, 2015 for the Erasmus+ TIPS program. It provides details on population sizes of Strasbourg and Erstein, and notes that Erstein is 17km from Strasbourg and located just across the German border. It references the European Parliament and concludes with information on Alsace chocolate.
London & Partners - MICE Presentation 2019MICEboard
The London Convention Bureau helps business event planners by finding venues, coordinating site visits and hotel rooms, and providing creative ideas for events and access to sector experts. It offers assistance for bids and free, impartial support as a first point of contact. London offers over 143,000 hotel rooms and 1,600 hotels, with 18,400 new rooms being built by 2022 across its 1,000 diverse venues. It has strong transportation links, a vast cultural offering, and is constantly reinventing itself with new spaces, areas, and initiatives.
Europeana Introduction at Creative Kick-Off event - Breandán KnowltonEuropeana
Presentation by Breandán Knowlton of Europeana at the kick-off of the Europeana Creative project in Vienna in February 2013. Outlines the Europeana Network, the Foundation and the project ecosystem.
Ignite Talk on the Exploring British Design Project given at the Europeana AGM 2015, Amsterdam, 4th November 2015.
http://pro.europeana.eu/event/europeana-annual-general-meeting-2015
4/11 ignite 1. adrian stevenson, jisc, “exploring british design”Europeana
The document discusses the Exploring British Design project which aggregates archive descriptions from the Brighton Design Archives at the University of Brighton onto the Archives Hub database. The Archives Hub data is then passed onto Archives Portal Europe and Europeana to create an online collection of British design archives accessible to the public. The project aims to exploit digital technologies to transform research in the arts and humanities as part of the UK Arts and Humanities Research Council's Digital Transformations initiative.
UK Committee on RDA, RDA Day: New Tools for the Future of Cataloguing - Jenny...CILIP MDG
“The RDA Day is programmed by the UK Committee on RDA. Using activities and games throughout informative presentations, the RDA Day will inform and engage metadata practitioners and managers on a content standard which integrates well with the metadata needs of the 21st century”
Paper presented on the UKCoR RDA Day during the Metadata & Discovery Group Conference (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
Presentaiton at Panel "Interoperable Platforms and CLIR Initiatives: A Global Perspective" at the 2019 IIIF Conference
Göttingen, Thursday 26 June 2019
https://iiif.io/event/2019/goettingen/program/30/
Funding Central and the Association of Charitable Foundations are resources for finding funding. Census data on demographics for Telford and Wrekin, England can be found at localstats.co.uk and ukcensusdata.com. Survey monkey, Picmonkey and Piktochart are tools for creating surveys and visuals.
The document discusses locations in France including Strasbourg, Erstein, and Gijon, Spain. It mentions a transnational meeting from October 21-23, 2015 for the Erasmus+ TIPS program. It provides details on population sizes of Strasbourg and Erstein, and notes that Erstein is 17km from Strasbourg and located just across the German border. It references the European Parliament and concludes with information on Alsace chocolate.
London & Partners - MICE Presentation 2019MICEboard
The London Convention Bureau helps business event planners by finding venues, coordinating site visits and hotel rooms, and providing creative ideas for events and access to sector experts. It offers assistance for bids and free, impartial support as a first point of contact. London offers over 143,000 hotel rooms and 1,600 hotels, with 18,400 new rooms being built by 2022 across its 1,000 diverse venues. It has strong transportation links, a vast cultural offering, and is constantly reinventing itself with new spaces, areas, and initiatives.
Europeana Introduction at Creative Kick-Off event - Breandán KnowltonEuropeana
Presentation by Breandán Knowlton of Europeana at the kick-off of the Europeana Creative project in Vienna in February 2013. Outlines the Europeana Network, the Foundation and the project ecosystem.
Ignite Talk on the Exploring British Design Project given at the Europeana AGM 2015, Amsterdam, 4th November 2015.
http://pro.europeana.eu/event/europeana-annual-general-meeting-2015
4/11 ignite 1. adrian stevenson, jisc, “exploring british design”Europeana
The document discusses the Exploring British Design project which aggregates archive descriptions from the Brighton Design Archives at the University of Brighton onto the Archives Hub database. The Archives Hub data is then passed onto Archives Portal Europe and Europeana to create an online collection of British design archives accessible to the public. The project aims to exploit digital technologies to transform research in the arts and humanities as part of the UK Arts and Humanities Research Council's Digital Transformations initiative.
UK Committee on RDA, RDA Day: New Tools for the Future of Cataloguing - Jenny...CILIP MDG
“The RDA Day is programmed by the UK Committee on RDA. Using activities and games throughout informative presentations, the RDA Day will inform and engage metadata practitioners and managers on a content standard which integrates well with the metadata needs of the 21st century”
Paper presented on the UKCoR RDA Day during the Metadata & Discovery Group Conference (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
Challenges to implementation - Jenny WrightCILIP MDG
“The RDA Day is programmed by the UK Committee on RDA. Using activities and games throughout informative presentations, the RDA Day will inform and engage metadata practitioners and managers on a content standard which integrates well with the metadata needs of the 21st century”
Paper presented on the UKCoR RDA Day during the Metadata & Discovery Group Conference (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
Application Profiles in RDA - Jenny WrightCILIP MDG
“The RDA Day is programmed by the UK Committee on RDA. Using activities and games throughout informative presentations, the RDA Day will inform and engage metadata practitioners and managers on a content standard which integrates well with the metadata needs of the 21st century”
Paper presented on the UKCoR RDA Day during the Metadata & Discovery Group Conference (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
The Official RDA Toolkit - Opportunities for Efficiency - Thurstan YoungCILIP MDG
“The RDA Day is programmed by the UK Committee on RDA. Using activities and games throughout informative presentations, the RDA Day will inform and engage metadata practitioners and managers on a content standard which integrates well with the metadata needs of the 21st century”
Paper presented on the UKCoR RDA Day during the Metadata & Discovery Group Conference (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
The Official RDA Toolkit - Opportunities for Enrichment - Thurstan YouingCILIP MDG
The document discusses opportunities for enriching metadata in the Official RDA Toolkit. It provides background on extension plans, representative expressions, and data provenance. An example is given of recording an extension plan and representative expression for a multi-volume work. The extension plan vocabulary and representative expression elements are shown as ways to enrich RDA descriptions through structured, encoded values.
“The RDA Day is programmed by the UK Committee on RDA. Using activities and games throughout informative presentations, the RDA Day will inform and engage metadata practitioners and managers on a content standard which integrates well with the metadata needs of the 21st century”
Presented on the UKCoR RDA Day during the Metadata & Discovery Group Conference (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
RDA methods, scenarios, tools - Gordon DunsireCILIP MDG
“The RDA Day is programmed by the UK Committee on RDA. Using activities and games throughout informative presentations, the RDA Day will inform and engage metadata practitioners and managers on a content standard which integrates well with the metadata needs of the 21st century”
Paper presented on the UKCoR RDA Day during the Metadata & Discovery Group Conference (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
Poster: What’s in a name? Re-Discovering cataloguing and index through metada...CILIP MDG
In 2019 CILIP’s Cataloguing and Indexing Group changed its name to the Metadata and Discovery Group. This poster will showcase the transition of the look and feel of the group’s logo and the process of designing and new one.
Poster presented at the CILIP Metadata and Discovery Group (MDG) Conference & UKCoR RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham).
Poster: Revamping our in-house cataloguing training / Victoria Parkinson (Kin...CILIP MDG
With hybrid working and a new LMS, we are revamping our in-house cataloguing training. We are learning from our teaching librarians and using the tools we have, such as Moodle, to create cataloguing training that allows anyone with an interest to learn the basics and making the best use of face-to-face time for putting those skills into practice. Over the past eight years we’ve adapted and updated our in-house training, and I’ll also talk about how we decide what to teach colleagues, and how we try to make the best use of staff time to keep skills up when cataloguing is one of many competing priorities and shared across several teams. Between staff turnover and COVID lockdowns and service changes, we are starting almost from scratch in building a pool of staff who can catalogue the material our suppliers can’t provide records for, which is an excellent time to take stock of what our cataloguing needs are, and advocate for the importance of creating and upgrading good quality records and why we need to build these skills in-house.
Poster presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
Poster: FAST : can it lighten the load, and what is the impact? / Jenny Wrigh...CILIP MDG
This poster presents the Faceted Application of Subject Terminology, giving an overview of the scheme, its advantages and potential issues, and its practical implementation. It will demonstrate that FAST is an important development for those interested in Linked Data, and the ways in which it is a useful tool for discovery in any system.
Poster presented at the CILIP Metadata and Discovery Group (MDG) Conference & UKCoR RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham).
Poster: The West Midlands Evidence Repository (WMER) : a regional collaborati...CILIP MDG
The West Midlands Evidence Repository (WMER) was born from a pre-pandemic recognition by managers of Knowledge and Library Services (KLSs) of 8 NHS Trusts in the West Midlands region of the need for a repository. This was to replace existing provision, or recognition of national priorities or local needs to record, collect, and share research, as well as potential for sharing patient information leaflets or guidelines. Some managers and services had previous experience of repositories, as well as being part of a national pilot. WMER, however, represented a new start for all to work in collaboration to establish a new service. The consortium would enable sharing of both costs and experience.
Initially, different repository suppliers were investigated by the KLS that had had a long-established repository, taking on board the experience of the group from the national pilot. The Atmire Open Repository platform was chosen as it met the consortium’s needs and had a proven track record of other collaborative repositories in the NHS. Financing was taken on by one Trust and the on-boarding was led in partnership between that Trust and the Trust that had undertaken the initial investigation.
With the initial on-boarding completed and the test server set-up, the group took a step back to ensure they worked together as a collaborative going forward. Collaborative work between the KLSs was facilitated by the formal creation of two groups, a Managers Group for overall approval and financial decision making and an Operational Group handling the setup and administration of the repository for the consortium. The Operational Group is led by the service with most experience of managing repositories and the lead of it acts as liaison between the two groups, with each group having representation from the eight organisations. Learning from other regional collaborations the Future NHS site was used as a collaborative workspace and Teams as the main means of communication.
The setup of the repository was completed on time after three months. There was initially a steep learning curve for all, especially the Operational Group who undertook this process. The group identified key metadata and metadata standards for the repository, including the use of ORCIDs and the use of Wessex Classification as a controlled vocabulary. The setup process was facilitated by the collaborative nature of the project as the variety of experience in the group was a great benefit. It should be noted support from the suppliers was specifically related to technical support only.
The collaborative nature of the project also allowed work to be shared, and tasks were given to members to be undertaken independently. However, a downside of collaborative projects is that decisions can take longer to be inclusive...
Poster presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
Poster: Updating the Wessex Classification Scheme for UK health libraries : a...CILIP MDG
The Wessex Classification Scheme was created by healthcare librarians in the South West of England, and was loosely based on the US National Library of Medicine classification. The scheme is widely used in healthcare libraries across the UK, both inside and outside the NHS. Although the scheme has gone through several revisions, there has been no major update since 2015, so the Wessex Classification Scheme Oversight Group was formed in September 2022 with the support of NHS England. The group aims to bring knowledge and skills from UK health library networks to improve the scheme and offers a chance for participants to develop skills in working with classification and subject indexing, and the opportunity to network widely. By forming a working group, it ensures the longevity of the scheme and shares the maintenance work more widely.
Initially, members were asked which parts of the scheme they felt needed updating the most and sub-groups were formed for LGBTQ+ issues and gender identity (the Pride sub-group), Ethnicity and Race, and Learning Disability and Neurodiversity (the LDN sub-group) as well as a smaller team working on ‘quick and simple’ updates....
Poster presented at the CILIP Metadata and Discovery Group (MDG) Conference & UKCoR RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham).
Revamping in-house cataloguing training / Victoria Parkinson (King's College ...CILIP MDG
With hybrid working and a new LMS, we are revamping our in-house cataloguing training. We are learning from our teaching librarians and using the tools we have, such as Moodle, to create cataloguing training that allows anyone with an interest to learn the basics and making the best use of face-to-face time for putting those skills into practice. Over the past eight years we’ve adapted and updated our in-house training, and I’ll also talk about how we decide what to teach colleagues, and how we try to make the best use of staff time to keep skills up when cataloguing is one of many competing priorities and shared across several teams. Between staff turnover and COVID lockdowns and service changes, we are starting almost from scratch in building a pool of staff who can catalogue the material our suppliers can’t provide records for, which is an excellent time to take stock of what our cataloguing needs are, and advocate for the importance of creating and upgrading good quality records and why we need to build these skills in-house.
Lightning Talk presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
UK NACO funnel : progress, obstacles, and solutions / Martin Kelleher (Univer...CILIP MDG
This Lightning Talk will provide a quick update on latest progress with the now established UK NACO Funnel, which allows participating institutions to contribute to Library of Congress / PCC authority control. The presentation will include a summary of the purpose of the funnel, details of latest expansion, problems and solutions with data submission software, and further plans and collaborations.
Lightning Talk presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
Ship[w]right[e]s? : the challenges of cataloguing reports from scientific exp...CILIP MDG
This document discusses the challenges of cataloguing reports from scientific expeditions, using the Challenger Reports as an example. It notes that there were 83 Challenger Reports published by 47 authors, held across 4 sites in 50 different sectional libraries, with 220 bib records created for this work. It also mentions the opportunity for the Natural History Museum to think about metadata across the entire museum collection as part of an effort to move specimens to a new location.
BFI Reuben Library : an RDA implementation story / Anastasia Kerameos (BFI Re...CILIP MDG
“From 1st January 2024, Adlib will no longer be supported or maintained by Axiell.” This statement acted as the catalyst for action, enabling the release of resources to implement significant changes to the BFI Reuben Library’s record structure, which in turn prompted a deeper look into our current cataloguing practices and future requirements.
Upgrading to Axiell Collections will allow the library to implement new RDA more fully – we had previously adopted some aspects but not all – and, importantly, it will allow us to better align our data structure with that of the organisation’s other collections, making it easier to manage and making it compatible with further planned system developments. By the time of the conference in September we will be cataloguing to an under the bonnet Work – Expression – Manifestation – Item (WEMI) record hierarchy and new cataloguing guidelines.
Having watched all the webinars available, having read every piece of documentation which seemed relevant, having spent hours reading and re-reading the contents of the RDA Toolkit we are currently working on the last stages of our application profile whilst still debating issues around putting the theory into practice, especially in the area of aggregates and diachronic works. I do not suggest I have all the answers, far from it, but by sharing the story of our journey, that of a medium sized non-academic library of specialist mostly print collections and illustrating it with practical examples I hope my presentation will be of use to others currently travelling a similar path.
Paper presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
RDA implementation at the British Library / Thurstan Young (British Library)CILIP MDG
On 23rd May 2023, the RDA Board announced that the original RDA Toolkit will be removed in May 2027. All RDA users will need to be prepared for transition to the official RDA Toolkit before then. As previously announced, a Countdown Clock will start running in May 2026, a year before the sunset date.
This paper will provide an update on the British Library’s plans for implementation of the new RDA Toolkit, following completion of the RDA Toolkit Restructure and Redesign (3R) project. It will provide an overview of the timeline and scope for implementation as well as describing the training and documentation underpinning the implementation and the support available to other institutions for their implementation.
Paper presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
Community forward : developing descriptive cataloguing of rare materials (RDA...CILIP MDG
Since 2013, Resource Description and Access (RDA) has been the chief cataloguing standard used in the United States. In 2019, the RDA Steering Committee previewed a new version of the RDA Toolkit, which introduced substantial changes, such as replacing instructions with a series of options, adding new concepts such as “nomens” and “diachronic works,” and replacing the prior organisation with a broader intellectual framework. This revised Toolkit became the official RDA Toolkit in December 2020, with major cataloguing bodies planning to adopt it in the coming years. Some cataloguers have expressed concerns regarding the official RDA Toolkit, particularly around cost and training required to learn the new standard.
In response to these concerns, the RBMS RDA Editorial Group, a group of volunteers from the Association of College and Research Libraries’ Rare Books and Manuscripts Section, developed a new manual, Descriptive Cataloging of Rare Materials (RDA Edition). DCRMR is informed by core principles of community and sustainability while employing open-access publication models and infrastructure. Designed in response to community feedback, it presents instructions in cataloguing workflow order using clear language while remaining aligned to the official RDA Toolkit and RDA element sets. The manual was approved in February 2022 in its first iteration and continues to be actively developed and updated. This presentation will discuss why the editorial group created an open and free manual; the process and tools for creating the manual, including the use of GitHub to publish a cataloguing standard; and outcomes to date.
Paper presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
The West Midlands Evidence Repository (WMER) : a regional collaboration proje...CILIP MDG
The West Midlands Evidence Repository (WMER) was born from a pre-pandemic recognition by managers of Knowledge and Library Services (KLSs) of 8 NHS Trusts in the West Midlands region of the need for a repository. This was to replace existing provision, or recognition of national priorities or local needs to record, collect, and share research, as well as potential for sharing patient information leaflets or guidelines. Some managers and services had previous experience of repositories, as well as being part of a national pilot. WMER, however, represented a new start for all to work in collaboration to establish a new service. The consortium would enable sharing of both costs and experience.
Initially, different repository suppliers were investigated by the KLS that had had a long-established repository, taking on board the experience of the group from the national pilot. The Atmire Open Repository platform was chosen as it met the consortium’s needs and had a proven track record of other collaborative repositories in the NHS. Financing was taken on by one Trust and the on-boarding was led in partnership between that Trust and the Trust that had undertaken the initial investigation.
With the initial on-boarding completed and the test server set-up, the group took a step back to ensure they worked together as a collaborative going forward. Collaborative work between the KLSs was facilitated by the formal creation of two groups, a Managers Group for overall approval and financial decision making and an Operational Group handling the setup and administration of the repository for the consortium. The Operational Group is led by the service with most experience of managing repositories and the lead of it acts as liaison between the two groups, with each group having representation from the eight organisations. Learning from other regional collaborations the Future NHS site was used as a collaborative workspace and Teams as the main means of communication.
The setup of the repository was completed on time after three months. There was initially a steep learning curve for all, especially the Operational Group who undertook this process. The group identified key metadata and metadata standards for the repository, including the use of ORCIDs and the use of Wessex Classification as a controlled vocabulary. The setup process was facilitated by the collaborative nature of the project as the variety of experience in the group was a great benefit. It should be noted support from the suppliers was specifically related to technical support only.
The collaborative nature of the project also allowed work to be shared, and tasks were given to members to be undertaken independently. However, a downside of collaborative projects is that decisions can take longer to be inclusive...
Paper presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
Authority of assertion in repository contributions to the PID graph / George ...CILIP MDG
The principles surrounding Linked Open Data and their implementation within digital libraries are well understood. Such implementations may be challenging, but successes are now well documented and continue to demonstrate the benefits of disseminating and enriching existing metadata with improved semantics and relational associations. Often facilitated in machine-readability enhancements to metadata by harnessing serializations of the Resource Description Framework (RDF) and its reliance of URIs, these LOD approaches have ensured digital libraries, and similar GLAMR initiatives elsewhere, contribute to the growing knowledge graphs associated with the wider semantic web by declaring statements of fact about web entities. Within open scholarly ecosystems a growing use of persistent identifiers (PIDs) to define and link scholarly entities has emerged, e.g., DOIs, ORCIDs, etc. The requirement for greater URI persistence has been motivated by several developments within the scholarly space; suffice to state that, when combined with appropriate structured data, PIDs can support improvements to resource discovery, as well as facilitate contributions to the ‘PID graph’ – a scholarly data graph describing and declaring associative relations between scholarly entities.
While the increased adoption of PIDs has the potential to transform scholarship, ensuring that these PIDs are used appropriately, encoded correctly within metadata, and that all relevant relational associations between scholarly entities are declared presents challenges. This is especially true within open scholarly repositories, from where many contributions to the PID graph will be made but – unlike many LOD contexts – from where the authority to assert specific relations may not always exist. Such declarations need to demonstrate reliability and provenance and are central to the interlinking of heterogeneous textual objects, datasets, software, research instruments, equipment, and the related PIDs these items may generate, such as for people, organizations, or other abstract entities.
This paper will explore the issues that arise when levels of authority to assert are lacking or are uncertain, and review results from a related study exploring the ‘PID literacy’ of scholars...
Paper presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
ISNI : a persistent identifier for creatives and associated organizations / T...CILIP MDG
Recent interest in persistent identifiers (PIDs) within the research and library sectors is bringing discussions about the adoption of standards – such as ISNI and DOI, and other ID schemes such as ORCID, Ringgold, ROR, CrossRef, etc. – to the fore, casting a spotlight on the PIDs already in use and their relationships to one another. In its capacity as a bridging identifier and a critical component in Linked Data applications, clearly ISNI has a major part to play in these discussions.
With a view to explaining the benefits of ISNIs for researchers, academic and scholarly publishers, institutions, funders, and other stakeholders – including information about ISNI’s centrally-managed database and the curation functions carried out by its direct data contributors – ISNI-IA continues to promulgate the ISNI standard within the research and library sectors, showing not only the importance of using ISNIs, but also the strength and quality of data that can be achieved when research identifiers operate collaboratively.
This presentation will be an opportunity for those in the library and research community to learn about progress with the ISNI standard to date, within the library, research, and publishing sectors and beyond!
Attendees will learn about:
• What the ISNI standard is.
• How the ISNI standard interacts with other identifiers.
• The benefits of ISNIs for the research sector.
• The level of adoption across the sectors that ISNI represents (including the library, music, publishing, research, and entertainment sectors).
• Upcoming ISNI projects (including the National Library of Finland’s ongoing project in collaboration with 5 prominent Copyright Management Organizations).
Paper presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
Challenges to implementation - Jenny WrightCILIP MDG
“The RDA Day is programmed by the UK Committee on RDA. Using activities and games throughout informative presentations, the RDA Day will inform and engage metadata practitioners and managers on a content standard which integrates well with the metadata needs of the 21st century”
Paper presented on the UKCoR RDA Day during the Metadata & Discovery Group Conference (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
Application Profiles in RDA - Jenny WrightCILIP MDG
“The RDA Day is programmed by the UK Committee on RDA. Using activities and games throughout informative presentations, the RDA Day will inform and engage metadata practitioners and managers on a content standard which integrates well with the metadata needs of the 21st century”
Paper presented on the UKCoR RDA Day during the Metadata & Discovery Group Conference (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
The Official RDA Toolkit - Opportunities for Efficiency - Thurstan YoungCILIP MDG
“The RDA Day is programmed by the UK Committee on RDA. Using activities and games throughout informative presentations, the RDA Day will inform and engage metadata practitioners and managers on a content standard which integrates well with the metadata needs of the 21st century”
Paper presented on the UKCoR RDA Day during the Metadata & Discovery Group Conference (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
The Official RDA Toolkit - Opportunities for Enrichment - Thurstan YouingCILIP MDG
The document discusses opportunities for enriching metadata in the Official RDA Toolkit. It provides background on extension plans, representative expressions, and data provenance. An example is given of recording an extension plan and representative expression for a multi-volume work. The extension plan vocabulary and representative expression elements are shown as ways to enrich RDA descriptions through structured, encoded values.
“The RDA Day is programmed by the UK Committee on RDA. Using activities and games throughout informative presentations, the RDA Day will inform and engage metadata practitioners and managers on a content standard which integrates well with the metadata needs of the 21st century”
Presented on the UKCoR RDA Day during the Metadata & Discovery Group Conference (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
RDA methods, scenarios, tools - Gordon DunsireCILIP MDG
“The RDA Day is programmed by the UK Committee on RDA. Using activities and games throughout informative presentations, the RDA Day will inform and engage metadata practitioners and managers on a content standard which integrates well with the metadata needs of the 21st century”
Paper presented on the UKCoR RDA Day during the Metadata & Discovery Group Conference (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
Poster: What’s in a name? Re-Discovering cataloguing and index through metada...CILIP MDG
In 2019 CILIP’s Cataloguing and Indexing Group changed its name to the Metadata and Discovery Group. This poster will showcase the transition of the look and feel of the group’s logo and the process of designing and new one.
Poster presented at the CILIP Metadata and Discovery Group (MDG) Conference & UKCoR RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham).
Poster: Revamping our in-house cataloguing training / Victoria Parkinson (Kin...CILIP MDG
With hybrid working and a new LMS, we are revamping our in-house cataloguing training. We are learning from our teaching librarians and using the tools we have, such as Moodle, to create cataloguing training that allows anyone with an interest to learn the basics and making the best use of face-to-face time for putting those skills into practice. Over the past eight years we’ve adapted and updated our in-house training, and I’ll also talk about how we decide what to teach colleagues, and how we try to make the best use of staff time to keep skills up when cataloguing is one of many competing priorities and shared across several teams. Between staff turnover and COVID lockdowns and service changes, we are starting almost from scratch in building a pool of staff who can catalogue the material our suppliers can’t provide records for, which is an excellent time to take stock of what our cataloguing needs are, and advocate for the importance of creating and upgrading good quality records and why we need to build these skills in-house.
Poster presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
Poster: FAST : can it lighten the load, and what is the impact? / Jenny Wrigh...CILIP MDG
This poster presents the Faceted Application of Subject Terminology, giving an overview of the scheme, its advantages and potential issues, and its practical implementation. It will demonstrate that FAST is an important development for those interested in Linked Data, and the ways in which it is a useful tool for discovery in any system.
Poster presented at the CILIP Metadata and Discovery Group (MDG) Conference & UKCoR RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham).
Poster: The West Midlands Evidence Repository (WMER) : a regional collaborati...CILIP MDG
The West Midlands Evidence Repository (WMER) was born from a pre-pandemic recognition by managers of Knowledge and Library Services (KLSs) of 8 NHS Trusts in the West Midlands region of the need for a repository. This was to replace existing provision, or recognition of national priorities or local needs to record, collect, and share research, as well as potential for sharing patient information leaflets or guidelines. Some managers and services had previous experience of repositories, as well as being part of a national pilot. WMER, however, represented a new start for all to work in collaboration to establish a new service. The consortium would enable sharing of both costs and experience.
Initially, different repository suppliers were investigated by the KLS that had had a long-established repository, taking on board the experience of the group from the national pilot. The Atmire Open Repository platform was chosen as it met the consortium’s needs and had a proven track record of other collaborative repositories in the NHS. Financing was taken on by one Trust and the on-boarding was led in partnership between that Trust and the Trust that had undertaken the initial investigation.
With the initial on-boarding completed and the test server set-up, the group took a step back to ensure they worked together as a collaborative going forward. Collaborative work between the KLSs was facilitated by the formal creation of two groups, a Managers Group for overall approval and financial decision making and an Operational Group handling the setup and administration of the repository for the consortium. The Operational Group is led by the service with most experience of managing repositories and the lead of it acts as liaison between the two groups, with each group having representation from the eight organisations. Learning from other regional collaborations the Future NHS site was used as a collaborative workspace and Teams as the main means of communication.
The setup of the repository was completed on time after three months. There was initially a steep learning curve for all, especially the Operational Group who undertook this process. The group identified key metadata and metadata standards for the repository, including the use of ORCIDs and the use of Wessex Classification as a controlled vocabulary. The setup process was facilitated by the collaborative nature of the project as the variety of experience in the group was a great benefit. It should be noted support from the suppliers was specifically related to technical support only.
The collaborative nature of the project also allowed work to be shared, and tasks were given to members to be undertaken independently. However, a downside of collaborative projects is that decisions can take longer to be inclusive...
Poster presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
Poster: Updating the Wessex Classification Scheme for UK health libraries : a...CILIP MDG
The Wessex Classification Scheme was created by healthcare librarians in the South West of England, and was loosely based on the US National Library of Medicine classification. The scheme is widely used in healthcare libraries across the UK, both inside and outside the NHS. Although the scheme has gone through several revisions, there has been no major update since 2015, so the Wessex Classification Scheme Oversight Group was formed in September 2022 with the support of NHS England. The group aims to bring knowledge and skills from UK health library networks to improve the scheme and offers a chance for participants to develop skills in working with classification and subject indexing, and the opportunity to network widely. By forming a working group, it ensures the longevity of the scheme and shares the maintenance work more widely.
Initially, members were asked which parts of the scheme they felt needed updating the most and sub-groups were formed for LGBTQ+ issues and gender identity (the Pride sub-group), Ethnicity and Race, and Learning Disability and Neurodiversity (the LDN sub-group) as well as a smaller team working on ‘quick and simple’ updates....
Poster presented at the CILIP Metadata and Discovery Group (MDG) Conference & UKCoR RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham).
Revamping in-house cataloguing training / Victoria Parkinson (King's College ...CILIP MDG
With hybrid working and a new LMS, we are revamping our in-house cataloguing training. We are learning from our teaching librarians and using the tools we have, such as Moodle, to create cataloguing training that allows anyone with an interest to learn the basics and making the best use of face-to-face time for putting those skills into practice. Over the past eight years we’ve adapted and updated our in-house training, and I’ll also talk about how we decide what to teach colleagues, and how we try to make the best use of staff time to keep skills up when cataloguing is one of many competing priorities and shared across several teams. Between staff turnover and COVID lockdowns and service changes, we are starting almost from scratch in building a pool of staff who can catalogue the material our suppliers can’t provide records for, which is an excellent time to take stock of what our cataloguing needs are, and advocate for the importance of creating and upgrading good quality records and why we need to build these skills in-house.
Lightning Talk presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
UK NACO funnel : progress, obstacles, and solutions / Martin Kelleher (Univer...CILIP MDG
This Lightning Talk will provide a quick update on latest progress with the now established UK NACO Funnel, which allows participating institutions to contribute to Library of Congress / PCC authority control. The presentation will include a summary of the purpose of the funnel, details of latest expansion, problems and solutions with data submission software, and further plans and collaborations.
Lightning Talk presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
Ship[w]right[e]s? : the challenges of cataloguing reports from scientific exp...CILIP MDG
This document discusses the challenges of cataloguing reports from scientific expeditions, using the Challenger Reports as an example. It notes that there were 83 Challenger Reports published by 47 authors, held across 4 sites in 50 different sectional libraries, with 220 bib records created for this work. It also mentions the opportunity for the Natural History Museum to think about metadata across the entire museum collection as part of an effort to move specimens to a new location.
BFI Reuben Library : an RDA implementation story / Anastasia Kerameos (BFI Re...CILIP MDG
“From 1st January 2024, Adlib will no longer be supported or maintained by Axiell.” This statement acted as the catalyst for action, enabling the release of resources to implement significant changes to the BFI Reuben Library’s record structure, which in turn prompted a deeper look into our current cataloguing practices and future requirements.
Upgrading to Axiell Collections will allow the library to implement new RDA more fully – we had previously adopted some aspects but not all – and, importantly, it will allow us to better align our data structure with that of the organisation’s other collections, making it easier to manage and making it compatible with further planned system developments. By the time of the conference in September we will be cataloguing to an under the bonnet Work – Expression – Manifestation – Item (WEMI) record hierarchy and new cataloguing guidelines.
Having watched all the webinars available, having read every piece of documentation which seemed relevant, having spent hours reading and re-reading the contents of the RDA Toolkit we are currently working on the last stages of our application profile whilst still debating issues around putting the theory into practice, especially in the area of aggregates and diachronic works. I do not suggest I have all the answers, far from it, but by sharing the story of our journey, that of a medium sized non-academic library of specialist mostly print collections and illustrating it with practical examples I hope my presentation will be of use to others currently travelling a similar path.
Paper presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
RDA implementation at the British Library / Thurstan Young (British Library)CILIP MDG
On 23rd May 2023, the RDA Board announced that the original RDA Toolkit will be removed in May 2027. All RDA users will need to be prepared for transition to the official RDA Toolkit before then. As previously announced, a Countdown Clock will start running in May 2026, a year before the sunset date.
This paper will provide an update on the British Library’s plans for implementation of the new RDA Toolkit, following completion of the RDA Toolkit Restructure and Redesign (3R) project. It will provide an overview of the timeline and scope for implementation as well as describing the training and documentation underpinning the implementation and the support available to other institutions for their implementation.
Paper presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
Community forward : developing descriptive cataloguing of rare materials (RDA...CILIP MDG
Since 2013, Resource Description and Access (RDA) has been the chief cataloguing standard used in the United States. In 2019, the RDA Steering Committee previewed a new version of the RDA Toolkit, which introduced substantial changes, such as replacing instructions with a series of options, adding new concepts such as “nomens” and “diachronic works,” and replacing the prior organisation with a broader intellectual framework. This revised Toolkit became the official RDA Toolkit in December 2020, with major cataloguing bodies planning to adopt it in the coming years. Some cataloguers have expressed concerns regarding the official RDA Toolkit, particularly around cost and training required to learn the new standard.
In response to these concerns, the RBMS RDA Editorial Group, a group of volunteers from the Association of College and Research Libraries’ Rare Books and Manuscripts Section, developed a new manual, Descriptive Cataloging of Rare Materials (RDA Edition). DCRMR is informed by core principles of community and sustainability while employing open-access publication models and infrastructure. Designed in response to community feedback, it presents instructions in cataloguing workflow order using clear language while remaining aligned to the official RDA Toolkit and RDA element sets. The manual was approved in February 2022 in its first iteration and continues to be actively developed and updated. This presentation will discuss why the editorial group created an open and free manual; the process and tools for creating the manual, including the use of GitHub to publish a cataloguing standard; and outcomes to date.
Paper presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
The West Midlands Evidence Repository (WMER) : a regional collaboration proje...CILIP MDG
The West Midlands Evidence Repository (WMER) was born from a pre-pandemic recognition by managers of Knowledge and Library Services (KLSs) of 8 NHS Trusts in the West Midlands region of the need for a repository. This was to replace existing provision, or recognition of national priorities or local needs to record, collect, and share research, as well as potential for sharing patient information leaflets or guidelines. Some managers and services had previous experience of repositories, as well as being part of a national pilot. WMER, however, represented a new start for all to work in collaboration to establish a new service. The consortium would enable sharing of both costs and experience.
Initially, different repository suppliers were investigated by the KLS that had had a long-established repository, taking on board the experience of the group from the national pilot. The Atmire Open Repository platform was chosen as it met the consortium’s needs and had a proven track record of other collaborative repositories in the NHS. Financing was taken on by one Trust and the on-boarding was led in partnership between that Trust and the Trust that had undertaken the initial investigation.
With the initial on-boarding completed and the test server set-up, the group took a step back to ensure they worked together as a collaborative going forward. Collaborative work between the KLSs was facilitated by the formal creation of two groups, a Managers Group for overall approval and financial decision making and an Operational Group handling the setup and administration of the repository for the consortium. The Operational Group is led by the service with most experience of managing repositories and the lead of it acts as liaison between the two groups, with each group having representation from the eight organisations. Learning from other regional collaborations the Future NHS site was used as a collaborative workspace and Teams as the main means of communication.
The setup of the repository was completed on time after three months. There was initially a steep learning curve for all, especially the Operational Group who undertook this process. The group identified key metadata and metadata standards for the repository, including the use of ORCIDs and the use of Wessex Classification as a controlled vocabulary. The setup process was facilitated by the collaborative nature of the project as the variety of experience in the group was a great benefit. It should be noted support from the suppliers was specifically related to technical support only.
The collaborative nature of the project also allowed work to be shared, and tasks were given to members to be undertaken independently. However, a downside of collaborative projects is that decisions can take longer to be inclusive...
Paper presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
Authority of assertion in repository contributions to the PID graph / George ...CILIP MDG
The principles surrounding Linked Open Data and their implementation within digital libraries are well understood. Such implementations may be challenging, but successes are now well documented and continue to demonstrate the benefits of disseminating and enriching existing metadata with improved semantics and relational associations. Often facilitated in machine-readability enhancements to metadata by harnessing serializations of the Resource Description Framework (RDF) and its reliance of URIs, these LOD approaches have ensured digital libraries, and similar GLAMR initiatives elsewhere, contribute to the growing knowledge graphs associated with the wider semantic web by declaring statements of fact about web entities. Within open scholarly ecosystems a growing use of persistent identifiers (PIDs) to define and link scholarly entities has emerged, e.g., DOIs, ORCIDs, etc. The requirement for greater URI persistence has been motivated by several developments within the scholarly space; suffice to state that, when combined with appropriate structured data, PIDs can support improvements to resource discovery, as well as facilitate contributions to the ‘PID graph’ – a scholarly data graph describing and declaring associative relations between scholarly entities.
While the increased adoption of PIDs has the potential to transform scholarship, ensuring that these PIDs are used appropriately, encoded correctly within metadata, and that all relevant relational associations between scholarly entities are declared presents challenges. This is especially true within open scholarly repositories, from where many contributions to the PID graph will be made but – unlike many LOD contexts – from where the authority to assert specific relations may not always exist. Such declarations need to demonstrate reliability and provenance and are central to the interlinking of heterogeneous textual objects, datasets, software, research instruments, equipment, and the related PIDs these items may generate, such as for people, organizations, or other abstract entities.
This paper will explore the issues that arise when levels of authority to assert are lacking or are uncertain, and review results from a related study exploring the ‘PID literacy’ of scholars...
Paper presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
ISNI : a persistent identifier for creatives and associated organizations / T...CILIP MDG
Recent interest in persistent identifiers (PIDs) within the research and library sectors is bringing discussions about the adoption of standards – such as ISNI and DOI, and other ID schemes such as ORCID, Ringgold, ROR, CrossRef, etc. – to the fore, casting a spotlight on the PIDs already in use and their relationships to one another. In its capacity as a bridging identifier and a critical component in Linked Data applications, clearly ISNI has a major part to play in these discussions.
With a view to explaining the benefits of ISNIs for researchers, academic and scholarly publishers, institutions, funders, and other stakeholders – including information about ISNI’s centrally-managed database and the curation functions carried out by its direct data contributors – ISNI-IA continues to promulgate the ISNI standard within the research and library sectors, showing not only the importance of using ISNIs, but also the strength and quality of data that can be achieved when research identifiers operate collaboratively.
This presentation will be an opportunity for those in the library and research community to learn about progress with the ISNI standard to date, within the library, research, and publishing sectors and beyond!
Attendees will learn about:
• What the ISNI standard is.
• How the ISNI standard interacts with other identifiers.
• The benefits of ISNIs for the research sector.
• The level of adoption across the sectors that ISNI represents (including the library, music, publishing, research, and entertainment sectors).
• Upcoming ISNI projects (including the National Library of Finland’s ongoing project in collaboration with 5 prominent Copyright Management Organizations).
Paper presented at the Metadata & Discovery Group Conference & RDA Day (6th - 8th Sept 2023 at IET Austin Court, Birmingham)
The Ipsos - AI - Monitor 2024 Report.pdfSocial Samosa
According to Ipsos AI Monitor's 2024 report, 65% Indians said that products and services using AI have profoundly changed their daily life in the past 3-5 years.
Build applications with generative AI on Google CloudMárton Kodok
We will explore Vertex AI - Model Garden powered experiences, we are going to learn more about the integration of these generative AI APIs. We are going to see in action what the Gemini family of generative models are for developers to build and deploy AI-driven applications. Vertex AI includes a suite of foundation models, these are referred to as the PaLM and Gemini family of generative ai models, and they come in different versions. We are going to cover how to use via API to: - execute prompts in text and chat - cover multimodal use cases with image prompts. - finetune and distill to improve knowledge domains - run function calls with foundation models to optimize them for specific tasks. At the end of the session, developers will understand how to innovate with generative AI and develop apps using the generative ai industry trends.
Global Situational Awareness of A.I. and where its headedvikram sood
You can see the future first in San Francisco.
Over the past year, the talk of the town has shifted from $10 billion compute clusters to $100 billion clusters to trillion-dollar clusters. Every six months another zero is added to the boardroom plans. Behind the scenes, there’s a fierce scramble to secure every power contract still available for the rest of the decade, every voltage transformer that can possibly be procured. American big business is gearing up to pour trillions of dollars into a long-unseen mobilization of American industrial might. By the end of the decade, American electricity production will have grown tens of percent; from the shale fields of Pennsylvania to the solar farms of Nevada, hundreds of millions of GPUs will hum.
The AGI race has begun. We are building machines that can think and reason. By 2025/26, these machines will outpace college graduates. By the end of the decade, they will be smarter than you or I; we will have superintelligence, in the true sense of the word. Along the way, national security forces not seen in half a century will be un-leashed, and before long, The Project will be on. If we’re lucky, we’ll be in an all-out race with the CCP; if we’re unlucky, an all-out war.
Everyone is now talking about AI, but few have the faintest glimmer of what is about to hit them. Nvidia analysts still think 2024 might be close to the peak. Mainstream pundits are stuck on the wilful blindness of “it’s just predicting the next word”. They see only hype and business-as-usual; at most they entertain another internet-scale technological change.
Before long, the world will wake up. But right now, there are perhaps a few hundred people, most of them in San Francisco and the AI labs, that have situational awareness. Through whatever peculiar forces of fate, I have found myself amongst them. A few years ago, these people were derided as crazy—but they trusted the trendlines, which allowed them to correctly predict the AI advances of the past few years. Whether these people are also right about the next few years remains to be seen. But these are very smart people—the smartest people I have ever met—and they are the ones building this technology. Perhaps they will be an odd footnote in history, or perhaps they will go down in history like Szilard and Oppenheimer and Teller. If they are seeing the future even close to correctly, we are in for a wild ride.
Let me tell you what we see.
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"sameer shah
Embark on a captivating financial journey with 'Financial Odyssey,' our hackathon project. Delve deep into the past performance of two companies as we employ an array of financial statement analysis techniques. From ratio analysis to trend analysis, uncover insights crucial for informed decision-making in the dynamic world of finance."
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeWalaa Eldin Moustafa
Dynamic policy enforcement is becoming an increasingly important topic in today’s world where data privacy and compliance is a top priority for companies, individuals, and regulators alike. In these slides, we discuss how LinkedIn implements a powerful dynamic policy enforcement engine, called ViewShift, and integrates it within its data lake. We show the query engine architecture and how catalog implementations can automatically route table resolutions to compliance-enforcing SQL views. Such views have a set of very interesting properties: (1) They are auto-generated from declarative data annotations. (2) They respect user-level consent and preferences (3) They are context-aware, encoding a different set of transformations for different use cases (4) They are portable; while the SQL logic is only implemented in one SQL dialect, it is accessible in all engines.
#SQL #Views #Privacy #Compliance #DataLake
Analysis insight about a Flyball dog competition team's performanceroli9797
Insight of my analysis about a Flyball dog competition team's last year performance. Find more: https://github.com/rolandnagy-ds/flyball_race_analysis/tree/main
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...sameer shah
"Join us for STATATHON, a dynamic 2-day event dedicated to exploring statistical knowledge and its real-world applications. From theory to practice, participants engage in intensive learning sessions, workshops, and challenges, fostering a deeper understanding of statistical methodologies and their significance in various fields."
End-to-end pipeline agility - Berlin Buzzwords 2024Lars Albertsson
We describe how we achieve high change agility in data engineering by eliminating the fear of breaking downstream data pipelines through end-to-end pipeline testing, and by using schema metaprogramming to safely eliminate boilerplate involved in changes that affect whole pipelines.
A quick poll on agility in changing pipelines from end to end indicated a huge span in capabilities. For the question "How long time does it take for all downstream pipelines to be adapted to an upstream change," the median response was 6 months, but some respondents could do it in less than a day. When quantitative data engineering differences between the best and worst are measured, the span is often 100x-1000x, sometimes even more.
A long time ago, we suffered at Spotify from fear of changing pipelines due to not knowing what the impact might be downstream. We made plans for a technical solution to test pipelines end-to-end to mitigate that fear, but the effort failed for cultural reasons. We eventually solved this challenge, but in a different context. In this presentation we will describe how we test full pipelines effectively by manipulating workflow orchestration, which enables us to make changes in pipelines without fear of breaking downstream.
Making schema changes that affect many jobs also involves a lot of toil and boilerplate. Using schema-on-read mitigates some of it, but has drawbacks since it makes it more difficult to detect errors early. We will describe how we have rejected this tradeoff by applying schema metaprogramming, eliminating boilerplate but keeping the protection of static typing, thereby further improving agility to quickly modify data pipelines without fear.
Learn SQL from basic queries to Advance queriesmanishkhaire30
Dive into the world of data analysis with our comprehensive guide on mastering SQL! This presentation offers a practical approach to learning SQL, focusing on real-world applications and hands-on practice. Whether you're a beginner or looking to sharpen your skills, this guide provides the tools you need to extract, analyze, and interpret data effectively.
Key Highlights:
Foundations of SQL: Understand the basics of SQL, including data retrieval, filtering, and aggregation.
Advanced Queries: Learn to craft complex queries to uncover deep insights from your data.
Data Trends and Patterns: Discover how to identify and interpret trends and patterns in your datasets.
Practical Examples: Follow step-by-step examples to apply SQL techniques in real-world scenarios.
Actionable Insights: Gain the skills to derive actionable insights that drive informed decision-making.
Join us on this journey to enhance your data analysis capabilities and unlock the full potential of SQL. Perfect for data enthusiasts, analysts, and anyone eager to harness the power of data!
#DataAnalysis #SQL #LearningSQL #DataInsights #DataScience #Analytics
8. Strategies
• Global correction and downgrade for miscoded records.
• System fix for newly-added records:
– Code as new
– Remove 035, 015, 016, etc.
– Assign new 040 field with $b = eng
– Warning field.
• Documentation
• Automated warnings for records with non-English 040 $b.
• Frequent reminders.
• Individual followup.
9. Not quite there yet
015 __ |a 10,B19 |2 dnb
016 7_ |a 1001983610 |2 DE-101
035 __ |a (OCoLC)614493721
040 __ |c GWDNB |d HEBIS |d OCLCQ |d UkOxU
082 04 |a 320.9624 |2 22/ger
084 __ |a 320 |2 sdnb
24500 |a [Title redacted] / |c ed. by the Heinrich Böll
Foundation. [Alex de Waal ... Ed.: Robert Furlong ;
Bernd Herrmann].
260 __ |a Berlin : |b Heinrich-Böll-Stiftung, |c 2010.
300 __ |a 123 S. : |b ill., Kt. ; |c 24 cm.
10. Why does it matter?
• English-language catalogues should be
English-language.
• Different cataloguing rules, name authorities,
LCSH - may impair searching.
• Records may be rejected by OCLC.
• If records are not rejected by
OCLC, other agencies will get
unsuitable records.
11. Shock II: ‘Unresolved’ records
• Many records
– had an OCLC system number in 035; but
– had been edited to be used for a different resource.
• If our records are rejected, WorldCat users will
not know that we have the resources.
• If our records slip through and cluster with the
wrong master records, WorldCat users will
think that we have copies of resources which
we do not have.
12. ‘Same’ record – different resources
035 |a (OCoLC)19508660
1001 |a Varennes, Claude de.
24513
|a Le voyage de France, dressé pour l'instruction & commodité tant des françois que des
estrangers.
250 |a 4. ed., corr. & augm.
260 |a Roven, |b J. Caillové, |c 1647.
300 |a 10 preliminary leaves, 60, 304, [29] pages
035 |a (OCoLC)19508660
1001 |a Varennes, Claude de.
24513
|a Le voyage de France, dressé pour l'instruction & commodité des François &
Estrangers. : |b Avec une description des Chemins, pour aller & venir par tout le Monde.
Tres-necessaire aux Voyageurs. / |c Corrigé & augmenté par le Sieur Du Verdier,
Historiographe du Roy..
260
|a A Paris, : |b Chez Michel Bobin, au troisiesme Pillier de la Grand'Salle du Palais, à
l'Esperance., |c 1655..
300 |a 8 unnumbered pages, folded map, 492, 4 unnumbered pages : |b illustrations; |c 8°.
13. Innovation means ...
... caring and sharing
... with the power of machines.
Image by VectorCharacters.net
Hello. I’m Bernadette O’Reilly, and I work for the Bodleian Libraries in Oxford. My job is to keep up with developments in international cataloguing standards and provide training, documentation, advice and a bit of quality control ...
... not just for the Bodleian’s own cataloguers but for all the cataloguers in Oxford University’s Libraries Information System (OLIS).
So I actually work with [click] nearly 200 cataloguers [click] in nearly 100 libraries scattered over several miles, half of which (the libraries in the colleges and some departments) [click] are completely independent of the Bodleian.
I can’t be looking over everyone’s shoulders – I actually work a mile and a half from this pretty bit, in the industrial estate beyond the top right-hand corner - so quality control depends on reports, when we know what to look out for, and otherwise on serendipity – just happening to notice records which indicate misunderstandings or knowledge gaps.
So sometimes I get surprises.
All the big surprises in the last few years, the drop-everything-to-fight-this-fire surprises, have involved the kind of data which is used for managing records both internally and externally – the elements which control things like whether a record should be edited, or upgraded, or overwritten, or exported to COPAC or WorldCat and, particularly, how our exported records will be handled by the receiving agency. Fires of this kind have proved particularly difficult to fight – smouldering unnoticed for years, taking hold in several locations at once and with a trick of breaking out afresh when we hope they have been damped down.
I would like to share what we have learned from a couple of these surprises, both of which relate to importing and exporting records.
We know that the problems are not confined to the Bodleian. Some of our faulty records were actually copied from other databases with the problems already present – from OCLC, RLUK and even LC and BNB.
The problems indicate that our cataloguing culture has not kept up with our the changing cataloguing environment. It is very clear that our cataloguers find it much easier to appreciate the importance of good descriptive data and access points than to appreciate the importance of the kinds of codes and identifiers which are primarily intended to be read by machines for automated processes. But this data is absolutely key to our ability to meet the growing challenges of dwindling resources and the growing opportunities of linked, machine-actionable data.
I am not at all intending to criticise my colleagues (or anyone else’s). When I was first trained to catalogue just 11 years ago I did not need to know anything much beyond AACR2, MARC21, LCSH and our local software. Now our cataloguers are expected to make best use of a vast and varied network of data and processes, both local and external. It’s a big ask. It is not enough just to design and document ‘efficient’ new streamlined workflows and algorithms – they will not work efficiently until we win the hearts and minds of all our colleagues to care about all the kinds of data which the workflows and algorithms use.
You could say that these issues are about passport control [click for passport control image].
Off to Paris next week, but whoops! I forgot to renew my passport. No problem – I’ll borrow my son’s. Of course, he is [click] male, fair-haired, six feet tall and half my age, while I am .. different [click], so I’ll have to make a few adjustments.
What could possibly go wrong? [Click for jail image].
Well, I might not get to travel at all.
If the Border Agency is cutting corners that day, I might get through. But that could cause another set of problems. Records might then indicate that my son is in France, while I am still in the UK. If someone really needed to get hold of one of us, they might get the wrong person, or no one at all.
Because, although I changed the descriptive bits on the passport, it still has a serial number which is associated uniquely with my son, and so goes with his description, not mine.
Nowadays catalogue records are travellers too.
Many agencies continually import records from WorldCat, RLUK, the Library of Congress, the British Library and others, and export their own records back to WorldCat and RLUK.
And records, like human travellers, have identities and histories, which are enshrined in various serial numbers and other identifiers. These associate them with a particular resource, and by implication with key aspects of its description, e.g. title, content, date of issue, physical format.
For instance, this record from OCLC’s WorldCat has
- [click] in the 035 field, its OCLC system number; this will be preserved if the record is downloaded to another database
- [click] in the 016, a national bibliography number; other records might have a BNB number in 015 and/or a Library of Congress control number in 050
- [click] in 040, codes of the agencies which created or edited the record, giving it a history.
Nowadays records even have something like nationalities, the language-of-cataloguing-agency recorded in 040 $b[click] , in this case French. This, of course, is not the language of the resource, but the language used in the non-transcribed elements of the record, e.g. for physical description [click] and notes. WorldCat now has records from many different language communities, and, to allow users to find records in their own language, has ruled that a different language-of-cataloguing means a different record. That means that the identifying numbers in a record’s 035 fields are associated not just with the resource catalogued but also with the language in which it is catalogued. And, of course, national bibliography numbers are always associated with a particular language.
Shock the first:
This one hit us in the summer of 2012, just when we really did not want to have to think about anything except RDA implementation. A record got itself noticed because it did not use Unicode and so displayed oddly in our system [click]. Its 040 $b and $e showed that it was created by a German-language agency to German standards [click] (Regelwerk fur Alphabetische Katalogisierung Wissenschaftliche Bibliotheken), and in other respects it was quite unsuitable for our use – physical description in German, no English-language subject headings – but that was not the shock. Initially we thought that it was more or less a one-off, and all we needed to do was to make sure that everyone realised that OCLC included non-English records, and knew what to look out for and avoid.
The shock was when we were told by several managers that their cataloguers could not manage without downloading non-English records, and had been downloading them for years. (Particularly cataloguers who dealt with foreign-language resources and art resources.) In fact, there were already several thousand records in OLIS coded as non-English. We just hadn‘t noticed before. Many had been more or less fully translated and adapted to English-language standards, but were still coded as French, German, Dutch, Spanish, Italian....
... while a few had not been translated or adapted at all. And that is how CM met CD in our catalogue [click] – ‘met’ is Dutch for ‘with’, so the CD is just accompanying material. (I whited out the title to avoid embarrassing any individual cataloguers.)
Our cataloguers had not been specifically warned to look out for foreign-language records, and many of them were unaware of the significance of the code in 040 $b. It had never occurred to me or my immediate colleagues that anyone might be tempted to use a record which had funny 300 fields and used unfamiliar standards and authorities and subject headings, rather than cataloguing originally. That was a failure on my own part to engage with this new source of data, because when original cataloguing would involve lots of diacritics or nonroman scripts and transliteration, adapting even a foreign-language record starts to look good.
So we had to get ourselves sorted fast.
[click] For the thousands of existing OLIS records with non-English 040 $b, we could only do a rough global correction, because there was so much variety as to how far they had been checked and/or translated; so all those records had to be downgraded.
[click] A brilliant systems colleague designed a fix to make it possible to use downloaded foreign-language records safely in future. It recodes the records correctly as new English-language records, and adds a warning field which reminds cataloguers that they still have to check MARC, RDA, LCSH, etc. themselves. The field should not be deleted until this is done, and meanwhile it prevents export.
Problem solved?
Actually, it has taken a stupendous effort to get through to (almost) all our cataloguers the importance of using the fix and making the checks: not just documentation [click], automated warnings [click] and general reminders [click] but in many cases person-to-person [click]. Communicating, providing context, mentioning the problems which might arise, so that it stops looking like a technicality. (This was obviously more difficult for us because our cataloguers are so scattered.)
Even very professional people could find it really hard to grasp that changing the language code and clearing out all the record’s old identifiers was as important as translating the data. After four years things are much better, but still a few people assume that they can make the problems go away by making little edits to the elements which trigger warning messages rather than using the proper fix and procedures.
In this record the cataloguer deleted the 040 $a and $b, thus losing the ‘ger’ code, to make the warnings go away; but it still has the 035 field which identifies it with a German-language record in WorldCat.
(In this case the physical description [click] is still in German, but that kind of error is much less common.)
Does it really matter?
- It’s obvious that people should be careful about what they download and translate it if necessary. Users expect English-language catalogues to be in English, so the records ought to be thoroughly translated.
- And it’s not just a matter of translating. Foreign-language records often use different cataloguing rules, different versions of MARC, different name authorities and different subject systems, so all of that ought to be checked as well, to maintain effective indexing.
- But there are potentially much greater problems if records are translated and are then exported again, particularly to WorldCat, without being given both the correct language-of-cataloguing code and a new identity. If they still have OCLC’s original system numbers in 035, OCLC’s deduplication algorithm will expect to cluster them with the original master records. But the language of cataloguing code, if it has been changed, or the translated terms in the physical description, etc., will preclude matching, so the records will probably be rejected as anomalous. In consequence, people searching WorldCat will not know that we have copies of the resources.
- If some records with unchanged language codes do slip into OCLC, they will be a snare and a delusion. Perhaps a Spanish-language agency wants to harvest Spanish-language records, so selects by language of cataloguing, 040$b=spa; but they might well end up with a proportion of records which have actually been translated into English.
Rather like if I try to travel on my son’s passport [click]: either I don’t get to travel at all, or I slip through the checks and then people who want to find my son get me instead.
Shock the second
After our last system migration it took a while to get our export to OCLC going again. When we eventually exported the backlog, several thousand records were rejected. Many for the language-related reasons I have just been talking about, some because of technicalities such as nonroman script in the wrong fields; but very many simply because of other types of mismatch
Lots of the records [click] had been derived from OCLC and still had the original system number but had been edited in ways that made them seem to be for different resources.
OCLC’s matching algorithm could not determine whether the records should be assimilated to existing master records or treated as new, and so rejected them entirely.
To be fair, sometimes they really were for the same resource but had been improved out of recognition, e.g. if a brief record had been edited to antiquarian standard – and in that case the only solution is to get the OCLC record improved; but most often they really had been edited to be used for a different resource – a different edition, a different date, a microform record used for a print resource, a single-part record extended for a multipart resource…
Again, what was startling was that this was being done not just occasionally by oversight, but quite regularly by experienced and careful cataloguers. One OCLC record had been downloaded 4 times, and adapted for 4 separate editions of a work, so we had 4 different OLIS records all with the same OCLC number in the 035 field.
An example: here are the key fields of an OCLC record; and here [click] are the key fields in an OLIS record derived from it.
The OCLC number in 035 is retained in the OLIS record, but the edition statement, place, publisher, date and physical description are all completely different, showing that it is being used for a quite different resource, although in other respects it is a very good record. In fact, it was trying to travel on another record’s passport [click], and was caught.
This particular hearts –and-minds campaign has only been running for 9 months, and most people have got the point now, but still people sometimes forget.
Basically, this is all about sharing and caring, but [click] on a scale so large that it can only be managed by mechanised processes [click]. By exchanging data on a really large scale we keep our operations viable, enrich our services and support other information providers. We have to be able to select and manage data on the basis of machine-readable markers; and so we have to be able to trust each other to apply these markers accurately.
Which means that cataloguers have to care about how they talk to machines.
Have you encountered similar problems in your own libraries?
Is there a future for encoding levels or other indicators of completeness and finality?
Does anyone know of plans for record management data in Bibframe or similar?