This document provides best practices for designing, creating, and managing digital library projects. It discusses what constitutes a good digital library and how they should be structured. Digital projects are undertaken to enhance accessibility, promote collaboration, and support research. Projects involve digitizing collections of objects and associated metadata. Standards are important to consider for file formats, descriptive metadata, and technical specifications. Pilot testing is emphasized to refine requirements. Projects can be carried out in-house or outsourced. Ongoing preservation is critical to address issues like bit rot, format obsolescence, and media decay. Common causes of IT project failure include poor planning, unclear goals and communications.
Practical steps towards digital preservation at institutional levelsChris Rusbridge
This document discusses practical steps for digital preservation at institutional levels. It addresses questions of who is responsible for digital preservation, when preservation activities should take place, and what materials should be preserved. Responsibilities could fall to national libraries, membership organizations, or commercial entities. Institutions should focus on preserving local scholarly works, learning materials, records, and experimental data in both the short and long term by planning requirements and formats. An example system called CDocS that automatically collects metadata and produces documents in XML is also outlined.
Just Digitise It by Daniel Wilksch (Coordinator Digital Projects, Public Record Office Victoria). Presented at the 2015 Community Heritage Grants (CHG) Preservation and Collection Management Training Workshops.
How to Face the Challenges of Web Archiving? The Experiences of a Small Libra...Liber2012
The document discusses two web archiving projects conducted as a collaboration between the National Library of Ireland and Internet Memory. The projects involved archiving Irish political websites during the 2011 general election and presidential election. The collaboration allowed both organizations to gain experience in web archiving despite lacking technical expertise. It provided a valuable record of online campaign information that no other organization was collecting. The projects demonstrated the benefits of partnerships for conducting rapid web archives of important historical materials.
The document discusses the creation of an LSE Digital Library. It outlines why a digital library is needed to support the institution's collections and research mission. It describes conducting an audit of collections and risks, exploring options through research of other institutions, and developing a proposal. The proposal included an implementation plan with development of staff skills, technical infrastructure, and a phased approach starting with preservation and moving to management and access over time. Roles and responsibilities were defined for different teams to work collaboratively on the digital library's development.
Flexible Design for Simple Digital Library Tools and ServicesLighton Phiri
I gave a talk today at this year (2013)'s South African Institute for Computer Scientists and Information Technologists ---SAICSIT--- conference, held in East London, South Africa. The pre-print is here [1], and the actual publisher copy is here [2]. I used the same bastardised version [3] of the Torino Beamer theme.
[1] http://goo.gl/PmHRVo
[2] http://goo.gl/ipQlgw
[3] http://blog.barisione.org/2007-09/torino-a-pretty-theme-for-latex-beamer
IAML Antwerp 2014 From historical collections to metadataKaren McAulay
ABSTRACT: From Historical Collections to Metadata: a case study in Scottish Musical Inheritance, paper by Dr Karen E McAulay, Music & Academic Services Librarian, Royal Conservatoire of Scotland.
The contemporary librarian is more than ever before a conduit for making historical material available to scholars and performers alike. The challenge in today’s world is not only to augment the crucial early manuscripts and publications with appropriate electronic versions, but to provide added value by enriching them with contextual and interpretative information.
The 3-year AHRC-funded project, Bass Culture in Scottish Musical Traditions, seeks to address this set of issues in Scottish bagpipe and fiddle music. Involving the Universities of Glasgow and Cambridge and the Royal Conservatoire of Scotland, it will provide a substantial web resource of sources and their interpretation, engaging with musicians working in a number of traditions to develop historically-informed practices. The aim is to enable musicians to have an understanding of the structures underpinning Scottish fiddle and pipe music, enriching the traditions with a deeper, more widespread appreciation of the diversity of their roots.
The metadata requirements of the two repertoires have similarities and divergences; flexibility is needed to apply suitable metadata across both. Compatibility with pre-existing approaches is also a prerequisite.
By October 2014, we will be building the web resource; this conference would provide a perfect opportunity to share an innovative collaboration between musicology, librarianship and web development.
The Long Beach Public Library has digitized parts of its photo, yearbook, and city directory collections to create an online digital archive accessible through the library catalog called Encore. The digital archive contains over 50 directories, 70 yearbooks, and 700 photos documenting the history of Long Beach. It features items like a 1928 creamery ad, photos of Babe Ruth and Eleanor Roosevelt visiting Long Beach, and images of the city council from different eras. Library staff encourage exploring the collections through the Encore search tool or on the digital archive website and providing feedback to continue expanding access to Local history resources.
DESIGN AND IMPLEMENTATION OF A DIGITAL ARCHIVE OF LEARNING OBJECTS FOR REMOTE...Khalid Md Saifuddin
Digital libraries are fast expanding into the role of independent educational entities that aspire not only to complement traditional classroom teaching, but also allow remote access to Learning Objects. Multifaceted roles of Learning Objects can be realized only if the course content and the related content management system are versatile enough to be captured into any individual’s learning needs. This paper presents the design & implementation of such a Digital Archive, which will facilitate the use/re-use, store & bookmark of Learning Objects focusing to learning needs.
Practical steps towards digital preservation at institutional levelsChris Rusbridge
This document discusses practical steps for digital preservation at institutional levels. It addresses questions of who is responsible for digital preservation, when preservation activities should take place, and what materials should be preserved. Responsibilities could fall to national libraries, membership organizations, or commercial entities. Institutions should focus on preserving local scholarly works, learning materials, records, and experimental data in both the short and long term by planning requirements and formats. An example system called CDocS that automatically collects metadata and produces documents in XML is also outlined.
Just Digitise It by Daniel Wilksch (Coordinator Digital Projects, Public Record Office Victoria). Presented at the 2015 Community Heritage Grants (CHG) Preservation and Collection Management Training Workshops.
How to Face the Challenges of Web Archiving? The Experiences of a Small Libra...Liber2012
The document discusses two web archiving projects conducted as a collaboration between the National Library of Ireland and Internet Memory. The projects involved archiving Irish political websites during the 2011 general election and presidential election. The collaboration allowed both organizations to gain experience in web archiving despite lacking technical expertise. It provided a valuable record of online campaign information that no other organization was collecting. The projects demonstrated the benefits of partnerships for conducting rapid web archives of important historical materials.
The document discusses the creation of an LSE Digital Library. It outlines why a digital library is needed to support the institution's collections and research mission. It describes conducting an audit of collections and risks, exploring options through research of other institutions, and developing a proposal. The proposal included an implementation plan with development of staff skills, technical infrastructure, and a phased approach starting with preservation and moving to management and access over time. Roles and responsibilities were defined for different teams to work collaboratively on the digital library's development.
Flexible Design for Simple Digital Library Tools and ServicesLighton Phiri
I gave a talk today at this year (2013)'s South African Institute for Computer Scientists and Information Technologists ---SAICSIT--- conference, held in East London, South Africa. The pre-print is here [1], and the actual publisher copy is here [2]. I used the same bastardised version [3] of the Torino Beamer theme.
[1] http://goo.gl/PmHRVo
[2] http://goo.gl/ipQlgw
[3] http://blog.barisione.org/2007-09/torino-a-pretty-theme-for-latex-beamer
IAML Antwerp 2014 From historical collections to metadataKaren McAulay
ABSTRACT: From Historical Collections to Metadata: a case study in Scottish Musical Inheritance, paper by Dr Karen E McAulay, Music & Academic Services Librarian, Royal Conservatoire of Scotland.
The contemporary librarian is more than ever before a conduit for making historical material available to scholars and performers alike. The challenge in today’s world is not only to augment the crucial early manuscripts and publications with appropriate electronic versions, but to provide added value by enriching them with contextual and interpretative information.
The 3-year AHRC-funded project, Bass Culture in Scottish Musical Traditions, seeks to address this set of issues in Scottish bagpipe and fiddle music. Involving the Universities of Glasgow and Cambridge and the Royal Conservatoire of Scotland, it will provide a substantial web resource of sources and their interpretation, engaging with musicians working in a number of traditions to develop historically-informed practices. The aim is to enable musicians to have an understanding of the structures underpinning Scottish fiddle and pipe music, enriching the traditions with a deeper, more widespread appreciation of the diversity of their roots.
The metadata requirements of the two repertoires have similarities and divergences; flexibility is needed to apply suitable metadata across both. Compatibility with pre-existing approaches is also a prerequisite.
By October 2014, we will be building the web resource; this conference would provide a perfect opportunity to share an innovative collaboration between musicology, librarianship and web development.
The Long Beach Public Library has digitized parts of its photo, yearbook, and city directory collections to create an online digital archive accessible through the library catalog called Encore. The digital archive contains over 50 directories, 70 yearbooks, and 700 photos documenting the history of Long Beach. It features items like a 1928 creamery ad, photos of Babe Ruth and Eleanor Roosevelt visiting Long Beach, and images of the city council from different eras. Library staff encourage exploring the collections through the Encore search tool or on the digital archive website and providing feedback to continue expanding access to Local history resources.
DESIGN AND IMPLEMENTATION OF A DIGITAL ARCHIVE OF LEARNING OBJECTS FOR REMOTE...Khalid Md Saifuddin
Digital libraries are fast expanding into the role of independent educational entities that aspire not only to complement traditional classroom teaching, but also allow remote access to Learning Objects. Multifaceted roles of Learning Objects can be realized only if the course content and the related content management system are versatile enough to be captured into any individual’s learning needs. This paper presents the design & implementation of such a Digital Archive, which will facilitate the use/re-use, store & bookmark of Learning Objects focusing to learning needs.
Digar - Digital archive of the National Library / Estonian Design Awards 2014Designawards
The National Library’s Archive was limited largely to physical objects. Therefore the younger generation of readers used to digital media had little or no access its contents. Research among schoolchildren revealed they consume a lot of digital media for infotainment purposes. By digitalising recommended literature archives, it is possible to change actual reading habits and lead young readers towards a wider educational range of information.
Noblessner / Estonian Design Awards 2014Designawards
Port Noblessner served as an example for designing a universal service model for small marinas. A model that would meet the expectations and needs of any yacht owner and sailor.
The document discusses the changing role of information professionals like librarians and archivists in the digital era. It notes that information is increasingly digital rather than physical, requiring new skills from professionals. Specifically, it recommends that professionals develop generic skills like communication and problem solving, managerial skills like planning and finance, and technical professional skills like digitization, metadata standards, and information technology. The skills allow professionals to continue effectively managing and providing access to information in new digital formats and environments.
David Riecks, Stock Artist's Alliance, presentation from VRA 28 Atlanta.
Making a Case for Photo Metadata from "Embedded Metadata: Share, Deliver, Preserve" session.
Planning and Managing Digital Library & Archive Projectsac2182
The document provides an overview of a workshop on developing and managing digital library and archive projects. It includes the workshop schedule, introductions from attendees, strategies for success, managing born-digital assets and digitized content, infrastructure requirements, and considerations for digital preservation over the long-term.
The document provides an overview of metadata and how it can be used. It discusses different types of metadata including structural, administrative, and descriptive metadata. It also covers how to create metadata by determining content types and attributes, and identifying functionality. Standards like Dublin Core, RDF/RDFa and Schema.org are examined as sources for metadata fields. The workshop teaches best practices for applying metadata to improve search, browsing and other functions.
"Filling the Digital Preservation Gap" with ArchivematicaJenny Mitcham
A webinar given by Jenny Mitcham and Simon Wilson to Digital Preservation Coalition members on 25th November 2015. It describes work underway in the "Filling the Digital Preservation Gap" project using Archivematica to preserve research data
A Digitization Primer for Botanical and Horticultural LibrariansChris Freeland
The document provides guidance on developing a digitization project for botanical and horticultural libraries. It discusses why institutions digitize materials, including making resources broadly available while preserving originals. A survey of attendees found most had little experience with digitization projects and were interested in digitizing various material formats, including photographs, printed works, and herbarium specimens. The document outlines topics to cover in developing a project, including audience and goals, ownership and copyright, preservation, metadata, scanning and sustainability.
This document discusses the importance of developers to higher education. It summarizes several projects funded by JISC that aim to support and connect developers, including Common Repositories Interfaces Group (CRIG), Wisdom of CRIG (WOCRIG), Developer Community Supporting Innovation (DevCSI), and Developer Days (dev8D). It argues that connecting developers leads to rapid innovation, knowledge transfer, and representation of developers' needs. Challenges include sustainability, perceptions of developers' value, and ensuring diversity among participants.
Text and Non-textual Objects: Seamless access for scientists
Uwe Rosemann (German National Library of Science and Technology (TIB), Germany)
The European High Level Expert Group on Scientific data has formulated the challenges for a scientific infrastructure to be reached by 2030: “Our vision is a scientific e-infrastructure that supports seamless access, use, re-use, and trust of data. In a sense, the physical and technical infrastructure becomes invisible and the data themselves become the infrastructure – a valuable asset, on which science, technology, the economy and society can advance”.
Here, “data” is not restricted to primary data but also includes all non-textual material (graphs, spectra, videos, 3D-objects etc.).
The German National Library of Science and Technology (TIB) has developed a concept for a national competence center for non-textual materials which is now founded by the German State and by the German Federal Countries. The center has to perform the task: developing solutions and services together with the scientific community to make such data available, citable, sharable and usable, including visual search tools and enhanced content-based retrieval.
With solutions such as DataCite and modular development for extraction, indexing and visual searching of new scientific metadata, TIB will accept the challenge. And will make all data accessible to its users fast, convenient and easy to use.
The paper shows what special tools are developed by TIB in the context of scientific AV-media, 3D-objects and research data.
Moving an Archive from Tape to Disk: A Case-Study at ICPSRBryan Beecher
This document summarizes ICPSR's efforts to transition its archive from tape to disk storage between 2006-2008. It describes ICPSR's mission to collect and preserve social science data. In 2006, ICPSR digitally preserved objects on tape but lacked automation. A new plan automated processes, moved all digital content from tape to disk by 2007, and discarded unnecessary paper records. This transition reduced costs while improving access and preservation for ICPSR staff. While progress was made, further work is still needed on a proper digital preservation system and long-term storage of larger digital objects and restricted-access materials.
Slides from keynote talk at Dawson Day 2012 (slightly revised)
Contains stats on LSE Library collection trends and overview on how we made the case for LSE Digital Library and how we are progressing with implementation.
The document discusses the GAMA project, which aimed to consolidate digital media art archives across Europe into a single platform for discovery and access called Europeana. It provided statistics on the number of artists, works, languages and other content included in GAMA. The project brought together various organizations to develop requirements, specifications, interfaces and tools to aggregate and expose the content while accounting for the diverse needs and characteristics of different archives. An iterative design process was used to test and refine the system based on best practices and user feedback.
Born Again: The Digitisation of the Anthropology Photographic Archive. 2004Rose Holley
The document summarizes a project to digitize the neglected anthropology photographic archive at the University of Auckland. Over 50,000 deteriorating negatives from 1950s-present were selected for scanning. A database was designed to preserve the images and improve access. Through a partnership with the library and anthropology department, 5,000 images were scanned over 2 years for $80,000. The digitization was successful in preserving parts of the unique collection and making it publicly searchable online.
2010 EGITF Amsterdam - Gap between GRID and HumanitiesDirk Roorda
The document discusses the potential role of grid computing technologies in supporting humanities research. It outlines several European research infrastructure projects that aim to apply such technologies, including CLARIN, DARIAH, CESSDA and SHARE. While these projects initially saw grid as enabling workflows and sustainable tools/data, they found the technology required significant customization and expertise. The document argues that humanities research involves small, linked data analyzed through both automated and human-guided methods. It proposes that "virtual use cases" outlining generic, infrastructure-level tasks could help bridge the gap between such research and complex technologies like the grid.
The document summarizes the development of Scripted, a lightweight browser-based code editor. It discusses observations that heavy IDEs are not ideal for JavaScript development and speed is essential. Two prototypes were created - Orion and Scripted. Scripted focused on speed, code awareness through static analysis, and module system comprehension. Near term goals include improved content assistance and a plugin model. Long term goals include debugging integration and support for additional languages.
Presentation on the Warsaw Conference on National Bibliographies August 2012nw13
An up date on the conference held at the National Library of Poland in August 2012 on the challenges facing national bibliographic services in the digital age. The presentation was made at the IFLA WLIC Conference as part of the IFLA Bibliography Standing Committee section of the conference.
Digital practice guidelines : the new generation presented by Scott WajonPublicLibraryServices
The document provides guidance on best practices for collecting and preserving born digital materials. It discusses analyzing file formats, generating checksums and file inventories, and copying files to archival storage while documenting the technical steps to ensure the integrity and provenance of digital collections over time. Key tools mentioned include antivirus software, file format identification tools like DROID, technical registries like PRONOM, and checksum verification for digital preservation of born digital materials according to established standards and guidelines.
Chemical Databases and Open Chemistry on the DesktopMarcus Hanwell
The modern chemist has access to large databases containing both experimental and calculated data. The power of HPC resources continues to increase, with more practitioners having routine access to powerful computational chemistry tools. This places an increasingly high burden on users to assimilate these resources into their workflow in order to effectively utilize resources. The creation of an open, extensible application framework that puts computational tools, data, and domain specific knowledge at the fingertips of chemists is increasingly important. A data-centric approach to chemistry, storing all data in a searchable database, will empower users to efficiently collaborate, innovate, and push the frontiers of research. Providing an open, user-friendly and extensible application will open up new tools to experimental chemists, while providing computational chemists the ability to address greater challenges. Additionally, by distributing experimental and computational data across the research community, incorporating cheminformatics analytics techniques, and providing visual search for chemical structures, the workflow of both groups can be significantly improved. This requires suitable data formats for data exchange, and databases with appropriate APIs for querying, and uploading data in order to effectively share. This talk will discuss recent progress made in developing a suite of open chemistry applications on the desktop. The applications can query online databases, such as the NIH structure resolver service, download and manipulate structures, and prepare input files for standalone computational chemistry codes. Another application developed to submit jobs, monitor and retrieve results from HPC resources will also be shown, and a desktop chemistry database browser. The Quixote project aims to establish standards for data exchange in computational chemistry, along with data repositories for organizations. Establishing these standards is important to promote open, reproducible chemistry, and their integration into user-friendly desktop applications will promote their integration in the standard workflow of researchers.
The document provides an overview of CONTENTdm 6 digital collection management software. It discusses the key features and functionality of CONTENTdm including its ability to store, manage and provide access to digital collections. It also describes the newly redesigned end-user interface and improved website configuration tools in CONTENTdm 6. The presentation concludes with a demonstration and discusses next steps for the software.
Digar - Digital archive of the National Library / Estonian Design Awards 2014Designawards
The National Library’s Archive was limited largely to physical objects. Therefore the younger generation of readers used to digital media had little or no access its contents. Research among schoolchildren revealed they consume a lot of digital media for infotainment purposes. By digitalising recommended literature archives, it is possible to change actual reading habits and lead young readers towards a wider educational range of information.
Noblessner / Estonian Design Awards 2014Designawards
Port Noblessner served as an example for designing a universal service model for small marinas. A model that would meet the expectations and needs of any yacht owner and sailor.
The document discusses the changing role of information professionals like librarians and archivists in the digital era. It notes that information is increasingly digital rather than physical, requiring new skills from professionals. Specifically, it recommends that professionals develop generic skills like communication and problem solving, managerial skills like planning and finance, and technical professional skills like digitization, metadata standards, and information technology. The skills allow professionals to continue effectively managing and providing access to information in new digital formats and environments.
David Riecks, Stock Artist's Alliance, presentation from VRA 28 Atlanta.
Making a Case for Photo Metadata from "Embedded Metadata: Share, Deliver, Preserve" session.
Planning and Managing Digital Library & Archive Projectsac2182
The document provides an overview of a workshop on developing and managing digital library and archive projects. It includes the workshop schedule, introductions from attendees, strategies for success, managing born-digital assets and digitized content, infrastructure requirements, and considerations for digital preservation over the long-term.
The document provides an overview of metadata and how it can be used. It discusses different types of metadata including structural, administrative, and descriptive metadata. It also covers how to create metadata by determining content types and attributes, and identifying functionality. Standards like Dublin Core, RDF/RDFa and Schema.org are examined as sources for metadata fields. The workshop teaches best practices for applying metadata to improve search, browsing and other functions.
"Filling the Digital Preservation Gap" with ArchivematicaJenny Mitcham
A webinar given by Jenny Mitcham and Simon Wilson to Digital Preservation Coalition members on 25th November 2015. It describes work underway in the "Filling the Digital Preservation Gap" project using Archivematica to preserve research data
A Digitization Primer for Botanical and Horticultural LibrariansChris Freeland
The document provides guidance on developing a digitization project for botanical and horticultural libraries. It discusses why institutions digitize materials, including making resources broadly available while preserving originals. A survey of attendees found most had little experience with digitization projects and were interested in digitizing various material formats, including photographs, printed works, and herbarium specimens. The document outlines topics to cover in developing a project, including audience and goals, ownership and copyright, preservation, metadata, scanning and sustainability.
This document discusses the importance of developers to higher education. It summarizes several projects funded by JISC that aim to support and connect developers, including Common Repositories Interfaces Group (CRIG), Wisdom of CRIG (WOCRIG), Developer Community Supporting Innovation (DevCSI), and Developer Days (dev8D). It argues that connecting developers leads to rapid innovation, knowledge transfer, and representation of developers' needs. Challenges include sustainability, perceptions of developers' value, and ensuring diversity among participants.
Text and Non-textual Objects: Seamless access for scientists
Uwe Rosemann (German National Library of Science and Technology (TIB), Germany)
The European High Level Expert Group on Scientific data has formulated the challenges for a scientific infrastructure to be reached by 2030: “Our vision is a scientific e-infrastructure that supports seamless access, use, re-use, and trust of data. In a sense, the physical and technical infrastructure becomes invisible and the data themselves become the infrastructure – a valuable asset, on which science, technology, the economy and society can advance”.
Here, “data” is not restricted to primary data but also includes all non-textual material (graphs, spectra, videos, 3D-objects etc.).
The German National Library of Science and Technology (TIB) has developed a concept for a national competence center for non-textual materials which is now founded by the German State and by the German Federal Countries. The center has to perform the task: developing solutions and services together with the scientific community to make such data available, citable, sharable and usable, including visual search tools and enhanced content-based retrieval.
With solutions such as DataCite and modular development for extraction, indexing and visual searching of new scientific metadata, TIB will accept the challenge. And will make all data accessible to its users fast, convenient and easy to use.
The paper shows what special tools are developed by TIB in the context of scientific AV-media, 3D-objects and research data.
Moving an Archive from Tape to Disk: A Case-Study at ICPSRBryan Beecher
This document summarizes ICPSR's efforts to transition its archive from tape to disk storage between 2006-2008. It describes ICPSR's mission to collect and preserve social science data. In 2006, ICPSR digitally preserved objects on tape but lacked automation. A new plan automated processes, moved all digital content from tape to disk by 2007, and discarded unnecessary paper records. This transition reduced costs while improving access and preservation for ICPSR staff. While progress was made, further work is still needed on a proper digital preservation system and long-term storage of larger digital objects and restricted-access materials.
Slides from keynote talk at Dawson Day 2012 (slightly revised)
Contains stats on LSE Library collection trends and overview on how we made the case for LSE Digital Library and how we are progressing with implementation.
The document discusses the GAMA project, which aimed to consolidate digital media art archives across Europe into a single platform for discovery and access called Europeana. It provided statistics on the number of artists, works, languages and other content included in GAMA. The project brought together various organizations to develop requirements, specifications, interfaces and tools to aggregate and expose the content while accounting for the diverse needs and characteristics of different archives. An iterative design process was used to test and refine the system based on best practices and user feedback.
Born Again: The Digitisation of the Anthropology Photographic Archive. 2004Rose Holley
The document summarizes a project to digitize the neglected anthropology photographic archive at the University of Auckland. Over 50,000 deteriorating negatives from 1950s-present were selected for scanning. A database was designed to preserve the images and improve access. Through a partnership with the library and anthropology department, 5,000 images were scanned over 2 years for $80,000. The digitization was successful in preserving parts of the unique collection and making it publicly searchable online.
2010 EGITF Amsterdam - Gap between GRID and HumanitiesDirk Roorda
The document discusses the potential role of grid computing technologies in supporting humanities research. It outlines several European research infrastructure projects that aim to apply such technologies, including CLARIN, DARIAH, CESSDA and SHARE. While these projects initially saw grid as enabling workflows and sustainable tools/data, they found the technology required significant customization and expertise. The document argues that humanities research involves small, linked data analyzed through both automated and human-guided methods. It proposes that "virtual use cases" outlining generic, infrastructure-level tasks could help bridge the gap between such research and complex technologies like the grid.
The document summarizes the development of Scripted, a lightweight browser-based code editor. It discusses observations that heavy IDEs are not ideal for JavaScript development and speed is essential. Two prototypes were created - Orion and Scripted. Scripted focused on speed, code awareness through static analysis, and module system comprehension. Near term goals include improved content assistance and a plugin model. Long term goals include debugging integration and support for additional languages.
Presentation on the Warsaw Conference on National Bibliographies August 2012nw13
An up date on the conference held at the National Library of Poland in August 2012 on the challenges facing national bibliographic services in the digital age. The presentation was made at the IFLA WLIC Conference as part of the IFLA Bibliography Standing Committee section of the conference.
Digital practice guidelines : the new generation presented by Scott WajonPublicLibraryServices
The document provides guidance on best practices for collecting and preserving born digital materials. It discusses analyzing file formats, generating checksums and file inventories, and copying files to archival storage while documenting the technical steps to ensure the integrity and provenance of digital collections over time. Key tools mentioned include antivirus software, file format identification tools like DROID, technical registries like PRONOM, and checksum verification for digital preservation of born digital materials according to established standards and guidelines.
Chemical Databases and Open Chemistry on the DesktopMarcus Hanwell
The modern chemist has access to large databases containing both experimental and calculated data. The power of HPC resources continues to increase, with more practitioners having routine access to powerful computational chemistry tools. This places an increasingly high burden on users to assimilate these resources into their workflow in order to effectively utilize resources. The creation of an open, extensible application framework that puts computational tools, data, and domain specific knowledge at the fingertips of chemists is increasingly important. A data-centric approach to chemistry, storing all data in a searchable database, will empower users to efficiently collaborate, innovate, and push the frontiers of research. Providing an open, user-friendly and extensible application will open up new tools to experimental chemists, while providing computational chemists the ability to address greater challenges. Additionally, by distributing experimental and computational data across the research community, incorporating cheminformatics analytics techniques, and providing visual search for chemical structures, the workflow of both groups can be significantly improved. This requires suitable data formats for data exchange, and databases with appropriate APIs for querying, and uploading data in order to effectively share. This talk will discuss recent progress made in developing a suite of open chemistry applications on the desktop. The applications can query online databases, such as the NIH structure resolver service, download and manipulate structures, and prepare input files for standalone computational chemistry codes. Another application developed to submit jobs, monitor and retrieve results from HPC resources will also be shown, and a desktop chemistry database browser. The Quixote project aims to establish standards for data exchange in computational chemistry, along with data repositories for organizations. Establishing these standards is important to promote open, reproducible chemistry, and their integration into user-friendly desktop applications will promote their integration in the standard workflow of researchers.
The document provides an overview of CONTENTdm 6 digital collection management software. It discusses the key features and functionality of CONTENTdm including its ability to store, manage and provide access to digital collections. It also describes the newly redesigned end-user interface and improved website configuration tools in CONTENTdm 6. The presentation concludes with a demonstration and discusses next steps for the software.
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012Lee Dirks
An invited talk to 40+ directors of national libraries worldwide at the annual ExLibris member meeting at IFLA (Helsinki, Finland) on August 15th, 2012.
Using Omeka as a Gateway to Digital Projectslibrarianrafia
Digital Frontiers 2015 https://digital-frontiers.org/ Presentation on Omeka 9/18/2015
Presenters: Jeff Downing, Lynn Johnson, and Derek Reece (Digital Projects Librarians) and Rafia Mirza (Digital Humanities Librarian)
The document discusses long-term preservation of enhanced publications in the DRIVER infrastructure. DRIVER aims to make European research openly accessible and interoperable by aggregating outputs into a virtual open knowledge base. This will enable services for researchers. Challenges to archiving enhanced publications include distributed ownership and deposit, perceptions of preservation, and special handling needs. Recommendations include developing guidelines for repositories on metadata and digital curation best practices to support long-term preservation of these publications.
A community of developers stimulating innovation in uk higher educationDevCSI
This document provides an overview of the DevCSI project, which aims to stimulate innovation in UK higher education by supporting a community of developers. It discusses the types of developers involved, including opportunistic, engineers, and connected developers. It outlines events held by DevCSI to bring developers together, such as hack days and challenges, and how these help developers build skills and solutions. It also discusses how supporting local developers can benefit institutions by empowering users and enabling local innovation. Finally, it shares some statistics on DevCSI's events and community outreach.
Similar to Digital projects best practices [xxxiii reunión nacional de archivos 201111] (20)
Digitization of the Tuol Sleng Genocide Museum ArchivesFrederick Zarndt
This document provides a summary of a report on a project to preserve, digitize, index and host archives from the Tuol Sleng Genocide Museum in Cambodia. The project aims to spread an objective vision of history by digitizing over 400,000 pages of materials related to the Khmer Rouge regime and Tuol Sleng prison. Key aspects of the project include training museum staff, digitizing the materials to high standards, creating searchable databases and indexes, and developing a public-facing website with crowd-sourcing capabilities to engage the Cambodian people. Challenges include the fragile materials, limited local skills and equipment, and ensuring the work is done to completion within a tight timeline and budget.
2017 Born Digital Legal Deposit Policies and PracticesFrederick Zarndt
This document summarizes the key details and findings of a survey conducted in 2014 and 2017 on born digital legal deposit policies and practices. The 2014 survey was sent to 20 national libraries and received responses from 17 libraries. It found that legal deposit laws varied widely, with Nordic countries leading in digital content capture while many others made no provision for digital. Only 7 countries addressed deposit of born-digital content. To update the survey, the authors expanded their team in 2017 and broadened the survey reach. The document reviews 17 previous related surveys from 2005-2016 on topics like audiovisual preservation, e-legal deposit, web archiving, and digital news preservation. It provides context on the goals and questions of each prior survey.
In 2015, three of the authors (Zarndt, McCain, Carner) surveyed the born digital content legal deposit policies and practices in 18 different countries and presented the results of the survey at the 2015 International News Media Conference hosted by the National Library of Sweden in Stockholm, Sweden, April 2015.
As a first step, the authors reviewed previous surveys about legal deposit and digital preservation. The authors updated and streamlined the 2015 survey in order to assess progress in creating or improving national policies and in implementing practices for preserving born digital content. The current survey consists of as many as 20 questions; which questions are asked depends on the respondent’s previous answers.
More than 50 countries and states in Australia, Germany and USA, participated in the survey. The survey closed at the end of November 2017. The authors expect to repeat the survey periodically in order to assess progress in developing born digital legal policy and implementing the policy in practice.
What did you say? interculture communication [20160308 phnom penh]Frederick Zarndt
The single biggest problem in communication is the illusion it has taken place. George Bernard Shaw, Irish playwright, co-founder of London School of Economics, and Nobel Prize in Literature (1925).
Projects are about communication, communication, and communication. B. Elenbass in "Staging a project: Are you setting your project up for success?"
What one says to compatriots in face-to-face conversation is often misunderstood; imagine the possibilities for misunderstandings with someone from halfway around the world, natively speaking another language, and living in a different culture! In such circumstances how can you be sure that your collocutor has understood you in face-to-face (hard), telephone (harder), and email (hardest) conversations? Without being fully present in the conversation -- mindfully aware -- whether it's face-to-face, by Skype or phone, or through email, successful communication is difficult, even more so for intercultural communication.
The ubiquity of English facilitates basic communication, but its use as a common language frequently disguises cultural differences. Furthermore, to say that English (or any other language) can be ambiguous, is an understatement. But regardless of language, clear communication is essential for success in any collaborative undertaking whether done by a small co-located group or by a globally dispersed team.
This tutorial teaches mindful communication and describes frameworks useful in understanding cultural differences and gives real-life examples of misunderstandings due to such differences. Expect to take away practical tools to understand your own cultural biases and in-class practice mindful communication with your colleagues from other cultures as well as your own. You will also learn about frameworks for understanding other cultures based on work by Geert Hofstede, Fons Trompenaars, and others as well as on the presenter's own experiences.
Coronado public library digital newspapers workshop local partnerships [oct 2...Frederick Zarndt
Using digitized historical newspapers for genealogical research
Brian Geiger, California Digital Newspaper Collection
Frederick Zarndt, IFLA Governing Board
1. Introductory remarks: Who we are; focus on freely available collections and especially those that allow researchers to create accounts; numerous sites they can pay to access but we won’t spend much time on them
2. Only small percentage of surviving newspapers have been digitized
3. How newspapers are digitized. Focusing especially on OCR, if it’s not OCR’ed well it’s not discoverable
4. How Coronado newspapers were digitized. CDNC’s work with the public library, Coronado Public Library’s work with the publisher, the process of scanning the film and processing the images, etc.
5. Free vs. Pay. 2 kinds of digitized newspaper archives: 1) publicly funded and available for free, 2) commercial sites you pay to access. Dozens or even hundreds of public sites, from small institutional to national.
6. Google won’t always get you what you want
7. Basic search using Elephind: What elephind is. Search “Abraham Lincoln” and explain what they see. Described “facets”
8. CDNC advanced search
9. Collecting What You Find: Right-click features in the CDNC
10. Collecting What You Find: CDNC user accounts
11. Interacting with Content: CDNC
12. Interacting with Content: Tagging and commenting in CDNC
Coronado public library digital newspapers workshop [Oct 2016]Frederick Zarndt
Using digitized historical newspapers for genealogical research
Brian Geiger, California Digital Newspaper Collection
Frederick Zarndt, IFLA Governing Board
1. Introductory remarks: Who we are; focus on freely available collections and especially those that allow researchers to create accounts; numerous sites they can pay to access but we won’t spend much time on them
2. Only small percentage of surviving newspapers have been digitized
3. How newspapers are digitized. Focusing especially on OCR, if it’s not OCR’ed well it’s not discoverable
4. How Coronado newspapers were digitized. CDNC’s work with the public library, Coronado Public Library’s work with the publisher, the process of scanning the film and processing the images, etc.
5. Free vs. Pay. 2 kinds of digitized newspaper archives: 1) publicly funded and available for free, 2) commercial sites you pay to access. Dozens or even hundreds of public sites, from small institutional to national.
6. Google won’t always get you what you want
7. Basic search using Elephind: What elephind is. Search “Abraham Lincoln” and explain what they see. Described “facets”
8. CDNC advanced search
9. Collecting What You Find: Right-click features in the CDNC
10. Collecting What You Find: CDNC user accounts
11. Interacting with Content: CDNC
12. Interacting with Content: Tagging and commenting in CDNC
What did you say? mindful interculture communication [201608 icgse]Frederick Zarndt
The single biggest problem in communication is the illusion it has taken place. George Bernard Shaw, Irish playwright, co-founder of London School of Economics, and Nobel Prize in Literature (1925).
Projects are about communication, communication, and communication. B. Elenbass in "Staging a project: Are you setting your project up for success?"
What one says to compatriots in face-to-face conversation is often misunderstood; imagine the possibilities for misunderstandings with someone from halfway around the world, natively speaking another language, and living in a different culture! In such circumstances how can you be sure that your collocutor has understood you in face-to-face (hard), telephone (harder), and email (hardest) conversations? Without being fully present in the conversation -- mindfully aware -- whether it's face-to-face, by Skype or phone, or through email, successful communication is difficult, even more so for intercultural communication.
The ubiquity of English facilitates basic communication, but its use as a common language frequently disguises cultural differences. Furthermore, to say that English (or any other language) can be ambiguous, is an understatement. But regardless of language, clear communication is essential for success in any collaborative undertaking whether done by a small co-located group or by a globally dispersed team.
This tutorial teaches mindful communication and describes frameworks useful in understanding cultural differences and gives real-life examples of misunderstandings due to such differences. Expect to take away practical tools to understand your own cultural biases and in-class practice mindful communication with your colleagues from other cultures as well as your own. You will also learn about frameworks for understanding other cultures based on work by Geert Hofstede, Fons Trompenaars, and others as well as on the presenter's own experiences.
Here Today, Gone within a Month: The Fleeting Life of Digital NewsFrederick Zarndt
In 1989 on the shores of Montana’s beautiful Flathead Lake, the owners of the weekly newspaper the Bigfork Eagle started TownNews.com to help community newspapers with developing technology. TownNews.com has since evolved into an integrated digital publishing and content management system used by more than 1600 newspaper, broadcast, magazine, and web-native publications in North America. TownNews.com is now headquartered on the banks of the mighty Mississippi river in Moline Illinois.
Not long ago Marc Wilson, CEO of TownNews.com, noticed that of the 220,000+ e-edition pages posted on behalf of its customers at the beginning of the month, 210,000 were deleted by month’s end.
What? The front page story about a local business being sold to an international corporation that I read online September 1 will be gone by September 30? As well as the story about my daughter’s 1st place finish in the district field and track meet?
A 2014 national survey by the Reynolds Journalism Institute (RJI) of 70 digital-only and 406 hybrid (digital and print) newspapers conclusively showed that newspaper publishers also do not maintain archives of the content they produce. RJI found a dismal 12% of the “hybrid” newspapers reported even backing up their digital news content and fully 20% of the “digital-only” newspapers reported that they are backing up none of their content. Educopia Institute’s 2012 and 2015 surveys with newspapers and libraries concur, and further demonstrate that the longstanding partner to the newspaper—the library—likewise is neither collecting nor preserving this digital content.
This leaves us with a bitter irony, that today, one can find stories published prior to 1922 in the Library of Congress’s Chronicling America and other digitized, out-of-copyright newspaper collections but cannot, and never will be able to, read a story published online less than a month ago.
In this paper we look at how much news is published online that is never published in print or on more permanent media. We estimate how much online news is or will soon be forever lost because no one preserves it: not publishers, not libraries, not content management systems, and not the Internet Archive. We delve into some of the reasons why this content is not yet preserved, and we examine the persistent challenges of digital preservation and of digital curation of this content type. We then suggest a pathway forward, via some initial steps that journalists, producers, legislators, libraries, distributors, and readers may each take to begin to rectify this historical loss going forward.
Here Today, Gone within a Month: The Fleeting Life of Digital NewsFrederick Zarndt
This document discusses the fleeting lifespan of digital news content. It notes that TownNews.com, which hosts digital content for over 1600 publications, found that 210,000 of the 220,000 digital news pages from the beginning of a month were deleted by the end of the month. Surveys have shown that few digital news producers actively preserve their content, with only 12% of hybrid print-digital newspapers backing up content and 20% of digital-only newspapers backing no content up at all. As a result, much recent digital news content is lost to researchers and the historical record. The document examines challenges to preserving born-digital news and suggests stakeholders like journalists, legislators, libraries and readers should take initial steps to address this problem
An international survey of born digital legal deposit policies and practices ...Frederick Zarndt
That news publication has changed dramatically since the advent of the Internet and the Web is no news to anyone. There are many examples of established news organizations that have either stopped printing newspapers or shifted to publishing news on websites or through social media such as Facebook and Twitter. There are even more examples of new news organizations that have never printed news on paper and are digital only.
To the authors’ knowledge, every country has one or more legal deposit organizations tasked with preserving news for future generations. Legal deposit laws in some countries have been amended to include news that may never be instantiated on paper (born digital news). However, legal deposit laws are by no means universally amended and, even when such amendments have been made, their embodiment in practice varies widely.
As a follow-on to the paper Missing links: The digital news preservation discontinuity (http://www.ifla.org/node/8933) presented in August 2014 at IFLA News Media section satellite conference at the ITU Library in Geneva, Switzerland, the authors have surveyed cultural heritage organizations (libraries) around the world about their respective national born digital legal deposit policies and practices. We share the survey results and consider the ramifications of inadequate born digital news preservation policies and practice to future generations.
An international survey of born digital legal deposit policies and practices ...Frederick Zarndt
That news publication has changed dramatically since the advent of the Internet and the Web is no news to anyone. There are many examples of established news organizations that have either stopped printing newspapers or shifted to publishing news on websites or through social media such as Facebook and Twitter. There are even more examples of new news organizations that have never printed news on paper and are digital only.
To the authors’ knowledge, every country has one or more legal deposit organizations tasked with preserving news for future generations. Legal deposit laws in some countries have been amended to include news that may never be instantiated on paper (born digital news). However, legal deposit laws are by no means universally amended and, even when such amendments have been made, their embodiment in practice varies widely.
As a follow-on to the paper Missing links: The digital news preservation discontinuity (http://www.ifla.org/node/8933) presented in August 2014 at IFLA News Media section satellite conference at the ITU Library in Geneva, Switzerland, the authors have surveyed cultural heritage organizations (libraries) around the world about their respective national born digital legal deposit policies and practices. We share the survey results and consider the ramifications of inadequate born digital news preservation policies and practice to future generations.
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...Frederick Zarndt
In all of its many flavors, crowdsourcing works. It works for cultural heritage organizations too. During this presentation we look at various aspects of crowdsourced OCR text correction, commenting, and tagging for digitized historical newspapers at the National Library of Australia’s Trove, the California Digital Newspaper Collection (CDNC), and at the Cambridge Public Library in Cambridge Massachusetts as well as the astounding number of historical birth, death, marriage, census, and other records transcribed by “crowd” volunteers at Family Search. Some aspects include: demographics, experiences, motivation, quality, preferred data, economics and marketing. You will see that crowd sourcing is not only feasible but also practical and desirable. You will wonder why your own cultural heritage organization hasn't begun its own crowdsourcing project!
20131019 digital collections - if you build them will anyone visit [library 2...Frederick Zarndt
This document discusses digital historical newspaper collections in libraries and their visibility on the internet. It finds that while libraries spend significant resources digitizing collections, the collections receive little internet traffic and have poor search engine results. Some key points made include:
- Historical newspaper collections are among the most used collections in libraries with digital texts but receive low percentages of overall website traffic.
- Searching sample collections for information on the Gallipoli campaign yields few or no results from the library collections in the first 100 Google/Google News search results.
- Simple changes like adding XML sitemaps and adjusting robots.txt files can significantly increase search engine indexing and traffic for digital collections.
20130903 what did you say? interculture communication [hamburg]Frederick Zarndt
This document discusses intercultural communication and misunderstandings. It provides quotes and principles about the importance of effective communication to build understanding between people from different cultures and avoid assumptions. It notes that a lack of communication or poor communication can lead to more assumptions and misunderstandings.
201308 wlic standards committee zarndt et al the alto editorial board collabo...Frederick Zarndt
The document provides information about the ALTO Editorial Board, which maintains the ALTO (Analyzed Layout and Text Object) XML standard for describing page layout and content metadata. The board was established in 2009 and is comprised of members from libraries and organizations around the world. The board aims to promote ALTO usage and ensure the standard evolves to meet emerging needs. Meeting agendas, procedures, and examples of board members' motivations for participation are presented.
Session 1 - Intro to Robotic Process Automation.pdfUiPathCommunity
👉 Check out our full 'Africa Series - Automation Student Developers (EN)' page to register for the full program:
https://bit.ly/Automation_Student_Kickstart
In this session, we shall introduce you to the world of automation, the UiPath Platform, and guide you on how to install and setup UiPath Studio on your Windows PC.
📕 Detailed agenda:
What is RPA? Benefits of RPA?
RPA Applications
The UiPath End-to-End Automation Platform
UiPath Studio CE Installation and Setup
💻 Extra training through UiPath Academy:
Introduction to Automation
UiPath Business Automation Platform
Explore automation development with UiPath Studio
👉 Register here for our upcoming Session 2 on June 20: Introduction to UiPath Studio Fundamentals: https://community.uipath.com/events/details/uipath-lagos-presents-session-2-introduction-to-uipath-studio-fundamentals/
Your One-Stop Shop for Python Success: Top 10 US Python Development Providersakankshawande
Simplify your search for a reliable Python development partner! This list presents the top 10 trusted US providers offering comprehensive Python development services, ensuring your project's success from conception to completion.
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor IvaniukFwdays
At this talk we will discuss DDoS protection tools and best practices, discuss network architectures and what AWS has to offer. Also, we will look into one of the largest DDoS attacks on Ukrainian infrastructure that happened in February 2022. We'll see, what techniques helped to keep the web resources available for Ukrainians and how AWS improved DDoS protection for all customers based on Ukraine experience
In the realm of cybersecurity, offensive security practices act as a critical shield. By simulating real-world attacks in a controlled environment, these techniques expose vulnerabilities before malicious actors can exploit them. This proactive approach allows manufacturers to identify and fix weaknesses, significantly enhancing system security.
This presentation delves into the development of a system designed to mimic Galileo's Open Service signal using software-defined radio (SDR) technology. We'll begin with a foundational overview of both Global Navigation Satellite Systems (GNSS) and the intricacies of digital signal processing.
The presentation culminates in a live demonstration. We'll showcase the manipulation of Galileo's Open Service pilot signal, simulating an attack on various software and hardware systems. This practical demonstration serves to highlight the potential consequences of unaddressed vulnerabilities, emphasizing the importance of offensive security practices in safeguarding critical infrastructure.
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving
Manufacturing custom quality metal nameplates and badges involves several standard operations. Processes include sheet prep, lithography, screening, coating, punch press and inspection. All decoration is completed in the flat sheet with adhesive and tooling operations following. The possibilities for creating unique durable nameplates are endless. How will you create your brand identity? We can help!
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyScyllaDB
Freshworks creates AI-boosted business software that helps employees work more efficiently and effectively. Managing data across multiple RDBMS and NoSQL databases was already a challenge at their current scale. To prepare for 10X growth, they knew it was time to rethink their database strategy. Learn how they architected a solution that would simplify scaling while keeping costs under control.
Introduction of Cybersecurity with OSS at Code Europe 2024Hiroshi SHIBATA
I develop the Ruby programming language, RubyGems, and Bundler, which are package managers for Ruby. Today, I will introduce how to enhance the security of your application using open-source software (OSS) examples from Ruby and RubyGems.
The first topic is CVE (Common Vulnerabilities and Exposures). I have published CVEs many times. But what exactly is a CVE? I'll provide a basic understanding of CVEs and explain how to detect and handle vulnerabilities in OSS.
Next, let's discuss package managers. Package managers play a critical role in the OSS ecosystem. I'll explain how to manage library dependencies in your application.
I'll share insights into how the Ruby and RubyGems core team works to keep our ecosystem safe. By the end of this talk, you'll have a better understanding of how to safeguard your code.
Discover top-tier mobile app development services, offering innovative solutions for iOS and Android. Enhance your business with custom, user-friendly mobile applications.
Must Know Postgres Extension for DBA and Developer during MigrationMydbops
Mydbops Opensource Database Meetup 16
Topic: Must-Know PostgreSQL Extensions for Developers and DBAs During Migration
Speaker: Deepak Mahto, Founder of DataCloudGaze Consulting
Date & Time: 8th June | 10 AM - 1 PM IST
Venue: Bangalore International Centre, Bangalore
Abstract: Discover how PostgreSQL extensions can be your secret weapon! This talk explores how key extensions enhance database capabilities and streamline the migration process for users moving from other relational databases like Oracle.
Key Takeaways:
* Learn about crucial extensions like oracle_fdw, pgtt, and pg_audit that ease migration complexities.
* Gain valuable strategies for implementing these extensions in PostgreSQL to achieve license freedom.
* Discover how these key extensions can empower both developers and DBAs during the migration process.
* Don't miss this chance to gain practical knowledge from an industry expert and stay updated on the latest open-source database trends.
Mydbops Managed Services specializes in taking the pain out of database management while optimizing performance. Since 2015, we have been providing top-notch support and assistance for the top three open-source databases: MySQL, MongoDB, and PostgreSQL.
Our team offers a wide range of services, including assistance, support, consulting, 24/7 operations, and expertise in all relevant technologies. We help organizations improve their database's performance, scalability, efficiency, and availability.
Contact us: info@mydbops.com
Visit: https://www.mydbops.com/
Follow us on LinkedIn: https://in.linkedin.com/company/mydbops
For more details and updates, please follow up the below links.
Meetup Page : https://www.meetup.com/mydbops-databa...
Twitter: https://twitter.com/mydbopsofficial
Blogs: https://www.mydbops.com/blog/
Facebook(Meta): https://www.facebook.com/mydbops/
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfChart Kalyan
A Mix Chart displays historical data of numbers in a graphical or tabular form. The Kalyan Rajdhani Mix Chart specifically shows the results of a sequence of numbers over different periods.
What is an RPA CoE? Session 1 – CoE VisionDianaGray10
In the first session, we will review the organization's vision and how this has an impact on the COE Structure.
Topics covered:
• The role of a steering committee
• How do the organization’s priorities determine CoE Structure?
Speaker:
Chris Bolin, Senior Intelligent Automation Architect Anika Systems
Have you ever been confused by the myriad of choices offered by AWS for hosting a website or an API?
Lambda, Elastic Beanstalk, Lightsail, Amplify, S3 (and more!) can each host websites + APIs. But which one should we choose?
Which one is cheapest? Which one is fastest? Which one will scale to meet our needs?
Join me in this session as we dive into each AWS hosting service to determine which one is best for your scenario and explain why!
Taking AI to the Next Level in Manufacturing.pdfssuserfac0301
Read Taking AI to the Next Level in Manufacturing to gain insights on AI adoption in the manufacturing industry, such as:
1. How quickly AI is being implemented in manufacturing.
2. Which barriers stand in the way of AI adoption.
3. How data quality and governance form the backbone of AI.
4. Organizational processes and structures that may inhibit effective AI adoption.
6. Ideas and approaches to help build your organization's AI strategy.
2. how’s and what’s of a
digital archive / library
• what is a (good) digital library ?
• how should a digital library be designed ?
• how should a digital library be created ?
• how is a digital library measured ?
• how should a digital project be executed ?
• how should a digital library or a digital project be
managed ?
2
3. why a digital project?
• to enhance accessibility of the content in libraries
and archives
• to increase collaboration and cooperation between
libraries and archives around the world
• to promote research
• to provide opportunities for entrepreneurs
3
8. digital projects overview
• collections: organized groups of digital
objects
• objects: digital materials
• metadata: information about objects and
collections
8
11. assess
• select the collection or content
• define the goals
• identify the users
• identify ownership and legal risks
• identify applicable standards
• evaluate capabilities
11
12. design: standards
• METS XML for descriptive, structural, technical,
and administrative metadata
• descriptive metadata
• Metadata Object Description Standard
(MODS) selected metadata from MARC
• Dublin Core fundamental group of text
elements for describing and cataloging
• technical metadata
• ALTO for OCR text
• PREMIS for digital preservation
• MIX for images
12
14. design: access
• user community
• user interface (UI)
• search
• authentication and user
management
• digital object presentation
• portability
• administration
14
15. implement: pilot
create requirements and acceptance criteria
repeat
{
digitize (small) pilot batch
test data against acceptance criteria
adjust requirements and acceptance criteria
}
until (no more adjustments are necessary)
digitize more data
NB: pilot batches are VERY VERY important!!
15
16. implement: in-house
reasons for in-house production
• collection cannot be moved
• collection is badly organized
• digitization must be done slowly over a long
period
• digitization is very simple
16
17. implement: outsource
reasons for outsourced production
• originals can’t be scanned in-house because…
• equipment is too expensive
• output data is beyond staff experience
• labor is too expensive
• large volume of work in a short time
• insufficient space, infrastructure, or staff
17
19. implement: crowd sourcing
• FamilySearch.org
• National Library of Australia
Newspapers Digitisation Program
• Library and Archives Canada
• Wikipedia
19
20. measure: acceptance criteria
• automatic quality checks
• is the digital object complete?
• is the digital object verifiable?
• is the digital object uncorrupted?
• manual quality checks
• does the metadata meet accuracy
specifications?
• does the text meet accuracy
specifications?
• is the image quality satisfactory?
20
21. measure: image quality
“…images which are ultimately to be viewed by human
beings, the only “correct” method of quantifying visual image
quality is through subjective evaluation. in practice,
however, subjective evaluation is usually too inconvenient,
time-consuming and expensive…”
“…best way to assess the quality of an image is to look at it
because human eyes are the ultimate viewers of most
images…”
Zhou Wang and Hamid R. Sheikh. Image Quality Assessment: From Error Visibility to Structural Similarity.
IEEE Transactions on Image Processing. April 2004
Zhou Wang, Alan Bovick, and Ligang Lu. Why is image quality assessment so difficult? IEEE Transactions
on Image Processing. April 2004
21
22. measure: use
• who is using the collection?
• what is the collection being used for?
• how many page views per day / week /
month?
• how long do visitors to the collection stay?
• how many repeat visitors to the collection?
22
23. preserve
• bit rot
• format obsolescence
• media obsolescence / decay
• migration to new media or hardware
• standards obsolescence
23
24. preserve: bit rot
gradual decay of …
• storage media because of media quality
• storage media because of improper storage
• data due to random events (bit-flip,
• software due to interface changes
• software due to non-obvious or inadvertent
configuration changes
24
25. preserve: media decay
a report by NIST and the Library of Congress says
that
• virtually all CD-Rs tested indicated an
estimated life expectancy beyond 15 years
• only 47 percent of recordable DVDs indicated
an estimated life expectancy beyond 15 years,
some had a life expectancy as short as 1.9 years
• in practice actual lifetimes may be considerably
shorter
25
26. preserve: media obsolescence
• 5 ¼” floppy disks
• 8 track tapes
• 3 ½” floppy disks
• ZIP drives
• CD-R, CD-RW, Blu-Ray
• microfilm
26
27. preserve: migration
• file format changes
• file name differences: case sensitive /
insensitive
• extended file attributes
• file permissions
• soft links / hard links
27
31. the problem
the 2009 CHAOS Report (The Standish Group)
reports that of all software projects surveyed, 44%
are “challenged”, 24% failed, and only 32%
succeeded
31
32. the problem
Roger Sessions estimates that the worldwide cost
of IT failure is USD $500 billion per month
Roger Sessions: CTO of ObjectWatch. He has written seven books including
Simple Architectures for Complex Enterprises and many articles. He is a
founding member of the Board of Directors of the International Association of
Software Architects. 32
33. the problem
in a recent survey of 1230 IT professionals
conducted by Embarcadero Technologies, 2 of the
3 biggest project challenges cited by the IT pros
are “poor planning” and “poor or no requirements”
33
34. the problem
in a March 2007 web poll conducted by the
Computing Technology Industry Association "nearly
28 percent of the more than 1,000 respondents
singled out poor communications as the number one
cause of project failure"
34
35. the problem
in a white paper written for Project Perfect by Taimour al
Neimat, he lists
• poor planning
• unclear goals and objectives
• objectives changing during the project
• unrealistic time or resource estimates
• lack of executive support and user involvement
• failure to communicate and act as a team
• inappropriate skills
as primary causes for the failure of complex IT projects
35
36. the problem
a recent tender from an (anonymous) government agency
• project to convert ~ 170,000 text images to xml
• value of project ~ USD $180,000
• 19 pages of definitions, governing law, proposal
evaluation criteria, contractual conditions, instructions
about tender response format, etc
• technical requirements description? < 1 page
• data acceptance criteria? “a high level of accuracy”
36
37. the problem
a recent program established by a prominent national
library
• digitize more than 20 million text pages
• high level image and xml requirements
• value of work awarded? > USD $5,000,000
• after award of work, technical requirements
expand to 43+ pages from ~3 pages
• acceptance criteria? added as an afterthought
and not well defined
37
38. the problem
typical tender evaluation criteria in priority order
1. understanding of requirements
2. reputation of service bureau
3. price
38
45. communication
“projects are about
communication, communication,
and communication”
Elenbass,
B.
(2000).
“Staging
a
Project:
Are
You
Se>ng
Your
Project
Up
for
Success?”.
Proceedings
of
the
Project
Management
InsItute
Annual
Seminars
&
Symposiums.
45
46. references
• METS, MODS, ALTO, PRISM, etc :
http://www.loc.gov/standards
• OAIS : http://public.ccsds.org/publications/RefModel.aspx
• NISO standards and guidelines :
http://www.niso.org/publications/rp
• good practice guides : http://www.ukoln.ac.uk
• And many, many more
46
47. preguntas?
Frederick Zarndt
frederick@frederickzarndt.com
This work is licensed under the Creative Commons
Attribution-ShareAlike (CC by SA)
License. To view a copy of this license visit
http://creativecommons.org/licenses/by-sa/3.0/
47
Editor's Notes
digital libraries are (relatively) new. best practices are still (rapidly) evolving. computing technologies, storage media, communication protocols, and standards are changing.iArchives story.
this talk will probably not give you answers but rather a bunch of questions that you should ask as you undertake a digitization project. it will also give you a list of things to do before, during, after a digitization project, but not tell you how to do them.mention communications, requirements, acceptance criteria
primarily to enhance access. access to a digital collection is not restricted to 1 user in 1 place. now it is possible for many users in many places to concurrently access the collection.may also be to preserve a deteriorating collection
digital collections are similar to analog collections – books, newspapers, magazines, photographs, records – only in digital form. digital collections differ from analog collections in that they are more flexible.A digital collection consists of digital objects that are selected and organized to facilitate their discovery, access, and use.Digital objects, metadata, and the user interface together create the user experience of a collection.
A digital object represents a discrete unit and is comprised of a digital file or files as well as descriptive metadata. Digital objects begin life in one of two ways: As a digitized file produced as a surrogate for materials that exist in analog format.As a "born digital" entity, with no analog counterpart.digital objects are either digital surrogates for analog objects or born digital objects scanned text, scanned photos born digital text, digital photos archived websites census records, land records
metadata is similar to a card catalog but more flexible. richer descriptive and administrative metadata. may contain data about the digital objects themselves.metadata is structured information associated with an object for purposes of discovery, description, use, management, and preservation.
phases implies separation / sequential. not necessarily sequential! more about this later…
digital collection users may be different from analog collection users (genealogists)digital collection users may be different from analog collection users (genealogists)copyright holders are generally not happy about digital surrogates! know Turkish / EU copyright law! collaborate with copyright holder if possible.examples: Singapore, Australia, USA
METS XML since version 1.1 ~2001. administered by LOC but developed by libraries around the world. METS editorial board. METS now at version 1.9METSsections:header, descriptive, administrative, files, structural map (heart of METSstructural links (between elements of structural map), behaviorMARC not often used with digital collections. replaced by MODS (administered by LOC) and / or Dublin Core (administered by OCLC)
TIFF since 1986. last update (version 6.0) 1992. now under control of AdobeJPEG2000 since 2000. intended to supersede JPEG.PDF, PDF/A under control of Adobe. PDF/A subset of PDF version 1.4 and an ISO standard. latest PDF version is 1.7. In 2008 Adobe granted a royalty-free rights for all patents owned by Adobe that are necessary to make, use, sell and distribute PDF compliant implementationslook for open, community developed, tried and tested standard formats
Crowdsourcing is a distributed problem-solving and production model. Problems are broadcast to an unknown group of solvers in the form of an open call for solutions. Users—also known as the crowd—typically form into online communities, and the crowd submits solutions. The crowd also sorts through the solutions, finding the best ones.FamilySearch documents are drawn primarily from a collection of 2.4 million microfilms made of historical documents from 110 countries.130,000+ volunteers from around the world. Records based data.Australia NDP 5,800,000 newspaper pages online. 50,000,000+ lines of newspaper text corrected, 2,000,000+ per month in 2011.Wikipedia founded 2001. 90,000 active contributors. Website ranks 6th in the world usage according to Alexa. Editions in 282 languages.
Recognizing that MARC is no longer fit for the purpose, work with the library and other interested communities to specify and implement a carrier for bibliographic information that is capable of representing the full range of data of interest to libraries, and of facilitating the exchange of such data both within the library community and with related communities.