The document discusses the Unified Digital Format Registry (UDFR), a semantic registry for digital preservation representation information. The UDFR aims to unify the functions and holdings of PRONOM and the Global Digital Format Registry (GDFR) into an open source, publicly accessible knowledge base. It will provide reliable information about file formats to help with digital preservation activities like identification, validation, risk assessment, and transformation. The UDFR uses semantic web technologies like RDF, OWL, and SKOS to model format information so it can be understood by both people and machines.
The document discusses two projects, VidiVideo and IM3I, that aimed to develop automatic metadata extraction and semantic video search engines. VidiVideo created a system that automatically annotates videos with over 1000 semantic concepts and provides desktop and web-based search interfaces. IM3I provided tools for audio-visual annotation, indexing services, and specialized search interfaces to enable new ways of interacting with multimedia archives. Both projects achieved state-of-the-art performance in object and concept recognition contests.
Host Identification and Location Decoupling a Comparison of ApproachesAntonio Marcos Alberti
The increasing proliferation of mobile devices with Internet access contributed to clarify some important limitations of TCP/IP stack regarding mobility, multihoming, traceability and security. In its original design, Internet IP addresses were overloaded to simultaneously support host identification (ID) and location (Loc). As a consequence, application functionality can be affected when IP addresses are changed to update mobile nodes location. This dual functionality causes many problems in the current Internet, especially in supporting mobility. To deal with this limitations several solutions based on the idea of ID/Loc splitting have been proposed. In this position paper we present and compare some of them, summarizing their main features and limitations. We also identify opportunities and challenges for future research in the area as well as expected impacts/relations with other Future Internet aspects.
The first presentation describing the Next Generation Localisation scenario, based on a self-configurable, scenario independent, service-oriented architecture (SOA) framework, with distributed and component-based services that are
extensible and accessible, with
Localisation Knowledge as backbone, and a clear open source IP in place. This idea was later developed into the Service-oriented architecture solution (SOLAS) at the University of Limerick, and its IP transferred exclusively to The Rosetta Foundation which initiated an open source project for SOLAS. SOLAS now powers the Translation Commons or TROMMONS on trombones.org, the language services 'dating' site for nonmarket translation and localization.
CTM Program: Terminology, Vocabularies, and BIMnovacsi
Development of controlled vocabularies by CSI, CSC, Netherlands, Norway, and others to establish an international buildingSMART Dictionary (IFD), its importance and benefits to practitioners, interoperability, and BIM development for the AEC Industry.
The document provides an overview of ICANN and its New gTLD Program. It explains that ICANN is responsible for managing domain names and IP addresses globally in a multi-stakeholder process. The New gTLD Program will introduce new generic top-level domains, including internationalized domain names, to promote innovation and competition on the internet. The application process for the new gTLDs will begin in January 2012 and involve evaluation and potential objections before delegation of any new top-level domains.
As more and more artifacts of our cultural heritage and scholarly work product become digitally created and disseminated, long-term preservation becomes increasingly challenging for archives, libraries and other memory organizations. To help these institutions fulfill their mission, software manufacturers and service providers have developed specialized digital preservation solutions for a variety of scholarly and cultural materials. These solutions utilize many of the current standards and best practices for assuring the authenticity, findability, accessibility and usability of digital content.
NISO is pleased to host a 90-minute webinar that will present an overview of several digital preservation standards, describe how preservation is distinct from general repository maintenance, and describe preservation applications in multiple environments.
The document describes the Annotation Ontology (AO) which was developed based on Annotea for annotating different types of documents like text, images, audio, and video. AO defines different types of selectors to specify the context within documents being annotated. It also defines annotation types that are subclasses of the main Annotation type like Comment, Erratum, Question, and Explanation. AO supports grouping annotations into sets and combining annotation sets for specific documents. It aims to integrate with Annotea and SIOC standards for annotations.
The document discusses building new clinical information systems that enable intra- and inter-operability. It provides background on Paolo Ciccarese, including his experience developing ontologies and semantic technologies to improve clinical decision support, workflows, and data integration across healthcare organizations. Ciccarese advocates for knowledge-based clinical systems that leverage formal medical terminologies and ontologies to optimize processes and support continuity of care.
The document discusses two projects, VidiVideo and IM3I, that aimed to develop automatic metadata extraction and semantic video search engines. VidiVideo created a system that automatically annotates videos with over 1000 semantic concepts and provides desktop and web-based search interfaces. IM3I provided tools for audio-visual annotation, indexing services, and specialized search interfaces to enable new ways of interacting with multimedia archives. Both projects achieved state-of-the-art performance in object and concept recognition contests.
Host Identification and Location Decoupling a Comparison of ApproachesAntonio Marcos Alberti
The increasing proliferation of mobile devices with Internet access contributed to clarify some important limitations of TCP/IP stack regarding mobility, multihoming, traceability and security. In its original design, Internet IP addresses were overloaded to simultaneously support host identification (ID) and location (Loc). As a consequence, application functionality can be affected when IP addresses are changed to update mobile nodes location. This dual functionality causes many problems in the current Internet, especially in supporting mobility. To deal with this limitations several solutions based on the idea of ID/Loc splitting have been proposed. In this position paper we present and compare some of them, summarizing their main features and limitations. We also identify opportunities and challenges for future research in the area as well as expected impacts/relations with other Future Internet aspects.
The first presentation describing the Next Generation Localisation scenario, based on a self-configurable, scenario independent, service-oriented architecture (SOA) framework, with distributed and component-based services that are
extensible and accessible, with
Localisation Knowledge as backbone, and a clear open source IP in place. This idea was later developed into the Service-oriented architecture solution (SOLAS) at the University of Limerick, and its IP transferred exclusively to The Rosetta Foundation which initiated an open source project for SOLAS. SOLAS now powers the Translation Commons or TROMMONS on trombones.org, the language services 'dating' site for nonmarket translation and localization.
CTM Program: Terminology, Vocabularies, and BIMnovacsi
Development of controlled vocabularies by CSI, CSC, Netherlands, Norway, and others to establish an international buildingSMART Dictionary (IFD), its importance and benefits to practitioners, interoperability, and BIM development for the AEC Industry.
The document provides an overview of ICANN and its New gTLD Program. It explains that ICANN is responsible for managing domain names and IP addresses globally in a multi-stakeholder process. The New gTLD Program will introduce new generic top-level domains, including internationalized domain names, to promote innovation and competition on the internet. The application process for the new gTLDs will begin in January 2012 and involve evaluation and potential objections before delegation of any new top-level domains.
As more and more artifacts of our cultural heritage and scholarly work product become digitally created and disseminated, long-term preservation becomes increasingly challenging for archives, libraries and other memory organizations. To help these institutions fulfill their mission, software manufacturers and service providers have developed specialized digital preservation solutions for a variety of scholarly and cultural materials. These solutions utilize many of the current standards and best practices for assuring the authenticity, findability, accessibility and usability of digital content.
NISO is pleased to host a 90-minute webinar that will present an overview of several digital preservation standards, describe how preservation is distinct from general repository maintenance, and describe preservation applications in multiple environments.
The document describes the Annotation Ontology (AO) which was developed based on Annotea for annotating different types of documents like text, images, audio, and video. AO defines different types of selectors to specify the context within documents being annotated. It also defines annotation types that are subclasses of the main Annotation type like Comment, Erratum, Question, and Explanation. AO supports grouping annotations into sets and combining annotation sets for specific documents. It aims to integrate with Annotea and SIOC standards for annotations.
The document discusses building new clinical information systems that enable intra- and inter-operability. It provides background on Paolo Ciccarese, including his experience developing ontologies and semantic technologies to improve clinical decision support, workflows, and data integration across healthcare organizations. Ciccarese advocates for knowledge-based clinical systems that leverage formal medical terminologies and ontologies to optimize processes and support continuity of care.
Ontology-based concept mapping in Plant SciencesKaty Jordan
The document discusses a research project that aimed to identify threshold concepts in plant sciences through interviews with tutors. Six threshold concepts were identified: water potential, electrical potential, gene for gene hypothesis, circadian clock, species area curves, and photoprotection. The researchers developed concept maps and ontologies of the key concepts, such as photosynthesis and photoprotection, to enhance student learning outcomes and engagement.
The document discusses ontology driven annotation. It describes what semantic web and annotation are, including annotating documents with canonical references to entities from an ontology with metadata. It provides examples of existing Wikipedia annotators and how they work. It also discusses using an academic and technical ontology to annotate documents with information like project titles, course prerequisites, professor names, etc. Sample AQL scripts are given to extract such information.
This document outlines Kiel Hume's personal brand manifesto and strategic communication plan. It establishes his brand statement of being an energetic PR professional focused on today's evolving communications landscape. His objectives are to leverage his status as an aspiring PR professional to find volunteer opportunities and internships in 2010, find employment with an agency to gain experience from 2010-2011, and build his skills to become a valued asset from 2011-2013. His strategies include establishing social media platforms to explore trends, build integrity by staying up-to-date, and learn from innovative campaigns. His tactics are to integrate online networks for maximum visibility, deliver thoughtful content, engage inspiring contacts, and follow-up on new connections.
Kiel Hume presents his personal brand manifesto as a PR professional focused on strategic communications across evolving digital landscapes. He aims to maximize learning through internships in 2010 before finding agency employment in 2011-2013. His strategy is to establish social media platforms delivering unique industry insights and trends to build integrity and value. Tactics include integrating online networks for engagement and delivering thoughtful content to generate interest from mentors and new contacts. He acknowledges strengths in writing and customer service but little work experience outside academia.
The Gene Ontology (GO) provides a controlled vocabulary for describing gene and gene product attributes across species. It consists of three ontologies covering biological processes, molecular functions, and cellular components. GO terms are organized into a directed acyclic graph structure and can have relationships like "is_a" and "part_of". Genes are annotated with GO terms to capture functional information, which is shared across species to facilitate research. While useful, the GO has some limitations like unclear reasoning principles and lack of validation procedures.
The document discusses the anatomy of a bird house, including recommended materials like cypress, red cedar, and redwood wood while avoiding fiberglass and pressure-treated wood. Acceptable exterior decor includes rough-hewn finishes or lead-free paint in white, tan, gray or dull green, while interiors should remain untreated. Hardware should be galvanized nails and hinges, and the company ensures quality control through feedback at every step of design, manufacturing and from customers.
HFCommunity: A Tool to Analyse the Hugging Face Hub CommunityAdem Ait
The document describes HFCommunity, a tool for analyzing the Hugging Face community hub. It extracts data from the Hugging Face platform and stores it in a database to enable metrics and visualizations about repositories, files, versions, community discussions, and dependencies. Some example metrics include the number of files in a typical repository or number of repositories with discussions. The tool is intended to provide insights into the Hugging Face community and support further natural language processing analysis.
A strategic view of document and digital object managementDerek Keats
A strategic view of document and digital object management for the University of the Witwatersrand, Johannesburg. This was a presentation given to senior managers who have an interest in enterprise digital asset management at Wits.
The Role of OAIS Representation Information in the Digital Curation of Crysta...ManjulaPatel
A presentation given by Manjula Patel (UKOLN, University of Bath) and Simon Coles (EPSRC NCS, University of Southampton) at the 5th IEEE International Conference on e-Science
9-11th December 2009, Oxford, UK
The document discusses using speech technology to help disclose and provide access to audiovisual archives by automatically generating time-stamped content descriptions. It faces challenges from a large backlog of undisclosed analog material with minimal descriptions. The approach involves digitizing content, adding metadata, and using speech recognition to generate content descriptions when transcripts are not available. This would allow online retrieval of archive fragments and reduce human effort for annotation. The project tests this approach on the Radio Rijnmond archives of over 60,000 hours of broadcasts.
This document discusses Archivematica, an open-source digital preservation system. It summarizes that Archivematica [1] allows users to process digital objects from ingest to access according to ISO standards, [2] creates preservation packages (AIPs) using microservices, and [3] relies on format policies to guide normalization and preservation of file formats.
Tackling File Characterization and Analysis in ArchivematicaCourtney Mumma
Courtney C. Mumma's presentation discusses Archivematica, an open-source digital preservation system. Archivematica uses a two-pronged approach to preservation planning: normalization on ingest and preserving the original file. Normalization relies on format policies that specify actions, tools, and settings for files of different formats. The Format Policy Registry allows users to select and customize normalization tools, formats, and properties for different file types.
Digital Presentation Best Practices: Lessons Learned From Across the PondULB - Bibliothèques
Digital Presentation Best Practices: Lessons Learned From Across the Pond. Slavko Manojlovich (Associate University Librarian (IT) / Manager, Digital Archives Initiative Memorial University St Johns Canada) and Benoit Pauwels (Head, Library Automation Team, Université libre de Bruxelles Belgium)
Digital Preservation Best Practices: Lessons Learned From Across the PondBenoit Pauwels
Digital Preservation Best Practices: Lessons Learned From Across the Pond. Slavko Manojlovich (Associate University Librarian (IT) / Manager, Digital Archives Initiative Memorial University St Johns Canada) and Benoit Pauwels (Head, Library Automation Team, Université libre de Bruxelles Belgium)
This document provides an overview of digital libraries, including definitions, benefits, limitations, components, standards, and challenges. It defines a digital library as a collection of information stored and accessed electronically, extending the functions of a traditional library digitally. Benefits include improved access and searchability, easier information sharing and preservation. Emerging technologies discussed include metadata standards, XML, and protocols like OAI-PMH for metadata harvesting. Common digital library software includes DSpace, Greenstone, and EPrints. Challenges involve digitization, description, legal issues, presentation of heterogeneous resources, and economic sustainability.
This document provides an overview of digital libraries, including definitions, benefits, limitations, components, standards, and challenges. It defines a digital library as a collection of information stored and accessed electronically, extending the functions of a traditional library digitally. Benefits include improved access, information sharing, and preservation, while limitations include technological obsolescence and rights management. Key components discussed include digital objects, metadata, and tools like DSpace and Greenstone for developing digital libraries. Emerging standards around identifiers, encoding, and metadata are also summarized.
This tutorial, given at CALIBER 2009 at Pondicherry University, aims at portraying the unlimited potential of a select set of Web based applications for libraries and information centers in a real world perspective by trying out open source solutions. It attempts to unleash and demystify the plethora of features and functionalities of some of the popular library management, digital library as well as OA Archive/Harvester applications such as KOHA, Greenstone, DSpace, OAI Harvester, Drupal etc..
This document provides an overview of the EVIA Digital Archive project which aims to digitize, annotate, and provide access to 150 hours of video from 15 ethnomusicologists. It describes the development timeline, technical tools and standards used, and interfaces for searching, browsing, and playing videos. Key phases included planning from 2001-2002, development from 2003-2005, and a 2004 summer institute where contributors segmented and annotated 10 hours of newly digitized video.
This document discusses research on detecting deception in real-time audio and video streams. It outlines challenges in synchronizing, capturing, indexing and analyzing multiple streams. It proposes using MPEG-7 semantic annotations to generate knowledge bases for analysis. The research tests infrastructure for capturing, storing and retrieving segmented streams in SQL Server 2008. It also demonstrates prototype avatar animation controlled by Python scripts. Further studies are needed on the visual concept models and detection analysis engine.
This document provides an introduction to digital preservation. It discusses challenges such as hardware and software obsolescence, storage media decay, and loss of information over time. Standards like the UNESCO Charter on Digital Preservation are mentioned, which emphasize the importance of preserving digital heritage. The heterogeneity of digital materials, formats, and metadata are issues that must be addressed. Approaches to preservation like migration, emulation, and normalization are outlined. The importance of preservation policies, metadata, tools, legal issues, and trusted repositories are also summarized.
ISA online service to make it easier for Public Administrations to find ICT s...Joao Frade
The document discusses how the European Commission's Interoperability Solutions for European Public Administrations (ISA) program is working to make it easier for public administrations to find and reuse ICT specifications by developing the Asset Description Metadata Schema (ADMS) and an ADMS-enabled federation of semantic asset repositories. The ADMS provides a common way to describe specifications using metadata. The federation will provide a single point of access and promote finding specifications from different publishers through adding valuable metadata described using ADMS.
This document discusses the application of RFID technology in libraries. It begins with an introduction to RFID and how it can automate library processes. It then discusses the benefits of RFID for libraries, staff, and patrons, including faster circulation, easier inventory management, and improved patron services. The document also covers RFID standards relevant to libraries, such as ISO 18000-3 and NCIP. It provides recommendations on RFID implementation, including a phased approach and considerations for vendor selection. Overall, the document aims to provide librarians with information on utilizing RFID technology in their libraries.
Ontology-based concept mapping in Plant SciencesKaty Jordan
The document discusses a research project that aimed to identify threshold concepts in plant sciences through interviews with tutors. Six threshold concepts were identified: water potential, electrical potential, gene for gene hypothesis, circadian clock, species area curves, and photoprotection. The researchers developed concept maps and ontologies of the key concepts, such as photosynthesis and photoprotection, to enhance student learning outcomes and engagement.
The document discusses ontology driven annotation. It describes what semantic web and annotation are, including annotating documents with canonical references to entities from an ontology with metadata. It provides examples of existing Wikipedia annotators and how they work. It also discusses using an academic and technical ontology to annotate documents with information like project titles, course prerequisites, professor names, etc. Sample AQL scripts are given to extract such information.
This document outlines Kiel Hume's personal brand manifesto and strategic communication plan. It establishes his brand statement of being an energetic PR professional focused on today's evolving communications landscape. His objectives are to leverage his status as an aspiring PR professional to find volunteer opportunities and internships in 2010, find employment with an agency to gain experience from 2010-2011, and build his skills to become a valued asset from 2011-2013. His strategies include establishing social media platforms to explore trends, build integrity by staying up-to-date, and learn from innovative campaigns. His tactics are to integrate online networks for maximum visibility, deliver thoughtful content, engage inspiring contacts, and follow-up on new connections.
Kiel Hume presents his personal brand manifesto as a PR professional focused on strategic communications across evolving digital landscapes. He aims to maximize learning through internships in 2010 before finding agency employment in 2011-2013. His strategy is to establish social media platforms delivering unique industry insights and trends to build integrity and value. Tactics include integrating online networks for engagement and delivering thoughtful content to generate interest from mentors and new contacts. He acknowledges strengths in writing and customer service but little work experience outside academia.
The Gene Ontology (GO) provides a controlled vocabulary for describing gene and gene product attributes across species. It consists of three ontologies covering biological processes, molecular functions, and cellular components. GO terms are organized into a directed acyclic graph structure and can have relationships like "is_a" and "part_of". Genes are annotated with GO terms to capture functional information, which is shared across species to facilitate research. While useful, the GO has some limitations like unclear reasoning principles and lack of validation procedures.
The document discusses the anatomy of a bird house, including recommended materials like cypress, red cedar, and redwood wood while avoiding fiberglass and pressure-treated wood. Acceptable exterior decor includes rough-hewn finishes or lead-free paint in white, tan, gray or dull green, while interiors should remain untreated. Hardware should be galvanized nails and hinges, and the company ensures quality control through feedback at every step of design, manufacturing and from customers.
HFCommunity: A Tool to Analyse the Hugging Face Hub CommunityAdem Ait
The document describes HFCommunity, a tool for analyzing the Hugging Face community hub. It extracts data from the Hugging Face platform and stores it in a database to enable metrics and visualizations about repositories, files, versions, community discussions, and dependencies. Some example metrics include the number of files in a typical repository or number of repositories with discussions. The tool is intended to provide insights into the Hugging Face community and support further natural language processing analysis.
A strategic view of document and digital object managementDerek Keats
A strategic view of document and digital object management for the University of the Witwatersrand, Johannesburg. This was a presentation given to senior managers who have an interest in enterprise digital asset management at Wits.
The Role of OAIS Representation Information in the Digital Curation of Crysta...ManjulaPatel
A presentation given by Manjula Patel (UKOLN, University of Bath) and Simon Coles (EPSRC NCS, University of Southampton) at the 5th IEEE International Conference on e-Science
9-11th December 2009, Oxford, UK
The document discusses using speech technology to help disclose and provide access to audiovisual archives by automatically generating time-stamped content descriptions. It faces challenges from a large backlog of undisclosed analog material with minimal descriptions. The approach involves digitizing content, adding metadata, and using speech recognition to generate content descriptions when transcripts are not available. This would allow online retrieval of archive fragments and reduce human effort for annotation. The project tests this approach on the Radio Rijnmond archives of over 60,000 hours of broadcasts.
This document discusses Archivematica, an open-source digital preservation system. It summarizes that Archivematica [1] allows users to process digital objects from ingest to access according to ISO standards, [2] creates preservation packages (AIPs) using microservices, and [3] relies on format policies to guide normalization and preservation of file formats.
Tackling File Characterization and Analysis in ArchivematicaCourtney Mumma
Courtney C. Mumma's presentation discusses Archivematica, an open-source digital preservation system. Archivematica uses a two-pronged approach to preservation planning: normalization on ingest and preserving the original file. Normalization relies on format policies that specify actions, tools, and settings for files of different formats. The Format Policy Registry allows users to select and customize normalization tools, formats, and properties for different file types.
Digital Presentation Best Practices: Lessons Learned From Across the PondULB - Bibliothèques
Digital Presentation Best Practices: Lessons Learned From Across the Pond. Slavko Manojlovich (Associate University Librarian (IT) / Manager, Digital Archives Initiative Memorial University St Johns Canada) and Benoit Pauwels (Head, Library Automation Team, Université libre de Bruxelles Belgium)
Digital Preservation Best Practices: Lessons Learned From Across the PondBenoit Pauwels
Digital Preservation Best Practices: Lessons Learned From Across the Pond. Slavko Manojlovich (Associate University Librarian (IT) / Manager, Digital Archives Initiative Memorial University St Johns Canada) and Benoit Pauwels (Head, Library Automation Team, Université libre de Bruxelles Belgium)
This document provides an overview of digital libraries, including definitions, benefits, limitations, components, standards, and challenges. It defines a digital library as a collection of information stored and accessed electronically, extending the functions of a traditional library digitally. Benefits include improved access and searchability, easier information sharing and preservation. Emerging technologies discussed include metadata standards, XML, and protocols like OAI-PMH for metadata harvesting. Common digital library software includes DSpace, Greenstone, and EPrints. Challenges involve digitization, description, legal issues, presentation of heterogeneous resources, and economic sustainability.
This document provides an overview of digital libraries, including definitions, benefits, limitations, components, standards, and challenges. It defines a digital library as a collection of information stored and accessed electronically, extending the functions of a traditional library digitally. Benefits include improved access, information sharing, and preservation, while limitations include technological obsolescence and rights management. Key components discussed include digital objects, metadata, and tools like DSpace and Greenstone for developing digital libraries. Emerging standards around identifiers, encoding, and metadata are also summarized.
This tutorial, given at CALIBER 2009 at Pondicherry University, aims at portraying the unlimited potential of a select set of Web based applications for libraries and information centers in a real world perspective by trying out open source solutions. It attempts to unleash and demystify the plethora of features and functionalities of some of the popular library management, digital library as well as OA Archive/Harvester applications such as KOHA, Greenstone, DSpace, OAI Harvester, Drupal etc..
This document provides an overview of the EVIA Digital Archive project which aims to digitize, annotate, and provide access to 150 hours of video from 15 ethnomusicologists. It describes the development timeline, technical tools and standards used, and interfaces for searching, browsing, and playing videos. Key phases included planning from 2001-2002, development from 2003-2005, and a 2004 summer institute where contributors segmented and annotated 10 hours of newly digitized video.
This document discusses research on detecting deception in real-time audio and video streams. It outlines challenges in synchronizing, capturing, indexing and analyzing multiple streams. It proposes using MPEG-7 semantic annotations to generate knowledge bases for analysis. The research tests infrastructure for capturing, storing and retrieving segmented streams in SQL Server 2008. It also demonstrates prototype avatar animation controlled by Python scripts. Further studies are needed on the visual concept models and detection analysis engine.
This document provides an introduction to digital preservation. It discusses challenges such as hardware and software obsolescence, storage media decay, and loss of information over time. Standards like the UNESCO Charter on Digital Preservation are mentioned, which emphasize the importance of preserving digital heritage. The heterogeneity of digital materials, formats, and metadata are issues that must be addressed. Approaches to preservation like migration, emulation, and normalization are outlined. The importance of preservation policies, metadata, tools, legal issues, and trusted repositories are also summarized.
ISA online service to make it easier for Public Administrations to find ICT s...Joao Frade
The document discusses how the European Commission's Interoperability Solutions for European Public Administrations (ISA) program is working to make it easier for public administrations to find and reuse ICT specifications by developing the Asset Description Metadata Schema (ADMS) and an ADMS-enabled federation of semantic asset repositories. The ADMS provides a common way to describe specifications using metadata. The federation will provide a single point of access and promote finding specifications from different publishers through adding valuable metadata described using ADMS.
This document discusses the application of RFID technology in libraries. It begins with an introduction to RFID and how it can automate library processes. It then discusses the benefits of RFID for libraries, staff, and patrons, including faster circulation, easier inventory management, and improved patron services. The document also covers RFID standards relevant to libraries, such as ISO 18000-3 and NCIP. It provides recommendations on RFID implementation, including a phased approach and considerations for vendor selection. Overall, the document aims to provide librarians with information on utilizing RFID technology in their libraries.
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...Dataconomy Media
Francisco Webber is the CEO and Founder of Cortical.io, a company that develops Natural Language Processing solutions for Big Text Data. Francisco’s medical background in genetics combined with over two decade’s of experience in Information Technology, inspired him to create a groundbreaking technology, called Semantic Folding, which is based on the latest findings on the way the human neocortex processes information.
This document discusses preserving audiovisual content in digital files. It covers topics like digital preservation, file formats, encodings, compression methods, and challenges like format obsolescence. The document recommends strategies like using uncompressed formats for audio and saving video in its native format. It also discusses tools that can help with tasks like format identification, validation and metadata extraction to support preservation of large amounts of audiovisual content.
RUresearch: Supporting the Management and Preservation of Research Data - Ale...ASIS&T
RUresearch: Supporting the Management and Preservation of Research Data
Aletia Morgan
Presentation at Research Data Access & Preservation Summit
22 March 2012
Putting it all together for digital assetsJon Morley
This document discusses the objectives and components of a digital asset pipeline implemented by the LDS Church Library to digitize and provide access to its collections. The pipeline ingests physical collections, generates metadata using tools like EAD and Rosetta, and delivers digital content to users through systems like Primo. The pipeline allows management of large collections across multiple library systems in a standardized way. Challenges include complex configurations and difficulties with batch ingest and metadata updates. Successes include ingesting over 57,000 books and providing consistent user experiences for 700,000 digitized files.
Similar to Dlf 2011UDFR-a-semantic-registry-for-format-representation-information-v1 (20)
Managing the Digitization of Large Press ArchivesDLFCLIR
From the 2014 DLF Forum in Atlanta, GA.
Session Leaders
Bassem Elsayed, Bibliotheca Alexandrina
Ahmed Samir, Bibliotheca Alexandrina
Managing the digitization of press material is quite a challenge; not only in terms of quantity, but also in terms of text and material quality, designing the workflow system which organizes the operations, and handling the metadata. This challenge has been the focus of the Bibliotheca Alexandrina’s digitization work during the past year in the course of its partnership with the Center for Economic, Judicial, and Social Study and Documentation (CEDEJ). Having more than 800,000 pages of press articles to be digitally preserved and publicly accessed, triggered an inevitable need to design a workflow that can manage such a massive collection and handle its attributes proficiently. The deployment of this endeavor required simultaneous intervention of four main aspects; data analysis of the collection, developing a digitization workflow for the collection at hand, implementing and installing the necessary software tools for metadata entry, and finally, publishing the digital archive online for researchers and public access.
The presentation will demonstrate the workflow system which is being implemented to manage this massive press collection, which has yielded to date more than 400,000 pages. It will shed some light on the BA’s Digital Assets Factory (DAF), which is the nucleus upon which the digitization process of CEDEJ collection has been built. Additionally, the presentation will discuss the tools implemented for ingesting data into the digitization process starting form indexing until the creation of batches that are ingested into the system. The outflow will also be discussed in terms of organizing and grouping multipart press clips, in addition to the reviewing, validation and correction of the output. Light will also be shed on the challenges encountered to associate the accessible online archive with a powerful search engine supporting multidimensional search while maintaining a user-friendly navigation experience.
The document discusses migrating a current transcription project to the Scripto/Omeka platform to enhance functionality and streamline workflows. It describes setting up the platform with a LAMP server and necessary software. Challenges include tracking progress, exporting collections from another system into the new transcription system, and designing the transcription interface. Next steps include separating functionality from the Omeka core, improving progress tracking, documentation, data export, and other tasks.
This document discusses the Public Knowledge Project's (PKP) transition from open source software to community-sourced software. It outlines PKP's governance structure, financial support, development partners and recent software releases. Specifically, it describes the new Open Monograph Press software and plans for usability assessments of the Open Journal Systems and Open Monograph Press software conducted by the California Digital Library and University of British Columbia.
Kevin Livingston presents on biomedical annotation. Biomedical researchers are interested in understanding their data in the context of curated databases and literature. PubMed has grown exponentially, with over 2,600 new entries per day and over 132 million total annotations. Annotation can be used for applications that are gene-centric or document-centric. The CRAFT corpus contains over 560,000 tokens and 21,000 sentences from 67 articles annotated with over 100,000 concepts from biomedical ontologies. Compositional annotation allows capturing complex semantic relationships and provenance of annotations.
Introducing NYU to Digital Scholarship: A faculty-library partnershipDLFCLIR
The document outlines NYU's tiered service model for introducing digital scholarship to faculty through a library-faculty partnership. It describes challenges like varying skill levels and unsupported technologies. It also details workshops held on topics like turning CVs digital and text mining. The document discusses assessing needs, comparing to other university programs, and ensuring partner faculty needs are met.
Collaborative Service Models: Building Support for Digital ScholarshipDLFCLIR
This document discusses two collaborative service models at Cornell University Library: Digital Consulting & Production Services (DCAPS) and the Research Data Management Service Group (RDMSG). DCAPS provides digitization and digital publishing services, while RDMSG assists with research data management. Both aim to support digital scholarship through consultation, training, and project management. They face challenges in sustaining operations and demonstrating impact through metrics and follow-up with clients.
The document summarizes a presentation about ArchivesSpace, an open source archives management tool. It discusses ArchivesSpace's development process, technical framework, community involvement opportunities, and plans for ongoing support through the organizational home of LYRASIS. ArchivesSpace aims to provide a flexible, scalable, and standards-compliant tool for archival description and management.
This document discusses casting digital humanities projects and work as library services. It proposes that the library can offer several core services to support digital humanities, including: 1) facilitating research and reuse of digitized materials, 2) providing infrastructure and consultation, and 3) partnering on initiatives to advance the digital humanities field. Framing this work as services could help with scalability and sustainability.
Katherine Kott Slides for DLF PM Group 2011DLFCLIR
Managing relationships with vendors on development projects requires careful planning and ongoing management. Key aspects include selecting the right vendor fit, establishing clear expectations through contracts, and defining roles, responsibilities and processes up front. Cultural differences between libraries and vendors regarding priorities, tools and work practices can lead to challenges and need to be addressed. Regular communication through meetings and status updates helps facilitate collaboration and resolve issues that arise. Learning from each other can strengthen outcomes, but misaligned expectations or lack of coordination can still cause problems.
1. The document discusses using a Scope of Work document instead of a traditional project charter to initiate projects in a library setting. It outlines the benefits of using a Scope of Work, such as allowing more time for approvals and adding deliverable dates.
2. There is diversity in how organizations define the differences between a project charter and scope statement. The document analyzes how the PMBOK Guide has evolved its definitions and processes related to project scope over different editions.
3. Using a Scope of Work instead of a charter has worked well given the library's predictable projects, experienced teams, and modest deadlines. However, challenges may arise from growing project needs and choices that require more formal approvals.
This document summarizes a presentation on the Hypatia platform, which was developed to help archivists manage, preserve, and provide access to digital archival materials. Key points include:
- Hypatia is an open source software based on Hydra and Fedora that aims to be a repository solution for digital archives.
- It grew out of the Archives Information Management System (AIMS) project and leverages the Hydra framework.
- The presentation covered Hypatia's functional requirements gathering, data models, demonstration of capabilities, and plans for future development and community involvement.
Strategies for Effective Upskilling is a presentation by Chinwendu Peace in a Your Skill Boost Masterclass organisation by the Excellence Foundation for South Sudan on 08th and 09th June 2024 from 1 PM to 3 PM on each day.
How to Fix the Import Error in the Odoo 17Celine George
An import error occurs when a program fails to import a module or library, disrupting its execution. In languages like Python, this issue arises when the specified module cannot be found or accessed, hindering the program's functionality. Resolving import errors is crucial for maintaining smooth software operation and uninterrupted development processes.
A workshop hosted by the South African Journal of Science aimed at postgraduate students and early career researchers with little or no experience in writing and publishing journal articles.
Chapter wise All Notes of First year Basic Civil Engineering.pptxDenish Jangid
Chapter wise All Notes of First year Basic Civil Engineering
Syllabus
Chapter-1
Introduction to objective, scope and outcome the subject
Chapter 2
Introduction: Scope and Specialization of Civil Engineering, Role of civil Engineer in Society, Impact of infrastructural development on economy of country.
Chapter 3
Surveying: Object Principles & Types of Surveying; Site Plans, Plans & Maps; Scales & Unit of different Measurements.
Linear Measurements: Instruments used. Linear Measurement by Tape, Ranging out Survey Lines and overcoming Obstructions; Measurements on sloping ground; Tape corrections, conventional symbols. Angular Measurements: Instruments used; Introduction to Compass Surveying, Bearings and Longitude & Latitude of a Line, Introduction to total station.
Levelling: Instrument used Object of levelling, Methods of levelling in brief, and Contour maps.
Chapter 4
Buildings: Selection of site for Buildings, Layout of Building Plan, Types of buildings, Plinth area, carpet area, floor space index, Introduction to building byelaws, concept of sun light & ventilation. Components of Buildings & their functions, Basic concept of R.C.C., Introduction to types of foundation
Chapter 5
Transportation: Introduction to Transportation Engineering; Traffic and Road Safety: Types and Characteristics of Various Modes of Transportation; Various Road Traffic Signs, Causes of Accidents and Road Safety Measures.
Chapter 6
Environmental Engineering: Environmental Pollution, Environmental Acts and Regulations, Functional Concepts of Ecology, Basics of Species, Biodiversity, Ecosystem, Hydrological Cycle; Chemical Cycles: Carbon, Nitrogen & Phosphorus; Energy Flow in Ecosystems.
Water Pollution: Water Quality standards, Introduction to Treatment & Disposal of Waste Water. Reuse and Saving of Water, Rain Water Harvesting. Solid Waste Management: Classification of Solid Waste, Collection, Transportation and Disposal of Solid. Recycling of Solid Waste: Energy Recovery, Sanitary Landfill, On-Site Sanitation. Air & Noise Pollution: Primary and Secondary air pollutants, Harmful effects of Air Pollution, Control of Air Pollution. . Noise Pollution Harmful Effects of noise pollution, control of noise pollution, Global warming & Climate Change, Ozone depletion, Greenhouse effect
Text Books:
1. Palancharmy, Basic Civil Engineering, McGraw Hill publishers.
2. Satheesh Gopi, Basic Civil Engineering, Pearson Publishers.
3. Ketki Rangwala Dalal, Essentials of Civil Engineering, Charotar Publishing House.
4. BCP, Surveying volume 1
How to Manage Your Lost Opportunities in Odoo 17 CRMCeline George
Odoo 17 CRM allows us to track why we lose sales opportunities with "Lost Reasons." This helps analyze our sales process and identify areas for improvement. Here's how to configure lost reasons in Odoo 17 CRM
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Dr. Vinod Kumar Kanvaria
Exploiting Artificial Intelligence for Empowering Researchers and Faculty,
International FDP on Fundamentals of Research in Social Sciences
at Integral University, Lucknow, 06.06.2024
By Dr. Vinod Kumar Kanvaria
This presentation includes basic of PCOS their pathology and treatment and also Ayurveda correlation of PCOS and Ayurvedic line of treatment mentioned in classics.
1. Unified Digital Format Registry
a semantic registry for digital preservation
Digital Library Federation Forum
Baltimore, October 31-November 2, 2011
UDFR: A Semantic Registry for Format
Representation Information
Lisa Dawn Colvin
Abhishek Salve
Stephen Abrams
UC Curation Center
California Digital Library
2. Unified Digital Format Registry
a semantic registry for digital preservation
Outline
What
Why
How
When
3. Unified Digital Format Registry
a semantic registry for digital preservation
Why formats?
“Format” is the dividing line between bits and
information
ffd8ffe000104a46 SOI
4946000102010083 APP0 JFIF 1.2
00830000ffed0fb0 APP13 IPTC
50686f746f73686f APP2 ICC
7020332e30003842 DQT
494d03e90a507269 SOF0 183x512
6e7420496e666f00 DRI
Syntax Semantics
0000007800000000 DHT
0048004800000000 SOS
02f40240ffeeffee ECS0
0306025203470528 RST0
03fc000200000048 ECS1
00480000000002d8 RST1
0228000100000064 ECS2
0000000100030... ...
4. Unified Digital Format Registry
a semantic registry for digital preservation
Why formats?
There are many necessary preservation activities that
can be usefully performed on bits qua bits
But to preserve information you most act on
formatted bits and know what those formats mean
• Preservation of syntax and semantics
5. Unified Digital Format Registry
a semantic registry for digital preservation
Unified Digital Format Registry
“A reliable, publicly accessible, and sustainable
knowledge base of file format representation
information for use by the digital preservation
community”
• “Unification” of the function and holdings of PRONOM
and GDFR
http://www.nationalarchives.gov.uk/PRONOM
http://gdfr.info/
• Open source platform / GPL
• Semantic wiki
• Funded by the Library of Congress
6. Unified Digital Format Registry
a semantic registry for digital preservation
Timeline
PRONOM – National Archives [UK], 2002
http://www.nationalarchives.gov.uk/PRONOM
“ready access to reliable technical information about the
nature of electronic records”
JHOVE – Harvard, 2003
http://hul.harvard.edu/jhove
“digital object validation and characterization”
GDFR – Harvard/OCLC, 2006
http://gdfr.info/
“a distributed and replicated registry of format information
populated and vetted by experts and enthusiasts world-
wide”
7. Unified Digital Format Registry
a semantic registry for digital preservation
Timeline
UDFR – Ad hoc stakeholder community, 2009
• Resolve PRONOM IPR issues and develop a community-
supported open source solution
• Advance beyond legacy RDBMS and XML database
technology
UDFR – CDL, January 2011
http://udfr.org/
“a semantic registry for digital preservation”
• Stakeholder meeting, April 2011
• Beta release, November 2011
• Production release, January 2012
8. Unified Digital Format Registry
a semantic registry for digital preservation
Representation information
What you need to know about something in order to
exploit that thing meaningfully [OAIS/ISO 14720]
Information that lets you answer important
preservation questions
• What format is it?
• What are its significant properties?
• Is it valid?
• Is it at risk?
• How can I render/play/read it?
• What can it be transformed into?
• And how?
9. Unified Digital Format Registry
a semantic registry for digital preservation
Why semantic?
Everyone wants to say something about everything
• The semantic web lets anyone say anything about
anything
• Understandable to both people and machines
10. Unified Digital Format Registry
a semantic registry for digital preservation
Data modeling
Abstract Controlled
Base Vocabulary …
holder
dependency
holder creator
owner Abstract product Abstract
Process IPR Agent Holding Digest
Product Signature
maintainer reference
embodies ipr specification file
digest
Abstract External
Software Hardware Media Document File
Format Signature
Internal
Signature
input / output signature
Character Compression
Assessment Grammar File Format
Encoding Algorithm
grammar
assessment
11. Unified Digital Format Registry
a semantic registry for digital preservation
Provenance
“Trust, but verify”
• Complete change history
at the assertion level,
including
– Who made the assertion, and when?
– Confidence based on personal and institutional
reputation
• Imprimatur by technically knowledgeable
reviewers
12. Unified Digital Format Registry
a semantic registry for digital preservation
Ontologies
Prefixu Namespace
udfrs http://udfr.org/onto#
udfr http://udfr.org/udfr/
dc http://purl.org/dc/elements/1.1/
dcterms http://purl.org/dc/terms/
foaf http://xmls.com/foaf/0.1/
owl http://www.w3.org/2002/07/owl#
pronom http://reference.data.gov.uk/technical-registry/
rdf http://www.w3.org/1999/02/22-rdf-syntax-ns#
rdfs http://www.w3.org/2000/01/rdf-schema#
skos http://www.w3.org/2004/02/skos/core#
xds http://www.w3.org/2001/XMLSchema#
13. Unified Digital Format Registry
a semantic registry for digital preservation
Technology stack
HTTP / SPARQL
JavaScript / CSS
Ontowiki Erfurt / RDFAuthor
http://aksw.org/Projects/Erfurt
http://ontowiki.net/
https://github.com/AKSW/RDFauthor
Zend framework Virtuoso 4store
http://www.zend.com/ http://virtuoso.openlinksw.com/
PHP RDF
http://www.php.net/ http://www.w3.org/RDF
Apache httpd
http://httpd.apache.org/
14. Unified Digital Format Registry
a semantic registry for digital preservation
Initial population
Export from PRONOM
• Working with TNA to identify appropriate subset
• Transform to cross-walk modeling differences
15. Unified Digital Format Registry
a semantic registry for digital preservation
Licensing
Code is available under GPLv3
http://www.gnu.org/copyleft/gpl.html
• Hosted on BitBucket
http://www.bitbucket.org/udfr
Data is contributed and available under CC-BY
http://creativecommons.org/licenses/by/3.0/
• Consistent with UK open government license applicable
to PRONOM data
http://www.nationalarchives.gov.uk/doc/open-government-licence
17. Unified Digital Format Registry
a semantic registry for digital preservation
Lessons learned
People with semantic experience are scarce
Too much time evaluating/prototyping potential
technology choices
More difficulty than anticipated integrating disparate
open source products
0.x software is often numbered that for a reason
Feature lists aren’t (always)
18. Unified Digital Format Registry
a semantic registry for digital preservation
Lessons learned
Availability of a worldwide selection of products is a
good thing (except when you don’t read German)
• Excellent support from AKWS/Universität Leipzig
Modeling differences
• RDF (non-)standards
VM deployment
• Disparate IT organizations supporting dev/prod instances
19. Unified Digital Format Registry
a semantic registry for digital preservation
Next steps
Long-term governance and operational support
Technical maintenance and enhancement
Replication/synchronization
Building contributor and reviewer communities
20. Unified Digital Format Registry
a semantic registry for digital preservation
For more information
UDFR UC3
http://udfr.org/ http://www.cdlib.org/uc3
http://bitbucket.org/udfr uc3@ucop.edu
Stephen Abrams Mark Reyes
PRONOM Lisa Colvin Abhishek Salve
http://www.nationalarchives.gov.uk/PRONOM Patricia Cruse Tracy Seneca
Scott Fisher Joan Starr
GDFR Erik Hetzner Carly Strasser
http://gdfr.info/ Greg Janée Marisa Strong
John Kunze Adrian Turner
OntoWiki Margaret Low
David Loy
Perry Willett
http://ontowiki.net/Projects/OntoWiki
Virtuoso
http://www.openlinksw.com/dataspace/dav/wiki/Main/VOSRDFWP
Agile Knowledge and Semantic Web (AKSW), Universität Leipzig
http://aksw.org/
Editor's Notes
Edward Burne-Jones (British, 1833-1898)The Days of Creation: the First Day, 1870-1876Watercolor and gouache, 102.2×35.5 cmFogg Art Museum, Harvard University, 1943.454Bequest of Grenville L. Winthrop
Move from necessity to sufficiencySyntax -- http://www.flickr.com/photos/afeeld/4322852401Philosophy dictionary definition – http://botox4thebrain.com
The age of the democratization of expressionShout! – Mark Wheadon, http://www.flickr.com/photos/mark_wheadon/2557902153Robots! – Jere Keys, http://www.flickr.com/photos/tyreseus/527207577
Gorbachev and Reagan -- AFP/Getty Images, http://www.britannica.com/bps/media-view/121436/1/0/0
Leaning Tower of Pisa – Stephen and Claire Farnsworth, http://www.flickr.com/photos/the_farnsworths/2623592483
WAAAAAAY too many plugs – Isaac Lee, http://www.flickr.com/photos/ikelee/12680878Checklist -- http://www.flickr.com/photos/adesigna/4090782772
Square peg in a round hole -- http://www.flickr.com/photos/21664580@N04/2095574414Tug of war -- http://www.flickr.com/photos/toffehoff/244870161 / http://www.flickr.com/photos/toffehoff/244870160
Legislature – Mike Refund, http://www.flickr.com/photos/deltamike/3358213826Wrench – Ed Platt, http://www.flickr.com/photos/philentropist/176054470Obama inauguration crowd – Brett Farmiloe, http://www.flickr.com/photos/pursuethepassion/3220803117