1. The document discusses modeling the content of digital libraries and information systems. It proposes modeling all entities as resources related through relationships. This allows modeling of complex information objects, composites, versions, and annotations.
2. Key constructs proposed in the model include modeling all entities as resources related through relationships. This includes information objects, actors, functions, as well as concepts and ideas. The model also distinguishes between works, manifestations, and items through relationships.
3. Metadata is defined as any data used to discover, interpret, or use information objects. Metadata resides in the relationships between resources rather than in the resources themselves. This allows flexible modeling of provenance, context, and other usage data.
Tutorial at OAI5 (cern.ch/oai5). Abstract: This tutorial will provide a practical overview of current practices in modelling complex or compound digital objects. It will examine some of the key scenarios around creating complex objects and will explore a number of approaches to packaging and transport. Taking research papers, or scholarly works, as an example, the tutorial will explore the different ways in which these, and their descriptive metadata, can be treated as complex objects. Relevant application profiles and metadata formats will be introduced and compared, such as Dublin Core, in particular the DCMI Abstract Model, and MODS, alongside content packaging standards, such as METS MPEG 21 DIDL and IMS CP. Finally, we will consider some future issues and activities that are seeking to address these. The tutorial will be of interest to librarians and technical staff with an interest in metadata or complex objects, their creation, management and re-use.
The document discusses digital libraries, including their architecture and design. It defines a digital library as a collection of documents available electronically on the internet or CD-ROM. Digital libraries use technology to break down traditional rules for archives by describing archived materials individually and allowing for reproduction. The document also discusses different types of metadata, including structural and descriptive metadata, and different metadata schemes.
Semantic technologies for the Internet of Things PayamBarnaghi
The document discusses semantic technologies for the Internet of Things. It describes how sensor data in the IoT is time-dependent, continuous, and variable quality. Semantic annotations and machine-interpretable formats like XML and RDF are needed to make the data interoperable. Ontologies provide formal definitions of concepts and relationships in a domain that enable machines to process IoT data and enable autonomous device interactions. The document outlines approaches to semantically describe sensor observations and measurements using XML, RDF graphs, and adding domain concepts and logical rules with ontologies.
The lecture covers several key topics in database systems including:
1. An overview of database concepts such as data models, normalization, data integrity restrictions, query optimization and processing, and SQL.
2. Parallel processing of data and recovery methods.
3. Database design and development including object-relational mapping technologies.
4. Distributed, parallel and heterogeneous databases including definitions and examples of each.
This document provides an overview of key concepts in data science and big data, including:
- Data science involves extracting knowledge and insights from structured, semi-structured, and unstructured data.
- The data value chain describes the process of acquiring data, analyzing it, curating it for storage, and using it.
- Big data is characterized by its volume, velocity, variety, and veracity. Hadoop is an open-source framework that allows distributed processing of large datasets across computer clusters.
The document discusses the future of repositories and how they may evolve based on emerging technologies and trends. It examines how repositories could adopt aspects of Web 2.0 like social features, tagging, and improved interfaces. It also explores the potential impact of semantic web technologies through richer metadata standards like the Scholarly Works Application Profile (SWAP). The document suggests repositories should focus on making content available on the web rather than just placing it in repositories to better support open access goals.
Keynote presentation delivered at ELAG 2013 in Gent, Belgium, on May 29 2013. Discusses Research Objects and the relationship to work my team has been involved in during the past couple of years: OAI-ORE, Open Annotation, Memento.
Tutorial at OAI5 (cern.ch/oai5). Abstract: This tutorial will provide a practical overview of current practices in modelling complex or compound digital objects. It will examine some of the key scenarios around creating complex objects and will explore a number of approaches to packaging and transport. Taking research papers, or scholarly works, as an example, the tutorial will explore the different ways in which these, and their descriptive metadata, can be treated as complex objects. Relevant application profiles and metadata formats will be introduced and compared, such as Dublin Core, in particular the DCMI Abstract Model, and MODS, alongside content packaging standards, such as METS MPEG 21 DIDL and IMS CP. Finally, we will consider some future issues and activities that are seeking to address these. The tutorial will be of interest to librarians and technical staff with an interest in metadata or complex objects, their creation, management and re-use.
The document discusses digital libraries, including their architecture and design. It defines a digital library as a collection of documents available electronically on the internet or CD-ROM. Digital libraries use technology to break down traditional rules for archives by describing archived materials individually and allowing for reproduction. The document also discusses different types of metadata, including structural and descriptive metadata, and different metadata schemes.
Semantic technologies for the Internet of Things PayamBarnaghi
The document discusses semantic technologies for the Internet of Things. It describes how sensor data in the IoT is time-dependent, continuous, and variable quality. Semantic annotations and machine-interpretable formats like XML and RDF are needed to make the data interoperable. Ontologies provide formal definitions of concepts and relationships in a domain that enable machines to process IoT data and enable autonomous device interactions. The document outlines approaches to semantically describe sensor observations and measurements using XML, RDF graphs, and adding domain concepts and logical rules with ontologies.
The lecture covers several key topics in database systems including:
1. An overview of database concepts such as data models, normalization, data integrity restrictions, query optimization and processing, and SQL.
2. Parallel processing of data and recovery methods.
3. Database design and development including object-relational mapping technologies.
4. Distributed, parallel and heterogeneous databases including definitions and examples of each.
This document provides an overview of key concepts in data science and big data, including:
- Data science involves extracting knowledge and insights from structured, semi-structured, and unstructured data.
- The data value chain describes the process of acquiring data, analyzing it, curating it for storage, and using it.
- Big data is characterized by its volume, velocity, variety, and veracity. Hadoop is an open-source framework that allows distributed processing of large datasets across computer clusters.
The document discusses the future of repositories and how they may evolve based on emerging technologies and trends. It examines how repositories could adopt aspects of Web 2.0 like social features, tagging, and improved interfaces. It also explores the potential impact of semantic web technologies through richer metadata standards like the Scholarly Works Application Profile (SWAP). The document suggests repositories should focus on making content available on the web rather than just placing it in repositories to better support open access goals.
Keynote presentation delivered at ELAG 2013 in Gent, Belgium, on May 29 2013. Discusses Research Objects and the relationship to work my team has been involved in during the past couple of years: OAI-ORE, Open Annotation, Memento.
The document discusses key concepts in relational data models including entities, attributes, relationships, and constraints. It provides examples of each concept and explains how they are the basic building blocks used to structure data in a relational database. Specific types of entities, attributes, relationships and their properties are defined, such as one-to-one, one-to-many, and many-to-many relationships. Overall, the document serves as an introduction to fundamental concepts in relational data modeling.
The document discusses strategies for modeling and publishing open government data as linked data. It outlines a process that includes identifying data, modeling exemplar records, naming resources with URIs, describing resources with vocabularies, converting data to RDF, and publishing and maintaining the data. The key steps are to focus on modeling real-world objects without consideration for specific applications, take an iterative approach, and be forgiving of imperfect initial models. Content management systems and wiki systems are not optimal for structured linked data, so a linked data management system like Callimachus is recommended.
The document provides an overview of knowledge graphs and the metaphactory knowledge graph platform. It defines knowledge graphs as semantic descriptions of entities and relationships using formal knowledge representation languages like RDF, RDFS and OWL. It discusses how knowledge graphs can power intelligent applications and gives examples like Google Knowledge Graph, Wikidata, and knowledge graphs in cultural heritage and life sciences. It also provides an introduction to key standards like SKOS, SPARQL, and Linked Data principles. Finally, it describes the main features and architecture of the metaphactory platform for creating and utilizing enterprise knowledge graphs.
- euroCRIS is an international organization that promotes interoperability in research through the CERIF data model. CERIF is a comprehensive data model that can represent all aspects of research information using linking entities.
- CRIS systems use the CERIF data model to store and exchange research information. CERIF allows complex relationships between research objects like people, projects, publications to be represented.
- Future developments include using CRIS for research data management, multidimensional researcher profiles, and building research information infrastructures based on interoperability standards like CERIF.
Engaging Information Professionals in the Process of Authoritative Interlinki...Lucy McKenna
Through the use of Linked Data (LD), Libraries, Archives and Museums (LAMs) have the potential to expose their collections to a larger audience and to allow for more efficient user searches. Despite this, relatively few LAMs have invested in LD projects and the majority of these display limited interlinking across datasets and institutions. A survey was conducted to understand Information Professionals' (IPs') position with regards to LD, with a particular focus on the interlinking problem. The survey was completed by 185 librarians, archivists, metadata cataloguers and researchers. Results indicated that, when interlinking, IPs find the process of ontology and property selection to be particularly challenging, and LD tooling to be technologically complex and unsuitable for their needs.
Our research is focused on developing an authoritative interlinking framework for LAMs with a view to increasing IP engagement in the linking process. Our framework will provide a set of standards to facilitate IPs in the selection of link types, specifically when linking local resources to authorities. The framework will include guidelines for authority, ontology and property selection, and for adding provenance data. A user-interface will be developed which will direct IPs through the resource interlinking process as per our framework. Although there are existing tools in this domain, our framework differs in that it will be designed with the needs and expertise of IPs in mind. This will be achieved by involving IPs in the design and evaluation of the framework. A mock-up of the interface has already been tested and adjustments have been made based on results. We are currently working on developing a minimal viable product so as to allow for further testing of the framework. We will present our updated framework, interface, and proposed interlinking solutions.
Here are the identities, states, and behaviors of the given objects:
LightBulb
Identity: Serial number
State: Wattage, color temperature, material
Behavior: Turn on, turn off, glow, illuminate
Car
Identity: Vehicle identification number (VIN)
State: Make, model, year, color, mileage
Behavior: Drive, accelerate, brake, turn
BankAccount
Identity: Account number
State: Balance, account type, owner name(s)
Behavior: Deposit, withdraw, pay bills, earn interest
The document discusses database concepts including:
1) The key concepts of a database including data, information, fields, records, files, and how a database improves over traditional file-based systems.
2) The functions of a database management system (DBMS) including database development, application development, and maintenance.
3) The database development process including planning, requirement specification, conceptual design, logical design, and physical design.
The Web of Linked Open Data, or LOD, is the most relevant achievement of the Semantic Web. Initially proposed by Tim Berners-Lee in a seminal paper published in Scientific American in 2001, the Semantic Web envisions a web where software agents can interact with large volumes of structured, easy to process data. It is now when users have at our disposal the first, mature results of this vision. Among them, and probably the most significant ones, are the different LOD initiatives and projects that publish open data in standard formats like RDF.
This presentation provides an overview and comparison of different LOD initiatives in the area of patent information, and analyses potential opportunities for building new information services based on largely available datasets of patent information. Information is based on different interviews conducted with innovation agents and on the analysis of professional bibliography and current implementations.
LOD opportunities are not only restricted to information aggregators, but also to end-users and innovation agents that need to face with the difficulties of dealing with large amounts of data. In both cases, the opportunities offered by LOD need to be assessed, as LOD has just become a standard, universal method to distribute, share and access data.
This document provides an overview of metadata, including:
1) Definitions of metadata from various sources, describing it as data that describes other data or information resources.
2) The main types of metadata - descriptive, processing, administrative, and semantic. Descriptive metadata retrieves information, processing metadata processes information, and administrative metadata manages information.
3) How metadata can be created automatically by tools or manually by people. Metadata schemes provide a formal structure to identify a discipline's knowledge and link it to information resources.
This document summarizes a workshop on metadata and digital libraries. It discusses the objectives of library systems and how they impact metadata. The workshop introduces Dublin Core metadata and examines how functional requirements inform system design and metadata decisions. Participants analyze sample metadata and use cases to understand these concepts. The summary highlights that system objectives guide metadata, and functional requirements defined through use cases specify required system behaviors and metadata.
An introduction to repository reference modelsJulie Allinson
Presentation at CETIS Metadata and Digital Repositories SIG Meeting, 1st March 2006, HE Academy, York. Julie Allinson, Digital Repositories Support Officer, UKOLN, University of Bath
Lecture Basic HTML tags. Beginning Web Site Design Stanford University Contin...Anwar Patel
A database management system (DBMS) allows users to create, maintain, and interact with databases by providing an interface to manage stored data and tools to manipulate and retrieve it efficiently. A relational database management system (RDBMS) organizes and stores data using a relational model with tables containing rows and columns. The relational model represents data entities as tables with attributes as columns and relationships as primary and foreign keys. The three main types of relationships are one-to-one, one-to-many, and many-to-many.
This document outlines the key concepts of database systems and the entity-relationship model. It defines what a database is, the advantages of using database management systems, and common DBMS architectures. It also explains the core components of the entity-relationship model including entities, attributes, relationships, keys, and degrees of relationships. The document provides examples and definitions to illustrate database and ER model concepts.
Data science involves analyzing structured and unstructured data to extract knowledge and insights. It is a multidisciplinary field that employs techniques from mathematics, statistics, computer science, and information science. Data scientists process and analyze large amounts of data to derive value from it. The data value chain describes the process of acquiring, storing, analyzing, curating, and using data. It involves transforming raw data into useful information that can be applied to business decisions and activities. Big data is large, diverse data that is too complex for traditional data processing due to its high volume, velocity, and variety.
Data Communities - reusable data in and outside your organization.Paul Groth
Description
Data is a critical both to facilitate an organization and as a product. How can you make that data more usable for both internal and external stakeholders? There are a myriad of recommendations, advice, and strictures about what data providers should do to facilitate data (re)use. It can be overwhelming. Based on recent empirical work (analyzing data reuse proxies at scale, understanding data sensemaking and looking at how researchers search for data), I talk about what practices are a good place to start for helping others to reuse your data. I put this in the context of the notion data communities that organizations can use to help foster the use of data both within your organization and externally.
Data science involves analyzing structured, semi-structured, and unstructured data to extract knowledge and insights. It employs techniques from fields like statistics, computer science, and information science. Data scientists possess strong skills in programming, statistics, data modeling, and machine learning. The data processing lifecycle involves data acquisition, analysis, curation, storage, and exploration. Big data is characterized by its volume, velocity, and variety. Technologies like Hadoop use clustered computing and distributed storage like HDFS to efficiently process and store large amounts of structured and unstructured data.
The document provides definitions and explanations of key concepts in database management systems. It discusses:
- The purpose of a DBMS is to solve problems with file processing systems like data redundancy, inconsistency, difficult data access and isolation, and integrity and concurrency issues.
- Data abstraction and levels of abstraction hide complexity from users through physical, logical, and view levels.
- A DBMS provides an environment for convenient and efficient data retrieval and storage.
- Data independence allows changes to schema definitions without affecting other levels.
METADATA: A PRACTICE AND ITS SERVICES TOWARDS DIGITAL ENVIRONMENTVikas Bhushan
The document provides an overview of metadata, including its definition, purpose, types and examples. It discusses metadata records and conceptual models like FRBR. It also covers approaches and standards towards metadata, including schemas like Dublin Core and approaches to ensure interoperability. Finally, it outlines the research landscape around metadata architecture, modeling and semantics and concludes with the importance of metadata in organizing digital information and enabling discovery.
This document discusses the properties of solids, liquids, and gases. It explains that in solids, particles are fixed in place with strong intermolecular forces, while in liquids they are closer together but with more motion. In gases, particles are far apart with minimal intermolecular forces and take the shape and volume of their container. It also describes evaporation as a process where molecules gain energy to break intermolecular bonds and become a gas, and condensation as the opposite where gases release energy and form bonds to become liquids. Dynamic equilibrium is reached when the rates of evaporation and condensation are equal in a closed system.
This document provides an overview of the CATIA V5 software, including:
- A course schedule covering part design, assembly modeling, wireframes, drafting and finite element analysis.
- Descriptions of the CATIA applications for product structure, part design, assembly design, and other tools.
- Information on how to interact with and get help within the CATIA interface.
The document discusses key concepts in relational data models including entities, attributes, relationships, and constraints. It provides examples of each concept and explains how they are the basic building blocks used to structure data in a relational database. Specific types of entities, attributes, relationships and their properties are defined, such as one-to-one, one-to-many, and many-to-many relationships. Overall, the document serves as an introduction to fundamental concepts in relational data modeling.
The document discusses strategies for modeling and publishing open government data as linked data. It outlines a process that includes identifying data, modeling exemplar records, naming resources with URIs, describing resources with vocabularies, converting data to RDF, and publishing and maintaining the data. The key steps are to focus on modeling real-world objects without consideration for specific applications, take an iterative approach, and be forgiving of imperfect initial models. Content management systems and wiki systems are not optimal for structured linked data, so a linked data management system like Callimachus is recommended.
The document provides an overview of knowledge graphs and the metaphactory knowledge graph platform. It defines knowledge graphs as semantic descriptions of entities and relationships using formal knowledge representation languages like RDF, RDFS and OWL. It discusses how knowledge graphs can power intelligent applications and gives examples like Google Knowledge Graph, Wikidata, and knowledge graphs in cultural heritage and life sciences. It also provides an introduction to key standards like SKOS, SPARQL, and Linked Data principles. Finally, it describes the main features and architecture of the metaphactory platform for creating and utilizing enterprise knowledge graphs.
- euroCRIS is an international organization that promotes interoperability in research through the CERIF data model. CERIF is a comprehensive data model that can represent all aspects of research information using linking entities.
- CRIS systems use the CERIF data model to store and exchange research information. CERIF allows complex relationships between research objects like people, projects, publications to be represented.
- Future developments include using CRIS for research data management, multidimensional researcher profiles, and building research information infrastructures based on interoperability standards like CERIF.
Engaging Information Professionals in the Process of Authoritative Interlinki...Lucy McKenna
Through the use of Linked Data (LD), Libraries, Archives and Museums (LAMs) have the potential to expose their collections to a larger audience and to allow for more efficient user searches. Despite this, relatively few LAMs have invested in LD projects and the majority of these display limited interlinking across datasets and institutions. A survey was conducted to understand Information Professionals' (IPs') position with regards to LD, with a particular focus on the interlinking problem. The survey was completed by 185 librarians, archivists, metadata cataloguers and researchers. Results indicated that, when interlinking, IPs find the process of ontology and property selection to be particularly challenging, and LD tooling to be technologically complex and unsuitable for their needs.
Our research is focused on developing an authoritative interlinking framework for LAMs with a view to increasing IP engagement in the linking process. Our framework will provide a set of standards to facilitate IPs in the selection of link types, specifically when linking local resources to authorities. The framework will include guidelines for authority, ontology and property selection, and for adding provenance data. A user-interface will be developed which will direct IPs through the resource interlinking process as per our framework. Although there are existing tools in this domain, our framework differs in that it will be designed with the needs and expertise of IPs in mind. This will be achieved by involving IPs in the design and evaluation of the framework. A mock-up of the interface has already been tested and adjustments have been made based on results. We are currently working on developing a minimal viable product so as to allow for further testing of the framework. We will present our updated framework, interface, and proposed interlinking solutions.
Here are the identities, states, and behaviors of the given objects:
LightBulb
Identity: Serial number
State: Wattage, color temperature, material
Behavior: Turn on, turn off, glow, illuminate
Car
Identity: Vehicle identification number (VIN)
State: Make, model, year, color, mileage
Behavior: Drive, accelerate, brake, turn
BankAccount
Identity: Account number
State: Balance, account type, owner name(s)
Behavior: Deposit, withdraw, pay bills, earn interest
The document discusses database concepts including:
1) The key concepts of a database including data, information, fields, records, files, and how a database improves over traditional file-based systems.
2) The functions of a database management system (DBMS) including database development, application development, and maintenance.
3) The database development process including planning, requirement specification, conceptual design, logical design, and physical design.
The Web of Linked Open Data, or LOD, is the most relevant achievement of the Semantic Web. Initially proposed by Tim Berners-Lee in a seminal paper published in Scientific American in 2001, the Semantic Web envisions a web where software agents can interact with large volumes of structured, easy to process data. It is now when users have at our disposal the first, mature results of this vision. Among them, and probably the most significant ones, are the different LOD initiatives and projects that publish open data in standard formats like RDF.
This presentation provides an overview and comparison of different LOD initiatives in the area of patent information, and analyses potential opportunities for building new information services based on largely available datasets of patent information. Information is based on different interviews conducted with innovation agents and on the analysis of professional bibliography and current implementations.
LOD opportunities are not only restricted to information aggregators, but also to end-users and innovation agents that need to face with the difficulties of dealing with large amounts of data. In both cases, the opportunities offered by LOD need to be assessed, as LOD has just become a standard, universal method to distribute, share and access data.
This document provides an overview of metadata, including:
1) Definitions of metadata from various sources, describing it as data that describes other data or information resources.
2) The main types of metadata - descriptive, processing, administrative, and semantic. Descriptive metadata retrieves information, processing metadata processes information, and administrative metadata manages information.
3) How metadata can be created automatically by tools or manually by people. Metadata schemes provide a formal structure to identify a discipline's knowledge and link it to information resources.
This document summarizes a workshop on metadata and digital libraries. It discusses the objectives of library systems and how they impact metadata. The workshop introduces Dublin Core metadata and examines how functional requirements inform system design and metadata decisions. Participants analyze sample metadata and use cases to understand these concepts. The summary highlights that system objectives guide metadata, and functional requirements defined through use cases specify required system behaviors and metadata.
An introduction to repository reference modelsJulie Allinson
Presentation at CETIS Metadata and Digital Repositories SIG Meeting, 1st March 2006, HE Academy, York. Julie Allinson, Digital Repositories Support Officer, UKOLN, University of Bath
Lecture Basic HTML tags. Beginning Web Site Design Stanford University Contin...Anwar Patel
A database management system (DBMS) allows users to create, maintain, and interact with databases by providing an interface to manage stored data and tools to manipulate and retrieve it efficiently. A relational database management system (RDBMS) organizes and stores data using a relational model with tables containing rows and columns. The relational model represents data entities as tables with attributes as columns and relationships as primary and foreign keys. The three main types of relationships are one-to-one, one-to-many, and many-to-many.
This document outlines the key concepts of database systems and the entity-relationship model. It defines what a database is, the advantages of using database management systems, and common DBMS architectures. It also explains the core components of the entity-relationship model including entities, attributes, relationships, keys, and degrees of relationships. The document provides examples and definitions to illustrate database and ER model concepts.
Data science involves analyzing structured and unstructured data to extract knowledge and insights. It is a multidisciplinary field that employs techniques from mathematics, statistics, computer science, and information science. Data scientists process and analyze large amounts of data to derive value from it. The data value chain describes the process of acquiring, storing, analyzing, curating, and using data. It involves transforming raw data into useful information that can be applied to business decisions and activities. Big data is large, diverse data that is too complex for traditional data processing due to its high volume, velocity, and variety.
Data Communities - reusable data in and outside your organization.Paul Groth
Description
Data is a critical both to facilitate an organization and as a product. How can you make that data more usable for both internal and external stakeholders? There are a myriad of recommendations, advice, and strictures about what data providers should do to facilitate data (re)use. It can be overwhelming. Based on recent empirical work (analyzing data reuse proxies at scale, understanding data sensemaking and looking at how researchers search for data), I talk about what practices are a good place to start for helping others to reuse your data. I put this in the context of the notion data communities that organizations can use to help foster the use of data both within your organization and externally.
Data science involves analyzing structured, semi-structured, and unstructured data to extract knowledge and insights. It employs techniques from fields like statistics, computer science, and information science. Data scientists possess strong skills in programming, statistics, data modeling, and machine learning. The data processing lifecycle involves data acquisition, analysis, curation, storage, and exploration. Big data is characterized by its volume, velocity, and variety. Technologies like Hadoop use clustered computing and distributed storage like HDFS to efficiently process and store large amounts of structured and unstructured data.
The document provides definitions and explanations of key concepts in database management systems. It discusses:
- The purpose of a DBMS is to solve problems with file processing systems like data redundancy, inconsistency, difficult data access and isolation, and integrity and concurrency issues.
- Data abstraction and levels of abstraction hide complexity from users through physical, logical, and view levels.
- A DBMS provides an environment for convenient and efficient data retrieval and storage.
- Data independence allows changes to schema definitions without affecting other levels.
METADATA: A PRACTICE AND ITS SERVICES TOWARDS DIGITAL ENVIRONMENTVikas Bhushan
The document provides an overview of metadata, including its definition, purpose, types and examples. It discusses metadata records and conceptual models like FRBR. It also covers approaches and standards towards metadata, including schemas like Dublin Core and approaches to ensure interoperability. Finally, it outlines the research landscape around metadata architecture, modeling and semantics and concludes with the importance of metadata in organizing digital information and enabling discovery.
This document discusses the properties of solids, liquids, and gases. It explains that in solids, particles are fixed in place with strong intermolecular forces, while in liquids they are closer together but with more motion. In gases, particles are far apart with minimal intermolecular forces and take the shape and volume of their container. It also describes evaporation as a process where molecules gain energy to break intermolecular bonds and become a gas, and condensation as the opposite where gases release energy and form bonds to become liquids. Dynamic equilibrium is reached when the rates of evaporation and condensation are equal in a closed system.
This document provides an overview of the CATIA V5 software, including:
- A course schedule covering part design, assembly modeling, wireframes, drafting and finite element analysis.
- Descriptions of the CATIA applications for product structure, part design, assembly design, and other tools.
- Information on how to interact with and get help within the CATIA interface.
This document provides an overview of developing tools using the Channel Access Server (CAS) library. It discusses the server-side and client-side responsibilities, the server application programming interface (API) including key classes like caServer and casPV, and how to implement functions like reading/writing PVs asynchronously. General design goals like not blocking client requests and reference counting of data descriptors are also covered.
The document traces the history and development of computers from the 1930s to present day, including the creation of the earliest computers, the transition from vacuum tubes to integrated circuits, and the rise of personal computers, servers, and the Internet. It discusses how computers have evolved from room-sized mainframes to devices embedded in everyday objects. The Internet started as a small network and transformed into a worldwide system with billions of users. The document outlines the four phases of the information age from institutional to collaborative computing.
The document provides an overview of XML (eXtensible Markup Language) and web servers. It discusses why XML was created, its basic rules and structure, validation using DTDs and schemas, and parsers like SAX and DOM. Advantages of XML include being text-based, modular, and able to display data differently in various media. Disadvantages include being more complex than HTML and a historical lack of applications and browser support. The document also defines a web server's primary function of storing, processing and delivering web pages via HTTP. Common features, path translation, kernel vs user-mode implementations, and current market share leaders are summarized.
This document discusses resource sharing, library networks, and consortia. It begins by explaining the three phases of development: 1) several libraries come together for mutual benefit through resource sharing and cooperation, 2) libraries are linked through networks using information and communication technologies, and 3) libraries form consortia to jointly acquire and share e-resources. It then provides details on the need for and objectives of resource sharing, areas of resource sharing such as interlibrary loans and cooperative acquisition, and how networks can facilitate resource sharing. Examples of important international and Indian networks are also summarized. Finally, the document outlines the key features, need for, and objectives of library consortia.
This document defines key concepts related to information processing systems. It discusses how data is collected and organized to become useful information. An information processing system accepts and stores data, processes it into information, and outputs it. The three main components are hardware, software, and peopleware. Hardware includes the tangible computer parts like the CPU, memory, and input/output devices. Software includes operating systems and applications. Peopleware refers to the users. The document also defines units of data measurement and types of computer hardware, software, files, and storage devices.
This document provides an overview of operating system concepts from the 9th edition of the textbook "Operating System Concepts" by Silberschatz, Galvin and Gagne. It discusses operating system services, structures, system calls, system programs, design and implementation. The key topics covered include user interfaces, process management, file systems, devices, resource allocation, protection and various system call examples.
The document discusses various DOS commands used to manage files and directories in MS-DOS. It describes internal commands that are preloaded and external commands stored on disks. Some key commands covered include DIR to list files, MD to create directories, CD to change directories, COPY to copy files, DEL to delete files, and FORMAT to format disks.
- Tim Paterson designed MS-DOS in 1980 while working for Seattle Computer Products. He wrote it in 8086 assembly language to be simple, fast and efficient.
- MS-DOS 1.0 was released by IBM in 1981. Microsoft then took over development and updated it through several versions until replacing it with Windows 95 in 1994.
- DOS used internal commands like COPY, DIR, DEL at the command prompt to manage files, directories and hardware. It had a basic file structure with 8.3 naming and supported wildcards for file searching.
Level 3 NCEA - NZ: A Nation In the Making 1872 - 1900 SML.pptHenry Hollis
The History of NZ 1870-1900.
Making of a Nation.
From the NZ Wars to Liberals,
Richard Seddon, George Grey,
Social Laboratory, New Zealand,
Confiscations, Kotahitanga, Kingitanga, Parliament, Suffrage, Repudiation, Economic Change, Agriculture, Gold Mining, Timber, Flax, Sheep, Dairying,
Gender and Mental Health - Counselling and Family Therapy Applications and In...PsychoTech Services
A proprietary approach developed by bringing together the best of learning theories from Psychology, design principles from the world of visualization, and pedagogical methods from over a decade of training experience, that enables you to: Learn better, faster!
How to Manage Reception Report in Odoo 17Celine George
A business may deal with both sales and purchases occasionally. They buy things from vendors and then sell them to their customers. Such dealings can be confusing at times. Because multiple clients may inquire about the same product at the same time, after purchasing those products, customers must be assigned to them. Odoo has a tool called Reception Report that can be used to complete this assignment. By enabling this, a reception report comes automatically after confirming a receipt, from which we can assign products to orders.
This presentation was provided by Racquel Jemison, Ph.D., Christina MacLaughlin, Ph.D., and Paulomi Majumder. Ph.D., all of the American Chemical Society, for the second session of NISO's 2024 Training Series "DEIA in the Scholarly Landscape." Session Two: 'Expanding Pathways to Publishing Careers,' was held June 13, 2024.
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...indexPub
The recent surge in pro-Palestine student activism has prompted significant responses from universities, ranging from negotiations and divestment commitments to increased transparency about investments in companies supporting the war on Gaza. This activism has led to the cessation of student encampments but also highlighted the substantial sacrifices made by students, including academic disruptions and personal risks. The primary drivers of these protests are poor university administration, lack of transparency, and inadequate communication between officials and students. This study examines the profound emotional, psychological, and professional impacts on students engaged in pro-Palestine protests, focusing on Generation Z's (Gen-Z) activism dynamics. This paper explores the significant sacrifices made by these students and even the professors supporting the pro-Palestine movement, with a focus on recent global movements. Through an in-depth analysis of printed and electronic media, the study examines the impacts of these sacrifices on the academic and personal lives of those involved. The paper highlights examples from various universities, demonstrating student activism's long-term and short-term effects, including disciplinary actions, social backlash, and career implications. The researchers also explore the broader implications of student sacrifices. The findings reveal that these sacrifices are driven by a profound commitment to justice and human rights, and are influenced by the increasing availability of information, peer interactions, and personal convictions. The study also discusses the broader implications of this activism, comparing it to historical precedents and assessing its potential to influence policy and public opinion. The emotional and psychological toll on student activists is significant, but their sense of purpose and community support mitigates some of these challenges. However, the researchers call for acknowledging the broader Impact of these sacrifices on the future global movement of FreePalestine.
How Barcodes Can Be Leveraged Within Odoo 17Celine George
In this presentation, we will explore how barcodes can be leveraged within Odoo 17 to streamline our manufacturing processes. We will cover the configuration steps, how to utilize barcodes in different manufacturing scenarios, and the overall benefits of implementing this technology.
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...TechSoup
Whether you're new to SEO or looking to refine your existing strategies, this webinar will provide you with actionable insights and practical tips to elevate your nonprofit's online presence.
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
DLF-JCDL2007ExpandedKoeln.ppt
1. 1
Digital Library Content Model
Dagobert Soergel
College of Information Studies
University of Maryland
Department of Library and Information Studies
University at Buffalo
2. 2
The Problem
Digital libraries must
1. Store a wide variety of often complex information objects
and display these objects on different platforms.
This requires modeling information objects, their internal
structure, and relationships among them.
2. Provide data that support discovery, interpretation, use,
and management of information objects.
This requires a good metadata model
3. Support annotation of information objects.
Annotations turn out to be surprisingly diverse.
An annotation my refer to only a part of an information object.
This requires an elegant model that can deal with many cases.
3. 3
Purpose of the talk
To reexamine a number of basic notions regarding
the content of a digital library (or, more generally, any
information system) to achieve sound definitions
Developed in the framework of the
DELOS Digital Library Reference Model
a framework for describing digital libraries, their content,
users, and functions and, for each, their qualities and
associated policies
4. 4
Premisses
• Modeling the content domain is complex and much
thinking is muddled
• Need to be able to handle both “data” and “documents”
• Any reference model
• needs to be abstract and must not commit to any
particular standard or design decision
• rather, it must provide a framework for specifying
the commitments of any particular DL
(or information system)
5. 5
Issues
0 Scope of this talk and modeling constructs
1a Content in the overall context of a DL reference model
1b Modeling information objects
1c Levels, versions, and relationships
1d Composite information objects / resources
1e Resource identifiers
2 Metadata, including provenance, context, usage
3 Annotation
6. 6
Scope of this talk
• A reference model for a broadly conceived digital
library will be able to model most any information
system, thus will be useful very broadly.
• The focus on digital libraries is in the application,
especially the type of collection, to which the model is
applied.
7. 7
Scope: level of abstraction
• The reference model should stay on an abstract level. It should
not require specific standards but rather
allow for plugging in any standard, such as RDA or DC.
• A DL should indicate to the users what standard it uses
for things like time, place, type of relationship, type of resource
• The reference model should
not require design choices but rather
provide a framework for specifying design choices,
such as selectivity of the collection. A DL will then indicate
whether its collection is selective or fully inclusive
8. 8
Modeling constructs
• The reference model should be based on an
entity-relationship model (E-R model).
• Second-order logic: relationship instances are resources that
can in turn be related to anything.
Apply pragmatically for useful navigation and common-sense
inferences; stay away from types of reasoning that run into
problems with second order logic.
• Must add mechanisms for indicating the degree of precision or
the degree of certainty of statements.
9. 9
Issues
1a Content in the overall context of
a DL reference model
1b Modeling information objects
1c Levels, versions, and relationships
1d Composite information objects / resources
1e Resource identifiers
2 Metadata, including provenance, context, usage
3 Annotation
10. 10
Content in the overall context of a
DL reference model
• Resources
• Structured data
• Unstructured data, text
• Uses of data
11. 11
Everything is a resource
W3C definition
A resource is anything that can be identified or named.
Any resource is represented by a resource identifiern
Resource includes
● external (non-digital) objects or events and
● digital object or event,
wherever that digital object or event may reside or occur.
Same as topic in topic maps
In an E-R model, entity types, entity instances (entity values),
relationship types, and relationship instances are all resources
In RDA: Resource restricted to information object.
Advantages of broader definition will become clear.
12. 12
Structured data = statements
Resource 1 <relationship> Resource 2
SoftwareModule <createdBy> LegalEntity
SoftwareModule <annotatedBy> Information object
Event <happenedIn> (Date1, Date2)
Multi-way relationships, frames
Statements are information objects, that is, they are
resources that can in turn be related to anything
Statement also called proposition or assertions (or fact)
13. 13
More on structured data
Data consist of statements about resources.
Such statements can be conceived as relationship instances
in which the resource in focus occupies one argument slot.
A simple statement using a binary relationship or a
multi-way relationship (a frame instance with slots filled)
(objects in an object-oriented database)
Drug treatment frame instance
Drug Taxoteer
treatsDisease Cancer, estrogen-negative
inPopulationGroup Elderly
hasSuccessRate 55%
14. 14
More on structured data
Slot fillers are also known as data values.
A data value makes sense only when it is seen in relation to
one or more resources, for example as a slot filler in a frame.
Examples
The value 55% makes sense only in the right context, such as in
the success slot of a drug treatment frame
The value 185 cm makes sense only if we know it is the height of
a person or the length of a pair of skis.
15. 15
There are two ways to communicate such statements.
1. Structured data:
One learns what one wants to know about the resource in
focus immediately from a relationship instance.
Hamlet <authoredBy> Shakespeare
The drug treatment frame on Taxoteer
The actual data of interest are represented in a database
16. 16
There are two ways to communicate such statements.
2. Unstructured data:
One needs to extract what one wants to know from a text or
image that is related to the resource in focus.
Shakespeare schrieb den Hamlet im Jahre 1625
Hamlet wurde von Shakespeare verfasst
Taxoteer ist effektiv in der Behandlung von Krebsen die keine
Rezeptoren fuer Estrogen haben. In aelteren Personen liegt
die Erfolgsrate bei 50%.
The data of interest are stored in what is commonly known as
document.
17. 17
Functions of data
Data about a resource may serve any of the following functions:
• learn about the resource and its various characteristics
• learn about the history and context of the resource
• learn how to use the resource
• manage the resource
• preserve the resource
The sections about metadata (roughly: data about an information object)
will specialize this list
18. 18
Relationship as the
basic modeling construct
Important principle:
Many concepts in a DL reference model are best modeled
based on relationships rather than based on entities
For example, “annotation-hood”
resides not in an information object but in the relationship
InformationObjectA <annotates> InformatioObjectB
InformationObject B <annotatedBy> InformationObjectA
19. 19
Resource type examples
• Information objects
Incl. documents, data streams, databases, queries and their
results (virtual information objects, such as database reports,
virtual collections)
• Actors that can search for, create, and manage resources
• Functions and services
• Software modules
• Policies
• Languages
• Ideas, concepts
20. 20
Inheritance
Many reference model constructs are specified at the level of resource.
They inherit down to the different resource types, especially
information objects
For example, the following statement types are valid for Resource
Resource <identifiedBy> Identifier
Resource <characterizedBy> QualityParameter
Resource <regulatetBy> Policy
Therefore, they are also valid for InformationObject or Actor or Policy
21. 21
Issues
1a Content in the overall context of a DL reference model
1b Modeling information objects
1c Levels, versions, and relationships
1d Composite information objects / resources
1e Resource identifiers
2 Metadata, including provenance, context, usage
3 Annotation
22. 22
Information objects 1
1. A formal relationship instance
(such a row in a table or a structured data record)
2. A document (written or spoken text, image, sound) from which a
human reader can learn about the resource in focus or about the
relationships among several resources.
Information extraction: document → formal relationship instances.
A collection of information objects is in turn an information object
• a table in a relational database = a collection of rows, each
representing a relationship instance or a collection of relationship
instances
• a collection of documents
23. 23
Information objects 2
An information object may be a close representation of an
external object or event, for example
• An image (photograph or painting) of a building. There may be
many such images taken from different angles etc.
• A video recording of a soccer game. There may be several
such video recordings, each capturing different scenes, or
capturing the same scene from different angles, or following
different players, etc. These are different information objects
representing the same external event.
24. 24
Real world objects, concepts, ideas
To provide full access to the information objects it contains,
a digital library must manage data about any kind of object
(real world objects, concepts, ideas) in its subject domain.
Why?
1. The DL may represent data in the form of a database
2. Users look for information objects that deal with or are
digital representations of any kind of object.
This idea underlies Topic Maps which were originally designed to
improve access to documents by relating the topics discussed in
these documents.
25. 25
Real world objects, concepts, ideas
Examples (these are all resources)
• People (focus of biographical reference tools)
• Organizations (focus of organization directories)
• Events (focus of developing "event gazetteers")
• Places (focus of gazetteers)
• Dates
• Mathematical theorems (focus of mathematical encyclopedias)
• Concepts, ideas
• Problems and proposed solutions
• Computer programs (focus of software directories or libraries)
The reference model should have a more complete list and indicate
sources dealing with these
26. 26
Issues
1a Content in the overall context of a DL reference model
1b Modeling information objects
1c Levels, versions, and
relationships
1d Composite information objects / resources
1e Resource identifiers
2 Metadata, including provenance, context, usage
3 Annotation
28. 28
Work
Intellectual or artistic entity, as the abstract essence or as a text,
image, or piece of music.
Range:
• A basic story or theme
• the story of Faust
• the myth of the Great Flood
• A text telling the story, such as
• Goethe's Faust
• the account of the Great Flood in the Bible (original Hebrew)
• the account of the same myth in another culture
• A specific version of the account in the Hebrew Bible
a Latin translation of the account in the Hebrew Bible
29. 29
Manifestation
A specific rendering of a work by means of a graphical image or
sound, taken in the abstract; the idea of such a rendering.
Examples:
• The text of Goethe's Faust printed in a particular typeface and layout
A performance at which the text is recited also renders the text but is
more properly considered a separate, but related, work.
• A specific score of a given version of Schubert's Fifth.
A performance of that version of Schubert’s Fifth also renders the
piece of music but is considered a separate, but related, work.
Also the rendering of a work in the form of digital storage that can be
transformed to a graphical image or sound, again taken as the abstract
pattern of digital signals.
30. 30
Item, individual copy
The embodiment of a manifestation in a physical object
We can perceive the content of an manifestation only through an
individual copy of it (unless we have memorized the visual expression
manifest in a manifestation and can conjure it up from memory).
There are works that have only one manifestation of which there is
only one copy.
31. 31
Relationships among information objects
The story of Faust <dealsWith> Pact with the devil
The story of Faust <isToldIn> Marlow’s Faust
The story of Faust <isToldIn> Goethe’s Faust
Goethe’s Faust <authoredBy> Goethe, Johann Wolfgang von
Goethe’s Faust <hasManifestation> R1231
R1231 <publishedBy> Cotta
R1231 <hasDate> 1871
R1232 <isCopyOf> R1231
R1232 <ownedBy> (HRieth, 1896, 1956)
R1232 <ownedBy> (DSoergel, 1956, *)
32. 32
Hierarchical inheritance
• Data about a work inherit to all works below it along <isToldIn>,
<hasVersion> etc. Therefore
Goethe' Faust <dealsWith> Pact with the devil
• Data about a work inherit to all its manifestations. Therefore
R1231 <authoredBy> Goethe, Johann Wolfgang von
• Data about a manifestation inherit to all its items
• Hierarchical inheritance increases efficiency
• More efficient catalog input
• More efficient catalog storage
• More efficient representation and reading of search results
33. More relationships
R271 The man I killed, by Michael Halliday
R519 The man I killed, play by Christopher Wern
R519 <isBasedOn> R271
R315 Handbook of commercial geography, by Robert
Chisholm
R783 Chisholm's handbook of commercial geography,
entirely rewritten by L. Dudley Stamp and S.
Carter Gilmour.
R783 <entirelyRewrittenFrom> R315
33
34. 34
Relationship to FRBR
Notes on Terminology
• The FRBR distinction between work and expression should
be rethought. It is unclear and consequently poorly
understood, and it may not be necessary. Just have work.
The intuition FRBR tries to capture in this distinction is better
handled through relationships among works as defined here.
• Following FRBR I use the term manifestation.
Other term: edition (in the sense of German Ausgabe),
but edition also means German Auflage,
so use of the term edition can be confusing.
• It would be nice to be able to use graphic expression as a
synonym for rendering, but to avoid any further confusion with
FRBR it is best not to use the term expression at all.
36. 36
Issues
1a Content in the overall context of a DL reference model
1b Modeling information objects
1c Levels, versions, and relationships
1d Composite information objects /
resources
1e Resource identifiers
2 Metadata, including provenance, context, usage
3 Annotation
37. 37
Composite information objects / resources
Examples
• Book divided into chapters, sections, paragraphs, words (XML
Document Object Model, DOM or TEI)
Each part can be seen as a separate information object
• Movie with images, soundtrack, close captions, script, all
coordinated (MPEG-7)
• A medical record with patient data, test data, images, live
monitoring data streams, diagnoses, drugs prescribed, etc.
38. 38
Composite information objects / resources
Abstractly:
Each component is a separate information object,
composition expressed through relationships
In practice:
Many document models for composite (or compound)
documents supporting presentation
DL needs to allow specification, for each document,
of the particular document model used
39. 39
Issues
1a Content in the overall context of a DL reference model
1b Modeling information objects
1c Levels, versions, and relationships
1d Composite information objects / resources
1e Resource identifiers
2 Metadata, including provenance, context, usage
3 Annotation
40. 40
Identifying information objects
1 Initial definition upon entry into the digital library.
2 Definition on the spot
Examples
Annotate a specific segment of a text document or a region
of an image or sound document or
Anchor an annotation to a specific location in a document.
The segment or anchor is a new information object that is
included in the original information object, and this new
information object is linked with any of several annotation
relationships to a new information object created by the user.
Related to composite objects. More on this under annotation
41. 41
Issues
1a Content in the overall context of a DL reference model
1b Modeling information objects
1c Levels, versions, and relationships
1d Composite information objects / resources
1e Resource identifiers
2 Metadata, including provenance,
context, usage
3 Annotation
42. 42
Data about information objects
Metadata =
data about information objects
if used for discovering, interpreting, and using information objects
Relate information objects to other types of resources. Examples:
InformationObject <hasCreator> Actor
InformationObject <dealsWith> Actor
InformationObject <containsText> Text (or, more specifically Word)
Relate a word in a text to the concept that is the meaning in which the
word is used in this particular position.
InformationObjectA <hasAbstract> InformationObjectB
InformationObjectA <hasCriticalCommentary> InformationObjectC
InformationObjectD <hasSupportiveCommentary>InformationObjectC
43. 43
More on defining metadata
The “metadata-hood” of an information object does not reside in the
information object, but in its relationship to another information
object and, more specifically, in its use
A piece of data
is used as metadata
if it is used for the purpose of discovering, interpreting, and
using information objects, which then give the ultimate data
wanted.
The same piece of data may fill the ultimate need to of the user in
one situation and be used as metadata in another situation.
44. 44
Not metadata
• Data about resources that are not information objects are not
metadata even if they are similar in form.
• Data about information objects are not always used as metadata.
For example, using author data to count a faculty members
publications or citation data to compute impact
• Extensive discussion of what exactly is the definition of metadata is
not a good use of resources. A system should provide the data that
are useful to a user for whatever purpose; what each piece of data is
called is less important.
45. 45
Metadata typologies
Metadata (and data in general) can be divided into categories from
several perspectives, and within each perspective there exist
several approaches.
Some examples of how to categorize metadata
• by purposes or use. Since the same unit of metadata can be
used for several purposes, the resulting categories overlap.
• by source, for example, extracted, assigned by cataloger,
assigned by user (social tagging), from usage tracking
• by intrinsic characteristics, for example data about
provenance or about the format of the information object
46. 46
Some metadata uses
A Learn about information objects and interpret them; this includes
A1 Learn about the identity and characteristics of information objects
(descriptive metadata)
A2 Learn about the history and other features of the context of the
information object (contextual metadata)
B Learn how to use an information object, including
B1 Learn how to gain legal access (access and rights metadata)
B2 Learn how to gain technical access to the information object
(what machinery and software is needed to access the
information object for a given purpose, such as assimilation by a
person or processing by a computer program)
C Manage information objects (administrative metadata), in particular
C1 Manage the preservation of information objects
(preservation metadata).
47. 47
Usage data
Data on usage of resources
and on usage rights, usage history, future use / preservation important
for discovering, interpreting, and using resources as well as managing
resources
Some of these data can be collected automatically
If the resource in question is an information object, this kind of data is
often used as metadata
48. 48
Issues
1a Content in the overall context of a DL reference model
1b Modeling information objects
1c Levels, versions, and relationships
1d Composite information objects / resources
1e Resource identifiers
2 Metadata, including provenance, context, usage
3 Annotation
49. 49
Annotation
InformationObjectA <annotatedBy> InformationObjectB
InformationObjectB may be created on the spot in order to annotate A
(InformationObjectB and the annotation relationship have the same
author) or B may preexist (the annotation relationship between A and B
is introduced by a third party)
Specific type of annotation expressed by specializing the annotatedBy
relationship, for example
InformationObjectA <criticizedBy> InformationObjectB
InformationObjectA <hasCriticalCommentary> InformationObjectC
InformationObjectD <hasSupportiveCommentary> InformationObjectC
InformationObjectE <isPartOfSpeech> PartOfSpeech
Annotation-hood is in the relationship, not in the information object
50. 50
Annotation
Annotation-hood is in the relationship, not in the information object
There is a wide range of relationship types that are called annotations.
Linguists think of annotations differently than scholars making
comments on a text.
Rather than trying to define exactly what “annotation” means, the
reference model should include a comprehensive list of relationship
types that might be considered annotation by somebody so that
anybody can define their meaning of annotation by giving the
appropriate subset of annotation relationship types.
The same thought applies to metadata, discussed on a later slide.
51. 51
Special resource types for annotations
Some annotations require special types of resources.
Examples
Annotate a text with part-of-speech indications
annotated resource : a one-word fragment of the text
annotating resource: a value from a list of parts of speech
Annotate a text with meaning for word sense disambiguation
annotated resource : a word or phrase in the text
annotating resource: a value from a list of meanings defined in some way
Annotation through underlining or other marks
annotated resource : a fragment of text or other information object
annotating resource: a pair (sign, meaning), e.g. (underline, important) or
(?, check this out) or (X, nonsense)
The annotated resource and the annotating resource may be very short
52. 52
Annotation and metadata
Metadata and annotation data overlap, and different
communities and individuals have different definitions of what is
included in metadata and what is included in annotations.
The precise nature of a unit of data about an information
object is determined by the relationship type and the resource
that is linked to. The interpretation of each type of data is in the
eye of the beholder.
Need an inventory of relationship types (a type of ontology)
For example, the CIDOC Content Reference Model (CIDOC/CRM)
is an inventory of broad relationship types.
In such an inventory, one could indicate who considers a given
relationship type as usable as metadata and/or as belonging to
annotation.
53. 53
Take-home message 1
The entity-relationship model (E-R model) provides
the unifying principle for a digital library content model
The E-R model allows representation of structured data of
any complexity on a conceptual level.
Defining relationships between information objects handles
• Modeling information objects
• Levels, versions, and relationships
• Composite information objects / resources
• Metadata
• Annotation
Many notions are captured better through
relationships than fine distinctions of entity types
54. 54
Take-home message 2
Any reference model
• needs to be abstract and must not commit to any
particular standard or design decision
• rather, it must provide a framework for specifying
the commitments of any particular DL
(or information system)
A reference model provides
a systematic framework for description and analysis,
not a prescription
57. 57
Construction process
• Need to be sure all applicable concepts from various
sources such as the 5S model and FRBR/CRM are
included, either in the skeleton model or in a list of
values / choices, as appropriate
• There is still work to be done to pull reference model
subject matter out of the reference architecture
document, and vice versa.
58. 58
Construction process
• We should have an online version of the reference
model document with the following properties
• Links to discussion of issues and underlying
rationale, capturing some of the discussion in the
group.
• Links from the reference model to the appropriate
section of the reference architecture
• The Wiki page may not quite do it.
59. 59
There are two ways to communicate such statements.
1. One learns what one wants to know about the resource in
focus immediately from a relationship instance.
Hamlet <authoredBy> Shakespeare
The drug treatment frame on Taxoteer
The actual data of interest are represented in a database that
captures these statements (relationship instances), such as
a collection of Prolog statements
a relational database
an object-oriented database
2. One needs to consult an information object that is related to
the resource in focus.
Shakespeare schrieb den Hamlet im Jahre 1625
Hamlet wurde von Shakespeare verfasst
Taxoteer ist effektiv in der Behandlung von Krebsen die keine
Rezeptoren fuer Estrogen haben. In aelteren Personen liegt die
Erfolgsrate bei 50%
60. • The DL designer must decide how to identify
the new resource that is a part of an existing
resource and
the new text object created by the annotator
and how to store the link between these two
information objects
60
61. 61
Identifying information objects
Architecture issues
Definition on the spot, options
(1) use completely independent identifiers and store the relationship explicitly
(2) use dependent identifiers
The part of a document can be identified by
document identifier followed by information that uniquely identifies the part.
The part relation is implied by the structure of the identifier.
The annotation information object could be identified by
the identifier of the resource being annotated followed by a short string that identifies the nth
annotation of this resource (like a footnote).
The relationship between the resource and the resource annotating it would be implied by
the identifier (however, the specific type of the annotation relationship would not be captured
this way).
The resource that annotates still can be referenced from any other context.
Implicit representation
Embedded annotations: The annotation is embedded in the document, linked to a point in a
text that is identified only by the place of the annotation. This could be converted to an
explicit representation.
62. 62
Some metadata uses
This is a specialization of the functions of data given above
A learn about other data, that is, information objects, and understand them; this includes
A1 learn about the identity and characteristics of information objects
(descriptive metadata)
A2 learn about the history and other features of the context of the
information object (contextual metadata)
B learn how to use an information object (source of data), including
B1 learn how to gain legal access to the information object
(access and use rights metadata)
B2 learn how to gain technical access to the information object
(what machinery and software is needed to access the information object
for a given purpose, such as assimilation by a person or processing by a
computer program)
C manage information objects (administrative metadata), in particular
C1 manage the preservation of information objects (preservation metadata).
63. 63
Metadata in the reference model
When describing a DL using the reference model,
need to be able to indicate any typology of metadata
used in the DL