The importance of metadata for datasets: The DCAT-AP European standardGiorgia Lodi
The presentation was delivered for a course at the University of Bologna. It presents DCAT-AP and the Italian extension DCAT-AP_IT. It includes a discussion on the new version of DCAT and DCAT-AP
The LODE-BD Recommendations present a reference tool that assist bibliographic data providers in selecting appropriate encoding strategies according to their needs in order to facilitate metadata exchange by, for example, constructing crosswalks between their local data formats and widely-used formats or even with a Linked Data representation. The LODE-BD Recommendations aim to address two questions: how to encode bibliographic data hosted by diverse open repositories for the purpose of exchanging data across data providers; and how to encode these data as Linked Open Data (LOD) - enabled bibliographic data.
The core component of the LODE-BD Recommendations report contains a set of recommended decision-making trees for common properties used in describing a bibliographic resource instance (article, monograph, thesis, conference paper, presentation material, research report, learning object, etc. - in print or electronic format). Each decision tree is delivered with various acting points and the matching encoding suggestions, usually with multiple options.
LODE-BD is a part of a series of LODE recommendations overarching a wide range of resource types including the encoding of value vocabularies used in describing agents, places, and topics in bibliographic data.
LOD2 plenary meeting in Paris: presentation of WP5: State of Play: Linked Data Visualization, Browsing and Authoring, by Renaud Delbru (National University of Ireland, Galway).
Beyond Seamless Access: Meta-data In The Age of Content IntegrationNew York University
This was an example of meta-data research that I did before Dot-COM bubble hit the East Coast in 2000. Much of what we envisioned for content integration shaped the meta-data movement for today. Its full potentials have not reached yet, e.g. the level of intelligent data for semantic apps, personalized delivery, interactive and bidirectional-linking services, repurposed services, etc. It's the first of its kind weaving content from scholarly publications (particularly in the context of formal and informal communications) with library mission critical applications in authority control, meta-data, directory services, ILS, ILL, knowledge-base for site map, etc.
HDL - Towards A Harmonized Dataset Model for Open Data PortalsAhmad Assaf
The Open Data movement triggered an unprecedented amount of data published in a wide range of domains. Governments and corporations around the world are encouraged to publish, share, use and integrate Open Data. There are many areas where one can see the added value of Open Data, from transparency and self-empowerment to improving efficiency, effectiveness and decision making. This growing amount of data requires rich metadata in order to reach its full potential. This metadata enables dataset discovery, understanding, integration and maintenance. Data portals, which are considered to be datasets' access points, offer metadata represented in different and heterogenous models. In this paper, we first conduct a unique and comprehensive survey of seven metadata models: CKAN, DKAN, Public Open Data, Socrata, VoID, DCAT and Schema.org. Next, we propose a Harmonized Dataset modeL (HDL) based on this survey. We describe use cases that show the benefits of providing rich metadata to enable dataset discovery, search and spam detection
The importance of metadata for datasets: The DCAT-AP European standardGiorgia Lodi
The presentation was delivered for a course at the University of Bologna. It presents DCAT-AP and the Italian extension DCAT-AP_IT. It includes a discussion on the new version of DCAT and DCAT-AP
The LODE-BD Recommendations present a reference tool that assist bibliographic data providers in selecting appropriate encoding strategies according to their needs in order to facilitate metadata exchange by, for example, constructing crosswalks between their local data formats and widely-used formats or even with a Linked Data representation. The LODE-BD Recommendations aim to address two questions: how to encode bibliographic data hosted by diverse open repositories for the purpose of exchanging data across data providers; and how to encode these data as Linked Open Data (LOD) - enabled bibliographic data.
The core component of the LODE-BD Recommendations report contains a set of recommended decision-making trees for common properties used in describing a bibliographic resource instance (article, monograph, thesis, conference paper, presentation material, research report, learning object, etc. - in print or electronic format). Each decision tree is delivered with various acting points and the matching encoding suggestions, usually with multiple options.
LODE-BD is a part of a series of LODE recommendations overarching a wide range of resource types including the encoding of value vocabularies used in describing agents, places, and topics in bibliographic data.
LOD2 plenary meeting in Paris: presentation of WP5: State of Play: Linked Data Visualization, Browsing and Authoring, by Renaud Delbru (National University of Ireland, Galway).
Beyond Seamless Access: Meta-data In The Age of Content IntegrationNew York University
This was an example of meta-data research that I did before Dot-COM bubble hit the East Coast in 2000. Much of what we envisioned for content integration shaped the meta-data movement for today. Its full potentials have not reached yet, e.g. the level of intelligent data for semantic apps, personalized delivery, interactive and bidirectional-linking services, repurposed services, etc. It's the first of its kind weaving content from scholarly publications (particularly in the context of formal and informal communications) with library mission critical applications in authority control, meta-data, directory services, ILS, ILL, knowledge-base for site map, etc.
HDL - Towards A Harmonized Dataset Model for Open Data PortalsAhmad Assaf
The Open Data movement triggered an unprecedented amount of data published in a wide range of domains. Governments and corporations around the world are encouraged to publish, share, use and integrate Open Data. There are many areas where one can see the added value of Open Data, from transparency and self-empowerment to improving efficiency, effectiveness and decision making. This growing amount of data requires rich metadata in order to reach its full potential. This metadata enables dataset discovery, understanding, integration and maintenance. Data portals, which are considered to be datasets' access points, offer metadata represented in different and heterogenous models. In this paper, we first conduct a unique and comprehensive survey of seven metadata models: CKAN, DKAN, Public Open Data, Socrata, VoID, DCAT and Schema.org. Next, we propose a Harmonized Dataset modeL (HDL) based on this survey. We describe use cases that show the benefits of providing rich metadata to enable dataset discovery, search and spam detection
Bengkel Metadata, RDA & Hyperlink PUiTM 2010
Anjuran : BPBPT PTAR
Tarikh : 6 April 2010
Tempat : Bilik Seminar PTAR 1
Penceramah : En. Goh Keng Yew
Jawatan: Technical Director (Paradigm System Berhad) System Engineer
Managing Electronic Collections in Alma presented at the 2016 GaCOMO in Athens as part of the Pre-Conference sponsored by TSIG and the Cataloging Functional Group of GIL.
Practical guide to architecting data lakes - Avinash Ramineni - Phoenix Data...Avinash Ramineni
Enterprises have been rapidly adopting data lakes as a complement or replacement of data warehouses. Many of the Data lake implementations are ignoring the inherent drawbacks and limitations of Data Lakes and ending up as data swamps with little or no benefit to the businesses. In this session we will go through some of challenges and the key aspects that need to be considered for successful Data lake implementations.
RDBMS gave us table schemas. A table schema, which is an essential metadata component, gave us the power to validate data types, and enforce constraints. In the age of varying data and schema-less data stores, how can we enforce these rules and how can we leverage metadata (even in RDBMS) to empower data validity, code checks, and automation.
This is a brief background into Big data (data lake) to put in context the importance of metadata from a governance perspective and more especially in todays heterogeneous big data platforms.
Data Lake for the Cloud: Extending your Hadoop ImplementationHortonworks
As more applications are created using Apache Hadoop that derive value from the new types of data from sensors/machines, server logs, click-streams, and other sources, the enterprise "Data Lake" forms with Hadoop acting as a shared service. While these Data Lakes are important, a broader life-cycle needs to be considered that spans development, test, production, and archival and that is deployed across a hybrid cloud architecture.
If you have already deployed Hadoop on-premise, this session will also provide an overview of the key scenarios and benefits of joining your on-premise Hadoop implementation with the cloud, by doing backup/archive, dev/test or bursting. Learn how you can get the benefits of an on-premise Hadoop that can seamlessly scale with the power of the cloud.
Over the last 10 years, the status of educational metadata, specifically
as they relate to learning objects, has changed radically. In the heady
days of the turn of the millennium, learning objects and their metadata
were seen as being destined to forever change the shape and form of
learning; by 2007, this approach has been derisively labeled "industrialist", and said to be quickly running "out of steam."
Given such drastic changes, how are metadata to be understood in the context
of different and emerging approaches to online learning resources, above
all those labelled 'open'? This question will be the focus of this presentation by Dr. Norm Friesen, Canada Research Chair in E-Learning Practices at Thompson Rivers University and Director of the CanCore
Initiative. Dr. Friesen will consider a number of alternatives to "learning object metadata." Dr. Friesen will conclude his
presentation by discussing an alternative that incorporates both high tech and ow tech approaches --one that is currently under
development as a multi-part international standard under the auspices of ISO (the
International Organization for Standardization).
This is a presentation on HTML from a series of works on the concept of library automation, a training course for Library and Information Science students
Panel presentation to a graduate class at the University of Arizona School of Information Resources and Library Science. Invited by Dr. Jana Bradley. July 2006.
The Rhizomer Semantic Content Management SystemRoberto García
The Rhizomer platform is a Content Management System (CMS) based on a Resource Oriented Approach (RESTful) and Semantic Web technologies. It achieves a great level of flexibility and provides sophisticated content management services. All content is described using semantic metadata semi-automatically extracted from multimedia content, which enriches the browsing experience and enables semantic queries. A usable user interface is built on top of the CMS in order to facilitate the interaction with content and enhance it with the information provided by the associated semantic metadata. As an application scenario of the platform, its use in a media company where audio content is managed and its speech transcript semantically annotated is described.
Part 4 of tutorials at DC2008, Berlin. (International Conference on Dublin Core and Metadata Applications). See also part 1-3 by Jane Greenberg, Pete Johnston, and Mikael Nilsson on DC history, concepts, and other schemas. This part focuses on practical issues.
Html for beginners. A basic information of html for beginners. A more depth coverage of html and css will be covered in the future presentations. visit my sites http://technoexplore.blogspot.com and http://hotjobstuff.blogspot.com for some other important presentations.
Tazkirah disampaikan oleh Ustaz Hj. Januri Salimin dari CITU pada Program Tazkirah @PTAR pada 28 Jan 2011
http://tazkirahptar.blogspot.com/2011/01/tazkirah-januari.html
More from Bhgn. Pembangunan Bibliografik & Pemprosesan Teknikal (6)
1. METADATA: AN INTRODUCTION Presented by: Hamidahbt. HJ. A. Rahman Senior Lecturer Faculty of Information Management UiTMPuncakPerdana Campus 40150 Shah Alam SELANGOR DARUL EHSAN 4/5/2010 1
2. Metadata is structured data which describes the characteristics of a resource. It shares many similar characteristics to the cataloguing that takes place in libraries, museums and archives. The term "meta" derives from the Greek word denoting a nature of a higher order or more fundamental kind. A metadata record consists of a number of pre-defined elements representing specific attributes of a resource, and each element can have one or more values. 4/5/2010 2 What is metadata?
3. What is metadata? Structured data about resources Library catalogues Abstracting and indexing services Archival finding aids Museum documentation Community information Carriers: MARC, HTML, SGML, XML 4/5/2010 3
6. Types of metadata Descriptive Metadata: to serve the purposes of discovery (how one finds a resource), identification (how a resource can be distinguished from other, similar resources), and selection (how to determine that a resource fills a particular need, for example, for the DVD version of a video recording) 4/5/2010 6
7. Types of metadata Administrative Metadata: is information intended to facilitate the management of resources. It can include such information as when and how an object was created,who is responsible for controlling access to or archiving the content, what control or processing activities have been performed in relation to it, and what restrictions on access or use apply. 4/5/2010 7
8. Types of metadata Structural Metadata (SM): can be thought of as the glue that holds compound digital objects together. A book, for example, may have many chapters, each consisting of a set of pages, each page represented by a separate digital file. Structural metadata (SM) is required to record the relationships between physical files and pages, between pages and chapters, and between chapters and the book as a whole. Presentation software uses SM to display Tables of contents and to deliver such functions as going directly to a requested chapter, or to turn pages forward or backward in order. 4/5/2010 8
9. Why use metadata? Metadata provides the essential link between the information creator and the information user. 4/5/2010 9
17. Metadata may be deployed in a number of ways: Embedding the metadata in the Web page by the creator or their agent using META tags in the HTML coding of the page As a separate HTML document linked to the resource it describes In a database linked to the resource. The records may either have been directly created within the database or extracted from another source, such as Web pages. 4/5/2010 11
21. EAD (Encoded Archives Description) IMS (IMS Global Learning Consortium) AGLS (Australian Government Locator Service) 4/5/2010 12
22. The data will be unusable, unless the encoding scheme understands the semantics of the metadata schema. The encoding allows the metadata to be processed by a computer program. Important schemes include: HTML(Hyper-Text Markup Language) SGML (Standard Generalised Markup Language) XML (eXtensible Markup Language) RDF (Resource Description Framework) MARC (MAchine Readable Cataloging) MIME (Multipurpose Internet Mail Extensions) 4/5/2010 13
23. Markup languages SGML- Standard Generalised Markup Language - controls document formatting for publication XML- Extensible Markup Language - “next generation” SGML HTML- Hyper Text Markup Language - SGML subset, controls display of web pages Tags (usually paired) structure text into elements e.g. headings, paragraphs, lists, etc. <title> </title> <p> </p> <li> </li> 4/5/2010 14
29. MARC – elements 1XX Main entry 2XX Title, SR, edition, publication 3XX Physical description 4XX Series 5XX Notes 6XX Subject access 7XX Added entries 8XX Added entries for series 9XX References and local fields 4/5/2010 16
35. Elements – XML reference name and tag4/5/2010 17
36.
37.
38. ONIX record <ISBN> 0123456789 </ISBN> <DistinctiveTitle> Alice in Wonderland </Distinctive Title> <Contributor> <ContributorRole> Author </ContributorRole> <PersonNameInverted> Carroll, Lewis </PersonNameInverted> </Contributor> <PublisherName> Collins </PublisherName> <PublicationDate> 2000 </Publication Date> 4/5/2010 19
39. Dublin Core - structure Simple resource discovery DCMES – Dublin Core Metadata Element Set HTML the most common ‘carrier’ Comprises 15 elements with element qualifiers element encoding schemes optional/mandatory elements Application profiles 4/5/2010 20
40. Dublin Core - elements Title Creator Subject Description Publisher Contributor Date Resource Type Format Resource Identifier Source Language Relation Coverage Rights 4/5/2010 21
41. Dublin Core - record <Title> Alice in Wonderland </Title> <Creator> Lewis Carroll </Creator> <Subject> <LCSH> Fiction </LCSH> </Subject> <Publisher> Project Gutenberg </Publisher> <Date> 2000 </Date> <Format> ASCII file via FTP </Format> <Identifier> http://promo.net/pg/….. </Identifier> 4/5/2010 22
45. EAD - elements <eadheader> [id + bibliographic inf. for finding aid] <archdesc> [data on a body of archival materials] <did> [container, physical description, physical location, repository, date and title of unit] <admininfo> [biography, scope, access, arrangement] <controlaccess> [name, place, genre, subject, title] </archdesc> 4/5/2010 25
47. EAD record - <archdesc> <archdesc> collection <did> <abstract> A collection of materials in and about shorthand collected by Sir Isaac Pitman and James Pitman</abstract> </did> <controlaccess> <subject encodinganalog=“MARC650”> Shorthand </subject> </controlaccess> </archdesc> </ead> 4/5/2010 27
48. Collection Description Schema developed May 2000 Access version for RSLP – summer 2001 Web version for Reveal – spring 2002 General attributes Subject Dates Associated agents External relationships 4/5/2010 28
50. Coll. Desc. - record Title: Pitman Collection Strength: Shorthand – national collection Phys. Desc: Printed texts and manuscripts Lang: English, Spanish, Esperanto, …… Access: Written request to the Librarian, Bath Univ. Accrual: passive, deposit Location: The Library, Bath University, Bath Subject: Shorthand, Sir Isaac Pitman Owner: Pitman Publishing Co. Catalogue: Bath University OPAC 4/5/2010 30
51. M21 Community Information Same principles as MARC Bibliographic Leader individual/organization/program/event/other Fixed fields 001-008, 010-099 fixed fields 007 disability facilities 008 special aspects Variable fields 4/5/2010 31
52. M21 Comm. Inf. - elements 1XX Name 2XX Title and Address 3XX Physical description 4XX Series (for events) 5XX Notes 6XX Subject access 7XX Added entries8XX Other variable fields 4/5/2010 32
53. M21 Comm. Inf. - record 110 $a CILIP 245 $a CILIP HQ 247 $a LA HQ $f 19?? - 2002 270 $a 7 Ridgmount St, London, WC1E 7AE $k 020 7255 0505 $m info@cilip.org.uk $r 9am to 6pm 311 $a Ewart Room $d seats 50 $g £100 per day 312 $a Overhead projector $f £10 per day 581 $a Library + Information Update 856 $a http://www.cilip.org.uk 4/5/2010 33
60. How does one create metadata? DC-dot - http://www.ukoln.ac.uk/metadata/dcdot/. This service will retrieve a Web page and automatically generate Dublin Core metadata, either as HTML tags or as RDF/XML, suitable for embedding in the section of the page. DCmeta - http://www.dstc.edu.au/RDU/MetaWeb/generic_tool.html. Developed by Tasmania Online. It is based on SuperNoteTab text-editor and can be customised. HotMeta - http://www.dstc.edu.au/Research/Projects/hotmeta/. A package of software, including metadata editor, repository and search engine. 4/5/2010 35