EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universid...European Data Forum
Selected Talk of Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain at the European Data Forum 2014, 19 March 2014 in Athens, Greece: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data
The importance of metadata for datasets: The DCAT-AP European standardGiorgia Lodi
The presentation was delivered for a course at the University of Bologna. It presents DCAT-AP and the Italian extension DCAT-AP_IT. It includes a discussion on the new version of DCAT and DCAT-AP
A Linked Data Prototype for the Union Catalog of Digital Archives Taiwanandrea huang
Linked data paradigm has provided the potential for any data to link or to be linked with structural information, internally and externally. To improve on current cultural
service of the Union Catalog of Digital Archives Taiwan (catalog.digitalarchives.tw), a linked data prototype is developed and benefited by extending the Art & Architecture Thesaurus (AAT) for a machine-understandable catalog service.
However, knowledge engineering is time and labor consuming, especially for an archive that is non-western based in culture and multidisciplinary in natural. This
makes data semantics of the UCdaT are extremely challenged for mapping to international standards and vocabularies.
At this stage, the triple store is an experimental addition to the existing Union Catalog of Digital Archives Taiwan architecture, and provides semantic links to target collections for relative suggestions. This will guide us in creating a future technical architecture that is scalable to the whole archive level, compliant with learning by doing
guidelines, and preserves the data even that is difficult to be understood fully at present, but at least to be linked by others that may provide third-party’s understandings for their own reuse.
EDF2014: Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universid...European Data Forum
Selected Talk of Daniel Vila-Suero, Researcher, Ontology Engineering Group, Universidad Politecnica de Madrid, Spain at the European Data Forum 2014, 19 March 2014 in Athens, Greece: 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data
The importance of metadata for datasets: The DCAT-AP European standardGiorgia Lodi
The presentation was delivered for a course at the University of Bologna. It presents DCAT-AP and the Italian extension DCAT-AP_IT. It includes a discussion on the new version of DCAT and DCAT-AP
A Linked Data Prototype for the Union Catalog of Digital Archives Taiwanandrea huang
Linked data paradigm has provided the potential for any data to link or to be linked with structural information, internally and externally. To improve on current cultural
service of the Union Catalog of Digital Archives Taiwan (catalog.digitalarchives.tw), a linked data prototype is developed and benefited by extending the Art & Architecture Thesaurus (AAT) for a machine-understandable catalog service.
However, knowledge engineering is time and labor consuming, especially for an archive that is non-western based in culture and multidisciplinary in natural. This
makes data semantics of the UCdaT are extremely challenged for mapping to international standards and vocabularies.
At this stage, the triple store is an experimental addition to the existing Union Catalog of Digital Archives Taiwan architecture, and provides semantic links to target collections for relative suggestions. This will guide us in creating a future technical architecture that is scalable to the whole archive level, compliant with learning by doing
guidelines, and preserves the data even that is difficult to be understood fully at present, but at least to be linked by others that may provide third-party’s understandings for their own reuse.
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...OpenAIRE
OpenAIRE Interoperability Workshop (8 Feb. 2013).
DataCite – Bridging the gap and helping to find, access and reuse data – Herbert Gruttemeier, INIST-CNRS
Over the past decade, as the scholarly community’s reliance on e-content has increased, so too has the development of preservation-related digital repositories. The need for descriptive, administrative, and structural metadata for each digital object in a preservation repository was clearly recognized by digital archivists and curators. However, in the early 2000’s, most of the published specifications for preservation-related metadata were either implementation specific or broadly theoretical. In 2003, the Online Computer Library Center (OCLC) and Research Libraries Group (RLG) established an international working group called PREMIS (Preservation Metadata: Implementation Strategies) to develop a common core set of metadata elements for digital preservation. The first version of the PREMIS Data Dictionary for Preservation Metadata and its supporting XML schema was issued in 2005. Experience using its specifications in preservation repositories has led to several revisions, with the completion of a version 2.0 in 2008. The Data Dictionary is now in version 2.2 (July 2012), and it is widely implemented in preservation repositories throughout the world in multiple domains.
This presentation sets out some of the challenges around citing and identifying datasets and introduces DataCite, the international data citation initiative. DataCite was founded on 1-December 2009 to support researchers by
providing methods for them to locate, identify, and cite
research datasets with confidence.
This presentation was given by Adam Farquhar at the STM Publishers Association Innovation Conference on 4-Dec-2009.
EUDAT Research Data Management | www.eudat.eu | EUDAT
| www.eudat.eu | The presentation gives an introduction to Research Data Management, explaining why it is important to manage and share data.
November 2016
A presentation of the Dutch Techcentre for Life Sciences FAIR Data ecosystem given at the BlueBridge workshop, a pre-event of the Research Data Alliance's 9th Plenary
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021dkNET
Abstract
Good data stewardship is the cornerstone of knowledge, discovery, and innovation in research. The FAIR Data Principles address data creators, stewards, software engineers, publishers, and others to promote maximum use of research data. The principles can be used as a framework for fostering and extending research data services.
This talk will provide an overview of the FAIR principles and the drivers behind their development by a broad community of international stakeholders. We will explore a range of topics related to putting FAIR data into practice, including how and where data can be described, stored, and made discoverable (e.g., data repositories, metadata); methods for identifying and citing data; interoperability of (meta)data; best-practice examples; and tips for enabling data reuse (e.g., data licensing). Practical examples of how FAIR is applied will be provided along the way.
Presenter: Christopher Erdmann, Engagement, support, and training expert on the NHLBI BioData Catalyst project at University of North Carolina Renaissance Computing Institute
dkNET Webinars Information: https://dknet.org/about/webinar
OpenData Public Research
Open Access Events: The Case for Open Data, Why you should Care
Map & Data Library - 5th Floor Robarts Library, University of Toronto
Thursday, Oct. 25 from 10:00-12:00
Organized by Data and Map Librarians, Marcel Fortin and Berenica Vejvoda
This presentation was provided by Vinod Chachra of VTLS Inc. during the NISO event "Next Generation Discovery Tools: New Tools, Aging Standards," held March 27 - March 28, 2008.
Initially prepared for the CERN/RDA workshop on Active Data Management Plans (28-30 June 2016). Also presented in Denver at International Data Week (12-17 Sept 2016).
#PIDapalooza presentation in Reykjavik, Iceland on 10 Nov 2016. Persistent identifiers as an ingredient for machine-actionable data management plans. @TheDMPTool @DMPonline
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...OpenAIRE
OpenAIRE Interoperability Workshop (8 Feb. 2013).
DataCite – Bridging the gap and helping to find, access and reuse data – Herbert Gruttemeier, INIST-CNRS
Over the past decade, as the scholarly community’s reliance on e-content has increased, so too has the development of preservation-related digital repositories. The need for descriptive, administrative, and structural metadata for each digital object in a preservation repository was clearly recognized by digital archivists and curators. However, in the early 2000’s, most of the published specifications for preservation-related metadata were either implementation specific or broadly theoretical. In 2003, the Online Computer Library Center (OCLC) and Research Libraries Group (RLG) established an international working group called PREMIS (Preservation Metadata: Implementation Strategies) to develop a common core set of metadata elements for digital preservation. The first version of the PREMIS Data Dictionary for Preservation Metadata and its supporting XML schema was issued in 2005. Experience using its specifications in preservation repositories has led to several revisions, with the completion of a version 2.0 in 2008. The Data Dictionary is now in version 2.2 (July 2012), and it is widely implemented in preservation repositories throughout the world in multiple domains.
This presentation sets out some of the challenges around citing and identifying datasets and introduces DataCite, the international data citation initiative. DataCite was founded on 1-December 2009 to support researchers by
providing methods for them to locate, identify, and cite
research datasets with confidence.
This presentation was given by Adam Farquhar at the STM Publishers Association Innovation Conference on 4-Dec-2009.
EUDAT Research Data Management | www.eudat.eu | EUDAT
| www.eudat.eu | The presentation gives an introduction to Research Data Management, explaining why it is important to manage and share data.
November 2016
A presentation of the Dutch Techcentre for Life Sciences FAIR Data ecosystem given at the BlueBridge workshop, a pre-event of the Research Data Alliance's 9th Plenary
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021dkNET
Abstract
Good data stewardship is the cornerstone of knowledge, discovery, and innovation in research. The FAIR Data Principles address data creators, stewards, software engineers, publishers, and others to promote maximum use of research data. The principles can be used as a framework for fostering and extending research data services.
This talk will provide an overview of the FAIR principles and the drivers behind their development by a broad community of international stakeholders. We will explore a range of topics related to putting FAIR data into practice, including how and where data can be described, stored, and made discoverable (e.g., data repositories, metadata); methods for identifying and citing data; interoperability of (meta)data; best-practice examples; and tips for enabling data reuse (e.g., data licensing). Practical examples of how FAIR is applied will be provided along the way.
Presenter: Christopher Erdmann, Engagement, support, and training expert on the NHLBI BioData Catalyst project at University of North Carolina Renaissance Computing Institute
dkNET Webinars Information: https://dknet.org/about/webinar
OpenData Public Research
Open Access Events: The Case for Open Data, Why you should Care
Map & Data Library - 5th Floor Robarts Library, University of Toronto
Thursday, Oct. 25 from 10:00-12:00
Organized by Data and Map Librarians, Marcel Fortin and Berenica Vejvoda
This presentation was provided by Vinod Chachra of VTLS Inc. during the NISO event "Next Generation Discovery Tools: New Tools, Aging Standards," held March 27 - March 28, 2008.
Initially prepared for the CERN/RDA workshop on Active Data Management Plans (28-30 June 2016). Also presented in Denver at International Data Week (12-17 Sept 2016).
#PIDapalooza presentation in Reykjavik, Iceland on 10 Nov 2016. Persistent identifiers as an ingredient for machine-actionable data management plans. @TheDMPTool @DMPonline
Language of Influence and Persuasion - introduction to the NLP Milton ModelFiona Campbell
Find out how language can be used to influence and persuade others in business. This presentation is an introduction to the language of ambiguity which in Neuro Linguistic Programming (NLP) terms is known as the Milton Model.
Are you a researcher, citizen scientist, institution or community looking for data storage and value-added services? Do you want access to tools to make your research data more FAIR (findable, accessible, interoperable, and reusable)? Interested in seeing how the future European Open Science Cloud could support research data and practically foster cross-border, cross-disciplinary collaboration? Then this webinar is for you!
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
https://ucsb.zoom.us/meeting/register/tZYod-ippz4pHtaJ0d3ERPIFy2QIvKqjwpXR
FAIRy stories: the FAIR Data principles in theory and in practice
The ‘FAIR Guiding Principles for scientific data management and stewardship’ [1] launched a global dialogue within research and policy communities and started a journey to wider accessibility and reusability of data and preparedness for automation-readiness (I am one of the army of authors). Over the past 5 years FAIR has become a movement, a mantra and a methodology for scientific research and increasingly in the commercial and public sector. FAIR is now part of NIH, European Commission and OECD policy. But just figuring out what the FAIR principles really mean and how we implement them has proved more challenging than one might have guessed. To quote the novelist Rick Riordan “Fairness does not mean everyone gets the same. Fairness means everyone gets what they need”.
As a data infrastructure wrangler I lead and participate in projects implementing forms of FAIR in pan-national European biomedical Research Infrastructures. We apply web-based industry-lead approaches like Schema.org; work with big pharma on specialised FAIRification pipelines for legacy data; promote FAIR by Design methodologies and platforms into the researcher lab; and expand the principles of FAIR beyond data to computational workflows and digital objects. Many use Linked Data approaches.
In this talk I’ll use some of these projects to shine some light on the FAIR movement. Spoiler alert: although there are technical issues, the greatest challenges are social. FAIR is a team sport. Knowledge Graphs play a role – not just as consumers of FAIR data but as active contributors. To paraphrase another novelist, “It is a truth universally acknowledged that a Knowledge Graph must be in want of FAIR data.”
[1] Wilkinson, M., Dumontier, M., Aalbersberg, I. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data 3, 160018 (2016). https://doi.org/10.1038/sdata.2016.18
Sandra Collins - Building a linked data based content discovery service for t...dri_ireland
Presentation at WMPA2014 - The 1st Winter School on Multimedia Processing and Applications
Dublin, Ireland, January 6-8, 2014
Co-located with MMM 2014, The 20th Anniversary International Conference on MultiMedia Modeling.
Trinity College Dublin
This module supported the training on Linked Open Data delivered to the EU Institutions on 30 November 2015 in Brussels. https://joinup.ec.europa.eu/community/ods/news/ods-onsite-training-european-commission
In 2003, the DoD CIO published the "Net-Centric Data Strategy". This briefing was a collection of the many slides that we used between 2003-2006 to articulate the concepts of the strategy and the value of information sharing. It addresses the essentials of Net-Centricity, the goals and approaches of the Data Strategy, Communities of Interest, Common Operational Picture COP) vs. User-Defined Operational Picture (UDOP).
GBIF BIFA mentoring, Day 5a Data management, July 2016Dag Endresen
GBIF BIFA mentoring in Los Banos, Philippines for the South-East Asian ASEAN Biodiversity Heritage Parks. With Dr. Yu-Huang Wang, Dr. Po-Jen Chiang, and Guan-Shuo Mai from TaiBIF the GBIF node of Taiwan (Chinese Tapei); and the Biodiversity Informatics team at ASEAN Centre For Biodiversity. http://www.gbif.no/events/2016/gbif-bifa-mentoring.html
Credits: EUDAT/OpenAire, December 2015 & May 2016, CC-BY-4.0
* http://www.slideshare.net/EUDAT/eudat-research-data-management
* http://www.slideshare.net/EUDAT/research-data-management-introduction-eudatopen-aire-webinar?ref=https://eudat.eu/events/webinar/research-data-management-an-introductory-webinar-from-openaire-and-eudat
* https://eudat.eu/events/webinar/research-data-management-an-introductory-webinar-from-openaire-and-eudat
* http://www.instantpresenter.com/WebConference/RecordingDefault.aspx?c_psrid=EB57D6888147
20140902 LinDa Workshop Semantincs2014 - LinDA Project OverviewLinDa_FP7
LinDa Project presentation - Challenges, tools, workplan and objectives
Presentation at LinDA Workshop on 2nd September 2014 at Semantics2014 by Spiros Mouzakitis
GBIF and reuse of research data, Bergen (2016-12-14)Dag Endresen
Biodiversity informatics seminar at the Department of Biology, University of Bergen on data publication and reuse of GBIF-mediated biodiversity data on 14th December 2016. Organized by the Norwegian GBIF Node and the Norwegian Biodiversity Information Center (NBIC, Artsdatabanken).
See also: http://www.gbif.no/events/2016/data-publishing-seminar-in-bergen.html
See also: http://doi.org/10.13140/RG.2.2.24290.32969
Linked Open Data Principles, Technologies and ExamplesOpen Data Support
Theoretical and practical introducton to linked data, focusing both on the value proposition, the theory/foundations, and on practical examples. The material is tailored to the context of the EU institutions.
Similar to 3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data (20)
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...Daniel Vila Suero
In a global world, vocabularies enabled for multilingual environments are increasingly in demand. In this session, discussion will include applicable standards (and examples), with a possible outcome a charge to a small group to begin developing some best practices.
See http://wiki.dublincore.org/index.php/VocDay_workshop_in_Lisbon and http://wiki.dublincore.org/index.php/Agenda2
Data enrichment and transformation in the LOD Context: Vocabulary usage in da...Daniel Vila Suero
Short talk for the session and panel discussion: "DATA ENRICHMENT AND TRANSFORMATION IN THE LOD CONTEXT: POOR AND POPULAR VS. RICH AND LONELY—CAN'T WE ACHIEVE BOTH?" at DCMI Conference Lisbon 2013
Talk at the 2nd Linked Open Data Conference from the Cataloguing and Indexing Group in Scotland (CIGS). Taking place in Edinburgh, Scotland on 21st September 2012
Status Quo and (current) Limitations of Library Linked DataDaniel Vila Suero
Talk at the Semantic Web in Libraries Conference 2012 (SWIB2012). Cologne 28/12/2012 during the session "TOWARDS AN INTERNATIONAL LOD LIBRARY ECOLOGY".
(http://swib.org/swib12/programme.php)
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Generating a custom Ruby SDK for your web service or Rails API using Smithyg2nightmarescribd
Have you ever wanted a Ruby client API to communicate with your web service? Smithy is a protocol-agnostic language for defining services and SDKs. Smithy Ruby is an implementation of Smithy that generates a Ruby SDK using a Smithy model. In this talk, we will explore Smithy and Smithy Ruby to learn how to generate custom feature-rich SDKs that can communicate with any web service, such as a Rails JSON API.
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring.
Learn about:
• The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks.
• Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective.
• Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification.
• Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process.
Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data
1. 19/03/2014 1Presentername
3LD: Towards high quality, industry-ready
Linguistic Linked Licensed Data
Daniel Vila-Suero1, Victor Rodríguez-
Doncel1, Asunción Gómez-Pérez1, Philipp
Cimiano2, John P. McCrae2, and Guadalupe
Aguado-de-Cea1
1Ontology Engineering Group, Facultad de Informática, UPM. Madrid, Spain
{dvila, vrodriguez, asun, lupe}@fi.upm.es
2 Forschungsbau Intelligente Systeme (FBIIS). Universität Bielefeld. Bielefeld, Germany
{cimiano, jmccrae}@cit-ec.uni-bielefeld.de
2. 19/03/2014 2Daniel Vila-Suero
Context: Lider project
• Ecosystem of Linguistic resources
(Corpora, Lexico-semantic data, etc.)
as LD and NLP services to support
content analytics.
Join us!
http://lider-project.eu
Linked Data for Language Technologies
Community Group (LD4LT)
3. 19/03/2014 3Daniel Vila-Suero
Licensing Linked Data, why?
Open Data Propietary Data
Gainvisibility
Encourage re-use
Protectyour data
Enablewaystotrackusage
Thinkaboutnewbusinessmodels
4. 19/03/2014 4Daniel Vila-Suero
How open is the LOD cloud?
[1] Rodriguez-Doncel, Victor et al., 2013. Rights declaration in Linked Data.
in Proc. of the 3rd Int. W. on Consuming Linked Data O. Hartig et al. (Eds) CEUR vol. 1034 (2013)
5. 19/03/2014 5Daniel Vila-Suero
How open is the LOD cloud?
• 338 datasets in :
[1] Rodriguez-Doncel, Victor et al., 2013. Rights declaration in Linked Data.
in Proc. of the 3rd Int. W. on Consuming Linked Data O. Hartig et al. (Eds) CEUR vol. 1034 (2013)
6. 19/03/2014 6Daniel Vila-Suero
Linguistic Linked Data
1 "Open Data andLinguistics" workinggroup, Open KnowledgeFoundation, see more http://linguistics.okfn.org/
Language resources
as Linked Data:
Lexica
Language descriptions
Corpora
….
Linguistic LOD (LLOD) cloud
9. 19/03/2014 9Daniel Vila-Suero
What is 3LD?
3LD
Linguistic Linked Licensed Data
Language resources such as:
- Lexica
- Corpora
- Dictionaries ..
10. 19/03/2014 10Daniel Vila-Suero
What is 3LD?
3LD
Linguistic Linked Licensed Data
Linguistic data as Linked Data using RDF and
standard data models (vocabularies):
- Lexica
- Corpora .. NIF
NLP Interchange Format
11. 19/03/2014 11Daniel Vila-Suero
What is 3LD?
3LD
Linguistic LinkedLicensedData
Linguistic Linked Data published along with
a machine-readable license.
ODRL
Open Digital Rights Language
NIF
NLP Interchange Format
12. 19/03/2014 12Daniel Vila-Suero
Guideline: Licensing models & mechanisms
Add "rights" metadata in the dataset description
(e.g., VoID, DCAT)
1 DCAT
Data catalog vocabulary
13. 19/03/2014 13Daniel Vila-Suero
Guideline: Licensing models & mechanisms
Add "rights" metadata in the dataset description
(e.g., VoID, DCAT)
1
Use standard predicates to declare "rights" statements
(e.g., Dublin Core terms: dc:rights, dct:license)2
DCAT
Data catalog vocabulary
14. 19/03/2014 14Daniel Vila-Suero
Guideline: Licensing models & mechanisms
Add "rights" metadata in the dataset description
(e.g., VoID, DCAT)
1
Use standard predicates to declare "rights" statements
(e.g., Dublin Core terms: dc:rights, dct:license)2
?
3a
Standard license available
DCAT
Data catalog vocabulary
15. 19/03/2014 15Daniel Vila-Suero
Guideline: Licensing models & mechanisms
Add "rights" metadata in the dataset description
(e.g., VoID, DCAT)
1
Use standard predicates to declare "rights" statements
(e.g., Dublin Core terms: dc:rights, dct:license)2
?Yes
Use URI of standard
license e.g., CC0
3a
Standard license available
DCAT
Data catalog vocabulary
16. 19/03/2014 16Daniel Vila-Suero
Guideline: Licensing models & mechanisms
Add "rights" metadata in the dataset description
(e.g., VoID, DCAT)
1
Use standard predicates to declare "rights" statements
(e.g., Dublin Core terms: dc:rights, dct:license)2
?
Use rights declaration
language, e.g., ODRL
Yes
Use URI of standard
license e.g., CC0
3b3a
No
Standard license available
ODRL
Open Digital Rights Language
DCAT
Data catalog vocabulary
17. 19/03/2014 17Daniel Vila-Suero
Demo: Conditional access to Linked Data
• Prototype developed at the Ontology
Engineering Group.
• A licenses-aware Linked Data server and a data
policies and licenses manager
• Using Web standards (DCAT descriptions,
SPARQL constructs, ODRL RDF policies, etc.)
Victor RodríguezDoncel
vrodriguez@fi.upm.es
18. 19/03/2014 18Daniel Vila-Suero
Demo: Use case
• Spanish geographical data: Administrative
units, geopositions, links to DBpedia
1 Browse the data (user)
2 Set policies for parts of
the dataset (admin)
3 Gain access to the
restricted data (user)
27. 19/03/2014 27Daniel Vila-Suero
Gain access to restricted data (user)
<http://localhost:99/ldr/policy/ee32f675-ccae-4ca9-a544-3c07abf0b16e>
a <http://www.w3.org/ns/odrl/2/Policy> , <http://www.w3.org/ns/odrl/2/Set>;
<http://www.w3.org/2000/01/rdf-schema#comment>
"Individual triples are available upon payment of 1 euro cent" ;
<http://www.w3.org/ns/odrl/2/permission>….
The work I will present today is a collaboration between the Ontology Engineering Group at Universidad Politécnica de Madrid and Universität Bielefeld
But is also the result of many discussions among the partners of the EU project LiderBut, what is Lider?Lider is a support and coordination action with the goal of setting the pathway for the creation of an ecosystem linguistic Linked Data and NLP services to support enterprise content analytics in Europe. And a crucial issue to achieve this is to listen to industry and the community, so please join us in the newly created W3C Linked Data for Language technologies CG.In this discussions with the community there are several recurring topics such as data modelling, quality, provenance, etc. But one of them seems to be of special relevancy and that will be the main topic of this talk.The main outcome of the project will be a roadmap for EU, several guidelines to help data publishers and consumers, a reference architecture and an industrial community
As you might have guessed from the tile, the topic is Licensing Linked Data and in particular Linguistic Linked Data, but why is this important?No matter you are publishing Open Data or data under more restrictive terms, you and the potential data consumers will benefit from providing a license along with your data.In the case of Open Data …For data under more restrictive terms of use…Given that everyone seems to agree on this, what is actually the current practice?In 2013, member of my group performed a study on the so-called Linked Data cloud and the results were a bit surprising, (or maybe not).
Although there's a lot of green areas (with licenses such as public domain, those that require attribution), you can see several red and orange areas corresponding to restrictive licenses and a considerable mass of grey which corresponds to
If you are interested u can read the paper, but as u can see in this graph almost 50% of the datasets are published either under not specified licenses or even closed licenses.
Going back to our topic, linguistic linked data. As you might be aware recently a new cloud of LLD has emerged with the support of the Open Data and Linguistics working group. One can look at this cloud from several perspectives language, type of resource, data models or quality, but what about the licenses, how open is this cloud?
In this case we found out that it is certainly more open than the LOD cloud, although there's still around 13 percent of unspecified or restrictive licenses. Adittionaly, this cloud has been selected and curated by a working group, but what willl happen when the scope gets broader including resources from ELRA or metashare for example?
This concern is why we have came up with the concept of 3LD which stands for Linguistic Linked Licensed Data