The document discusses semantic web technologies including linked data and SPARQL. It describes how the semantic web allows sharing and connecting data across applications through common data formats and languages to describe relationships between data and real world objects. Linked data follows principles like using URIs to identify resources and HTTP URIs to look up related data through dereferenceable links, enabling exploration of a web of connected data.
The document provides methodological guidelines for publishing linked data. It introduces linked data and its key principles of using URIs, HTTP URIs, providing useful information through standards like RDF and SPARQL, and including links between data. The rest of the document outlines guidelines for publishing linked data, including identifying data sources, modeling vocabularies by reusing existing ones, generating RDF data from sources, generating URIs, publishing and linking the RDF data, enabling discovery through mechanisms like CKAN and Sitemaps, and tools that can help with each step of the process.
Lecture 3: Data Formats on the Social Web (2013)Lora Aroyo
This document discusses data formats on the social web. It describes vocabularies used to define terms for exchanging and integrating data between applications. In particular, it describes FOAF (Friend of a Friend), an ontology used for publishing personal profile data and social relationships as linked RDF documents. FOAF allows integrating factual information about people with other human-oriented content on the web. The document also briefly mentions challenges of cleaning and transforming messy social web data into consistent formats.
The document discusses the history and evolution of the World Wide Web from its inception in the late 1980s to present day. It outlines key milestones like the creation of HTTP, HTML, the first web browser in 1991. It then describes how the web grew from static informational sites to dynamic platforms that could support real-time content updates and user-generated content. The trends of more content from diverse sources and a shift from static to dynamic websites that can be easily updated are discussed as factors that necessitated the development of dynamic websites.
This document discusses HathiTrust, a digital repository managed by over 60 partner institutions, as a potential repository for government documents. It provides an overview of HathiTrust, noting that it contains over 10 million volumes, including over 2.9 million public domain volumes. The document estimates that around 300,000 titles in HathiTrust, or 4% of the total, are US government documents, with 80% of those being in the public domain. It compares the ability to find full text of documents between HathiTrust and Google Books. The document outlines who can access what within HathiTrust and how content can be searched, discovered, and loaded into local library catalogs. It acknowledges challenges around searching, copyright statuses, and metadata
This presentation gives a brief overview on achievements and challenges of the Data Web and describes different aspects of using the Semantic Data Wiki OntoWiki for Linked Data management.
Linked data and Semantic Web Applications for LibrariesVikas Bhushan
This document provides an overview of linked data and semantic applications for libraries. It discusses the key concepts of linked data including URIs, RDF, ontologies and the semantic web. It then outlines some of the core technologies that enable linked data such as XML, RDF, RDF Schema and OWL. The document also reviews current trends and implementations of linked data including projects from DBPedia, BBC, Google and OCLC. Finally, it discusses why linked data is important for libraries by allowing library resources to be more discoverable on the web and enabling new applications using library metadata.
These slides were originally a tutorial presented for the SIG preceding the May 2009 meeting of the PRISM Forum.
They attempt to give a survey of the technologies, tools, and state of the world with respect to the Semantic Web as of the first half of 2009.
The document provides methodological guidelines for publishing linked data. It introduces linked data and its key principles of using URIs, HTTP URIs, providing useful information through standards like RDF and SPARQL, and including links between data. The rest of the document outlines guidelines for publishing linked data, including identifying data sources, modeling vocabularies by reusing existing ones, generating RDF data from sources, generating URIs, publishing and linking the RDF data, enabling discovery through mechanisms like CKAN and Sitemaps, and tools that can help with each step of the process.
Lecture 3: Data Formats on the Social Web (2013)Lora Aroyo
This document discusses data formats on the social web. It describes vocabularies used to define terms for exchanging and integrating data between applications. In particular, it describes FOAF (Friend of a Friend), an ontology used for publishing personal profile data and social relationships as linked RDF documents. FOAF allows integrating factual information about people with other human-oriented content on the web. The document also briefly mentions challenges of cleaning and transforming messy social web data into consistent formats.
The document discusses the history and evolution of the World Wide Web from its inception in the late 1980s to present day. It outlines key milestones like the creation of HTTP, HTML, the first web browser in 1991. It then describes how the web grew from static informational sites to dynamic platforms that could support real-time content updates and user-generated content. The trends of more content from diverse sources and a shift from static to dynamic websites that can be easily updated are discussed as factors that necessitated the development of dynamic websites.
This document discusses HathiTrust, a digital repository managed by over 60 partner institutions, as a potential repository for government documents. It provides an overview of HathiTrust, noting that it contains over 10 million volumes, including over 2.9 million public domain volumes. The document estimates that around 300,000 titles in HathiTrust, or 4% of the total, are US government documents, with 80% of those being in the public domain. It compares the ability to find full text of documents between HathiTrust and Google Books. The document outlines who can access what within HathiTrust and how content can be searched, discovered, and loaded into local library catalogs. It acknowledges challenges around searching, copyright statuses, and metadata
This presentation gives a brief overview on achievements and challenges of the Data Web and describes different aspects of using the Semantic Data Wiki OntoWiki for Linked Data management.
Linked data and Semantic Web Applications for LibrariesVikas Bhushan
This document provides an overview of linked data and semantic applications for libraries. It discusses the key concepts of linked data including URIs, RDF, ontologies and the semantic web. It then outlines some of the core technologies that enable linked data such as XML, RDF, RDF Schema and OWL. The document also reviews current trends and implementations of linked data including projects from DBPedia, BBC, Google and OCLC. Finally, it discusses why linked data is important for libraries by allowing library resources to be more discoverable on the web and enabling new applications using library metadata.
These slides were originally a tutorial presented for the SIG preceding the May 2009 meeting of the PRISM Forum.
They attempt to give a survey of the technologies, tools, and state of the world with respect to the Semantic Web as of the first half of 2009.
The European Information Landscape
LIBER and Europeana Travel
LIBER and Early European Books
Copyright and IPR
Digital Preservation
European policy on Open Access
EU Consultation on Access to, and Preservation of, Scientific Information
LERU Roadmap Towards Open Access
The document provides an introduction to the Semantic Web by using a simplified example of combining bookstore data from different sources. It demonstrates how [1] exporting data as relations, [2] merging the relations based on identical resources, and [3] adding extra knowledge allows for more powerful queries across datasets. The key components that enable this are using URIs to identify resources, representing data as RDF triples, and classifying resources and properties with common terminologies like ontologies. This approach aims to realize a Web of Data where decentralized and heterogeneous data sources can be programmatically integrated and queried.
This document provides notes on the course "Web Technologies" for students of the B.Tech III year Information Technology program at Malla Reddy College of Engineering & Technology. It covers topics like introduction to the internet and world wide web, web browsers, HTML, CSS, JavaScript, XML, PHP, web servers, servlets, JSP, and databases. Key concepts explained include the history of the internet, components of the world wide web, common web browser types, using HTML tags and CSS for web page formatting, and introducing core web technologies like XML, PHP, Java servlets and JSP.
Enterprise Multimedia Integration and SearchOzelin
playence presentation called "Enterprise Multimedia Integration and Search" for the Future Enterprise Systems Workshop, held jointly with the Future Internet Symposium in Berlin, Germany, 20th September 2010
Intro to the Semantic Web Landscape - 2011LeeFeigenbaum
An introduction to the Semantic Web landscape as it stands near the end of 2011. Includes an introduction to the core technologies in the Semantic Web technology stack.
This material was presented at the November, 2011, Cambridge Semantic Web meetup.
This document is a presentation about Linked Open Europeana and semantics for citizens. It discusses how the Europeana Data Model (EDM) aims to make Europeana a part of Linked Open Data. EDM preserves original metadata while allowing for interoperability between datasets. It uses standards like SKOS, OAI ORE, and DCMI to semantically enrich metadata and link cultural heritage objects to external open data sources. This will allow Europeana to provide semantic exploration, context discovery, and knowledge generation for various types of citizens like tourists, teachers, and businesses.
The document provides an overview of social semantics and the social semantic web. It discusses how social data on platforms like Facebook and Twitter can be represented semantically using ontologies and vocabularies. This includes representing people with FOAF, relationships with Schema.org, content with SIOC, and behavior with OUBO. Representing social data semantically allows it to be queried, linked across platforms, and analyzed with semantic web technologies. The social semantic web aims to overcome the siloed nature of social data and enable portability of social information.
Research Data management - Importance, Good Practices, GuidanceFrank Uiterwaal
This document discusses the importance of research data management. It defines research data management as organizing data from collection through dissemination and archiving to ensure results can be verified and built upon. Good research data management practices include developing data management plans, making data FAIR (findable, accessible, interoperable, and reusable) according to common standards, and storing data in trusted repositories with support from data stewards. Following best practices helps ensure research integrity and allows others to build upon existing work.
Modern developments in the use of internet by inqilab patelInqilab Patel
Modern developments in internet use include web browsers, ISPs, wikis, social networking, tagging, blogs, digital media sharing websites, podcasts, and bit streaming. Web browsers allow users to access and view internet resources, while ISPs provide access to the internet. Wikis enable collaborative document editing. Social networking focuses on building social relations online. Tagging helps organize and classify online information. Blogs and digital media sharing websites allow users to share content. Podcasts deliver audio content via RSS feeds. Bit streaming transmits data as a continuous sequence of bits.
The document discusses the evolution of the web from Web 1.0 to the current Web 3.0. Web 1.0 began in the 1990s and allowed for mainly read-only access to information on the internet. Web 2.0 emerged in the early 2000s and enabled user-generated content and greater interactivity through technologies like blogs, wikis, mashups and social media. Web 3.0, also called the Semantic Web, aims to make web content machine-readable through metadata and technologies like RDF and OWL so that intelligent software agents can process information on behalf of users. It involves greater integration of mobile technologies as well.
1. The document discusses the opportunities and challenges of a proposed public-private e-book platform in Flanders that would provide centralized long-term storage and facilitate exploitation of digital books.
2. It notes the importance of reading and libraries for education and culture, and argues that a government initiative is needed to ensure all players have access to content as the publishing industry shifts digital.
3. The proposed platform would leverage existing library collections and services, providing broad access to digital content through a collaborative model involving publishers, distributors, and other stakeholders.
Open Data Dialog 2013 - Linked Data in EducationStefan Dietze
The document discusses opportunities and challenges of using linked data in education. It begins by outlining how linked data principles can be useful for sharing educational data by providing background knowledge and common standards and vocabularies. However, it notes that currently only a few datasets are actually reused or linked, in part due to heterogeneity in datasets, unreliable metadata, and a lack of links between datasets. The LinkedUp project aims to address these issues by collecting and profiling open educational datasets, generating links between them, and building applications and tools to help utilize the data. Key activities include developing a dataset catalog, generating topic profiles of datasets, running challenges to identify innovative applications, and engaging stakeholders.
Shaping our Future: Digitization Partnerships Across Libraries, Archives and ...UBC Library
The document discusses trends in digital information and digitization, including convergence through collaboration between libraries, archives and museums. It provides examples of collaborative digitization projects and how they help inspire new kinds of research. Open source and open access models are also discussed as important trends. Professional competencies must evolve as the roles of librarians, archivists and curators converge in the digital realm. Global organizations have a role to play in encouraging partnerships and training.
Digitised Content: How we Make It Relevant to Researchers, Teachers and StudentsLIBER Europe
This document discusses several initiatives related to digitizing content and making it accessible through Europeana. It describes the Europeana Travel project which aims to digitize over 1 million items related to travel and make them available through Europeana. It also outlines the Early European Books project to digitize European printed works between 1475-1700 and the Europeana Libraries project to provide 5 million digitized items to Europeana from European research libraries. Finally, it discusses LIBER's vision for a future research space that would provide interactive access to digitized and born-digital materials through Europeana.
This document describes an inquiry-based science teaching approach called IBST. It involves problem-based learning activities with multiple solutions, experiments, and self-regulated learning. Students work collaboratively to analyze problems, develop strategies to find information, conduct research, and write reports. Smartphones and mobile tools are used to support field work, information sharing, and real-time collaboration between student groups. The goal is to enable contextual, adaptive, and seamless learning through a personalized recommendation and search system.
There has been plenty of hype around the Semanic Web, but will we ever see the vision of intelligent agents working on our behalf? This talk introduces the concepts of the Semantic Web as envisioned by Tim Berners-Lee over 10 years ago and compares that vision to where we have come since then. It includes a discussion of implementations such as XML, RDF, OWL (web ontology language), and SPARQL. After reviewing the design principles and enabling technologies, I plan to show how these techniques can be implemented in WebGUI.
1) The document summarizes recent reforms to the English National Health Service (NHS) proposed by the UK coalition government.
2) Key aspects of the reforms include transferring around 70% of the NHS budget to groups of general practitioners (GPs), increasing hospital autonomy and competition, and expanding patient choice.
3) The reforms aim to reduce central control over the NHS and introduce more market-based incentives, but also face significant implementation challenges and risks of disruption.
The National Ecological Observatory Network (NEON) aims to quantify belowground biomass across its continental monitoring network. It will measure fine root biomass, microbial biomass, and nitrogen transformation rates at each site. These measurements will be taken from soil cores and pits to characterize belowground processes and complement aboveground measurements. The data will help improve models of nutrient cycling and predictions of ecosystem responses to climate change.
- The document discusses a new satellite TV technology called "Unicable" that allows multiple satellite receivers to be connected to a single LNB using digital signals rather than separate cables running to each receiver.
- It describes testing done on an MTI AK541-XT2BL Unicable LNB that can support four receivers. Testers were able to easily receive channels on all receivers simultaneously without interference.
- Reception quality was found to be good, even for weaker satellites. Unicable was found to be no more sensitive to cable quality than previous satellite TV technologies. However, the technology is still new and not yet supported by many receiver manufacturers.
This document analyzes and compares the cover drive shot in cricket between two international cricketers - one who plays aggressively and one who plays defensively. It describes the phases of the shot and key performance factors for both players. The aggressive player generates more momentum through a wider stance and larger bat swing, while the defensive player focuses on accuracy with a narrow stance and rapid bat movement. A study was conducted analyzing video footage of both players hitting the cover drive. Results showed the aggressive player takes more time to swing the bat but can hit harder, while the defensive player plays the shot more accurately with less risk of error. In conclusion, the analysis compared how batting style and techniques differ between offensive and defensive cricketers for the cover drive shot.
The European Information Landscape
LIBER and Europeana Travel
LIBER and Early European Books
Copyright and IPR
Digital Preservation
European policy on Open Access
EU Consultation on Access to, and Preservation of, Scientific Information
LERU Roadmap Towards Open Access
The document provides an introduction to the Semantic Web by using a simplified example of combining bookstore data from different sources. It demonstrates how [1] exporting data as relations, [2] merging the relations based on identical resources, and [3] adding extra knowledge allows for more powerful queries across datasets. The key components that enable this are using URIs to identify resources, representing data as RDF triples, and classifying resources and properties with common terminologies like ontologies. This approach aims to realize a Web of Data where decentralized and heterogeneous data sources can be programmatically integrated and queried.
This document provides notes on the course "Web Technologies" for students of the B.Tech III year Information Technology program at Malla Reddy College of Engineering & Technology. It covers topics like introduction to the internet and world wide web, web browsers, HTML, CSS, JavaScript, XML, PHP, web servers, servlets, JSP, and databases. Key concepts explained include the history of the internet, components of the world wide web, common web browser types, using HTML tags and CSS for web page formatting, and introducing core web technologies like XML, PHP, Java servlets and JSP.
Enterprise Multimedia Integration and SearchOzelin
playence presentation called "Enterprise Multimedia Integration and Search" for the Future Enterprise Systems Workshop, held jointly with the Future Internet Symposium in Berlin, Germany, 20th September 2010
Intro to the Semantic Web Landscape - 2011LeeFeigenbaum
An introduction to the Semantic Web landscape as it stands near the end of 2011. Includes an introduction to the core technologies in the Semantic Web technology stack.
This material was presented at the November, 2011, Cambridge Semantic Web meetup.
This document is a presentation about Linked Open Europeana and semantics for citizens. It discusses how the Europeana Data Model (EDM) aims to make Europeana a part of Linked Open Data. EDM preserves original metadata while allowing for interoperability between datasets. It uses standards like SKOS, OAI ORE, and DCMI to semantically enrich metadata and link cultural heritage objects to external open data sources. This will allow Europeana to provide semantic exploration, context discovery, and knowledge generation for various types of citizens like tourists, teachers, and businesses.
The document provides an overview of social semantics and the social semantic web. It discusses how social data on platforms like Facebook and Twitter can be represented semantically using ontologies and vocabularies. This includes representing people with FOAF, relationships with Schema.org, content with SIOC, and behavior with OUBO. Representing social data semantically allows it to be queried, linked across platforms, and analyzed with semantic web technologies. The social semantic web aims to overcome the siloed nature of social data and enable portability of social information.
Research Data management - Importance, Good Practices, GuidanceFrank Uiterwaal
This document discusses the importance of research data management. It defines research data management as organizing data from collection through dissemination and archiving to ensure results can be verified and built upon. Good research data management practices include developing data management plans, making data FAIR (findable, accessible, interoperable, and reusable) according to common standards, and storing data in trusted repositories with support from data stewards. Following best practices helps ensure research integrity and allows others to build upon existing work.
Modern developments in the use of internet by inqilab patelInqilab Patel
Modern developments in internet use include web browsers, ISPs, wikis, social networking, tagging, blogs, digital media sharing websites, podcasts, and bit streaming. Web browsers allow users to access and view internet resources, while ISPs provide access to the internet. Wikis enable collaborative document editing. Social networking focuses on building social relations online. Tagging helps organize and classify online information. Blogs and digital media sharing websites allow users to share content. Podcasts deliver audio content via RSS feeds. Bit streaming transmits data as a continuous sequence of bits.
The document discusses the evolution of the web from Web 1.0 to the current Web 3.0. Web 1.0 began in the 1990s and allowed for mainly read-only access to information on the internet. Web 2.0 emerged in the early 2000s and enabled user-generated content and greater interactivity through technologies like blogs, wikis, mashups and social media. Web 3.0, also called the Semantic Web, aims to make web content machine-readable through metadata and technologies like RDF and OWL so that intelligent software agents can process information on behalf of users. It involves greater integration of mobile technologies as well.
1. The document discusses the opportunities and challenges of a proposed public-private e-book platform in Flanders that would provide centralized long-term storage and facilitate exploitation of digital books.
2. It notes the importance of reading and libraries for education and culture, and argues that a government initiative is needed to ensure all players have access to content as the publishing industry shifts digital.
3. The proposed platform would leverage existing library collections and services, providing broad access to digital content through a collaborative model involving publishers, distributors, and other stakeholders.
Open Data Dialog 2013 - Linked Data in EducationStefan Dietze
The document discusses opportunities and challenges of using linked data in education. It begins by outlining how linked data principles can be useful for sharing educational data by providing background knowledge and common standards and vocabularies. However, it notes that currently only a few datasets are actually reused or linked, in part due to heterogeneity in datasets, unreliable metadata, and a lack of links between datasets. The LinkedUp project aims to address these issues by collecting and profiling open educational datasets, generating links between them, and building applications and tools to help utilize the data. Key activities include developing a dataset catalog, generating topic profiles of datasets, running challenges to identify innovative applications, and engaging stakeholders.
Shaping our Future: Digitization Partnerships Across Libraries, Archives and ...UBC Library
The document discusses trends in digital information and digitization, including convergence through collaboration between libraries, archives and museums. It provides examples of collaborative digitization projects and how they help inspire new kinds of research. Open source and open access models are also discussed as important trends. Professional competencies must evolve as the roles of librarians, archivists and curators converge in the digital realm. Global organizations have a role to play in encouraging partnerships and training.
Digitised Content: How we Make It Relevant to Researchers, Teachers and StudentsLIBER Europe
This document discusses several initiatives related to digitizing content and making it accessible through Europeana. It describes the Europeana Travel project which aims to digitize over 1 million items related to travel and make them available through Europeana. It also outlines the Early European Books project to digitize European printed works between 1475-1700 and the Europeana Libraries project to provide 5 million digitized items to Europeana from European research libraries. Finally, it discusses LIBER's vision for a future research space that would provide interactive access to digitized and born-digital materials through Europeana.
This document describes an inquiry-based science teaching approach called IBST. It involves problem-based learning activities with multiple solutions, experiments, and self-regulated learning. Students work collaboratively to analyze problems, develop strategies to find information, conduct research, and write reports. Smartphones and mobile tools are used to support field work, information sharing, and real-time collaboration between student groups. The goal is to enable contextual, adaptive, and seamless learning through a personalized recommendation and search system.
There has been plenty of hype around the Semanic Web, but will we ever see the vision of intelligent agents working on our behalf? This talk introduces the concepts of the Semantic Web as envisioned by Tim Berners-Lee over 10 years ago and compares that vision to where we have come since then. It includes a discussion of implementations such as XML, RDF, OWL (web ontology language), and SPARQL. After reviewing the design principles and enabling technologies, I plan to show how these techniques can be implemented in WebGUI.
1) The document summarizes recent reforms to the English National Health Service (NHS) proposed by the UK coalition government.
2) Key aspects of the reforms include transferring around 70% of the NHS budget to groups of general practitioners (GPs), increasing hospital autonomy and competition, and expanding patient choice.
3) The reforms aim to reduce central control over the NHS and introduce more market-based incentives, but also face significant implementation challenges and risks of disruption.
The National Ecological Observatory Network (NEON) aims to quantify belowground biomass across its continental monitoring network. It will measure fine root biomass, microbial biomass, and nitrogen transformation rates at each site. These measurements will be taken from soil cores and pits to characterize belowground processes and complement aboveground measurements. The data will help improve models of nutrient cycling and predictions of ecosystem responses to climate change.
- The document discusses a new satellite TV technology called "Unicable" that allows multiple satellite receivers to be connected to a single LNB using digital signals rather than separate cables running to each receiver.
- It describes testing done on an MTI AK541-XT2BL Unicable LNB that can support four receivers. Testers were able to easily receive channels on all receivers simultaneously without interference.
- Reception quality was found to be good, even for weaker satellites. Unicable was found to be no more sensitive to cable quality than previous satellite TV technologies. However, the technology is still new and not yet supported by many receiver manufacturers.
This document analyzes and compares the cover drive shot in cricket between two international cricketers - one who plays aggressively and one who plays defensively. It describes the phases of the shot and key performance factors for both players. The aggressive player generates more momentum through a wider stance and larger bat swing, while the defensive player focuses on accuracy with a narrow stance and rapid bat movement. A study was conducted analyzing video footage of both players hitting the cover drive. Results showed the aggressive player takes more time to swing the bat but can hit harder, while the defensive player plays the shot more accurately with less risk of error. In conclusion, the analysis compared how batting style and techniques differ between offensive and defensive cricketers for the cover drive shot.
The document discusses various privacy and security issues related to internet use. It covers topics like internet privacy laws, the Children's Online Privacy Protection Act (COPPA), gender and electronic privacy concerns like cyberstalking, the USA PATRIOT Act, cookies, spam, spyware, and more. It provides details on specific cases and legislation while outlining both benefits and criticisms of various technologies and their impact on user privacy.
The document discusses the emergence of the social web and the relationship between Web 2.0 and the Semantic Web. It describes how blogs, wikis, and social networks enabled new forms of user-generated content and social interaction online in the early 2000s. The document also explains how Semantic Web technologies could enhance Web 2.0 by enabling the standardized exchange and combination of user data and services.
The document discusses several ontologies for the social web including FOAF, SIOC, and SKOS. FOAF describes personal information and social networks. SIOC provides methods for interconnecting online communities like blogs and forums. It aims to address interoperability issues on the social web. SIOC has been adopted in over 400 sites and has the potential to become a foundational vocabulary for the semantic web.
The document discusses the semantic web, which aims to make web content machine-readable through the use of metadata, XML tags, and ontologies. It explains that the semantic web will bring structure to web page meaning to allow computers to process semantics. Key components that enable the semantic web are RDF, which describes web resources through subject-property-object triples, and ontologies, which formally define relations among terms. The ultimate goal of the semantic web is to help computers comprehend semantic documents to assist the evolution of shared human knowledge.
This document discusses the emerging field of social semantic sensor web. It describes how the proliferation of sensors embedded in devices, homes, cars, etc. can be connected to the social web and annotated with semantic technologies. This would allow machines to better understand sensor data, such as using ontologies to infer weather conditions from different sensor readings. The document outlines technologies like the SSN ontology for describing sensors and how sensor data could be attached to social media posts. Finally, it discusses potential applications in areas like disaster management, traffic reporting, and crowdsourcing health data.
The document provides an overview of semantic web analytics presented by 5th semester GIBS students. It discusses the evolution of the web from version 1.0 to 3.0, with key differences explained. Version 1.0 focused on information search, 2.0 added user participation, and 3.0 aims to be a semantic web where meaning is understood. The core technologies that make up the semantic web - RDF, SPARQL and OWL - are also summarized. The future potential of the semantic web is highlighted as making internet data machine readable and enabling computers to interpret information like humans through advances in AI and ML.
The document discusses the evolution of the web from Web 1.0 to Web 2.0 and the problems with representing meaning. It introduces semantic web as representing things rather than just documents using semantic annotations in formats like RDFa, microformats and microdata. Linked data allows complex queries across a web of data by embedding semantic annotations and using common schemas like Schema.org. Major companies are now building knowledge graphs to represent structured data from sources on a linked open web.
The document provides an overview of the deep web and digital investigations. It defines the deep web as data that is inaccessible to regular search engines but exists on the internet. This includes dynamically generated web pages, private websites requiring login, and files accessible only through direct filesystem access. The document estimates the deep web is 400-550 times larger than the surface web that is indexed by search engines. Standard digital forensic procedures can be applied to investigate the deep web, but tools may need to be adapted to handle specialized browsers and access methods used to retrieve deep web resources.
The document discusses the key concepts of Web 2.0, including how it utilizes collective intelligence through social bookmarking, tagging, wikis and collaborative filtering. It also examines how Web 2.0 applications harness the network effect to aggregate user data and benefit from increased participation. Finally, it outlines some of the design principles of Web 2.0 such as treating the web as a platform, harnessing collective intelligence, and providing rich user experiences through technologies like AJAX.
Web 3.0 aims to bring together the content of Web 1.0 and the social aspects of Web 2.0 by creating meaning from online content and social context through advances like semantic search engines that understand concepts rather than just keywords. It will be a more personalized, intelligent web where software agents can perform tasks on a user's behalf based on their needs and preferences. While definitions vary, Web 3.0 is emerging as a dynamic, customized web that filters and organizes information for individuals.
Contextual Computing - Knowledge Graphs & Web of EntitiesRichard Wallis
Richard Wallis gave a presentation on contextual computing and knowledge graphs at the SmartData 2017 conference. He discussed how knowledge graphs powered by structured data on the web are providing global context that enables new applications of cognitive and contextual computing. Schema.org plays a key role by defining a common vocabulary and enabling a web of related entities laid out as a global graph. This graph of entities delivers context on a global scale and lays the foundation for the next revolution in computing.
Talk given at Open Knowledge Foundation 'Opening Up Metadata: Challenges, Standards and Tools' Workshop, Queen Mary University of London, 13th June 2012.
Info on the event at http://openglam.org/2012/05/31/last-places-left-for-opening-up-metadata-challenges-standards-and-tools/
Научно-технический семинар «Веб-семантика: учим поисковых роботов «понимать» тексты» в петербургском офисе Яндекса, 9 октября 2012 г.
Питер Мика, старший научный сотрудник Yahoo! Research, Барселона.
This is an edited version of a talk that I gave on the 11th of February to some PhD students from the University of Utrecht at a seminar on science and communication.
Web 3.0 refers to the next stage of the internet that brings together content and social aspects through semantics. It aims to create meaning from online data by understanding context through metadata and allowing software agents to perform tasks. Key aspects include real-time information sharing, understanding user intent through semantics rather than just keywords, open collaboration, and integrating location data from mobile devices. The documents discuss prototypes like Twine/Evri that organize online content by topic and allow sharing, as well as the vision of the Semantic Web to make online information more machine-readable and useful.
The document discusses the semantic web, which aims to present web page data in a way that is understood by computers, allowing machines to search, aggregate, and combine information without human operators. It is an extension of the current web that adds new machine-readable data and metadata to existing documents to enable both automatic processing by machines and manual use by humans using technologies like RDF.
An Introduction to Semantic Web TechnologyAnkur Biswas
The document provides an overview of the semantic web and some of its key challenges. It discusses:
1) The evolution of the world wide web from a web of documents to a web of linked data through technologies like RDF, OWL, and SPARQL that add semantic meaning.
2) The vision for the semantic web is to publish machine-readable data using common formats so that information can be automatically processed by agents and integrated across sources.
3) Some challenges in realizing this vision include dealing with implicit knowledge, heterogeneous data distributions, and maintaining links and correctness over time as data changes.
Web 3.0? A look at the future of the World Wide Webrgkwml
The document discusses the history and future of the World Wide Web. It describes the evolution from Web 1.0's static pages to Web 2.0's user-generated content. Web 3.0, also called the Semantic Web, aims to add meaning and structure to online content using metadata standards. However, challenges remain regarding who will assign, maintain, and organize the metadata at a global scale. The future of an open Web also remains uncertain as control increasingly shifts to large companies.
Very basic introductory talk about the Semantic Web, given to undergraduate and posgraduate students of Universidad del Valle (Cali, Colombia) in September 2010
Learning Analytics : entre Promesses et RéalitéSerge Garlatti
Université Bretagne Pays de Loire, UTICE : LES LEARNING ANALYTICS : QUAND LE BIG DATA S’INTÉRESSE À L’ÉDUCATION.
https://utice.u-bretagneloire.fr/evenement/les-learning-analytics-quand-le-big-data-sinteresse-leducation
L’usage du numérique dans l’éducation permet d’accéder aujourd’hui à une multitude de données sur le comportement des étudiants : identité, interactions entre apprenants, interactions avec les plateformes et outils d’apprentissage, résultats aux évaluations... La collecte et l’exploitation de ces données permettent de mieux comprendre les processus d’apprentissage et ainsi d’adapter les parcours pédagogiques proposés pour en renforcer l’efficacité, mais aussi de personnaliser les apprentissages ou de développer des outils de pilotage des formations. Une communauté de chercheurs et d’enseignants se développe autour de ce que l’on appelle les learning analytics, ou l’analyse des données d’apprentissage. Ce séminaire basé sur les recherches et des retours d’expérience d’enseignants-chercheurs et de jeunes entreprises permettra de cerner les enjeux et les perspectives des learning analytics.
This document discusses modal logics and formalisms. It begins by defining classical and non-classical logics, with modal logics listed as an example of an extended logic. It then covers modal logics in more detail, defining their language and model theory using possible world semantics. Models are defined as structures consisting of possible worlds related by an accessibility relation. Truth is evaluated at possible worlds based on this relation. The document also discusses axiomatic modal logics like KT and relations between main modal systems. Finally, it notes that axioms like D, T, B, 4 and 5 are not valid in the class of all standard models.
The document discusses models and formalisms in logic. It introduces formal systems as consisting of an axiomatic theory, symbols for constructing formulas, grammar rules, axioms, inference rules, and properties like consistency. Propositional and predicate logic are examined, including their model theories, axiomatic theories, properties like completeness and soundness, and resolution principles. Normal forms and the resolution rule are defined as ways to deduce theorems in a formal system.
This document discusses modal logics and formalisms. It defines modal logics as logics that add new logical constants like necessity (□) and possibility (◇) to classical logic. It describes how modal logics can be classified based on whether they are extended logics that add new well-formed formulas or deviant logics that interpret the usual logical constants differently. The document then focuses on modal logics, defining their language and providing details on their model theory using possible world semantics. It discusses truth in possible worlds and models. It also describes several axiomatic modal systems and the relationships between them, and examines the classes of models validated by different axioms.
The document discusses formal systems and logic. It begins by introducing formal systems, which consist of an axiomatic theory, symbols for constructing formulas, grammar rules, axioms, inference rules, and theorems deduced from axioms and rules. Propositional and predicate logic are then covered, including their languages, semantics using interpretation functions, axiomatic theories, and properties like soundness and completeness. Resolution principles like normal forms and the resolution rule are also summarized. The document provides examples and explanations throughout.
SPARQL is a standardized query language for retrieving and manipulating data stored in RDF format. It was created by the RDF Data Access Working Group to provide querying of RDF stores. SPARQL supports four query forms: SELECT, CONSTRUCT, DESCRIBE, and ASK. It also defines a protocol for executing queries over HTTP. SPARQL has become a key technology for working with semantic data on the web.
The document discusses inquiry-based science teaching and scenarios for its implementation. It describes key features of inquiry-based science teaching as including ill-defined, open-ended learning activities; experiments and practical experiences; emphasis on student autonomy; and peer communication. Two scenarios are proposed: the first involves students investigating the industrial landscape around a bridge, and the second uses smartphones and tools to support collaborative work during a site visit. The document advocates for the convergence of Web 2.0 tools, ubiquitous computing, and semantic technologies to enhance student experiences with inquiry-based learning.
The document discusses future learning landscapes involving the convergence of pervasive and contextual computing, global social media, and the semantic web in technology-enhanced learning. It outlines how web 2.0 tools, mobile and ubiquitous learning, and linked data can contribute to new forms of inquiry-based science teaching. Specifically, it presents a vision of ubiquitous cloud learning environments that integrate these technologies to support adaptive, context-aware, and collaborative learning anytime, anywhere.
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
Dr. Sean Tan, Head of Data Science, Changi Airport Group
Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...Zilliz
Join us to introduce Milvus Lite, a vector database that can run on notebooks and laptops, share the same API with Milvus, and integrate with every popular GenAI framework. This webinar is perfect for developers seeking easy-to-use, well-integrated vector databases for their GenAI apps.
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slackshyamraj55
Discover the seamless integration of RPA (Robotic Process Automation), COMPOSER, and APM with AWS IDP enhanced with Slack notifications. Explore how these technologies converge to streamline workflows, optimize performance, and ensure secure access, all while leveraging the power of AWS IDP and real-time communication via Slack notifications.
Maruthi Prithivirajan, Head of ASEAN & IN Solution Architecture, Neo4j
Get an inside look at the latest Neo4j innovations that enable relationship-driven intelligence at scale. Learn more about the newest cloud integrations and product enhancements that make Neo4j an essential choice for developers building apps with interconnected data and generative AI.
How to Get CNIC Information System with Paksim Ga.pptxdanishmna97
Pakdata Cf is a groundbreaking system designed to streamline and facilitate access to CNIC information. This innovative platform leverages advanced technology to provide users with efficient and secure access to their CNIC details.
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
Full-RAG: A modern architecture for hyper-personalizationZilliz
Mike Del Balso, CEO & Co-Founder at Tecton, presents "Full RAG," a novel approach to AI recommendation systems, aiming to push beyond the limitations of traditional models through a deep integration of contextual insights and real-time data, leveraging the Retrieval-Augmented Generation architecture. This talk will outline Full RAG's potential to significantly enhance personalization, address engineering challenges such as data management and model training, and introduce data enrichment with reranking as a key solution. Attendees will gain crucial insights into the importance of hyperpersonalization in AI, the capabilities of Full RAG for advanced personalization, and strategies for managing complex data integrations for deploying cutting-edge AI solutions.
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...Neo4j
Leonard Jayamohan, Partner & Generative AI Lead, Deloitte
This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Social web & linked data
1. Semantic Web in Action:
Web 3.0, Social Web
Linked Data, Be Aware!
S. Garlatti
10/09/09
2. Outline
¢ Semantic Web
¢ Linked Data
¢ SPARQL
page 2 Semantic Web in Action
3. Technological Issues
an Informal learning case
• I plan to ride in Europe for vacation, on a
motorcycle.
– I need to learn more about travelling on motorcycle
across Europe
• Rider’s apparel: boots, jackets, pants, gloves, armor, etc.
• Motorcycle accessories: luggages, side cases, topcases, tank
bags, GPS, etc.
• Riding issues, skills, etc.
– According to my social networks, I know there is a
teacher at Telecom Bretagne who used to ride
across Europe
page 3 Semantic Web in Action
4. Who is teaching at Telecom Bretagne and
riding a motorcycle across Europe?
page 4 Semantic Web in Action
5. Technological Issues
• Google Information retrieval
– Query : « teacher » « riding » « Europe »
« Motorbike »
• 613 000 results
• How to find out relevant resources?
• Polysemy
• Huge number of results!
page 5 Semantic Web in Action
8. Search in Picture Web Sites with
Geolocalization
page 8 Semantic Web in Action
9. Technological Issues
• Relevance of resources?
– Human interpretation
• Reuse, exchange and sharing of resources
– Computer would have the ability to get
content meaning!
– impossible?
page 9 Semantic Web in Action
10. Semantic Web
¢ The Semantic Web is a web of data.
• There is lots of data we all use every day, and it is not part of the
web.
• I can see my bank statements on the web, and my photographs, and
I can see my appointments in a calendar.
• But can I see my photos in a calendar to see what I was doing when I
took them?
• Can I see bank statement lines in a calendar?
¢ Why not?
• Because we don't have a web of data.
• Because data is controlled by applications, and each application
keeps it to itself.
Semantic Web in Action
11. Semantic Web
¢ The Semantic Web is about two things.
- It is about common formats for integration and combination of
data drawn from diverse sources
- On the contrary, the original Web is mainly concentrated on
the interchange of documents.
- It is also about language for recording how the data relates to
real world objects (meaning).
page 11 Semantic Web in Action
12. Semantic Web
¢ The Semantic Web
• That allows a person, or a machine, to start off in
one database, and then move through an unending
set of databases which are connected not by wires
but by being about the same thing.
• Provides a common framework that allows data to
be shared and reused across application, enterprise,
and community boundaries
page 12 Semantic Web in Action
13. The Semantic Web
¢ The Semantic Web is about two things.
- It is about common formats for integration and combination of
data drawn from diverse sources
- On the contrary, the original Web is mainly concentrated on
the interchange of documents.
- It is also about language for recording how the data relates to
real world objects (meaning).
page 13 Semantic Web in Action
14. The Semantic Web
¢ That allows a person, or a machine, to start off in
one database, and then move through an unending
set of databases which are connected not by wires
but by being about the same thing.
¢ Provides a common framework that allows data to
be shared and reused across application,
enterprise, and community boundaries
¢ The Semantic Web will enable machines to
COMPREHEND semantic documents and data, not
human speech and writings.
page 14 Semantic Web in Action
15. Linked Data
¢ Classic Web
• Data exposed to the web via HTML
• Web Pages Found via search engines
¢ What do we actually want?
• Use the web like a Single Global Database
• Ask Complex Queries over multiples pages / web
sites / data sources
page 15 Semantic Web in Action
16. The Semantic Web
• Resource Description
– Similar to simple sentence
• Subject Verb Complement
– Examples
• Telecom Bretagne has a president called XXX
• Telecom Bretagne is a French grande ecole
• Telecom Bretagne has a web site http://www.tele...
page 16 Semantic Web in Action
17. The Semantic Web
• Sentence meaning
– Different contexts à Different terms
• « director », « president », « Dean », …
– Meaning linked to communities of practices
• Use of common vocabularies!
• Named: Ontologies
• An ontology determine a unique meaning to verbs and
categories of subject and complement
page 17 Semantic Web in Action
18. The Semantic Web
• A little bit more formal
– Telecom Bretagne dbpprop:president XXX (en)
– Telecom Bretagne dbpprop:type French Grande Ecole (en)
– Telecom Bretagne dbpprop:website http://www.tele...
• Queries
– French Grande Ecole in which XXX is president?
• ?Grande_Ecole dbpprop:president XXX (en)
• ?Grande_Ecole dbpprop:type French Grande Ecole (en)
page 18 Semantic Web in Action
19. Linked Data
Who is teaching at Telecom Bretagne
and riding a motorcycle
across Europe?
page 19 Semantic Web in Action
20. Linked Data
¢ Trouver Tous les films
- Du genre « Thriller »
- ET classés au moins quatre par
- ET appréciés au moins par quatre de mes amis
– OU ceux qui me suivent sur
- ET dont Leonardo Di Caprio n’est pas un des acteurs ?
page 20
22. Linked Data
Web 1.0 Web 2.0 Web 3.0
Many
Web
sites
Few
large
Web
sites
Many
Web
sites
ontaining
unstructured,
are
specialized
on
containing
&
seman<cally
extual
content
specific
content
types
syndica<ng
arbitrarily
structured
content
Video
Pictures
Encyclopedic
+
ar3cles
+
Source
:
Dr. Sören Auer
AKSW, Institut für Informatik
page 22 Computer Science Department Semantic Web in Action
23. Linked Data
¢ Linked Data
• Is a term used to describe a method of exposing, sharing, and
connecting data on the Web via dereferenceable URIs.
¢ Principles (Tim Berners-Lee)
• The Semantic Web isn't just about putting data on the web. It is about making
links, so that a person or machine can explore the web of data. With linked
data, when you have some of it, you can find other, related, data.
• Like the web of hypertext, the web of data is constructed with documents on
the web. However, unlike the web of hypertext, where links are
relationships anchors in hypertext documents written in HTML, for data they
links between arbitrary things described by RDF. The URIs identify any kind of
object or concept.
Source : Wikipedia, Tim Berners Lee
page 23 Semantic Web in Action
24. Linked Data
¢ Tim Berners-Lee outlined four principles of Linked Data in
his Design Issues: Linked Data note, paraphrased along the
following lines:
• Use URIs to identify things that you expose to the Web as resources.
• Use HTTP URIs so that people can locate and look up (dereference)
these things.
• Provide useful information about the resource when its URI is
dereferenced.
• Include links to other, related URIs in the exposed data as a means of
improving information discovery on the Web.
page 24 Semantic Web in Action
25. Linked Data
¢ Linking Open Data Community Project
• The goal of the W3C Semantic Web Education and Outreach
group's Linking Open Data community project is to extend the
Web with a data commons by publishing various open datasets
as RDF on the Web and by setting RDF links between data
items from different data sources. As of October 2007, datasets
consist of over two billion RDF triples, which are interlinked by
over two million RDF links. (Wikipedia)
page 25 Semantic Web in Action
28. Linked Data
¢ DBpedia
• Is a community effort to extract structured information from
Wikipedia and to make this information available on the Web.
DBpedia allows you to ask sophisticated queries against
Wikipedia, and to link other data sets on the Web to Wikipedia
data.
¢ Resources
• http://wiki.dbpedia.org/About
• http://blog.dbpedia.org/
• http://www4.wiwiss.fu-berlin.de/dbpedia/dev/ontology.htm
page 28 Semantic Web in Action
29. Linked Data
¢ Examples
• THING à http://dbpedia.org/page/Brest%2C_France
• HTML à http://dbpedia.org/resource/Brest%2C_France
• RDF à http://dbpedia.org/data/Brest%2C_France
• http://dbpedia.org/page/%C3%89cole_nationale_sup
%C3%A9rieure_des_t%C3%A9l
%C3%A9communications_de_Bretagne
• http://dbpedia.org/page/Amy_Winehouse
• http://dbpedia.org/page/Berlin
page 29 Semantic Web in Action
30. Linked Data
¢ Dbpedia Sparql Endpoint
• Langage de requêtes SPARQL
• http://dbpedia.org/snorql/ *
• http://dbpedia.org/sparql
• Twinkle : a sparql query tool
- http://www.ldodds.com/projects/twinkle/
page 30 Semantic Web in Action
31. Linked Data
¢ FREEBASE
Website http://www.freebase.com/
http://www.freebase.com/docs/data
• View data about New York
- http://www.freebase.com/view/en/new_york
• RDF metadata about New York
- http://rdf.freebase.com/rdf/en.new_york
• Edit New York Topic
- http://www.freebase.com/edit/topic/en/new_york
• Explore New York Metadata
- http://www.freebase.com/tools/explore/en/new_york
Semantic Web in Action
page 31
32. Linked Data
¢ Semantic CrunchBase
• The free directory of technology companies,
people, and investors.
• http://cb.semsol.org/
• http://cb.semsol.org/sparql
• http://cb.semsol.org/company/techcrunch
• http://cb.semsol.org/product/facebook-platform
• http://cb.semsol.org/explore
Semantic Web in Action
33. Linked Data
¢ Les Smob : les Twitter version Web sémantique
• http://www.lespetitescases.net/smob-twitter-version-web-
semantique/
• http://smob.websemantique.org/sparql.php
• http://microplanet.sioc-project.org/sparql.php
Semantic Web in Action
34. Linked Data
¢ IMDB, Linked Movie Database http://www.linkedmdb.org/
• LinkedMDB publishes linked open data using the D2R Server.
The project aims at publishing the first open semantic web
database for movies, including a large number of interlinks to
several datasets on the open data cloud and references to
related webpages
¢ Data can be accessed using
• Web browsers: http://www.linkedmdb.org/
• Semantic Web browsers - entry point:
http://data.linkedmdb.org/all
• SPARQL clients - SPARQL endpoint:
http://data.linkedmdb.org/sparql
page 34 Semantic Web in Action
35. Linked Data
¢ Musicbrainz
• This is a database published with D2R Server. It can
be accessed using
- Your web browser http://dbtune.org/musicbrainz/
- Semantic Web browsers at URL http://dbtune.org/musicbrainz/all
- SPARQL clients at URL http://dbtune.org/musicbrainz/sparql or
In a web browser http://dbtune.org/musicbrainz/snorql/
page 35 Semantic Web in Action
36. Linked Data
¢ GovTrack.us
• A civic project to track congress
• The 2000 U.S. Census: 1 Billion RDF Triples
• http://www.rdfabout.com/demo/census/
• http://www.govtrack.us/developers/rdf.xpd
• The SPARQL endpoint is a combined endpoint for various RDF
data I play with. It answers queries about 1 billion triples,
mostly about U.S. Census data (see rdfabout.com), but also
the GovTrack data.
page 36 Semantic Web in Action
37. Linked Data
¢ GovTrack.us
• The SPARQL engine's base URL is
http://www.rdfabout.com/sparql , following (or trying to follow)
the SPARQL Protocol spec.
• The SPARQL engine is Ryan Levering's engine for SESAME,
plus my SemWeb library for C#.
• The data store is persisted in MySQL. Responses are limited to
1000 rows in the hopes that having this public won't break
anything.
page 37 Semantic Web in Action
38. Linked Data
¢ Revyu.com
• Revyu.com is a web site where you can review and rate things.
Unlike many other reviewing sites on the web, Revyu.com lets
you review and rate absolutely anything you can name
• Web access: http://revyu.com/
• SPARQL Endpoint http://revyu.com/sparql/
page 38 Semantic Web in Action
39. SPARQL: SPARQL Protocol and RDF Query
Language
¢ SPARQL
• pronounced "sparkle" [1]) is an RDF query language; its name
is a recursive acronym that stands for SPARQL Protocol and
RDF Query Language. It is standardized by the RDF Data
Access Working Group (DAWG) of the
World Wide Web Consortium, and is considered a component
of the semantic web.
• Initially released as a Candidate Recommendation in April
2006, but returned to Working Draft status in October 2006,
due to two open issues. [2] In June 2007, SPARQL advanced
to Candidate Recommendation once again. [3] On 12th
November 2007 the status of SPARQL changed into Proposed
Recommendation. [4] On 15th January 2008, SPARQL became
an official W3C Recommendation. [5]
page 39 Semantic Web in Action
40. SPARQL: SPARQL Protocol and RDF Query
Language
¢ SPARQL =
• A Query Language
• A Result Form
• An Access Protocol
page 40 Semantic Web in Action
41. SPARQL: SPARQL Protocol and RDF Query
Language
¢ The Query Language: query forms
• « Select » clause returns all or subset of the
variables bound in a query pattern match
• « Construct » returns an RDF graph constructed by
substituting variables in a set of triple templates
• « Ask » returns a boolean indicating whether a query
pattern matches
• « Describe » returns an RDF graph that describe the
resources found
page 41 Semantic Web in Action
42. SPARQL: SPARQL Protocol and RDF Query
Language
¢ «Select » equivalent to « SQL Select » returns a
regular table
Select …
From … Identify data sources to query
Where { … } The triple/graph pattern
to be matched against the
triple/graphs of RDF
A conjunction of triples
¢ PREFIX to declare the schema used in the query
page 42 Semantic Web in Action
43. SPARQL: SPARQL Protocol and RDF Query
Language
¢ PREFIX foaf: <http://xmlns.com/foaf/0.1/>
SELECT ?name
FROM <http://example.org/foaf/aliceFoaf>
WHERE
{
?x foaf:name ?name
}
Result:
name
« Alice »
page 43 Semantic Web in Action
44. SPARQL: SPARQL Protocol and RDF Query
Language
¢
PREFIX foaf: <http://xmlns.com/foaf/0.1/>
PREFIX : <http://dbpedia.org/resource/>
PREFIX dbpedia2: <http://dbpedia.org/property/>
PREFIX dbpedia: <http://dbpedia.org/>
SELECT distinct ?name ?birth ?person
FROM http://dbpedia.org/
WHERE
¢ {
?person dbpedia2:birthPlace <http://dbpedia.org/resource/Berlin>.
?person dbpedia2:birth ?birth .
?person foaf:name ?name .
}
page 44 Semantic Web in Action
45. SPARQL: SPARQL Protocol and RDF Query
Language
¢ SPARQL results:
¢ namebirthperson« ":Dru_Berrymore/birth/birth_date_and_age :Dru_Berrymore
"Dru Berrymore"@de:Dru_Berrymore/birth/birth_date_and_age :Dru_Berrymore
"Walter Benjamin"@de:Berlin :Walter_Benjamin "Walter
Benjamin"@de:Germany :Walter_Benjamin
Name Birth Person
« Dru Berrymore »
page 45 Semantic Web in Action
46. SPARQL: SPARQL Protocol and RDF Query
Language
¢ SELECT distinct ?name ?person
FROM <http://dbpedia.org/>
WHERE
{
?person dbpedia2:birthPlace <http://dbpedia.org/
resource/Berlin> .
?person foaf:name ?name .
}
¢ SELECT distinct ?name ?birth ?death ?person
FROM <http://dbpedia.org/>
WHERE
{
?person dbpedia2:birthPlace <http://dbpedia.org/
resource/Berlin> .
?person dbpedia2:birth ?birth .
?person foaf:name ?name .
?person dbpedia2:death ?death.
}
page 46 Semantic Web in Action
47. SPARQL: SPARQL Protocol and RDF Query
Language
¢ A
constraint, expressed by the keyword “FILTER”,
is a restriction on solutions over the whole group
in which the filter appears
¢ PREFIX dc: <http://purl.org/dc/elements/1.1/>
PREFIX ns: <http://example.org/ns#>
SELECT ?title ?price
WHERE
{
?x ns:price ?price .
FILTER (?price < 30.5)
?x dc:title ?title .
}
page 47 Semantic Web in Action
48. SPARQL: SPARQL Protocol and RDF Query
Language
¢ “regex” matches only plain literals with no
language tag
¢ { ?x foaf:name ?name .
?x foaf:mbox ?mbox .
FILTER regex(?name, "Smith") }
¢ PREFIX dc: <http://purl.org/dc/elements/1.1/>
SELECT ?title
WHERE
{ ?x dc:title ?title
FILTER regex(?title, "web", "i" ) }
page 48 Semantic Web in Action
49. SPARQL: SPARQL Protocol and RDF Query
Language
¢ Optional parts of the graph pattern may be
specified syntactically with the “OPTIONAL”
keyword applied to a graph pattern
¢ SELECT distinct ?name ?birth ?death ?person
FROM <http://dbpedia.org/>
WHERE
{ ?person dbpedia2:birthPlace <http://
dbpedia.org/resource/Berlin> .
?person dbpedia2:birth ?birth .
?person foaf:name ?name .
OPTIONAL {?person dbpedia2:death ?death}
}
page 49 Semantic Web in Action
50. SPARQL: SPARQL Protocol and RDF Query
Language
¢ Matching alternative
• Pattern alternatives are syntactically specified
with the UNION keyword
¢ SELECT distinct ?name ?birth ?death ?person
WHERE {
{?person dbpedia2:birthPlace
<http://dbpedia.org/resource/Berlin> }
UNION
{?person dbpedia2:death ?death}
?person foaf:name ?name .
?person dbpedia2:birth ?birth .
}
page 50 Semantic Web in Action
51. SPARQL: SPARQL Protocol and RDF Query
Language
¢ Sequence & Modify
• « Order By » to sort,
• « LIMIT » result number,
• « OFFSET » rank of first result
¢ SELECT distinct ?name ?person
WHERE
{
?person dbpedia2:birthPlace <http://dbpedia.org/resource/Berlin>.
?person foaf:name ?name.
}
ORDER BY ?name LIMIT 20 OFFSET 20
page 51 Semantic Web in Action
52. SPARQL: SPARQL Protocol and RDF Query
Language
¢ « Construct »
• The CONSTRUCT query form returns a single RDF
graph specified by a graph template.
- The result is an RDF graph formed by taking each query
solution in the solution sequence, substituting for the
variables in the graph template, and combining the triples into
a single RDF graph by set union.
• Useful for aggregating data from multiple sources
and merging it into a local store (from Ingenta)
page 52 Semantic Web in Action
53. SPARQL: SPARQL Protocol and RDF Query
Language
¢ PREFIX foaf: <http://xmlns.com/foaf/0.1/>
FROM <http://molene.enstb.org/mlearning09/wp-
content/plugins/wp-rdfa/foaf.php
CONSTRUCT
{
?friend a foaf:Person;
foaf:name ?name;
foaf:homepage ?home.}
WHERE
{
?person foaf:mbox <mailto:ac@enstb.com>;
foaf:knows ?friend.
?friend foaf:name ?name;
foaf:homepage ?home.}
page 53 Semantic Web in Action
54. SPARQL: SPARQL Protocol and RDF Query
Language
¢ ASK
• Returns a true/false value: test whether or not a query pattern
has a solution.
• No information is returned about the possible query solutions,
just whether or not a solution exists
• Is there data that looks like this? Do you have any information
about that? (from Ingenta)
¢ PREFIX foaf: <http://xmlns.com/foaf/0.1/>
ASK
WHERE {
?person a foaf:Person;
foaf:mbox <mailto:ab@telecom-bretagne>.
}
page 54 Semantic Web in Action
55. SPARQL: SPARQL Protocol and RDF Query
Language
¢ DESCRIBE
• The DESCRIBE form returns a single result RDF
graph containing RDF data about resources.
• CONSTRUCT but with less control
- Tell me about this or things that look like this … but you
decide what’s relevant (from Ingenta)
¢ PREFIX foaf: <http://xmlns.com/foaf/0.1/>
DESCRIBE ?friend
WHERE {
?person foaf:mbox “mailto:ab@telecom-bretagne”;
foaf:knows ?friend.}
page 55 Semantic Web in Action
56. SPARQL: SPARQL Protocol and RDF Query
Language
¢ Applied uses (from Ingenta)
• DESCRIBE for Prototyping
- DESCRIBE <http://example.org/someResource>
- Quickly assembling Uis, Web APIs
• SELECT for Indexing
- Building an ordering over some data ORDER BY, LIMIT
page 56 Semantic Web in Action
57. SPARQL: SPARQL Protocol and RDF Query
Language
¢ Applied uses (from Ingenta)
• CONSTRUCT for transformation and also simple
inferencing
- CONSTRUCT could be the XSLT of RDF
- Currently limited by lack of expressions in CONSTRUCT
triple templates
• ASK for validation
• ASK – DESCRIBE – CONSTRUCT Pattern:
- Probe endpoint, Grab default view of data, Refine data extraction and/or
apply transformation
page 57 Semantic Web in Action
58. SPARQL: SPARQL Protocol and RDF Query
Language
¢ SPARQL Protocol (from F. Gandon, INRIA)
• Sending queries and their results accross the web
¢ Example with HTTP binding
• GET /sparql/?query=<encoded query>
HTTP/1.1
Host: www.inria.fr
User-agent: my-sparql-client/0.1
page 58 Semantic Web in Action
59. SPARQL: SPARQL Protocol and RDF Query
Language
¢ Example with SOAP binding (from F. Gandon)
¢ <?xml version="1.0" encoding="UTF-8"?>
<soapenv:Envelope
xmlns:soapenv="http://www.w3.org/2003/05/soap-envelope/"
xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:xsi="http://
www.w3.org/2001/XMLSchema-instance">
<soapenv:Body>
<query-request xmlns="http://www.w3.org/2005/09/sparql-
protocol-types/#">
<query> SELECT ?x ?p ?y WHERE {?x ?p ?y} </query>
</query-request>
</soapenv:Body>
</soapenv:Envelope>
page 59 Semantic Web in Action
60. SPARQL: SPARQL Protocol and RDF Query
Language
¢ Access to Data on the web
• http://dbpedia.org/snorql/
• http://dbpedia.org/sparql
• http://demo.openlinksw.com/rdfbrowser2/
• http://dataviewer.zitgist.com/
• Etc.
¢ Twinkle : a sparql query tool
• http://www.ldodds.com/projects/twinkle
page 60 Semantic Web in Action
61. SPARQL: SPARQL Protocol and RDF Query
Language
¢ Resources
• http://en.wikipedia.org/wiki/SPARQL
• http://www.w3.org/TR/rdf-sparql-query/
• http://jena.sourceforge.net/ARQ/Tutorial/
• http://esw.w3.org/topic/SparqlImplementations
• http://arc.semsol.org/home
• http://virtuoso.openlinksw.com/wiki/main/Main/
page 61 Semantic Web in Action