Professor Christian Bizer erforscht technische und empirische Fragen bezüglich der Entwicklung des World Wide Webs hin zu einem globalen Datenraum. Im Rahmen des WebDataCommons-Projekts untersucht er die globale Adaption von Webseiten-Annotations-Standards wie Schema.org. Im Rahmen des DBpedia-Projekts beschäftigt er sich mit der Ableitung umfassender Multi-Domänen-Wissensbasen aus Wikipedia und dem World Wide Web.
Nokia New Asha Platform Developer TrainingAndreas Jakl
In-depth look at the new opportunities and APIs of the Nokia Asha SDK, which enables you to develop apps for the latest phones like the Nokia Asha 501.
The training materials includes a quick overview of the refreshed UX, UI development and iconography, internationalization, phone / network / SIM state detection, file selections, notifications, radio tuner, maps, gestures and porting between different touch and non-touch devices.
The developer training was held by Mopius in Budapest on May 14th and was the world's first on-site training for the new Asha platform, just a few days after the platform's release.
P esentación realizada por Ana Ramírez para el curso de Crorrientes Pedagógicas 2008 de la Universidad de Los Andes Táchira en torno al concepto de Noología expuesto por Morin en el libro Los siete sabres necesarios a la educación del futuro
Evolving the Web into a Global Dataspace – Advances and ApplicationsChris Bizer
Keynote talk at the 18th International Conference on Business Information Systems, 24-26 June 2015, Poznań, Poland
URL:
http://bis.kie.ue.poznan.pl/bis2015/keynote-speakers/
Abstract:
Motivated by Google, Yahoo!, Microsoft, and Facebook, hundreds of thousands of websites have started to annotate structured data within their pages using markup formats such as Microdata, RDFa, and Microformats. In parallel, the adoption of Linked Data technologies by government agencies, libraries, and scientific institutions has risen considerably. In his talk, Christian Bizer will give an overview of the content profile of the resulting Web of Data. He will showcase applications that exploit the Web of Data and will discuss the challenges of integrating and cleansing data from thousands of independent Web data sources.
A possible future role of schema.org for business reportingsopekmir
The presentation demonstrates a vision for the “reporting extension” that could enhance the processes related to business reporting and the role it could have for the SBR vision.
Schema.org Annotations and Web Tables: Underexploited Semantic Nuggets on the...Chris Bizer
Slideset of the keynote talk given by Christian Bizer at the Language Data and Knowledge (LDK 2019) conference in Leipzig, Germany
Abstract of the talk:
Millions of websites have started to annotate data describing products, local business, events, jobs, places, recipes, and reviews within their HTML pages using the schema.org vocabulary. These annotations are widely used by search engines to render rich snippets within search results. Surprisingly, the annotations are hardly used by the research community. In the talk, Christian Bizer investigates the potential of schema.org annotations for being used as training data for tasks such as entity matching, information extraction, and sentiment analysis. Web pages that offer semantic annotations often also contain additional structured data in the form of HTML tables. In the second part of the talk, Christian Bizer discusses the interplay of semantic annotations and web tables for information extraction as well as the general potential of relational HTML tables for complementing knowledge bases such as DBpedia, focusing on the discovery of formerly unknown long tail entities as well as the extraction of n-ary relations.
Link to the conference website:
http://2019.ldk-conf.org/invited-speakers/
‘Edge’ Technologies: a new language of innovationDXC Eclipse
Microsoft Dynamics 365: Continue Your Transformation Journey.
‘Edge’ Technologies: a new language of innovation .
We will demystify the new language of ‘Edge’ Technologies – Common Data Service, Power Apps, Flow, Cortana Intelligence Suite, BOT Framework.
Presented by Henrik Mozart - Senior Technology Specialist, DXC Eclipse
Structured data SEO presentation WPTO Robin Macrae 2017-02-18Robin Macrae
Structured data (aka schema markup) is one of the key SEO techniques to improve search engine ranking by providing to Google, Bing, Yandex and other search engines with data describing the content in question (e.g., a product, event, person, recipe, etc.) using the schema.org types for the relevant industry or category. This presentation is a primer or guide to understanding and using structured data for SEO and semantic search.
This is my journey taken from year 2012 on wards, after graduation in my MS with major in AI. I have taken various certification courses, trainings, hands-on labs; few key ones are from Google, and Microsoft.
Nokia New Asha Platform Developer TrainingAndreas Jakl
In-depth look at the new opportunities and APIs of the Nokia Asha SDK, which enables you to develop apps for the latest phones like the Nokia Asha 501.
The training materials includes a quick overview of the refreshed UX, UI development and iconography, internationalization, phone / network / SIM state detection, file selections, notifications, radio tuner, maps, gestures and porting between different touch and non-touch devices.
The developer training was held by Mopius in Budapest on May 14th and was the world's first on-site training for the new Asha platform, just a few days after the platform's release.
P esentación realizada por Ana Ramírez para el curso de Crorrientes Pedagógicas 2008 de la Universidad de Los Andes Táchira en torno al concepto de Noología expuesto por Morin en el libro Los siete sabres necesarios a la educación del futuro
Evolving the Web into a Global Dataspace – Advances and ApplicationsChris Bizer
Keynote talk at the 18th International Conference on Business Information Systems, 24-26 June 2015, Poznań, Poland
URL:
http://bis.kie.ue.poznan.pl/bis2015/keynote-speakers/
Abstract:
Motivated by Google, Yahoo!, Microsoft, and Facebook, hundreds of thousands of websites have started to annotate structured data within their pages using markup formats such as Microdata, RDFa, and Microformats. In parallel, the adoption of Linked Data technologies by government agencies, libraries, and scientific institutions has risen considerably. In his talk, Christian Bizer will give an overview of the content profile of the resulting Web of Data. He will showcase applications that exploit the Web of Data and will discuss the challenges of integrating and cleansing data from thousands of independent Web data sources.
A possible future role of schema.org for business reportingsopekmir
The presentation demonstrates a vision for the “reporting extension” that could enhance the processes related to business reporting and the role it could have for the SBR vision.
Schema.org Annotations and Web Tables: Underexploited Semantic Nuggets on the...Chris Bizer
Slideset of the keynote talk given by Christian Bizer at the Language Data and Knowledge (LDK 2019) conference in Leipzig, Germany
Abstract of the talk:
Millions of websites have started to annotate data describing products, local business, events, jobs, places, recipes, and reviews within their HTML pages using the schema.org vocabulary. These annotations are widely used by search engines to render rich snippets within search results. Surprisingly, the annotations are hardly used by the research community. In the talk, Christian Bizer investigates the potential of schema.org annotations for being used as training data for tasks such as entity matching, information extraction, and sentiment analysis. Web pages that offer semantic annotations often also contain additional structured data in the form of HTML tables. In the second part of the talk, Christian Bizer discusses the interplay of semantic annotations and web tables for information extraction as well as the general potential of relational HTML tables for complementing knowledge bases such as DBpedia, focusing on the discovery of formerly unknown long tail entities as well as the extraction of n-ary relations.
Link to the conference website:
http://2019.ldk-conf.org/invited-speakers/
‘Edge’ Technologies: a new language of innovationDXC Eclipse
Microsoft Dynamics 365: Continue Your Transformation Journey.
‘Edge’ Technologies: a new language of innovation .
We will demystify the new language of ‘Edge’ Technologies – Common Data Service, Power Apps, Flow, Cortana Intelligence Suite, BOT Framework.
Presented by Henrik Mozart - Senior Technology Specialist, DXC Eclipse
Structured data SEO presentation WPTO Robin Macrae 2017-02-18Robin Macrae
Structured data (aka schema markup) is one of the key SEO techniques to improve search engine ranking by providing to Google, Bing, Yandex and other search engines with data describing the content in question (e.g., a product, event, person, recipe, etc.) using the schema.org types for the relevant industry or category. This presentation is a primer or guide to understanding and using structured data for SEO and semantic search.
This is my journey taken from year 2012 on wards, after graduation in my MS with major in AI. I have taken various certification courses, trainings, hands-on labs; few key ones are from Google, and Microsoft.
Is the Semantic Web what we expected? Adoption Patterns and Content-driven Ch...Chris Bizer
http://iswc2016.semanticweb.org/pages/program/keynote-bizer.html
Semantic Web technologies, such as Linked Data and Schema.org, are used by a significant number of websites to support the automated processing of their content. In the talk, I will contrast the original vision of the Semantic Web with empirical findings about the adoption of Semantic Web technologies on the Web. The analysis will show areas in which data providers behave as envisioned by the Semantic Web community but will also reveal areas in which real-world adoption patterns strongly deviate. Afterwards, I will discuss the challenges that result from the current adoption situation. To address these challenges, I will exemplify entity reconciliation, vocabulary matching, and data quality assessment techniques which exploit all semantic clues that are provided while being tolerant to noise and lazy data providers.
Industry Ontologies: Case Studies in Creating and Extending Schema.org for In...MakoLab SA
The presentation introduces listeners into the details of the most important global semantic vocabulary build jointly by Google, Yahoo, Microsoft and Yandex: schema.org. It then discusses the experiences related to the creation of “hosted” extensions for the automotive industries (existing: auto.schema.org) and for the financial industries (in making: fibo.schema.org). The two extensions, built by an international team of specialists managed by MakoLab with full respect to the community processes, have two different creation strategies which will be presented and discussed.
The use cases for both vocabularies will be demonstrated. They are related to both “external” business effects (better visibility of the websites using them on the web) and “internal” effects (new kind of analytics and search capacities).
The presentation will also invite to participate to two W3C Community Groups responsible for the open communication activities around the two extensions.
A Car as a Semantic Web Thing - Motivation and DemonstrationBenjamin Klotz
Car signal data is usually hard to access, understand
and integrate for non automotive domain experts.
In this paper, we use semantic technologies for enriching
signal data in the automotive industry and access it through
Web of Things interactions. This combination allows the access
and integration of car data from the web.
We built VSSo, a Vehicle Signal ontology based on
SOSA/SSN Observations and Actuations, and generated WoT
Actions, Events and Properties, enriched with domain metadata.
We mapped VSSo to a Web of Things ontology and we
developed a Web of Things protocol binding with LwM2M,
and made an implementation in a real car.
This implementation resulted in a first working prototype,
and a number of future improvements required in order to be
compliant with automotive standards.
Interacting with Linked Data to Facilitate its SustainabilityRoberto García
Presentation about the importance of user participation for the sustainability of Linked Data publishing. It also shows an approach to automatic User Interface generation for Linked Data that then facilitates users participation.
Methods and Challenges for Metaverse Analytics.pdfSafaa Alnabulsi
Which existing methods and analytical approaches can be applied to quantitatively study metaverse?
Which challenges are associated with the quantitative investigation of metaverse and the application of those methods?
Similar to TFF2015, Christian Bizer, Uni Mannheim "Schema.org-Annotationen in Webseiten" (20)
Vortrag Online: Oliver Csendes, Österreich Werbung – Updates zur ÖW-Datenstrategie & Datenökosystem Tourismus – Was bringt das Destinationen und Betrieben? Sowie: AI-Concierge im Dataspace
Vortrag: Torsten Maier, Payback Österreich – PAYBACK als Loyaltyplattform – Möglichkeiten für den Tourismus
Innovative Ansätze zur Stärkung der Kundenbindung und -loyalität im Tourismussektor
Use-Case-Session 1 – NTT Data / Fiegl&Spielberger / The future is now: Künstliche Intelligenz, smarte Sensoren und Prognose-Dashboards läuten die nächste Ära des Tourismus ein
Eröffnungs-Vortrag: Bundesministerium für Klimaschutz, Umwelt, Energie, Mobilität, Innovation und Technologie – Aktive Mobilität & Mobilitätsmanagement
Vortrag: Oliver Csendes, Österreich Werbung – ÖW-Datenstrategie & Datenökosystem Tourismus – Data Intelligence Offensive UND Use-Case: Martin Reichhart, Österreich Werbung – Daten als Grundlage für eine nachhaltige Destinationsentwicklung
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie WellsRosie Wells
Insight: In a landscape where traditional narrative structures are giving way to fragmented and non-linear forms of storytelling, there lies immense potential for creativity and exploration.
'Collapsing Narratives: Exploring Non-Linearity' is a micro report from Rosie Wells.
Rosie Wells is an Arts & Cultural Strategist uniquely positioned at the intersection of grassroots and mainstream storytelling.
Their work is focused on developing meaningful and lasting connections that can drive social change.
Please download this presentation to enjoy the hyperlinks!
This presentation, created by Syed Faiz ul Hassan, explores the profound influence of media on public perception and behavior. It delves into the evolution of media from oral traditions to modern digital and social media platforms. Key topics include the role of media in information propagation, socialization, crisis awareness, globalization, and education. The presentation also examines media influence through agenda setting, propaganda, and manipulative techniques used by advertisers and marketers. Furthermore, it highlights the impact of surveillance enabled by media technologies on personal behavior and preferences. Through this comprehensive overview, the presentation aims to shed light on how media shapes collective consciousness and public opinion.
TFF2015, Christian Bizer, Uni Mannheim "Schema.org-Annotationen in Webseiten"
1. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 1
Prof. Dr. Christian Bizer
Schema.org Annotations in
Webpages
Opportunities & Challenges for the
Tourism Industry
2. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 2
Hello
Professor Christian Bizer
University of Mannheim
Data and Web Science Group
Research Topics
Web Technologies
Web Data Integration
Web Mining
Evolution of the Web
3. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 3
Outline
1. Motivation for Semantic Annotations
2. Global Adoption
3. Adoption in Tourism
4. Opportunities and Challenges
4. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 4
Motivation for Semantic Annotations
Websites want to be understood.
by humans
but also by machines
Websites are hard to understand
for machines.
hinders content sharing
hinders the development of smart search engines
5. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 5
Semantic Annotations in Webpages
Possible solution: Websites help machines
to understand their content by including
semantic annotations.
<div itemscope itemtype="http://schema.org/Restaurant">
<span itemprop="name">Hill Restaurant</span>
<span itemprop="telephone">+43 1 3201111</span>
Hours: <span itemprop="openingHours">Monday-Sunday 11am - 21:30pm</span>
Categories: <span itemprop="servesCuisine"> Austrian </span>,
Price Range: <span itemprop="priceRange">€€€€</span>
</div>
6. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 6
Hotel Webpage including Semantic Annotations
<div itemtype="http://schema.org/Hotel">
< span itemprop="name">Vienna Marriott Hotel</span>
<span itemprop="address" itemscope="" itemtype="http://schema.org/PostalAddress">
<span itemprop="streetAddress">Parkring 12a</span>
<span itemprop="addressLocality">Vienna</span>
<span itemprop="postalCode">1010</span>
<span itemprop="addressCountry">Austria</span>
</span>
<span itemprop="description">Stay at Vienna Marriott Hotel, one of the elegant Vienna hotels located in
the city center at the famous Ringstrasse, opposite the city park. St. Stephen’s Square and other
attractions are within walking distance of this hotel in Vienna, Austria.</span>
<div itemprop="aggregateRating" itemscope itemtype="http://schema.org/AggregateRating">
<span itemprop="ratingValue"> 4 </span> stars -based on
<span itemprop="reviewCount"> 250 </span> reviews
<span itemprop="branchOf">Marriott International, Inc.</span>
</div>
7. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 7
Semantic Annotation Formats
Microformats
Microdata
RDFa
date back to 2003
small set of fixed formats
W3C Recommendation 2008
can represent any type of data
proposed in 2009
tries to be simpler than RDFa
8. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 8
Open Graph Protocol
allows site owners to determine how
entities are described in Facebook
relies on RDFa for embedding data into HTML pages
available since April 2010
9. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 9
Schema.org
ask site owners since 2011 to
annotate data for enriching search results.
200+ Types: Event, Organization, Person, Place, Product, Review
Encoding: Microdata or RDFa or JSON-LD
10. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 10
Usage of Schema.org Data @ Google
Rich snippets
within
search results
11. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 11
Flight Offers in Google Search Results
Annotated
webpages
directly below
Google Flights
results
12. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 12
Event Data in Google Applications
https://developers.google.com/structured-data/
13. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 13
Reviews and Ratings
14. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 14
Google Knowledge Graph
aims to describe all “relevant” things in the world
describes more than 570 million things with 18 billion facts (2014)
used to augment search results and answer fact queries
used as background knowledge for ranking search result
consists of commercial third-party data and Web data
15. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 15
Schema.org Annotations included in Knowledge Graph
16. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 16
2. Global Adoption
17. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 17
The Common Crawl
18. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 18
The Web Data Commons Project
extracts all Microformat, Microdata, RDFa data from the
Common Crawl
analyzes and provides the extracted data for download
four extractions runs so far
• 2009/2010 CC Corpus: 2.5 billion HTML pages 5.1 billion RDF triples
• 2012 CC Corpus: 3.0 billion HTML pages 7.3 billion RDF triples
• 2013 CC Corpus: 2.2 billion HTML pages 17.2 billion RDF triples
• 2014 CC Corpus: 2.0 billion HTML pages 20.4 billion RDF triples
uses 100 machines on Amazon EC2
• approx. 3000 machine/hours
(spot instances of type c1.xlarge) 550 EUR
http://www.webdatacommons.org/
19. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 19
Overall Adoption 2014
620 million HTML pages out of the 2.01 billion pages
contained in the crawl provide annotations (30%).
2.72 million pay-level-domains out of the 15.68 million
pay-level-domains covered by the crawl provide
annotations (17%)
20. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 20
Number of PLDs using the Annotation Formats
WebDataCommons, 2014:
819,990 websites (PLDs) provide Microdata annotations.
Google, 2014*:
5 million websites provide Schema.org data.
* Guha in LDOW2014 Keynote
24. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 24
Growth of Tourism-related Schema.org Classes
26. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 26
Adoption by Main Players in the Tourism Industry
27. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 27
Adoption by Booking Websites
Booking Sites ‐ Top 20 Schema:Hotel Any Class
Agoda
DaysInn
Kayak
EasyToBook
Travelocity
Priceline
Hotwire
Make my Trip
Hotel Info
Expedia
Booking.com (uses Data‐Voc)
Hotels.com
Amoma.com
Lowcostholidays
Splendia
Elvoline
Eurostars
Jovago
Onhotels
Travelrepublic
Adoption:
60 %
29. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 29
Adoption by Hotel Chains Websites
Hotel Chains ‐Top 20 Schema:Hotel Any Class
Starwood Hotels and Resorts
InterContinental Hotels Group
Marriott International
Sol Melia SA
Golden Tulip Hospitality group
Wyndham Hotel Group
Global Hyatt Corp.
Extended Stay Hotels
Mövenpick
Hilton Worldwide
Accor
Best Western
Carlson
Westmont Hospitality Group
TUI AG/TUI Hotels & Resorts
Jin Jiang International Hotels
The Rezidor Hotel Group
LQ Management LLC
Home Inns
Adoption:
45 %
30. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 30
Adoption by Singel Hotel Websites
Single Hotel Sites – Top 20 Schema:Hotel Any Class
Caesars Palace
Clarion Hotel
The Venetian Las Vegas
MGM Grand Las Vegas
First World Hotel
Disney's All‐Star Resort
Izmailovo Hotel
Wynn Las Vegas
Mandalay Bay
Luxor Las Vegas
Ambassador City Jomtien
Excalibur Hotel and Casino
Aria Resort & Casino
Bellagio Las Vegas
Circus Circus Las Vegas
Shinagawa Prince Hotel
Atlantis Paradise Island
The Mirage
Monte Carlo Resort and Casino
Estrel Hotel
Adoption:
10 %
31. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 31
4. Opportunities & Challenges
32. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 32
Opportunity 1: Search Engine Optimization
Get richer visibility in search results
Maybe get better ranking
33. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 33
Opportunity 2: Change Push to Pull Communication
Current situation:
• Information providers need to
push data into multiple channels
• multiple search engines
• multiple booking portals
Web approach
• You maintain a website
• All interested parties
crawl your data
• Today: Search engines
• Future: Also other apps?
34. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 34
Opportunity 3: Additional Data for Tourism Applications
Tourism websites / applications rely on
• hotel descriptions / offers
• reviews and rating
• some location information
Potentially relevant additional Schema.org data:
• nearby local businesses / restaurants / ski resorts
• nearby landmarks / historical buildings / museums
• nearby hospitals / libraries
• nearby events
• and ratings for all these things
High up-to-dateness of data
• as original data providers know about changes first
35. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 35
Challenge 1: Data Integration and Cleansing
For applications wanting to use the data, data integration and
cleansing are not trivial.
The schema is standardized, but
• Entity names differ
• Schema rather flat and rather low number of properties are used
• Data quality differs as the data is created by experts and rookies
36. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 36
Example for these Challenges: E-Commerce Data
Microdata
(2012)
Example Product Names:
• AppleMacBook Air MC968/A 11.6-Inch Laptop
• Apple MacBook Air 11-in, Intel Core i5 1.60GHz, 64 GB, Lion 10.7
Example Description:
• Faster Flash Storage with 64 GB Solid State Drive and USB 3.0 …
37. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 37
Classification of Offers by Product Category
We analyzed 1.9 million product offers from 9200 shops (WDC2012)
We trained classifier for 9 product categories on product descriptions
from Amazon.
Petar Petrovski, Volha Bryl, Christian Bizer: Integrating Product Data from Websites offering
Microdata Markup. 4th Workshop on Data Extraction and Object Search (DEOS2014).
38. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 38
Identity Resolution for Electronic Products
We trained a parser to extract product features.
We used Silk framework to find offers of the same product.
39. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 39
Challenge 2: Search Engines as New Competitors?
Google builds the Knowledge Graph
• including information about local businesses, points of interest, hospitals, …
Google has detailed knowledge about the user
• your search and browsing behavior
• your movement patterns via Android
Does this put Google in a good position
for recommending hotels?
Direction and outcome of new EU
anti-trust case against Google will be
interesting.
• Placement of competitors in research results
• Reuse of conent from other websites
https://maps.google.com/locationhistory/
40. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 40
Summary
Semantic annotations make it easier for machines to
understand web content.
Adoption of Schema.org annotations increases sharply.
Opportunities for tourism industry
1. Search engine optimization
2. Change push to pull communication
3. Additional data for tourism applications
Challenges for tourism industry
1. Data integration and cleansing
2. New competitors?
41. Bizer: Schema.org Annotations – Opportunities and Challenges for the Tourism Industry, 17.4.2015 Slide 41
References and Download
Papers
• Robert Meusel, Petar Petrovski and Christian Bizer: The WebDataCommons Microdata, RDFa
and Microformat Dataset Series. 13th International Semantic Web Conference (ISWC2014).
• Petar Petrovski, Volha Bryl, Christian Bizer: Integrating Product Data from Websites offering
Microdata Markup. 4th Workshop on Data Extraction and Object Search (DEOS2014).
• Christian Bizer, Kai Eckert, Robert Meusel, Hannes Mühleisen, Michael Schuhmacher, Johanna
Völker: Deployment of RDFa, Microdata, and Microformats on the Web – A Quantitative
Analysis. 12th International Semantic Web Conference (ISWC2013).
More detailed statistics on RDFa, Microdata and Microformats adoption
• http://www.webdatacommons.org/structureddata/
Download the Web Data Commons Schema.org data
• http://webdatacommons.org/structureddata/2014-12/stats/schema_org_subsets.html