Human Activities as Linked Data

Paolo Pareti
Paolo ParetiResearch Fellow at Southampton University
Integrating 
Know-How 
in the Linked Data Cloud 
Paolo Pareti, Benoit Testu, Ryutaro Ichise, 
Ewan Klein and Adam Barker 
https://w3id.org/prohow/ 
“As we all know, there is a large amount of facts available on the Web. But what about human activities or know-how? The goal of this talk is to 
tell you how this kind of knowledge can be made machine understandable and available on the Web.”
Human activities (or know-how) 
1. can be represented as Linked Data 
2. can be automatically extracted 
3. can be automatically interlinked 
4. experiment: extracted a large Linked Data dataset 
5. evaluation: our system outperforms humans 
“In particular, the presentation will focus on those five points.”
339933,,660000 
“If we ask an intelligent system this question: ‘What is the population of the capital of New Zealand?’ we would now assume it can answer this 
question correctly, by accessing knowledge bases available on the Web. But what happens if we ask a seemingly easier question: ‘What do you 
need to wash you hands?’ In this case, the system would not be able to answer.”
??? 
“This is because, to answer this question, the intelligent system would need to have some understanding of what an activity is, and maybe what 
are its requirements. This knowledge, however, is not currently available in existing knowledge bases.”
Why Know-How? 
“But actually know-how is very useful and has a lot of applications. Know-how is relevant in almost all domains, and it can be common sense 
know-how available on the Web, or maybe internal know-how of specific organizations, such as standard operating procedures. This knowledge 
also has applications in fields such as question answering, recommender systems and activity recognition.”
“Human know-how is on the Web, but why is it not accessible? First of all, this knowledge is usually represented in unstructured resources. We 
can think for example of step-by-step instructions, which are typically represented as text in natural language, 
or maybe as pictures and videos.”
? 
? 
? 
“But the most serious limitation is the fact that a single document contains only limited information. What happens if we (or a machine) does not 
understand how to do a specific step, or what a particular ingredient is. In fact, it is often the case that humans look at multiple resources to 
complete a complex task for the same time.”
Data 
“The first step for making know-how machine understandable is by using a structured representation. We can identify several entities in a 
process, such as steps, methods, requirements and outputs. We can link those entities with each other, depending on which relation exists 
between them.”
Linked Data 
“To solve the problem of the isolation of single resources, we have adopted a Linked Data representation. In this way, humans and machines can 
discover related resources when they are interested in more information about a specific entity. It is important to notice that these are not just 
links between documents, but between specific entities contained in these documents.”
“Our simple Linked Data representation of know-how is a point of contact between humans and machines. From the human perspective, know-how 
as Linked Data is a way to manage and find relevant resources which are human understandable. From the machine perspective, this data 
can be easily used for analysis, inferencing, and it can be extended to more complex representations where required.”
“So all of this is not just an idea. It is actually possible and we have run experiments and evaluated our results.”
“What do we want to achieve exactly, when we talk about machine-understandable activities? While it is true that we want to have a knowledge 
representation more powerful than simple text in a document, we cannot yet aim to have machines capable of automating all human activities. 
Therefore we need to start by reaching a first significant but realistic goal.”
“We show the usefulness of this system in a real application. A task currently done by humans is the interlinking of related know-how resources. 
In particular, the WikiHow community is actively creating such kind of links; for example between the step of a process and another set of 
instructions that explains how to do it.”
How to 
Make a Pancake 
Steps: 
1. Prepare the mix 
2. Pour the mix 
in a hot pan 
3. Cook until golden 
Make a Pancake has_step 
has_step 
has_step 
Prepare the mix Cook until golden 
Pour the mix 
in a hot pan 
“This is a simplified example (e.g. missing the relations to specify the order of the steps) of how our system generates a Linked Data 
representation of a Web document. This can be done in many ways, but when the original document has some degree of structure, this 
knowledge extraction can be done easily and accurately.”
How to 
Make a Pancake 
Steps: 
1. Prepare the mix 
2. Pour the mix 
in a hot pan 
3. Cook until golden 
Make a Pancake has_step 
requires 
requires 
has_step 
has_step 
Eggs 
Milk 
Prepare the mix Cook until golden 
Pour the mix 
in a hot pan 
Requirements: 
● Eggs 
● Milk 
● Flour 
Flour 
requires 
“On the Web, most of these resources have some degree of structure. This is because a well structured set of instructions is better understood 
by humans, even before machines. This structure usually takes form of a simple enumeration of steps, methods and requirements.”
> 200,000 
procedures 
> 2,600,000 
entities 
“WikiHow and Snapguide are two large repositories that contain well organized know-how. We have extracted the knowledge of these websites 
and obtained a large dataset of over 200,000 procedures decomposed in over 2,600,000 entities. This can be seen as a large-scale extraction of 
know-how from the Web and conversion to Linked Data.”
Hot to Install an Operating System 
create a partition 
How to Create 
a Partition 
“In order to interlink the extracted entities, we have created a system to automatically discover two kinds of links. The first kind is a functional link 
between a step and another set of instructions that explains how this step can be done.”
DBpedia Guacamole 
How to Make Guacamole How to Serve Nachos 
“The second kind of links we discovered is similar to an Input/Output link between two processes. Instead of representing it directly, we have this 
link implicitly represented by the types of the input and the output of processes. In this example, we can infer that there is an Input/Output relation 
between the two processes, as one requires the object ‘Guacamole’ while the other outputs it.”
Evaluation 
+ 16% precision 
+ ×2 number of links 
+ ×2 coverage 
+ automatic 
+ semantic links 
“Finally we evaluated the links extracted by our system against the links generated manually by the WikiHow community. The result was a 
significant improvement. Our system identified links of better quality, more in number, and better spread across all resources. All of this on top of 
being a completely automatic system which creates semantic Linked Data links, more expressive than simple html links.”
Know How as Linked Data? 
….a dream that comes true! 
● Generated a large dataset of > 200,000 
human activities as Linked Data 
● Integrated in the Linked Data Cloud 
● Outperformed the human baseline 
https://w3id.org/prohow/ 
“In conclusion, we have seen how know-how can become a new useful resource on the Linked Data Cloud. Our system automated the extraction 
and the integration of this knowledge on a large scale. Please visit this website if you are interested in this dataset or information about the 
project. This website also contains a link to an online visualization tool to explore the dataset”.
1 of 20

Recommended

Patchwork February 2013 UK by
Patchwork February 2013 UKPatchwork February 2013 UK
Patchwork February 2013 UKFutureGov
859 views25 slides
Patchwork February 2013 MAV by
Patchwork February 2013 MAVPatchwork February 2013 MAV
Patchwork February 2013 MAVFutureGov
298 views25 slides
#1NWebinar: Cracking Big Content by
#1NWebinar: Cracking Big Content#1NWebinar: Cracking Big Content
#1NWebinar: Cracking Big ContentOne North
2.7K views47 slides
The semanticweb may2001_timbernerslee by
The semanticweb may2001_timbernersleeThe semanticweb may2001_timbernerslee
The semanticweb may2001_timbernersleegrknsfk
168 views4 slides
io dance by
io danceio dance
io dancemonika hardy
816 views29 slides
SciSoftDays Talk - Howison: Spreading the work in software ecosystems by
SciSoftDays Talk - Howison: Spreading the work in software ecosystemsSciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystemsJames Howison
356 views25 slides

More Related Content

What's hot

Systems Thinking workshop, given at Lean UX NYC by
Systems Thinking workshop, given at Lean UX NYCSystems Thinking workshop, given at Lean UX NYC
Systems Thinking workshop, given at Lean UX NYCjohanna kollmann
3.5K views49 slides
The Inline Interface by
The Inline InterfaceThe Inline Interface
The Inline InterfacePeter Brantley
709 views30 slides
Hypertext2007 Wendy Hall - "Whatever Happened to Hypertext?" by
Hypertext2007 Wendy Hall - "Whatever Happened to Hypertext?"Hypertext2007 Wendy Hall - "Whatever Happened to Hypertext?"
Hypertext2007 Wendy Hall - "Whatever Happened to Hypertext?"hypertext2007
677 views34 slides
Mashing up the web” - combining, fusing, creating ideas in linking web 2.0 t... by
Mashing up the web”  - combining, fusing, creating ideas in linking web 2.0 t...Mashing up the web”  - combining, fusing, creating ideas in linking web 2.0 t...
Mashing up the web” - combining, fusing, creating ideas in linking web 2.0 t...Allan Cho
389 views15 slides
We Want Our Data Now! 7 principles of democratizing data by
We Want Our Data Now! 7 principles of democratizing dataWe Want Our Data Now! 7 principles of democratizing data
We Want Our Data Now! 7 principles of democratizing dataW. David Stephenson
1.4K views17 slides
Grant: The Impact of Cloud, Mobile, and Managing the Changing Platforms of Di... by
Grant: The Impact of Cloud, Mobile, and Managing the Changing Platforms of Di...Grant: The Impact of Cloud, Mobile, and Managing the Changing Platforms of Di...
Grant: The Impact of Cloud, Mobile, and Managing the Changing Platforms of Di...National Information Standards Organization (NISO)
757 views54 slides

What's hot(20)

Systems Thinking workshop, given at Lean UX NYC by johanna kollmann
Systems Thinking workshop, given at Lean UX NYCSystems Thinking workshop, given at Lean UX NYC
Systems Thinking workshop, given at Lean UX NYC
johanna kollmann3.5K views
Hypertext2007 Wendy Hall - "Whatever Happened to Hypertext?" by hypertext2007
Hypertext2007 Wendy Hall - "Whatever Happened to Hypertext?"Hypertext2007 Wendy Hall - "Whatever Happened to Hypertext?"
Hypertext2007 Wendy Hall - "Whatever Happened to Hypertext?"
hypertext2007677 views
Mashing up the web” - combining, fusing, creating ideas in linking web 2.0 t... by Allan Cho
Mashing up the web”  - combining, fusing, creating ideas in linking web 2.0 t...Mashing up the web”  - combining, fusing, creating ideas in linking web 2.0 t...
Mashing up the web” - combining, fusing, creating ideas in linking web 2.0 t...
Allan Cho389 views
We Want Our Data Now! 7 principles of democratizing data by W. David Stephenson
We Want Our Data Now! 7 principles of democratizing dataWe Want Our Data Now! 7 principles of democratizing data
We Want Our Data Now! 7 principles of democratizing data
W. David Stephenson1.4K views
Open, social and linked - what do current Web trends tell us about the future... by Andy Powell
Open, social and linked - what do current Web trends tell us about the future...Open, social and linked - what do current Web trends tell us about the future...
Open, social and linked - what do current Web trends tell us about the future...
Andy Powell1.5K views
Breaking Out of the Walled Garden: Lessons Learned in Moving Library Linked D... by OCLC
Breaking Out of the Walled Garden: Lessons Learned in Moving Library Linked D...Breaking Out of the Walled Garden: Lessons Learned in Moving Library Linked D...
Breaking Out of the Walled Garden: Lessons Learned in Moving Library Linked D...
OCLC1.1K views
Where the Social Web Meets the Semantic Web. Tom Gruber by Nelson Piedra
Where the Social Web Meets the Semantic Web. Tom GruberWhere the Social Web Meets the Semantic Web. Tom Gruber
Where the Social Web Meets the Semantic Web. Tom Gruber
Nelson Piedra3.5K views
Data Big and Broad (Oxford, 2012) by James Hendler
Data Big and Broad (Oxford, 2012)Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)
James Hendler5K views
Gabor Cselle - The Future of Email by gaborcselle
Gabor Cselle - The Future of EmailGabor Cselle - The Future of Email
Gabor Cselle - The Future of Email
gaborcselle2.9K views
A LITERATURE REVIEW ON SEMANTIC WEB – UNDERSTANDING THE PIONEERS’ PERSPECTIVE by csandit
A LITERATURE REVIEW ON SEMANTIC WEB – UNDERSTANDING THE PIONEERS’ PERSPECTIVEA LITERATURE REVIEW ON SEMANTIC WEB – UNDERSTANDING THE PIONEERS’ PERSPECTIVE
A LITERATURE REVIEW ON SEMANTIC WEB – UNDERSTANDING THE PIONEERS’ PERSPECTIVE
csandit37 views
Linked Data and the Semantic Web - Mimas Seminar by Adrian Stevenson
Linked Data and the Semantic Web - Mimas SeminarLinked Data and the Semantic Web - Mimas Seminar
Linked Data and the Semantic Web - Mimas Seminar
Adrian Stevenson846 views
Cultural heritage collections in a web 2 by Lynne Thomas
Cultural heritage collections in a web 2Cultural heritage collections in a web 2
Cultural heritage collections in a web 2
Lynne Thomas513 views
Facilitating Web Science Collaboration through Semantic Markup by James Hendler
Facilitating Web Science Collaboration through Semantic MarkupFacilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic Markup
James Hendler2.5K views
Isle of Man open data overview by Chris Taggart
Isle of Man open data overviewIsle of Man open data overview
Isle of Man open data overview
Chris Taggart1.3K views
Semantic web and information graph by Chao-Hsuan Shen
Semantic web and information graphSemantic web and information graph
Semantic web and information graph
Chao-Hsuan Shen293 views

Viewers also liked

A Linked Data Scalability Challenge: Frequently Reused Concepts Lose their Me... by
A Linked Data Scalability Challenge: Frequently Reused Concepts Lose their Me...A Linked Data Scalability Challenge: Frequently Reused Concepts Lose their Me...
A Linked Data Scalability Challenge: Frequently Reused Concepts Lose their Me...Paolo Pareti
1K views29 slides
How to Start Using LaTeX and BibTeX by
How to Start Using LaTeX and BibTeXHow to Start Using LaTeX and BibTeX
How to Start Using LaTeX and BibTeXPaolo Pareti
1.3K views32 slides
End note reference manager2013 by
End note reference manager2013End note reference manager2013
End note reference manager2013Bettie Kock
627 views89 slides
BibTex:Bibliografía para Latex by
BibTex:Bibliografía para LatexBibTex:Bibliografía para Latex
BibTex:Bibliografía para LatexErnesto CC
655 views33 slides
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken... by
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...Yun-Nung (Vivian) Chen
1.6K views14 slides
An Intelligent Assistant for High-Level Task Understanding by
An Intelligent Assistant for High-Level Task UnderstandingAn Intelligent Assistant for High-Level Task Understanding
An Intelligent Assistant for High-Level Task UnderstandingYun-Nung (Vivian) Chen
1.8K views30 slides

Viewers also liked(6)

A Linked Data Scalability Challenge: Frequently Reused Concepts Lose their Me... by Paolo Pareti
A Linked Data Scalability Challenge: Frequently Reused Concepts Lose their Me...A Linked Data Scalability Challenge: Frequently Reused Concepts Lose their Me...
A Linked Data Scalability Challenge: Frequently Reused Concepts Lose their Me...
Paolo Pareti1K views
How to Start Using LaTeX and BibTeX by Paolo Pareti
How to Start Using LaTeX and BibTeXHow to Start Using LaTeX and BibTeX
How to Start Using LaTeX and BibTeX
Paolo Pareti1.3K views
End note reference manager2013 by Bettie Kock
End note reference manager2013End note reference manager2013
End note reference manager2013
Bettie Kock627 views
BibTex:Bibliografía para Latex by Ernesto CC
BibTex:Bibliografía para LatexBibTex:Bibliografía para Latex
BibTex:Bibliografía para Latex
Ernesto CC655 views
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken... by Yun-Nung (Vivian) Chen
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
An Intelligent Assistant for High-Level Task Understanding by Yun-Nung (Vivian) Chen
An Intelligent Assistant for High-Level Task UnderstandingAn Intelligent Assistant for High-Level Task Understanding
An Intelligent Assistant for High-Level Task Understanding

Similar to Human Activities as Linked Data

Knowledgebase vs Database by
Knowledgebase vs DatabaseKnowledgebase vs Database
Knowledgebase vs DatabaseCJ Jenkins
29.2K views21 slides
Research on collaborative information sharing systems by
Research on collaborative information sharing systemsResearch on collaborative information sharing systems
Research on collaborative information sharing systemsDavide Eynard
817 views43 slides
Searching for patterns in crowdsourced information by
Searching for patterns in crowdsourced informationSearching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSilvia Puglisi
259 views32 slides
Bootstrap Alliance Google Call to Action by
Bootstrap Alliance Google Call to ActionBootstrap Alliance Google Call to Action
Bootstrap Alliance Google Call to Actionyesheng
1K views60 slides
Knowledge Sharing over social networking systems by
Knowledge Sharing over social networking systemsKnowledge Sharing over social networking systems
Knowledge Sharing over social networking systemstanguy
986 views37 slides
Online College Information System by
Online College Information SystemOnline College Information System
Online College Information SystemJenny Mancini
3 views83 slides

Similar to Human Activities as Linked Data(20)

Knowledgebase vs Database by CJ Jenkins
Knowledgebase vs DatabaseKnowledgebase vs Database
Knowledgebase vs Database
CJ Jenkins29.2K views
Research on collaborative information sharing systems by Davide Eynard
Research on collaborative information sharing systemsResearch on collaborative information sharing systems
Research on collaborative information sharing systems
Davide Eynard817 views
Searching for patterns in crowdsourced information by Silvia Puglisi
Searching for patterns in crowdsourced informationSearching for patterns in crowdsourced information
Searching for patterns in crowdsourced information
Silvia Puglisi259 views
Bootstrap Alliance Google Call to Action by yesheng
Bootstrap Alliance Google Call to ActionBootstrap Alliance Google Call to Action
Bootstrap Alliance Google Call to Action
yesheng1K views
Knowledge Sharing over social networking systems by tanguy
Knowledge Sharing over social networking systemsKnowledge Sharing over social networking systems
Knowledge Sharing over social networking systems
tanguy986 views
Online College Information System by Jenny Mancini
Online College Information SystemOnline College Information System
Online College Information System
Jenny Mancini3 views
Analyzing And On Difference Between Web 2.0 And 2.0 by Lauren Barker
Analyzing And On Difference Between Web 2.0 And 2.0Analyzing And On Difference Between Web 2.0 And 2.0
Analyzing And On Difference Between Web 2.0 And 2.0
Lauren Barker2 views
Enhancing Data Center Performance On A Cloud Environment... by Angela Gibbs
Enhancing Data Center Performance On A Cloud Environment...Enhancing Data Center Performance On A Cloud Environment...
Enhancing Data Center Performance On A Cloud Environment...
Angela Gibbs2 views
Information Organisation for the Future Web: with Emphasis to Local CIRs by inventionjournals
Information Organisation for the Future Web: with Emphasis to Local CIRs Information Organisation for the Future Web: with Emphasis to Local CIRs
Information Organisation for the Future Web: with Emphasis to Local CIRs
Cutting the trees of knowledge by irismei
Cutting the trees of knowledgeCutting the trees of knowledge
Cutting the trees of knowledge
irismei208 views
Cutting the trees of knowledge by irismei
Cutting the trees of knowledgeCutting the trees of knowledge
Cutting the trees of knowledge
irismei178 views
Learning 2.0: What happens when learning meets the read/write web by James BonTempo
Learning 2.0: What happens when learning meets the read/write webLearning 2.0: What happens when learning meets the read/write web
Learning 2.0: What happens when learning meets the read/write web
James BonTempo297 views
Analysis And Findings On Outdoor Activities by Carli Ferrante
Analysis And Findings On Outdoor ActivitiesAnalysis And Findings On Outdoor Activities
Analysis And Findings On Outdoor Activities
Carli Ferrante3 views
Tutorial Cognition - Irene by SSSW
Tutorial Cognition - IreneTutorial Cognition - Irene
Tutorial Cognition - Irene
SSSW626 views
Essay On Database by Syracuse2
Essay On DatabaseEssay On Database
Essay On Database
Syracuse229 views

Recently uploaded

Note on the Riemann Hypothesis by
Note on the Riemann HypothesisNote on the Riemann Hypothesis
Note on the Riemann Hypothesisvegafrank2
9 views20 slides
Gel Filtration or Permeation Chromatography by
Gel Filtration or Permeation ChromatographyGel Filtration or Permeation Chromatography
Gel Filtration or Permeation ChromatographyPoonam Aher Patil
12 views15 slides
Cyanobacteria as a Biofertilizer (BY- Ayushi).pptx by
Cyanobacteria as a Biofertilizer (BY- Ayushi).pptxCyanobacteria as a Biofertilizer (BY- Ayushi).pptx
Cyanobacteria as a Biofertilizer (BY- Ayushi).pptxAyushiKardam
9 views13 slides
Thin layer chromatography ( Horizontal) by
Thin layer chromatography  ( Horizontal)Thin layer chromatography  ( Horizontal)
Thin layer chromatography ( Horizontal)Poonam Aher Patil
9 views81 slides
ALGAL PRODUCTS.pptx by
ALGAL PRODUCTS.pptxALGAL PRODUCTS.pptx
ALGAL PRODUCTS.pptxRASHMI M G
7 views17 slides
Oral_Presentation_by_Fatma (2).pdf by
Oral_Presentation_by_Fatma (2).pdfOral_Presentation_by_Fatma (2).pdf
Oral_Presentation_by_Fatma (2).pdffatmaalmrzqi
8 views7 slides

Recently uploaded(20)

Note on the Riemann Hypothesis by vegafrank2
Note on the Riemann HypothesisNote on the Riemann Hypothesis
Note on the Riemann Hypothesis
vegafrank29 views
Gel Filtration or Permeation Chromatography by Poonam Aher Patil
Gel Filtration or Permeation ChromatographyGel Filtration or Permeation Chromatography
Gel Filtration or Permeation Chromatography
Cyanobacteria as a Biofertilizer (BY- Ayushi).pptx by AyushiKardam
Cyanobacteria as a Biofertilizer (BY- Ayushi).pptxCyanobacteria as a Biofertilizer (BY- Ayushi).pptx
Cyanobacteria as a Biofertilizer (BY- Ayushi).pptx
AyushiKardam9 views
Oral_Presentation_by_Fatma (2).pdf by fatmaalmrzqi
Oral_Presentation_by_Fatma (2).pdfOral_Presentation_by_Fatma (2).pdf
Oral_Presentation_by_Fatma (2).pdf
fatmaalmrzqi8 views
Geometrical qualities of the generalised Schwarzschild spacetimes by Orchidea Maria Lecian
Geometrical qualities of the generalised Schwarzschild spacetimesGeometrical qualities of the generalised Schwarzschild spacetimes
Geometrical qualities of the generalised Schwarzschild spacetimes
INTRODUCTION TO PLANT SYSTEMATICS.pptx by RASHMI M G
INTRODUCTION TO PLANT SYSTEMATICS.pptxINTRODUCTION TO PLANT SYSTEMATICS.pptx
INTRODUCTION TO PLANT SYSTEMATICS.pptx
RASHMI M G 5 views
Real Science Radio - Dr Paul Homan Climate Change.pptx by Fred Williams
Real Science Radio - Dr Paul Homan Climate Change.pptxReal Science Radio - Dr Paul Homan Climate Change.pptx
Real Science Radio - Dr Paul Homan Climate Change.pptx
Fred Williams8 views
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe... by Anmol Vishnu Gupta
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...
Towards Error-Corrected Quantum Computing with Neutral Atoms by Yuval Boger
Towards Error-Corrected Quantum Computing with Neutral AtomsTowards Error-Corrected Quantum Computing with Neutral Atoms
Towards Error-Corrected Quantum Computing with Neutral Atoms
Yuval Boger5 views
Paper Chromatography or Paper partition chromatography by Poonam Aher Patil
Paper Chromatography or Paper partition chromatographyPaper Chromatography or Paper partition chromatography
Paper Chromatography or Paper partition chromatography
AI for automated materials discovery via learning to represent, predict, gene... by Deakin University
AI for automated materials discovery via learning to represent, predict, gene...AI for automated materials discovery via learning to represent, predict, gene...
AI for automated materials discovery via learning to represent, predict, gene...
RADIATION PHYSICS.pptx by drpriyanka8
RADIATION PHYSICS.pptxRADIATION PHYSICS.pptx
RADIATION PHYSICS.pptx
drpriyanka815 views
Worldviews and their (im)plausibility: Science and Holism by JohnWilkins48
Worldviews and their (im)plausibility: Science and HolismWorldviews and their (im)plausibility: Science and Holism
Worldviews and their (im)plausibility: Science and Holism
JohnWilkins4849 views
Eukaryotic microbiology lab Dos and Donts.pptx by Prasanna Kumar
Eukaryotic microbiology lab Dos and Donts.pptxEukaryotic microbiology lab Dos and Donts.pptx
Eukaryotic microbiology lab Dos and Donts.pptx
Prasanna Kumar8 views

Human Activities as Linked Data

  • 1. Integrating Know-How in the Linked Data Cloud Paolo Pareti, Benoit Testu, Ryutaro Ichise, Ewan Klein and Adam Barker https://w3id.org/prohow/ “As we all know, there is a large amount of facts available on the Web. But what about human activities or know-how? The goal of this talk is to tell you how this kind of knowledge can be made machine understandable and available on the Web.”
  • 2. Human activities (or know-how) 1. can be represented as Linked Data 2. can be automatically extracted 3. can be automatically interlinked 4. experiment: extracted a large Linked Data dataset 5. evaluation: our system outperforms humans “In particular, the presentation will focus on those five points.”
  • 3. 339933,,660000 “If we ask an intelligent system this question: ‘What is the population of the capital of New Zealand?’ we would now assume it can answer this question correctly, by accessing knowledge bases available on the Web. But what happens if we ask a seemingly easier question: ‘What do you need to wash you hands?’ In this case, the system would not be able to answer.”
  • 4. ??? “This is because, to answer this question, the intelligent system would need to have some understanding of what an activity is, and maybe what are its requirements. This knowledge, however, is not currently available in existing knowledge bases.”
  • 5. Why Know-How? “But actually know-how is very useful and has a lot of applications. Know-how is relevant in almost all domains, and it can be common sense know-how available on the Web, or maybe internal know-how of specific organizations, such as standard operating procedures. This knowledge also has applications in fields such as question answering, recommender systems and activity recognition.”
  • 6. “Human know-how is on the Web, but why is it not accessible? First of all, this knowledge is usually represented in unstructured resources. We can think for example of step-by-step instructions, which are typically represented as text in natural language, or maybe as pictures and videos.”
  • 7. ? ? ? “But the most serious limitation is the fact that a single document contains only limited information. What happens if we (or a machine) does not understand how to do a specific step, or what a particular ingredient is. In fact, it is often the case that humans look at multiple resources to complete a complex task for the same time.”
  • 8. Data “The first step for making know-how machine understandable is by using a structured representation. We can identify several entities in a process, such as steps, methods, requirements and outputs. We can link those entities with each other, depending on which relation exists between them.”
  • 9. Linked Data “To solve the problem of the isolation of single resources, we have adopted a Linked Data representation. In this way, humans and machines can discover related resources when they are interested in more information about a specific entity. It is important to notice that these are not just links between documents, but between specific entities contained in these documents.”
  • 10. “Our simple Linked Data representation of know-how is a point of contact between humans and machines. From the human perspective, know-how as Linked Data is a way to manage and find relevant resources which are human understandable. From the machine perspective, this data can be easily used for analysis, inferencing, and it can be extended to more complex representations where required.”
  • 11. “So all of this is not just an idea. It is actually possible and we have run experiments and evaluated our results.”
  • 12. “What do we want to achieve exactly, when we talk about machine-understandable activities? While it is true that we want to have a knowledge representation more powerful than simple text in a document, we cannot yet aim to have machines capable of automating all human activities. Therefore we need to start by reaching a first significant but realistic goal.”
  • 13. “We show the usefulness of this system in a real application. A task currently done by humans is the interlinking of related know-how resources. In particular, the WikiHow community is actively creating such kind of links; for example between the step of a process and another set of instructions that explains how to do it.”
  • 14. How to Make a Pancake Steps: 1. Prepare the mix 2. Pour the mix in a hot pan 3. Cook until golden Make a Pancake has_step has_step has_step Prepare the mix Cook until golden Pour the mix in a hot pan “This is a simplified example (e.g. missing the relations to specify the order of the steps) of how our system generates a Linked Data representation of a Web document. This can be done in many ways, but when the original document has some degree of structure, this knowledge extraction can be done easily and accurately.”
  • 15. How to Make a Pancake Steps: 1. Prepare the mix 2. Pour the mix in a hot pan 3. Cook until golden Make a Pancake has_step requires requires has_step has_step Eggs Milk Prepare the mix Cook until golden Pour the mix in a hot pan Requirements: ● Eggs ● Milk ● Flour Flour requires “On the Web, most of these resources have some degree of structure. This is because a well structured set of instructions is better understood by humans, even before machines. This structure usually takes form of a simple enumeration of steps, methods and requirements.”
  • 16. > 200,000 procedures > 2,600,000 entities “WikiHow and Snapguide are two large repositories that contain well organized know-how. We have extracted the knowledge of these websites and obtained a large dataset of over 200,000 procedures decomposed in over 2,600,000 entities. This can be seen as a large-scale extraction of know-how from the Web and conversion to Linked Data.”
  • 17. Hot to Install an Operating System create a partition How to Create a Partition “In order to interlink the extracted entities, we have created a system to automatically discover two kinds of links. The first kind is a functional link between a step and another set of instructions that explains how this step can be done.”
  • 18. DBpedia Guacamole How to Make Guacamole How to Serve Nachos “The second kind of links we discovered is similar to an Input/Output link between two processes. Instead of representing it directly, we have this link implicitly represented by the types of the input and the output of processes. In this example, we can infer that there is an Input/Output relation between the two processes, as one requires the object ‘Guacamole’ while the other outputs it.”
  • 19. Evaluation + 16% precision + ×2 number of links + ×2 coverage + automatic + semantic links “Finally we evaluated the links extracted by our system against the links generated manually by the WikiHow community. The result was a significant improvement. Our system identified links of better quality, more in number, and better spread across all resources. All of this on top of being a completely automatic system which creates semantic Linked Data links, more expressive than simple html links.”
  • 20. Know How as Linked Data? ….a dream that comes true! ● Generated a large dataset of > 200,000 human activities as Linked Data ● Integrated in the Linked Data Cloud ● Outperformed the human baseline https://w3id.org/prohow/ “In conclusion, we have seen how know-how can become a new useful resource on the Linked Data Cloud. Our system automated the extraction and the integration of this knowledge on a large scale. Please visit this website if you are interested in this dataset or information about the project. This website also contains a link to an online visualization tool to explore the dataset”.