SlideShare a Scribd company logo
1 of 20
Download to read offline
Integrating 
Know-How 
in the Linked Data Cloud 
Paolo Pareti, Benoit Testu, Ryutaro Ichise, 
Ewan Klein and Adam Barker 
https://w3id.org/prohow/ 
“As we all know, there is a large amount of facts available on the Web. But what about human activities or know-how? The goal of this talk is to 
tell you how this kind of knowledge can be made machine understandable and available on the Web.”
Human activities (or know-how) 
1. can be represented as Linked Data 
2. can be automatically extracted 
3. can be automatically interlinked 
4. experiment: extracted a large Linked Data dataset 
5. evaluation: our system outperforms humans 
“In particular, the presentation will focus on those five points.”
339933,,660000 
“If we ask an intelligent system this question: ‘What is the population of the capital of New Zealand?’ we would now assume it can answer this 
question correctly, by accessing knowledge bases available on the Web. But what happens if we ask a seemingly easier question: ‘What do you 
need to wash you hands?’ In this case, the system would not be able to answer.”
??? 
“This is because, to answer this question, the intelligent system would need to have some understanding of what an activity is, and maybe what 
are its requirements. This knowledge, however, is not currently available in existing knowledge bases.”
Why Know-How? 
“But actually know-how is very useful and has a lot of applications. Know-how is relevant in almost all domains, and it can be common sense 
know-how available on the Web, or maybe internal know-how of specific organizations, such as standard operating procedures. This knowledge 
also has applications in fields such as question answering, recommender systems and activity recognition.”
“Human know-how is on the Web, but why is it not accessible? First of all, this knowledge is usually represented in unstructured resources. We 
can think for example of step-by-step instructions, which are typically represented as text in natural language, 
or maybe as pictures and videos.”
? 
? 
? 
“But the most serious limitation is the fact that a single document contains only limited information. What happens if we (or a machine) does not 
understand how to do a specific step, or what a particular ingredient is. In fact, it is often the case that humans look at multiple resources to 
complete a complex task for the same time.”
Data 
“The first step for making know-how machine understandable is by using a structured representation. We can identify several entities in a 
process, such as steps, methods, requirements and outputs. We can link those entities with each other, depending on which relation exists 
between them.”
Linked Data 
“To solve the problem of the isolation of single resources, we have adopted a Linked Data representation. In this way, humans and machines can 
discover related resources when they are interested in more information about a specific entity. It is important to notice that these are not just 
links between documents, but between specific entities contained in these documents.”
“Our simple Linked Data representation of know-how is a point of contact between humans and machines. From the human perspective, know-how 
as Linked Data is a way to manage and find relevant resources which are human understandable. From the machine perspective, this data 
can be easily used for analysis, inferencing, and it can be extended to more complex representations where required.”
“So all of this is not just an idea. It is actually possible and we have run experiments and evaluated our results.”
“What do we want to achieve exactly, when we talk about machine-understandable activities? While it is true that we want to have a knowledge 
representation more powerful than simple text in a document, we cannot yet aim to have machines capable of automating all human activities. 
Therefore we need to start by reaching a first significant but realistic goal.”
“We show the usefulness of this system in a real application. A task currently done by humans is the interlinking of related know-how resources. 
In particular, the WikiHow community is actively creating such kind of links; for example between the step of a process and another set of 
instructions that explains how to do it.”
How to 
Make a Pancake 
Steps: 
1. Prepare the mix 
2. Pour the mix 
in a hot pan 
3. Cook until golden 
Make a Pancake has_step 
has_step 
has_step 
Prepare the mix Cook until golden 
Pour the mix 
in a hot pan 
“This is a simplified example (e.g. missing the relations to specify the order of the steps) of how our system generates a Linked Data 
representation of a Web document. This can be done in many ways, but when the original document has some degree of structure, this 
knowledge extraction can be done easily and accurately.”
How to 
Make a Pancake 
Steps: 
1. Prepare the mix 
2. Pour the mix 
in a hot pan 
3. Cook until golden 
Make a Pancake has_step 
requires 
requires 
has_step 
has_step 
Eggs 
Milk 
Prepare the mix Cook until golden 
Pour the mix 
in a hot pan 
Requirements: 
● Eggs 
● Milk 
● Flour 
Flour 
requires 
“On the Web, most of these resources have some degree of structure. This is because a well structured set of instructions is better understood 
by humans, even before machines. This structure usually takes form of a simple enumeration of steps, methods and requirements.”
> 200,000 
procedures 
> 2,600,000 
entities 
“WikiHow and Snapguide are two large repositories that contain well organized know-how. We have extracted the knowledge of these websites 
and obtained a large dataset of over 200,000 procedures decomposed in over 2,600,000 entities. This can be seen as a large-scale extraction of 
know-how from the Web and conversion to Linked Data.”
Hot to Install an Operating System 
create a partition 
How to Create 
a Partition 
“In order to interlink the extracted entities, we have created a system to automatically discover two kinds of links. The first kind is a functional link 
between a step and another set of instructions that explains how this step can be done.”
DBpedia Guacamole 
How to Make Guacamole How to Serve Nachos 
“The second kind of links we discovered is similar to an Input/Output link between two processes. Instead of representing it directly, we have this 
link implicitly represented by the types of the input and the output of processes. In this example, we can infer that there is an Input/Output relation 
between the two processes, as one requires the object ‘Guacamole’ while the other outputs it.”
Evaluation 
+ 16% precision 
+ ×2 number of links 
+ ×2 coverage 
+ automatic 
+ semantic links 
“Finally we evaluated the links extracted by our system against the links generated manually by the WikiHow community. The result was a 
significant improvement. Our system identified links of better quality, more in number, and better spread across all resources. All of this on top of 
being a completely automatic system which creates semantic Linked Data links, more expressive than simple html links.”
Know How as Linked Data? 
….a dream that comes true! 
● Generated a large dataset of > 200,000 
human activities as Linked Data 
● Integrated in the Linked Data Cloud 
● Outperformed the human baseline 
https://w3id.org/prohow/ 
“In conclusion, we have seen how know-how can become a new useful resource on the Linked Data Cloud. Our system automated the extraction 
and the integration of this knowledge on a large scale. Please visit this website if you are interested in this dataset or information about the 
project. This website also contains a link to an online visualization tool to explore the dataset”.

More Related Content

What's hot

Systems Thinking workshop, given at Lean UX NYC
Systems Thinking workshop, given at Lean UX NYCSystems Thinking workshop, given at Lean UX NYC
Systems Thinking workshop, given at Lean UX NYCjohanna kollmann
 
Hypertext2007 Wendy Hall - "Whatever Happened to Hypertext?"
Hypertext2007 Wendy Hall - "Whatever Happened to Hypertext?"Hypertext2007 Wendy Hall - "Whatever Happened to Hypertext?"
Hypertext2007 Wendy Hall - "Whatever Happened to Hypertext?"hypertext2007
 
Mashing up the web” - combining, fusing, creating ideas in linking web 2.0 t...
Mashing up the web”  - combining, fusing, creating ideas in linking web 2.0 t...Mashing up the web”  - combining, fusing, creating ideas in linking web 2.0 t...
Mashing up the web” - combining, fusing, creating ideas in linking web 2.0 t...Allan Cho
 
We Want Our Data Now! 7 principles of democratizing data
We Want Our Data Now! 7 principles of democratizing dataWe Want Our Data Now! 7 principles of democratizing data
We Want Our Data Now! 7 principles of democratizing dataW. David Stephenson
 
Open, social and linked - what do current Web trends tell us about the future...
Open, social and linked - what do current Web trends tell us about the future...Open, social and linked - what do current Web trends tell us about the future...
Open, social and linked - what do current Web trends tell us about the future...Andy Powell
 
Breaking Out of the Walled Garden: Lessons Learned in Moving Library Linked D...
Breaking Out of the Walled Garden: Lessons Learned in Moving Library Linked D...Breaking Out of the Walled Garden: Lessons Learned in Moving Library Linked D...
Breaking Out of the Walled Garden: Lessons Learned in Moving Library Linked D...OCLC
 
Where the Social Web Meets the Semantic Web. Tom Gruber
Where the Social Web Meets the Semantic Web. Tom GruberWhere the Social Web Meets the Semantic Web. Tom Gruber
Where the Social Web Meets the Semantic Web. Tom GruberNelson Piedra
 
Cryptocollege how blockchain can reimagine higher education. J. David Judd
Cryptocollege  how blockchain can reimagine higher education. J. David JuddCryptocollege  how blockchain can reimagine higher education. J. David Judd
Cryptocollege how blockchain can reimagine higher education. J. David Judderaser Juan José Calderón
 
Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)James Hendler
 
Gabor Cselle - The Future of Email
Gabor Cselle - The Future of EmailGabor Cselle - The Future of Email
Gabor Cselle - The Future of Emailgaborcselle
 
A LITERATURE REVIEW ON SEMANTIC WEB – UNDERSTANDING THE PIONEERS’ PERSPECTIVE
A LITERATURE REVIEW ON SEMANTIC WEB – UNDERSTANDING THE PIONEERS’ PERSPECTIVEA LITERATURE REVIEW ON SEMANTIC WEB – UNDERSTANDING THE PIONEERS’ PERSPECTIVE
A LITERATURE REVIEW ON SEMANTIC WEB – UNDERSTANDING THE PIONEERS’ PERSPECTIVEcsandit
 
Linked Data and the Semantic Web - Mimas Seminar
Linked Data and the Semantic Web - Mimas SeminarLinked Data and the Semantic Web - Mimas Seminar
Linked Data and the Semantic Web - Mimas SeminarAdrian Stevenson
 
Cultural heritage collections in a web 2
Cultural heritage collections in a web 2Cultural heritage collections in a web 2
Cultural heritage collections in a web 2Lynne Thomas
 
Facilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic MarkupFacilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic MarkupJames Hendler
 
Isle of Man open data overview
Isle of Man open data overviewIsle of Man open data overview
Isle of Man open data overviewChris Taggart
 
Mathews blockchain presentation
Mathews blockchain presentationMathews blockchain presentation
Mathews blockchain presentationMichael Mathews
 
Semantic web and information graph
Semantic web and information graphSemantic web and information graph
Semantic web and information graphChao-Hsuan Shen
 

What's hot (20)

Systems Thinking workshop, given at Lean UX NYC
Systems Thinking workshop, given at Lean UX NYCSystems Thinking workshop, given at Lean UX NYC
Systems Thinking workshop, given at Lean UX NYC
 
The Inline Interface
The Inline InterfaceThe Inline Interface
The Inline Interface
 
Hypertext2007 Wendy Hall - "Whatever Happened to Hypertext?"
Hypertext2007 Wendy Hall - "Whatever Happened to Hypertext?"Hypertext2007 Wendy Hall - "Whatever Happened to Hypertext?"
Hypertext2007 Wendy Hall - "Whatever Happened to Hypertext?"
 
Mashing up the web” - combining, fusing, creating ideas in linking web 2.0 t...
Mashing up the web”  - combining, fusing, creating ideas in linking web 2.0 t...Mashing up the web”  - combining, fusing, creating ideas in linking web 2.0 t...
Mashing up the web” - combining, fusing, creating ideas in linking web 2.0 t...
 
We Want Our Data Now! 7 principles of democratizing data
We Want Our Data Now! 7 principles of democratizing dataWe Want Our Data Now! 7 principles of democratizing data
We Want Our Data Now! 7 principles of democratizing data
 
Grant: The Impact of Cloud, Mobile, and Managing the Changing Platforms of Di...
Grant: The Impact of Cloud, Mobile, and Managing the Changing Platforms of Di...Grant: The Impact of Cloud, Mobile, and Managing the Changing Platforms of Di...
Grant: The Impact of Cloud, Mobile, and Managing the Changing Platforms of Di...
 
Birks presentation
Birks presentationBirks presentation
Birks presentation
 
Open, social and linked - what do current Web trends tell us about the future...
Open, social and linked - what do current Web trends tell us about the future...Open, social and linked - what do current Web trends tell us about the future...
Open, social and linked - what do current Web trends tell us about the future...
 
Breaking Out of the Walled Garden: Lessons Learned in Moving Library Linked D...
Breaking Out of the Walled Garden: Lessons Learned in Moving Library Linked D...Breaking Out of the Walled Garden: Lessons Learned in Moving Library Linked D...
Breaking Out of the Walled Garden: Lessons Learned in Moving Library Linked D...
 
Where the Social Web Meets the Semantic Web. Tom Gruber
Where the Social Web Meets the Semantic Web. Tom GruberWhere the Social Web Meets the Semantic Web. Tom Gruber
Where the Social Web Meets the Semantic Web. Tom Gruber
 
Cryptocollege how blockchain can reimagine higher education. J. David Judd
Cryptocollege  how blockchain can reimagine higher education. J. David JuddCryptocollege  how blockchain can reimagine higher education. J. David Judd
Cryptocollege how blockchain can reimagine higher education. J. David Judd
 
Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)
 
Gabor Cselle - The Future of Email
Gabor Cselle - The Future of EmailGabor Cselle - The Future of Email
Gabor Cselle - The Future of Email
 
A LITERATURE REVIEW ON SEMANTIC WEB – UNDERSTANDING THE PIONEERS’ PERSPECTIVE
A LITERATURE REVIEW ON SEMANTIC WEB – UNDERSTANDING THE PIONEERS’ PERSPECTIVEA LITERATURE REVIEW ON SEMANTIC WEB – UNDERSTANDING THE PIONEERS’ PERSPECTIVE
A LITERATURE REVIEW ON SEMANTIC WEB – UNDERSTANDING THE PIONEERS’ PERSPECTIVE
 
Linked Data and the Semantic Web - Mimas Seminar
Linked Data and the Semantic Web - Mimas SeminarLinked Data and the Semantic Web - Mimas Seminar
Linked Data and the Semantic Web - Mimas Seminar
 
Cultural heritage collections in a web 2
Cultural heritage collections in a web 2Cultural heritage collections in a web 2
Cultural heritage collections in a web 2
 
Facilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic MarkupFacilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic Markup
 
Isle of Man open data overview
Isle of Man open data overviewIsle of Man open data overview
Isle of Man open data overview
 
Mathews blockchain presentation
Mathews blockchain presentationMathews blockchain presentation
Mathews blockchain presentation
 
Semantic web and information graph
Semantic web and information graphSemantic web and information graph
Semantic web and information graph
 

Viewers also liked

A Linked Data Scalability Challenge: Frequently Reused Concepts Lose their Me...
A Linked Data Scalability Challenge: Frequently Reused Concepts Lose their Me...A Linked Data Scalability Challenge: Frequently Reused Concepts Lose their Me...
A Linked Data Scalability Challenge: Frequently Reused Concepts Lose their Me...Paolo Pareti
 
How to Start Using LaTeX and BibTeX
How to Start Using LaTeX and BibTeXHow to Start Using LaTeX and BibTeX
How to Start Using LaTeX and BibTeXPaolo Pareti
 
End note reference manager2013
End note reference manager2013End note reference manager2013
End note reference manager2013Bettie Kock
 
BibTex:Bibliografía para Latex
BibTex:Bibliografía para LatexBibTex:Bibliografía para Latex
BibTex:Bibliografía para LatexErnesto CC
 
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...Yun-Nung (Vivian) Chen
 
An Intelligent Assistant for High-Level Task Understanding
An Intelligent Assistant for High-Level Task UnderstandingAn Intelligent Assistant for High-Level Task Understanding
An Intelligent Assistant for High-Level Task UnderstandingYun-Nung (Vivian) Chen
 

Viewers also liked (6)

A Linked Data Scalability Challenge: Frequently Reused Concepts Lose their Me...
A Linked Data Scalability Challenge: Frequently Reused Concepts Lose their Me...A Linked Data Scalability Challenge: Frequently Reused Concepts Lose their Me...
A Linked Data Scalability Challenge: Frequently Reused Concepts Lose their Me...
 
How to Start Using LaTeX and BibTeX
How to Start Using LaTeX and BibTeXHow to Start Using LaTeX and BibTeX
How to Start Using LaTeX and BibTeX
 
End note reference manager2013
End note reference manager2013End note reference manager2013
End note reference manager2013
 
BibTex:Bibliografía para Latex
BibTex:Bibliografía para LatexBibTex:Bibliografía para Latex
BibTex:Bibliografía para Latex
 
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
 
An Intelligent Assistant for High-Level Task Understanding
An Intelligent Assistant for High-Level Task UnderstandingAn Intelligent Assistant for High-Level Task Understanding
An Intelligent Assistant for High-Level Task Understanding
 

Similar to Human Activities as Linked Data

Knowledgebase vs Database
Knowledgebase vs DatabaseKnowledgebase vs Database
Knowledgebase vs DatabaseCJ Jenkins
 
Research on collaborative information sharing systems
Research on collaborative information sharing systemsResearch on collaborative information sharing systems
Research on collaborative information sharing systemsDavide Eynard
 
Searching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSearching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSilvia Puglisi
 
Bootstrap Alliance Google Call to Action
Bootstrap Alliance Google Call to ActionBootstrap Alliance Google Call to Action
Bootstrap Alliance Google Call to Actionyesheng
 
Knowledge Sharing over social networking systems
Knowledge Sharing over social networking systemsKnowledge Sharing over social networking systems
Knowledge Sharing over social networking systemstanguy
 
Online College Information System
Online College Information SystemOnline College Information System
Online College Information SystemJenny Mancini
 
Analyzing And On Difference Between Web 2.0 And 2.0
Analyzing And On Difference Between Web 2.0 And 2.0Analyzing And On Difference Between Web 2.0 And 2.0
Analyzing And On Difference Between Web 2.0 And 2.0Lauren Barker
 
Enhancing Data Center Performance On A Cloud Environment...
Enhancing Data Center Performance On A Cloud Environment...Enhancing Data Center Performance On A Cloud Environment...
Enhancing Data Center Performance On A Cloud Environment...Angela Gibbs
 
Intelligent Content & Search
Intelligent Content & SearchIntelligent Content & Search
Intelligent Content & SearchStephen Lahanas
 
Information Organisation for the Future Web: with Emphasis to Local CIRs
Information Organisation for the Future Web: with Emphasis to Local CIRs Information Organisation for the Future Web: with Emphasis to Local CIRs
Information Organisation for the Future Web: with Emphasis to Local CIRs inventionjournals
 
Presentation on KBS APP and SEMANTIC WEB
Presentation on KBS APP and SEMANTIC WEB Presentation on KBS APP and SEMANTIC WEB
Presentation on KBS APP and SEMANTIC WEB TayyabMuradHashmi
 
Cutting the trees of knowledge
Cutting the trees of knowledgeCutting the trees of knowledge
Cutting the trees of knowledgeirismei
 
Cutting the trees of knowledge
Cutting the trees of knowledgeCutting the trees of knowledge
Cutting the trees of knowledgeirismei
 
Learning 2.0: What happens when learning meets the read/write web
Learning 2.0: What happens when learning meets the read/write webLearning 2.0: What happens when learning meets the read/write web
Learning 2.0: What happens when learning meets the read/write webJames BonTempo
 
Analysis And Findings On Outdoor Activities
Analysis And Findings On Outdoor ActivitiesAnalysis And Findings On Outdoor Activities
Analysis And Findings On Outdoor ActivitiesCarli Ferrante
 
Tutorial Cognition - Irene
Tutorial Cognition - IreneTutorial Cognition - Irene
Tutorial Cognition - IreneSSSW
 
Essay On Database
Essay On DatabaseEssay On Database
Essay On DatabaseSyracuse2
 

Similar to Human Activities as Linked Data (20)

Knowledgebase vs Database
Knowledgebase vs DatabaseKnowledgebase vs Database
Knowledgebase vs Database
 
Research on collaborative information sharing systems
Research on collaborative information sharing systemsResearch on collaborative information sharing systems
Research on collaborative information sharing systems
 
Searching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSearching for patterns in crowdsourced information
Searching for patterns in crowdsourced information
 
Bootstrap Alliance Google Call to Action
Bootstrap Alliance Google Call to ActionBootstrap Alliance Google Call to Action
Bootstrap Alliance Google Call to Action
 
Knowledge Sharing over social networking systems
Knowledge Sharing over social networking systemsKnowledge Sharing over social networking systems
Knowledge Sharing over social networking systems
 
Online College Information System
Online College Information SystemOnline College Information System
Online College Information System
 
Analyzing And On Difference Between Web 2.0 And 2.0
Analyzing And On Difference Between Web 2.0 And 2.0Analyzing And On Difference Between Web 2.0 And 2.0
Analyzing And On Difference Between Web 2.0 And 2.0
 
Enhancing Data Center Performance On A Cloud Environment...
Enhancing Data Center Performance On A Cloud Environment...Enhancing Data Center Performance On A Cloud Environment...
Enhancing Data Center Performance On A Cloud Environment...
 
Annotated Bibliography On Database Design
Annotated Bibliography On Database DesignAnnotated Bibliography On Database Design
Annotated Bibliography On Database Design
 
Intelligent Content & Search
Intelligent Content & SearchIntelligent Content & Search
Intelligent Content & Search
 
Information Organisation for the Future Web: with Emphasis to Local CIRs
Information Organisation for the Future Web: with Emphasis to Local CIRs Information Organisation for the Future Web: with Emphasis to Local CIRs
Information Organisation for the Future Web: with Emphasis to Local CIRs
 
Een oceaan van data
Een oceaan van dataEen oceaan van data
Een oceaan van data
 
Essay Information
Essay InformationEssay Information
Essay Information
 
Presentation on KBS APP and SEMANTIC WEB
Presentation on KBS APP and SEMANTIC WEB Presentation on KBS APP and SEMANTIC WEB
Presentation on KBS APP and SEMANTIC WEB
 
Cutting the trees of knowledge
Cutting the trees of knowledgeCutting the trees of knowledge
Cutting the trees of knowledge
 
Cutting the trees of knowledge
Cutting the trees of knowledgeCutting the trees of knowledge
Cutting the trees of knowledge
 
Learning 2.0: What happens when learning meets the read/write web
Learning 2.0: What happens when learning meets the read/write webLearning 2.0: What happens when learning meets the read/write web
Learning 2.0: What happens when learning meets the read/write web
 
Analysis And Findings On Outdoor Activities
Analysis And Findings On Outdoor ActivitiesAnalysis And Findings On Outdoor Activities
Analysis And Findings On Outdoor Activities
 
Tutorial Cognition - Irene
Tutorial Cognition - IreneTutorial Cognition - Irene
Tutorial Cognition - Irene
 
Essay On Database
Essay On DatabaseEssay On Database
Essay On Database
 

Recently uploaded

Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdf
Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdfPests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdf
Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdfPirithiRaju
 
Human brain.. It's parts and function.
Human brain.. It's parts and function. Human brain.. It's parts and function.
Human brain.. It's parts and function. MUKTA MANJARI SAHOO
 
Physics Serway Jewett 6th edition for Scientists and Engineers
Physics Serway Jewett 6th edition for Scientists and EngineersPhysics Serway Jewett 6th edition for Scientists and Engineers
Physics Serway Jewett 6th edition for Scientists and EngineersAndreaLucarelli
 
World Water Day 22 March 2024 - kiyorndlab
World Water Day 22 March 2024 - kiyorndlabWorld Water Day 22 March 2024 - kiyorndlab
World Water Day 22 March 2024 - kiyorndlabkiyorndlab
 
Applied Biochemistry feedback_M Ahwad 2023.docx
Applied Biochemistry feedback_M Ahwad 2023.docxApplied Biochemistry feedback_M Ahwad 2023.docx
Applied Biochemistry feedback_M Ahwad 2023.docxmarwaahmad357
 
Alternative system of medicine herbal drug technology syllabus
Alternative system of medicine herbal drug technology syllabusAlternative system of medicine herbal drug technology syllabus
Alternative system of medicine herbal drug technology syllabusPradnya Wadekar
 
Legacy Analysis of Dark Matter Annihilation from the Milky Way Dwarf Spheroid...
Legacy Analysis of Dark Matter Annihilation from the Milky Way Dwarf Spheroid...Legacy Analysis of Dark Matter Annihilation from the Milky Way Dwarf Spheroid...
Legacy Analysis of Dark Matter Annihilation from the Milky Way Dwarf Spheroid...Sérgio Sacani
 
3.2 Pests of Sorghum_Identification, Symptoms and nature of damage, Binomics,...
3.2 Pests of Sorghum_Identification, Symptoms and nature of damage, Binomics,...3.2 Pests of Sorghum_Identification, Symptoms and nature of damage, Binomics,...
3.2 Pests of Sorghum_Identification, Symptoms and nature of damage, Binomics,...PirithiRaju
 
Principles & Formulation of Hair Care Products
Principles & Formulation of Hair Care  ProductsPrinciples & Formulation of Hair Care  Products
Principles & Formulation of Hair Care Productspurwaborkar@gmail.com
 
geometric quantization on coadjoint orbits
geometric quantization on coadjoint orbitsgeometric quantization on coadjoint orbits
geometric quantization on coadjoint orbitsHassan Jolany
 
Lehninger_Chapter 17_Fatty acid Oxid.ppt
Lehninger_Chapter 17_Fatty acid Oxid.pptLehninger_Chapter 17_Fatty acid Oxid.ppt
Lehninger_Chapter 17_Fatty acid Oxid.pptSachin Teotia
 
Exploration Method’s in Archaeological Studies & Research
Exploration Method’s in Archaeological Studies & ResearchExploration Method’s in Archaeological Studies & Research
Exploration Method’s in Archaeological Studies & ResearchPrachya Adhyayan
 
Identification of Superclusters and Their Properties in the Sloan Digital Sky...
Identification of Superclusters and Their Properties in the Sloan Digital Sky...Identification of Superclusters and Their Properties in the Sloan Digital Sky...
Identification of Superclusters and Their Properties in the Sloan Digital Sky...Sérgio Sacani
 
Pests of tenai_Identification,Binomics_Dr.UPR
Pests of tenai_Identification,Binomics_Dr.UPRPests of tenai_Identification,Binomics_Dr.UPR
Pests of tenai_Identification,Binomics_Dr.UPRPirithiRaju
 
Krishi Vigyan Kendras - कृषि विज्ञान केंद्र
Krishi Vigyan Kendras - कृषि विज्ञान केंद्रKrishi Vigyan Kendras - कृषि विज्ञान केंद्र
Krishi Vigyan Kendras - कृषि विज्ञान केंद्रKrashi Coaching
 
Pests of ragi_Identification, Binomics_Dr.UPR
Pests of ragi_Identification, Binomics_Dr.UPRPests of ragi_Identification, Binomics_Dr.UPR
Pests of ragi_Identification, Binomics_Dr.UPRPirithiRaju
 
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...Sérgio Sacani
 
KeyBio pipeline for bioinformatics and data science
KeyBio pipeline for bioinformatics and data scienceKeyBio pipeline for bioinformatics and data science
KeyBio pipeline for bioinformatics and data scienceLayne Sadler
 
Role of Herbs in Cosmetics in Cosmetic Science.
Role of Herbs in Cosmetics in Cosmetic Science.Role of Herbs in Cosmetics in Cosmetic Science.
Role of Herbs in Cosmetics in Cosmetic Science.ShwetaHattimare
 

Recently uploaded (20)

Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdf
Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdfPests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdf
Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdf
 
Human brain.. It's parts and function.
Human brain.. It's parts and function. Human brain.. It's parts and function.
Human brain.. It's parts and function.
 
Physics Serway Jewett 6th edition for Scientists and Engineers
Physics Serway Jewett 6th edition for Scientists and EngineersPhysics Serway Jewett 6th edition for Scientists and Engineers
Physics Serway Jewett 6th edition for Scientists and Engineers
 
World Water Day 22 March 2024 - kiyorndlab
World Water Day 22 March 2024 - kiyorndlabWorld Water Day 22 March 2024 - kiyorndlab
World Water Day 22 March 2024 - kiyorndlab
 
Applied Biochemistry feedback_M Ahwad 2023.docx
Applied Biochemistry feedback_M Ahwad 2023.docxApplied Biochemistry feedback_M Ahwad 2023.docx
Applied Biochemistry feedback_M Ahwad 2023.docx
 
Alternative system of medicine herbal drug technology syllabus
Alternative system of medicine herbal drug technology syllabusAlternative system of medicine herbal drug technology syllabus
Alternative system of medicine herbal drug technology syllabus
 
Cheminformatics tools and chemistry data underpinning mass spectrometry analy...
Cheminformatics tools and chemistry data underpinning mass spectrometry analy...Cheminformatics tools and chemistry data underpinning mass spectrometry analy...
Cheminformatics tools and chemistry data underpinning mass spectrometry analy...
 
Legacy Analysis of Dark Matter Annihilation from the Milky Way Dwarf Spheroid...
Legacy Analysis of Dark Matter Annihilation from the Milky Way Dwarf Spheroid...Legacy Analysis of Dark Matter Annihilation from the Milky Way Dwarf Spheroid...
Legacy Analysis of Dark Matter Annihilation from the Milky Way Dwarf Spheroid...
 
3.2 Pests of Sorghum_Identification, Symptoms and nature of damage, Binomics,...
3.2 Pests of Sorghum_Identification, Symptoms and nature of damage, Binomics,...3.2 Pests of Sorghum_Identification, Symptoms and nature of damage, Binomics,...
3.2 Pests of Sorghum_Identification, Symptoms and nature of damage, Binomics,...
 
Principles & Formulation of Hair Care Products
Principles & Formulation of Hair Care  ProductsPrinciples & Formulation of Hair Care  Products
Principles & Formulation of Hair Care Products
 
geometric quantization on coadjoint orbits
geometric quantization on coadjoint orbitsgeometric quantization on coadjoint orbits
geometric quantization on coadjoint orbits
 
Lehninger_Chapter 17_Fatty acid Oxid.ppt
Lehninger_Chapter 17_Fatty acid Oxid.pptLehninger_Chapter 17_Fatty acid Oxid.ppt
Lehninger_Chapter 17_Fatty acid Oxid.ppt
 
Exploration Method’s in Archaeological Studies & Research
Exploration Method’s in Archaeological Studies & ResearchExploration Method’s in Archaeological Studies & Research
Exploration Method’s in Archaeological Studies & Research
 
Identification of Superclusters and Their Properties in the Sloan Digital Sky...
Identification of Superclusters and Their Properties in the Sloan Digital Sky...Identification of Superclusters and Their Properties in the Sloan Digital Sky...
Identification of Superclusters and Their Properties in the Sloan Digital Sky...
 
Pests of tenai_Identification,Binomics_Dr.UPR
Pests of tenai_Identification,Binomics_Dr.UPRPests of tenai_Identification,Binomics_Dr.UPR
Pests of tenai_Identification,Binomics_Dr.UPR
 
Krishi Vigyan Kendras - कृषि विज्ञान केंद्र
Krishi Vigyan Kendras - कृषि विज्ञान केंद्रKrishi Vigyan Kendras - कृषि विज्ञान केंद्र
Krishi Vigyan Kendras - कृषि विज्ञान केंद्र
 
Pests of ragi_Identification, Binomics_Dr.UPR
Pests of ragi_Identification, Binomics_Dr.UPRPests of ragi_Identification, Binomics_Dr.UPR
Pests of ragi_Identification, Binomics_Dr.UPR
 
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...
 
KeyBio pipeline for bioinformatics and data science
KeyBio pipeline for bioinformatics and data scienceKeyBio pipeline for bioinformatics and data science
KeyBio pipeline for bioinformatics and data science
 
Role of Herbs in Cosmetics in Cosmetic Science.
Role of Herbs in Cosmetics in Cosmetic Science.Role of Herbs in Cosmetics in Cosmetic Science.
Role of Herbs in Cosmetics in Cosmetic Science.
 

Human Activities as Linked Data

  • 1. Integrating Know-How in the Linked Data Cloud Paolo Pareti, Benoit Testu, Ryutaro Ichise, Ewan Klein and Adam Barker https://w3id.org/prohow/ “As we all know, there is a large amount of facts available on the Web. But what about human activities or know-how? The goal of this talk is to tell you how this kind of knowledge can be made machine understandable and available on the Web.”
  • 2. Human activities (or know-how) 1. can be represented as Linked Data 2. can be automatically extracted 3. can be automatically interlinked 4. experiment: extracted a large Linked Data dataset 5. evaluation: our system outperforms humans “In particular, the presentation will focus on those five points.”
  • 3. 339933,,660000 “If we ask an intelligent system this question: ‘What is the population of the capital of New Zealand?’ we would now assume it can answer this question correctly, by accessing knowledge bases available on the Web. But what happens if we ask a seemingly easier question: ‘What do you need to wash you hands?’ In this case, the system would not be able to answer.”
  • 4. ??? “This is because, to answer this question, the intelligent system would need to have some understanding of what an activity is, and maybe what are its requirements. This knowledge, however, is not currently available in existing knowledge bases.”
  • 5. Why Know-How? “But actually know-how is very useful and has a lot of applications. Know-how is relevant in almost all domains, and it can be common sense know-how available on the Web, or maybe internal know-how of specific organizations, such as standard operating procedures. This knowledge also has applications in fields such as question answering, recommender systems and activity recognition.”
  • 6. “Human know-how is on the Web, but why is it not accessible? First of all, this knowledge is usually represented in unstructured resources. We can think for example of step-by-step instructions, which are typically represented as text in natural language, or maybe as pictures and videos.”
  • 7. ? ? ? “But the most serious limitation is the fact that a single document contains only limited information. What happens if we (or a machine) does not understand how to do a specific step, or what a particular ingredient is. In fact, it is often the case that humans look at multiple resources to complete a complex task for the same time.”
  • 8. Data “The first step for making know-how machine understandable is by using a structured representation. We can identify several entities in a process, such as steps, methods, requirements and outputs. We can link those entities with each other, depending on which relation exists between them.”
  • 9. Linked Data “To solve the problem of the isolation of single resources, we have adopted a Linked Data representation. In this way, humans and machines can discover related resources when they are interested in more information about a specific entity. It is important to notice that these are not just links between documents, but between specific entities contained in these documents.”
  • 10. “Our simple Linked Data representation of know-how is a point of contact between humans and machines. From the human perspective, know-how as Linked Data is a way to manage and find relevant resources which are human understandable. From the machine perspective, this data can be easily used for analysis, inferencing, and it can be extended to more complex representations where required.”
  • 11. “So all of this is not just an idea. It is actually possible and we have run experiments and evaluated our results.”
  • 12. “What do we want to achieve exactly, when we talk about machine-understandable activities? While it is true that we want to have a knowledge representation more powerful than simple text in a document, we cannot yet aim to have machines capable of automating all human activities. Therefore we need to start by reaching a first significant but realistic goal.”
  • 13. “We show the usefulness of this system in a real application. A task currently done by humans is the interlinking of related know-how resources. In particular, the WikiHow community is actively creating such kind of links; for example between the step of a process and another set of instructions that explains how to do it.”
  • 14. How to Make a Pancake Steps: 1. Prepare the mix 2. Pour the mix in a hot pan 3. Cook until golden Make a Pancake has_step has_step has_step Prepare the mix Cook until golden Pour the mix in a hot pan “This is a simplified example (e.g. missing the relations to specify the order of the steps) of how our system generates a Linked Data representation of a Web document. This can be done in many ways, but when the original document has some degree of structure, this knowledge extraction can be done easily and accurately.”
  • 15. How to Make a Pancake Steps: 1. Prepare the mix 2. Pour the mix in a hot pan 3. Cook until golden Make a Pancake has_step requires requires has_step has_step Eggs Milk Prepare the mix Cook until golden Pour the mix in a hot pan Requirements: ● Eggs ● Milk ● Flour Flour requires “On the Web, most of these resources have some degree of structure. This is because a well structured set of instructions is better understood by humans, even before machines. This structure usually takes form of a simple enumeration of steps, methods and requirements.”
  • 16. > 200,000 procedures > 2,600,000 entities “WikiHow and Snapguide are two large repositories that contain well organized know-how. We have extracted the knowledge of these websites and obtained a large dataset of over 200,000 procedures decomposed in over 2,600,000 entities. This can be seen as a large-scale extraction of know-how from the Web and conversion to Linked Data.”
  • 17. Hot to Install an Operating System create a partition How to Create a Partition “In order to interlink the extracted entities, we have created a system to automatically discover two kinds of links. The first kind is a functional link between a step and another set of instructions that explains how this step can be done.”
  • 18. DBpedia Guacamole How to Make Guacamole How to Serve Nachos “The second kind of links we discovered is similar to an Input/Output link between two processes. Instead of representing it directly, we have this link implicitly represented by the types of the input and the output of processes. In this example, we can infer that there is an Input/Output relation between the two processes, as one requires the object ‘Guacamole’ while the other outputs it.”
  • 19. Evaluation + 16% precision + ×2 number of links + ×2 coverage + automatic + semantic links “Finally we evaluated the links extracted by our system against the links generated manually by the WikiHow community. The result was a significant improvement. Our system identified links of better quality, more in number, and better spread across all resources. All of this on top of being a completely automatic system which creates semantic Linked Data links, more expressive than simple html links.”
  • 20. Know How as Linked Data? ….a dream that comes true! ● Generated a large dataset of > 200,000 human activities as Linked Data ● Integrated in the Linked Data Cloud ● Outperformed the human baseline https://w3id.org/prohow/ “In conclusion, we have seen how know-how can become a new useful resource on the Linked Data Cloud. Our system automated the extraction and the integration of this knowledge on a large scale. Please visit this website if you are interested in this dataset or information about the project. This website also contains a link to an online visualization tool to explore the dataset”.