Text-Mined Data in a Knowledge Graph

Tomás Sabat Stöfsel
Text Mined Knowledge Graphs
Welcome everyone! Please leave your name and where you’re calling in from
in the chat.

What is Text Mining?
Twitter: @GraknLabs

Text mining is the automatic extraction of structured semantic information
from unstructured machine-readable text.
Twitter: @GraknLabs

Text Mining
Twitter: @GraknLabs

?Text Mining
Twitter: @GraknLabs

What are the Challenges of Going Beyond Text Mining?
Twitter: @GraknLabs

Integration
Difficult to ingest and integrate
complex networks of text mined output
across bodies of text
Twitter: @GraknLabs

Integration
Normalisation
Difficult to contextualise extracted
knowledge from text with existing
knowledge
Twitter: @GraknLabs

Integration
Normalisation
Difficult to contextualise extracted
knowledge from text with existing
knowledge
Discovery
Difficult to investigate insights contained
in text in a scalable and efficient way
Twitter: @GraknLabs

How Can we Solve These Challenges?
Twitter: @GraknLabs

Integration
Ingest and integrate complex networks of
text mined output into one collection – a
knowledge graph
Twitter: @GraknLabs

Integration
knowledge graph
Normalisation
Impose an explicit structure on text
mined data to contextualise the
relationships with existing knowledge
Twitter: @GraknLabs

Integration
knowledge graph
Normalisation
Discovery
Use automated reasoning and analytics
to investigate and interpret insights
contained across text in a scalable and
efficient way
Twitter: @GraknLabs

Grakn is the knowledge base
foundation for intelligent systems
a.k.a.
a Knowledge Graph
Knowledge Storage System
Knowledge Inference Knowledge Analytics
Twitter: @GraknLabs

Articles, papers, etc
High-Level Architecture
NLP
Graql

How Do We Build A Text Mined Knowledge Graph?
1. Model and migrate text mined output into a knowledge graph – Grakn.
2. Discover and interpret new insights.
Twitter: @GraknLabs

1. Model and migrate text mined output into a knowledge graph – Grakn.
Twitter: @GraknLabs

Medical Text
Medical Literature
Electronic Health Records
Patient Medical History
Diagnostic Test Reports
Clinical Reports
Twitter: @GraknLabs

Medical Text
Medical Literature
Twitter: @GraknLabs

Text Mining/NLP Tools
Twitter: @GraknLabs

~~~~~~~~~~
~~~~~~~~~~
~~~~~~~~~~
Mined-Text
What Does Text Mined Output Look Like?
…..
Twitter: @GraknLabs

~~~~~~~~~~
~~~~~~~~~~
~~~~~~~~~~
…..
Twitter: @GraknLabs
Mined-Text
Natural Language Processing

~~~~~~~~~~
~~~~~~~~~~
~~~~~~~~~~
~~~~~~~~~~
Sentence
~~~~~~~~~~
Sentence
~~~~~~~~~~
Sentence
…..
…..
Twitter: @GraknLabs
Mined-Text

~~~~~~~~~~
~~~~~~~~~~
~~~~~~~~~~
~~~~~~~~~~
Sentence
~
Token
~
Token
~
Type
~
Type
…..
…..
…..
…..
Twitter: @GraknLabs
Mined-Text

~~~~~~~~~~
~~~~~~~~~~
~~~~~~~~~~
~~~~~~~~~~
Sentence
~
Relation
~
Token
~
Token
~
Type
~
Type
~
Type
…..
…..
…..
…..
Twitter: @GraknLabs
Mined-Text

~~~~~~~~~~
~~~~~~~~~~
~~~~~~~~~~
~~~~~~~~~~
Sentence
~
Relation
~
Token
~
Token
~
Type
~
Type
~
Type
Sentiment
…..
…..
…..
…..
Twitter: @GraknLabs
Mined-Text

~~~~~~~~~~
~~~~~~~~~~
~~~~~~~~~~
~~~~~~~~~~
Sentence
~
Relation
~
Token
~
Token
~
Type
~
Type
~
Type
Sentiment
…..
…..
…..
…..
~
Confidence
~
Confidence
~
Confidence
~
Confidence
Twitter: @GraknLabs
Mined-Text

Relation
~~~~~~~~~~
~~~~~~~~~~
~~~~~~~~~~
Syed is proud to be the brother of Zainab.
Sentence
Zainab
Token
Syed
Token
Person
Type
Person
Type
Sibling
Type
1.0
Confidence
1.0
Confidence
0.9
Confidence
Sentiment
…..
…..
…..
…..
…..
0.9
Confidence
Twitter: @GraknLabs
Mined-Text

What is Done Currently?
Twitter: @GraknLabs

What Should be Done?
Twitter: @GraknLabs

How Do We Model in Graql?
Twitter: @GraknLabs

Twitter: @GraknLabs
Entities

Relationships
Twitter: @GraknLabs
Entities

Relationships
Attributes
Twitter: @GraknLabs
Entities

How Do We Migrate Data Into Grakn?
const query = await transaction.query (
“insert $t isa token, has lemma ” Syed”, has type ” person”; “
);
InsertQuery query = Graql.insert(
var(”t").isa(”token”).has(”lemma”, ”Syed”).has(”type”, ”person”)
);
query = transaction.query(
“insert $t isa token, has lemma ” Syed”, has type ” person”;”
)
Twitter: @GraknLabs

2. Discover and interpret new insights.
Twitter: @GraknLabs

How Do We Discover An Insight?
What knowledge is extracted
from a PubMed article?
Question
Twitter: @GraknLabs

What knowledge is extracted
from a PubMed article? =
Graql
Twitter: @GraknLabs
Question

~~~~~BRAF inhibitor Dabrafenib~~~~~~~~~
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Trametinib and Dabrafenib treat Melanoma
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
~~~~MEK inhibitor Trametinib~~~~~~~~~~~
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Twitter: @GraknLabs
gene
protein drug
drug
disease
treatment
treatment
inhibition
inhibition

Which PubMed articles
mention the disease
Melanoma and the gene BRAF?
Question
Twitter: @GraknLabs

Which PubMed articles
mention the disease
Melanoma and the gene BRAF? =
Twitter: @GraknLabs
Question Graql

Twitter: @GraknLabs
BRAF
article
article
article
article
article
melanoma
…
…

Give me a drug related to the
disease melanoma?
Twitter: @GraknLabs
Question

Give me a drug related to the
disease melanoma? =
Graql
Twitter: @GraknLabs
Question

How Do We Interpret An Insight?
Twitter: @GraknLabs

titleabstractpmid
sentence
treatment
melanomaDabrafenib
When
Twitter: @GraknLabs
Pubmed-article
mined-
relation
token token

disease
therapeutic
treated-condition
Then
Twitter: @GraknLabs
When
titleabstractpmid
sentence
treatment
melanomaDabrafenib
Pubmed-article
mined-
relation
token token
treatment
Drug: Dabrafenib
Disease: Melanoma

How Do We Do Reasoning in Graql?
Twitter: @GraknLabs

Twitter: @GraknLabs
titleabstractpmid
Pubmed-article

Twitter: @GraknLabs
titleabstractpmid
sentence
Pubmed-article

Twitter: @GraknLabs
titleabstractpmid
sentence
treatment
Pubmed-article
mined-
relation
token token

Twitter: @GraknLabs
titleabstractpmid
sentence
treatment
melanomaDabrafenib
Pubmed-article
mined-
relation
token token

Twitter: @GraknLabs
titleabstractpmid
sentence
treatment
melanomaDabrafenib
Pubmed-article
mined-
relation
token token
Disease: Melanoma Drug: Dabrafenib

Twitter: @GraknLabs
Disease: Melanoma Drug: Dabrafenibtreatment
titleabstractpmid
sentence
treatment
melanomaDabrafenib
Pubmed-article
mined-
relation
token token

What makes Grakn a Knowledge Base for Text Mining?
Integration
knowledge graph
Normalisation
Discovery
Use automated reasoning and analytics
to investigate and interpret insights
contained across text in a scalable and
efficient way
Twitter: @GraknLabs

Thank you for attending this webinar!
Follow us on:
@graknlabs
Tomás: @tasabat
Daniel: @meanwhile_inn
Join our chatroom on:
https://discord.gg/graknlabs

Text-Mined Data in a Knowledge Graph

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Text-Mined Data in a Knowledge Graph

Similar to Text-Mined Data in a Knowledge Graph (17)

More from Vaticle

More from Vaticle (20)

Recently uploaded

Recently uploaded (20)

Text-Mined Data in a Knowledge Graph