SlideShare a Scribd company logo
1 of 28
Download to read offline
www.moving-project.eu
TraininG towards a society of data-saVvy inforMation prOfessionals
to enable open leadership INnovation
Chifumi Nishioka and Ansgar Scherp
ZBW -- Leibniz Information Centre for Economics and Kiel University, Germany
Information-theoretic Analysis of Entity
Dynamics on the Linked Open Data cloud
www.moving-project.eu
2 of 19
Motivation
• Understanding the dynamics of the LOD cloud is
important for many applications
• e.g., SPARQL query caching, crawling strategies, term
recommendations
• Related work
• Evolution of LOD documents [Käfer et al. 13]
• Dynamics of LOD sources [Dividino et al. 14]
• Entities on the LOD cloud
• Used by a lot of applications
• Knowledge graph in search engines
• Document modeling [Schuhmacher and Ponzetto 14]
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Come to the presentation of
“TermPicker” by Johann Schaible at
14:30 on 1st June (Wednesday)
We conduct an analysis focusing on entities
www.moving-project.eu
3 of 19
Research Goals
• Measure the changes in entities between two points in time
• Represent the temporal dynamics of entities as time-series
• Time-series clustering
• Periodicity detection
• Evaluate four different features of entities
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Goal 1: Represent the temporal dynamics of entities
Goal 2: Find out the representative temporal patterns of
entity dynamics
Goal 3: Find out which features of entity more likely
define temporal dynamics of entities
www.moving-project.eu
4 of 19
Formalization
• 𝑋𝑡: snapshot of LOD documents at point in time 𝑡
• Snapshot is a collection of triples 𝑥
• 𝑥: triple
• 𝑥 = 𝑠, 𝑝, 𝑜 : subject, predicate, and object
Chifumi Nishioka (chni@informatik.uni-kiel.de)
www.moving-project.eu
5 of 19
Entity and Entity Representations
• Entities are represented by a set of triples
• Entity Representation: Out
• Set of triples with common subject URI
• e.g., db:John_Brown is defined by two triples
• Entity Representation: InOut
• Set of triples with common subject URI or object URI
• e.g., db:John_Brown is defined by three triples
Chifumi Nishioka (chni@informatik.uni-kiel.de)
db:Anne_Smith db:spouseOf db:John_Brown
db:John_Brown db:birthplace db:Los_Angels
db:John_Brown db:works db:Green_University
www.moving-project.eu
6 of 19
Triple Weighting
• example: Barack Obama
• <Barack_Obama, dbp:vicePresident , Joe_Biden> is more
important than <Barack_Obama, rdf:type , foaf:Person>
• Baseline
• All triples have a same weight
• Combined Information Content (combIC)
[Schuhmacher and Ponzetto 14]
• 𝐼𝐶 𝑣 = −log⁡( 𝑃(𝑣))
• 𝑝𝑟𝑒𝑑 𝑥 , 𝑜𝑏𝑗 𝑥 returns predicate and object of a
triple 𝑥, respectively
Chifumi Nishioka (chni@informatik.uni-kiel.de)
𝑤 𝑏𝑎𝑠𝑒𝑙𝑖𝑛𝑒(𝑥) = 1
Each triple in entities has different importance for entities
𝑤𝑐𝑜𝑚𝑏𝐼𝐶 𝑥 = 𝐼𝐶 𝑝𝑟𝑒𝑑 𝑥 + 𝐼𝐶(𝑜𝑏𝑗(𝑥))
www.moving-project.eu
7 of 19
Measuring Entity Dynamics
• Cosine distance
• Euclidean distance
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Goal 1: Represent the temporal dynamics of entities
𝛿 𝑐𝑜𝑠𝑑 𝐸𝑡1
, 𝐸𝑡2
= 1 −
𝐸𝑡1
∙ 𝐸𝑡2
||𝐸𝑡1
|| ∙ | 𝐸𝑡2
|
1. Measure the amount of changes in entities between two
successive snapshots by one of two distance measures
𝛿 𝑒𝑢𝑐 𝐸𝑡1
, 𝐸𝑡2
= (𝐸𝑡1,𝑖 − 𝐸𝑡2,𝑖)2
𝑖=1
www.moving-project.eu
8 of 19
Vector Representation of Entities
• Represent an entity 𝐸 by one-hot encoding
• Extract all unique triples from different snapshots
• Fix order of triples
• e.g., db:Anne_Smith at 𝑡1 is (1,1,1,0,0) and at 𝑡2 is
(1,0,1,1,1)
• Cosine distance: 𝛿 𝑐𝑜𝑠𝑑 𝐸𝑡1
, 𝐸𝑡2
= 1 −
2
3∙ 4
= 0.42
• Euclidean distance: 𝛿 𝑒𝑢𝑐 𝐸𝑡1
, 𝐸𝑡2
= 3
Chifumi Nishioka (chni@informatik.uni-kiel.de)
𝑡1
1 db:Anne_Smith db:birthplace db:New_York
2 db:Anne_Smith db:works db:Green_University
3 db:Anne_Smith db:spouseOf db:John_Brown
𝑡2
1 db:Anne_Smith db:birthplace db:New_York
4 db:Anne_Smith db:works db:Royal_University
3 db:Anne_Smith db:spouseOf db:John_Brown
5 db:Anne_Smith db:degree db:Master_of_Science
www.moving-project.eu
9 of 19
Temporal Dynamics of Entities
• Temporal Dynamics of an entity 𝐸
• 𝑛: the number of snapshots
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Δ(𝐸) = (𝛿 𝐸𝑡1
, 𝐸𝑡2
, 𝛿 𝐸𝑡2
, 𝐸𝑡3
, ⋯ 𝛿(𝐸𝑡 𝑛−1
, 𝐸𝑡 𝑛
))
2. Represent temporal dynamics of entities by a time-series of the
amount of changes in an entity between two successive
snapshots
Subsequently, we mine the resulted time-series to find out
patterns of temporal dynamics of entities
www.moving-project.eu
10 of 19
Time-series Clustering
Chifumi Nishioka (chni@informatik.uni-kiel.de)
• Clustering algorithm: k-means++ [Arthur and
Vassilvitskii 07]
• Introduce an improved initial seeding into k-means
• Distance measure: Euclidean distance
• The most efficient measure for distance between
time-series with a reasonably high accuracy [Wang et
al. 13]
• Optimization of the number of clusters :
Average Silhouette
Goal 2: Find out the representative temporal patterns of
entity dynamics
www.moving-project.eu
11 of 19
Periodicity Detection
• Periodicity Detection
• A task of detecting periodicity from time-series
• Example 1: (1, 3, 2, 1, 3, 2) -> periodicity of three
• Example 2: (1, 2, 1, 2, 1, 2) -> periodicity of two
• Employ a convolution-based algorithm [Elfeky et al.
05]
Chifumi Nishioka (chni@informatik.uni-kiel.de)
We assume that the amount of changes of entities have
some periodicity
We see the centroids of the resulted clusters as patterns of
entity dynamics
www.moving-project.eu
12 of 19
Dataset
• Dynamic Linked Data Observatory (DyLDO)
dataset [Käfer et al. 12]
• Weekly snapshots of the fixed set of LOD documents
• 165 snapshots over three years (05/2012 to 07/2015)
• Entities in the DyLDO dataset
• Almost 75% of entities appear only at one snapshot
• Focus on entities that appear at >70% of snapshots
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Entity
representation
# of unique
entities in 165
snapshots
# of entities that
appear at >70% of
snapshots
Out 27,788,902 2,909,700
InOut 29,097,929 2,950,533
www.moving-project.eu
13 of 19
Patterns of Entity Dynamics (1/3)
• Analysis with respect to eight conditions
• Conditions are made by two entity representations,
two distance measures, two triple weighting methods
• Result of clustering
• # of clusters are smaller when using combIC
Chifumi Nishioka (chni@informatik.uni-kiel.de)
www.moving-project.eu
14 of 19
Patterns of Entity Dynamics (2/3)
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Out Cosine Baseline Out Cosine CombIC
Out Euclidean CombIC
Out Euclidean Baseline
www.moving-project.eu
15 of 19
Patterns of Entity Dynamics (3/3)
Chifumi Nishioka (chni@informatik.uni-kiel.de)
InOut Cosine Baseline InOut Cosine CombIC
InOut Euclidean Baseline InOut Euclidean CombIC
www.moving-project.eu
16 of 19
Periodicity of Entity Dynamics
Chifumi Nishioka (chni@informatik.uni-kiel.de)
We observe periodicities in temporal dynamics of entities
• e.g., “Periodicity of 56” indicates that the amount of
entity changes vary along with one-year cycle
• Different patterns have different periodicities
www.moving-project.eu
17 of 19
Features for Entity Dynamics (1/2)
• Four features of entities
• RDF Type (𝑓1)
• Property (𝑓2)
• Union of RDF types and properties (𝑓3)
• Pay level domain (PLD) of entity URI (𝑓4)
• e.g., http://dbpedia.org/resource/The_Beatles -> dbpedia.org
• Evaluate four features by RandIndex
• RandIndex: a metric of clustering
• Measure the difference of clustering by a feature and
by entity dynamics (i.e., time-series vectors)
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Goal 3: Find out which features of entity more likely
define temporal dynamics of entities
www.moving-project.eu
18 of 19
Features for Entity Dynamics (2/2)
• Entities that share a common PLD are more likely
to have similar temporal dynamics of entities
when employing baseline for triple weighting
• When using combIC, entities that have a common
RDF type or ECS more likely to belong a same
cluster
Chifumi Nishioka (chni@informatik.uni-kiel.de)
www.moving-project.eu
19 of 19
Thank you for your
attention!
Project consortium and funding agency
Chifumi Nishioka (chni@informatik.uni-kiel.de)
MOVING is funded by the EU Horizon 2020 Programme under the project number INSO-4-2015: 693092
www.moving-project.eu
20 of 19
Conclusion
• Temporal dynamics of entities on the LOD cloud
• Represent the temporal dynamics of entities as time-
series
• Find out the representative temporal patterns of
entity dynamics
• Find out which features of entity
• Future work
• e.g., SPARQL query caching
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Goal 3: Find out which features of entity more likely
define temporal dynamics of entities
www.moving-project.eu
21 of 19
Appendix
Chifumi Nishioka (chni@informatik.uni-kiel.de)
www.moving-project.eu
22 of 19
Reference
• [Arthur and Vassilvitskii 07] D. Arthur and S. Vassilvitskii. k-means++: The advantages of
careful seeding. SODA, 2007.
• [Elfeky et al. 05] M.G. Elfeky, W.G. Aref, and A.K. Elmagarmid. Periodicity detection in
time series databases. IEEE TKDE, 2005.
• [Käfer et al. 12] T. Käfer, J. Umbrich, A. Hogan, and A. Polleres. Towards a dynamic
linked data observatory. LDOW, 2012.
• [Käfer et al. 13] T. Käfer, A. Abdelrahman, J. Umbrich, P. O’Byrne, and A. Hogan.
Observing linked data dynamics. ESWC, 2013.
• [Neumann and Moerkotte 11] T. Neumann and G. Moerkotte. Characteristic sets:
Accurate cardinality estimation for RDF queries with multiple joins. ICDE, 2011.
• [Schuhmacher and Ponzetto 14] M. Schuhmacher and S.P. Ponzetto. Knowledge-based
graph document modeling. WSDM, 2014.
• [Wang et al. 13] X. Wang, A. Mueen, H. Ding, G. Trajcevski, P. Scheuermann, and E.
Keogh. Experimental comparison of representation methods and distance measures for
time series data. Data Mining and Knowledge Discovery, 2013.
• [Yang and Leskovec 11] J. Yang and J. Leskovec. Patterns of temporal variation in online
media. WSDM, 2011.
Chifumi Nishioka (chni@informatik.uni-kiel.de)
www.moving-project.eu
23 of 19
Entities in the DyLDO dataset
• Distribution of # of times of appearances of
entities in 165 snapshots
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Entity representation Out Entity representation InOut
www.moving-project.eu
24 of 19
Resulted Temporal Patterns (1/4)
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Out Cosine Baseline Out Cosine CombIC
www.moving-project.eu
25 of 19
Resulted Temporal Patterns (2/4)
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Out Euclidean Baseline Out Euclidean CombIC
www.moving-project.eu
26 of 19
Resulted Temporal Patterns (3/4)
Chifumi Nishioka (chni@informatik.uni-kiel.de)
InOut Cosine Baseline InOut Cosine CombIC
www.moving-project.eu
27 of 19
Resulted Temporal Patterns (4/4)
Chifumi Nishioka (chni@informatik.uni-kiel.de)
InOut Euclidean Baseline InOut Euclidean CombIC
www.moving-project.eu
28 of 19
Presentation
• PROFILES 2016
Chifumi Nishioka (chni@informatik.uni-kiel.de)

More Related Content

What's hot

New Approaches to Interactive Multimedia Content Retrieval from different Sou...
New Approaches to Interactive Multimedia Content Retrieval from different Sou...New Approaches to Interactive Multimedia Content Retrieval from different Sou...
New Approaches to Interactive Multimedia Content Retrieval from different Sou...Grupo HULAT
 
[DOLAP2019] Augmented Business Intelligence
[DOLAP2019] Augmented Business Intelligence[DOLAP2019] Augmented Business Intelligence
[DOLAP2019] Augmented Business IntelligenceUniversity of Bologna
 
[ADBIS 2021] - Optimizing Execution Plans in a Multistore
[ADBIS 2021] - Optimizing Execution Plans in a Multistore[ADBIS 2021] - Optimizing Execution Plans in a Multistore
[ADBIS 2021] - Optimizing Execution Plans in a MultistoreChiara Forresi
 
Lessons from Data Science Program at Indiana University: Curriculum, Students...
Lessons from Data Science Program at Indiana University: Curriculum, Students...Lessons from Data Science Program at Indiana University: Curriculum, Students...
Lessons from Data Science Program at Indiana University: Curriculum, Students...Geoffrey Fox
 
Pattern Recognition in Multiple Bike sharing Systems for comparability
Pattern Recognition in Multiple Bike sharing Systems for comparabilityPattern Recognition in Multiple Bike sharing Systems for comparability
Pattern Recognition in Multiple Bike sharing Systems for comparability Athiq Ahamed
 
Topics of interest for IWPT'01.doc
Topics of interest for IWPT'01.docTopics of interest for IWPT'01.doc
Topics of interest for IWPT'01.docbutest
 
Boolean matrix factorisation for collaborative filtering
Boolean matrix factorisation for collaborative filteringBoolean matrix factorisation for collaborative filtering
Boolean matrix factorisation for collaborative filteringDmitrii Ignatov
 
Data Science and Online Education
Data Science and Online EducationData Science and Online Education
Data Science and Online EducationGeoffrey Fox
 
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...University of Bologna
 

What's hot (9)

New Approaches to Interactive Multimedia Content Retrieval from different Sou...
New Approaches to Interactive Multimedia Content Retrieval from different Sou...New Approaches to Interactive Multimedia Content Retrieval from different Sou...
New Approaches to Interactive Multimedia Content Retrieval from different Sou...
 
[DOLAP2019] Augmented Business Intelligence
[DOLAP2019] Augmented Business Intelligence[DOLAP2019] Augmented Business Intelligence
[DOLAP2019] Augmented Business Intelligence
 
[ADBIS 2021] - Optimizing Execution Plans in a Multistore
[ADBIS 2021] - Optimizing Execution Plans in a Multistore[ADBIS 2021] - Optimizing Execution Plans in a Multistore
[ADBIS 2021] - Optimizing Execution Plans in a Multistore
 
Lessons from Data Science Program at Indiana University: Curriculum, Students...
Lessons from Data Science Program at Indiana University: Curriculum, Students...Lessons from Data Science Program at Indiana University: Curriculum, Students...
Lessons from Data Science Program at Indiana University: Curriculum, Students...
 
Pattern Recognition in Multiple Bike sharing Systems for comparability
Pattern Recognition in Multiple Bike sharing Systems for comparabilityPattern Recognition in Multiple Bike sharing Systems for comparability
Pattern Recognition in Multiple Bike sharing Systems for comparability
 
Topics of interest for IWPT'01.doc
Topics of interest for IWPT'01.docTopics of interest for IWPT'01.doc
Topics of interest for IWPT'01.doc
 
Boolean matrix factorisation for collaborative filtering
Boolean matrix factorisation for collaborative filteringBoolean matrix factorisation for collaborative filtering
Boolean matrix factorisation for collaborative filtering
 
Data Science and Online Education
Data Science and Online EducationData Science and Online Education
Data Science and Online Education
 
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
 

Viewers also liked

Including financial criteria in the strategic planning of knowledge repositor...
Including financial criteria in the strategic planning of knowledge repositor...Including financial criteria in the strategic planning of knowledge repositor...
Including financial criteria in the strategic planning of knowledge repositor...MOVING Project
 
TRECVID 2016 Ad-hoc Video Search task, CERTH-ITI
TRECVID 2016 Ad-hoc Video Search task, CERTH-ITITRECVID 2016 Ad-hoc Video Search task, CERTH-ITI
TRECVID 2016 Ad-hoc Video Search task, CERTH-ITIMOVING Project
 
TRECVID 2016 POSTER CERTH-ITI
TRECVID 2016 POSTER CERTH-ITITRECVID 2016 POSTER CERTH-ITI
TRECVID 2016 POSTER CERTH-ITIMOVING Project
 
VIDEO AESTHETIC QUALITY ASSESSMENT USING KERNEL SUPPORT VECTOR MACHINE WITH I...
VIDEO AESTHETIC QUALITY ASSESSMENT USING KERNEL SUPPORT VECTOR MACHINE WITH I...VIDEO AESTHETIC QUALITY ASSESSMENT USING KERNEL SUPPORT VECTOR MACHINE WITH I...
VIDEO AESTHETIC QUALITY ASSESSMENT USING KERNEL SUPPORT VECTOR MACHINE WITH I...MOVING Project
 
Mining and Managing Large-scale Linked Open Data
Mining and Managing Large-scale Linked Open DataMining and Managing Large-scale Linked Open Data
Mining and Managing Large-scale Linked Open DataMOVING Project
 
Jak prowadzić konto firmowe na Twitterze?
Jak prowadzić konto firmowe na Twitterze?Jak prowadzić konto firmowe na Twitterze?
Jak prowadzić konto firmowe na Twitterze?Artur Jabłoński
 

Viewers also liked (6)

Including financial criteria in the strategic planning of knowledge repositor...
Including financial criteria in the strategic planning of knowledge repositor...Including financial criteria in the strategic planning of knowledge repositor...
Including financial criteria in the strategic planning of knowledge repositor...
 
TRECVID 2016 Ad-hoc Video Search task, CERTH-ITI
TRECVID 2016 Ad-hoc Video Search task, CERTH-ITITRECVID 2016 Ad-hoc Video Search task, CERTH-ITI
TRECVID 2016 Ad-hoc Video Search task, CERTH-ITI
 
TRECVID 2016 POSTER CERTH-ITI
TRECVID 2016 POSTER CERTH-ITITRECVID 2016 POSTER CERTH-ITI
TRECVID 2016 POSTER CERTH-ITI
 
VIDEO AESTHETIC QUALITY ASSESSMENT USING KERNEL SUPPORT VECTOR MACHINE WITH I...
VIDEO AESTHETIC QUALITY ASSESSMENT USING KERNEL SUPPORT VECTOR MACHINE WITH I...VIDEO AESTHETIC QUALITY ASSESSMENT USING KERNEL SUPPORT VECTOR MACHINE WITH I...
VIDEO AESTHETIC QUALITY ASSESSMENT USING KERNEL SUPPORT VECTOR MACHINE WITH I...
 
Mining and Managing Large-scale Linked Open Data
Mining and Managing Large-scale Linked Open DataMining and Managing Large-scale Linked Open Data
Mining and Managing Large-scale Linked Open Data
 
Jak prowadzić konto firmowe na Twitterze?
Jak prowadzić konto firmowe na Twitterze?Jak prowadzić konto firmowe na Twitterze?
Jak prowadzić konto firmowe na Twitterze?
 

Similar to Information theoritic analysis of entity dynamics on the linked open data cloud

A Comparison of Different Strategies for Automated Semantic Document Annotation
A Comparison of Different Strategies for Automated Semantic Document AnnotationA Comparison of Different Strategies for Automated Semantic Document Annotation
A Comparison of Different Strategies for Automated Semantic Document AnnotationAnsgar Scherp
 
AUGUR: Forecasting the Emergence of New Research Topics
AUGUR: Forecasting the Emergence of New Research TopicsAUGUR: Forecasting the Emergence of New Research Topics
AUGUR: Forecasting the Emergence of New Research TopicsAngelo Salatino
 
Invited Talk: Early Detection of Research Topics
Invited Talk: Early Detection of Research Topics Invited Talk: Early Detection of Research Topics
Invited Talk: Early Detection of Research Topics Angelo Salatino
 
Temporal models for mining, ranking and recommendation in the Web
Temporal models for mining, ranking and recommendation in the WebTemporal models for mining, ranking and recommendation in the Web
Temporal models for mining, ranking and recommendation in the WebTu Nguyen
 
STRIP: stream learning of influence probabilities.
STRIP: stream learning of influence probabilities.STRIP: stream learning of influence probabilities.
STRIP: stream learning of influence probabilities.Albert Bifet
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology:  A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology:  A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasAngelo Salatino
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasAngelo Salatino
 
A Comparison of Approaches for Automated Text Extraction from Scholarly Figures
A Comparison of Approaches for Automated Text Extraction from Scholarly FiguresA Comparison of Approaches for Automated Text Extraction from Scholarly Figures
A Comparison of Approaches for Automated Text Extraction from Scholarly FiguresAnsgar Scherp
 
TechnicalBackgroundOverview
TechnicalBackgroundOverviewTechnicalBackgroundOverview
TechnicalBackgroundOverviewMotaz El-Saban
 
Accelerating aggregation using intra-cycle parallelism
Accelerating aggregation using intra-cycle parallelismAccelerating aggregation using intra-cycle parallelism
Accelerating aggregation using intra-cycle parallelismZiqiang Feng
 
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3Dr. Aparna Varde
 
Mining and Managing Large-scale Linked Open Data
Mining and Managing Large-scale Linked Open DataMining and Managing Large-scale Linked Open Data
Mining and Managing Large-scale Linked Open DataAnsgar Scherp
 
The FLuID Meta Model: Incrementally Compute Schema-level Indices for the Web...
The FLuID Meta Model: Incrementally Compute  Schema-level Indices for the Web...The FLuID Meta Model: Incrementally Compute  Schema-level Indices for the Web...
The FLuID Meta Model: Incrementally Compute Schema-level Indices for the Web...Till Blume
 
2019 cvpr paper_overview
2019 cvpr paper_overview2019 cvpr paper_overview
2019 cvpr paper_overviewLEE HOSEONG
 
2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong LeeMoazzem Hossain
 
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)Thilo Stadelmann
 
Platforms and the Semantic Web
Platforms and the Semantic WebPlatforms and the Semantic Web
Platforms and the Semantic WebDanny Ayers
 

Similar to Information theoritic analysis of entity dynamics on the linked open data cloud (20)

A Comparison of Different Strategies for Automated Semantic Document Annotation
A Comparison of Different Strategies for Automated Semantic Document AnnotationA Comparison of Different Strategies for Automated Semantic Document Annotation
A Comparison of Different Strategies for Automated Semantic Document Annotation
 
AUGUR: Forecasting the Emergence of New Research Topics
AUGUR: Forecasting the Emergence of New Research TopicsAUGUR: Forecasting the Emergence of New Research Topics
AUGUR: Forecasting the Emergence of New Research Topics
 
Invited Talk: Early Detection of Research Topics
Invited Talk: Early Detection of Research Topics Invited Talk: Early Detection of Research Topics
Invited Talk: Early Detection of Research Topics
 
Temporal models for mining, ranking and recommendation in the Web
Temporal models for mining, ranking and recommendation in the WebTemporal models for mining, ranking and recommendation in the Web
Temporal models for mining, ranking and recommendation in the Web
 
STRIP: stream learning of influence probabilities.
STRIP: stream learning of influence probabilities.STRIP: stream learning of influence probabilities.
STRIP: stream learning of influence probabilities.
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology:  A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology:  A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
 
A Comparison of Approaches for Automated Text Extraction from Scholarly Figures
A Comparison of Approaches for Automated Text Extraction from Scholarly FiguresA Comparison of Approaches for Automated Text Extraction from Scholarly Figures
A Comparison of Approaches for Automated Text Extraction from Scholarly Figures
 
TechnicalBackgroundOverview
TechnicalBackgroundOverviewTechnicalBackgroundOverview
TechnicalBackgroundOverview
 
Accelerating aggregation using intra-cycle parallelism
Accelerating aggregation using intra-cycle parallelismAccelerating aggregation using intra-cycle parallelism
Accelerating aggregation using intra-cycle parallelism
 
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
 
Scientific Publication Retrieval in Linked Data
Scientific Publication Retrieval in Linked DataScientific Publication Retrieval in Linked Data
Scientific Publication Retrieval in Linked Data
 
Mining and Managing Large-scale Linked Open Data
Mining and Managing Large-scale Linked Open DataMining and Managing Large-scale Linked Open Data
Mining and Managing Large-scale Linked Open Data
 
Multimedia Mining
Multimedia Mining Multimedia Mining
Multimedia Mining
 
The FLuID Meta Model: Incrementally Compute Schema-level Indices for the Web...
The FLuID Meta Model: Incrementally Compute  Schema-level Indices for the Web...The FLuID Meta Model: Incrementally Compute  Schema-level Indices for the Web...
The FLuID Meta Model: Incrementally Compute Schema-level Indices for the Web...
 
2019 cvpr paper_overview
2019 cvpr paper_overview2019 cvpr paper_overview
2019 cvpr paper_overview
 
2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee
 
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)
 
Brochure-INFIERI-english_0
Brochure-INFIERI-english_0Brochure-INFIERI-english_0
Brochure-INFIERI-english_0
 
Platforms and the Semantic Web
Platforms and the Semantic WebPlatforms and the Semantic Web
Platforms and the Semantic Web
 

More from MOVING Project

Opening up education through digitization. Remarks on recent developments in ...
Opening up education through digitization. Remarks on recent developments in ...Opening up education through digitization. Remarks on recent developments in ...
Opening up education through digitization. Remarks on recent developments in ...MOVING Project
 
MOVING: Applying digital science methodology for TVET
MOVING: Applying digital science methodology for TVETMOVING: Applying digital science methodology for TVET
MOVING: Applying digital science methodology for TVETMOVING Project
 
Learning analytics for reflective learning
Learning analytics for reflective learningLearning analytics for reflective learning
Learning analytics for reflective learningMOVING Project
 
Challenges in Developing Automatic Learning Guidance in Relation to an Inform...
Challenges in Developing Automatic Learning Guidance in Relation to an Inform...Challenges in Developing Automatic Learning Guidance in Relation to an Inform...
Challenges in Developing Automatic Learning Guidance in Relation to an Inform...MOVING Project
 
Unesco mobileweek 2019_frontier_tech_oer-final
Unesco mobileweek 2019_frontier_tech_oer-finalUnesco mobileweek 2019_frontier_tech_oer-final
Unesco mobileweek 2019_frontier_tech_oer-finalMOVING Project
 
Inferring knowledge acquisition through Web navigation behaviour
Inferring knowledge acquisition through Web navigation behaviourInferring knowledge acquisition through Web navigation behaviour
Inferring knowledge acquisition through Web navigation behaviourMOVING Project
 
ITI-CERTH participation in TRECVID 2018
ITI-CERTH participation in TRECVID 2018ITI-CERTH participation in TRECVID 2018
ITI-CERTH participation in TRECVID 2018MOVING Project
 
Wissenschaft 2.0 und offene Forschungsmethoden vermitteln– Der MOOC "Science ...
Wissenschaft 2.0 und offene Forschungsmethoden vermitteln– Der MOOC "Science ...Wissenschaft 2.0 und offene Forschungsmethoden vermitteln– Der MOOC "Science ...
Wissenschaft 2.0 und offene Forschungsmethoden vermitteln– Der MOOC "Science ...MOVING Project
 
Wissenschaft 2.0 und offene Forschungsmethoden vermitteln: Der MOOC Science 2...
Wissenschaft 2.0 und offene Forschungsmethoden vermitteln: Der MOOC Science 2...Wissenschaft 2.0 und offene Forschungsmethoden vermitteln: Der MOOC Science 2...
Wissenschaft 2.0 und offene Forschungsmethoden vermitteln: Der MOOC Science 2...MOVING Project
 
VERGE: A Multimodal Interactive Search Engine for Video Browsing and Retrieval
VERGE: A Multimodal Interactive Search Engine for Video Browsing and RetrievalVERGE: A Multimodal Interactive Search Engine for Video Browsing and Retrieval
VERGE: A Multimodal Interactive Search Engine for Video Browsing and RetrievalMOVING Project
 
Temporal Lecture Video Fragmentation using Word Embeddings
Temporal Lecture Video Fragmentation using Word EmbeddingsTemporal Lecture Video Fragmentation using Word Embeddings
Temporal Lecture Video Fragmentation using Word EmbeddingsMOVING Project
 
The Impact of Blocking and Name-Matching on Author Disambiguation.
The Impact of Blocking and Name-Matching on Author Disambiguation.The Impact of Blocking and Name-Matching on Author Disambiguation.
The Impact of Blocking and Name-Matching on Author Disambiguation.MOVING Project
 
Effective Unsupervised Author Disambiguation with Relative Frequencies
Effective Unsupervised Author Disambiguation with Relative FrequenciesEffective Unsupervised Author Disambiguation with Relative Frequencies
Effective Unsupervised Author Disambiguation with Relative FrequenciesMOVING Project
 
What to read next? Challenges and Preliminary Results in Selecting Represen...
What to read next? Challenges and  Preliminary Results in Selecting  Represen...What to read next? Challenges and  Preliminary Results in Selecting  Represen...
What to read next? Challenges and Preliminary Results in Selecting Represen...MOVING Project
 
Qualitative Analysis of Vocabulary Evolution on the Linked Open Data Cloud
Qualitative Analysis of Vocabulary Evolution on the Linked Open Data CloudQualitative Analysis of Vocabulary Evolution on the Linked Open Data Cloud
Qualitative Analysis of Vocabulary Evolution on the Linked Open Data CloudMOVING Project
 
Analyzing the Evolution of Vocabulary Terms and Their Impact on the LOD Cloud
Analyzing the Evolution of Vocabulary Terms and Their Impact on the LOD CloudAnalyzing the Evolution of Vocabulary Terms and Their Impact on the LOD Cloud
Analyzing the Evolution of Vocabulary Terms and Their Impact on the LOD CloudMOVING Project
 
Keeping Linked Open Data Caches Up-to-date by Predicting the Life-time of RDF...
Keeping Linked Open Data Caches Up-to-date by Predicting the Life-time of RDF...Keeping Linked Open Data Caches Up-to-date by Predicting the Life-time of RDF...
Keeping Linked Open Data Caches Up-to-date by Predicting the Life-time of RDF...MOVING Project
 
Deep Multi-task Learning with Label Correlation Constraint for Video Concept ...
Deep Multi-task Learning with Label Correlation Constraint for Video Concept ...Deep Multi-task Learning with Label Correlation Constraint for Video Concept ...
Deep Multi-task Learning with Label Correlation Constraint for Video Concept ...MOVING Project
 
Generic to Specific Recognition Models for Membership Analysis in Group Videos
Generic to Specific Recognition Models for Membership Analysis in Group VideosGeneric to Specific Recognition Models for Membership Analysis in Group Videos
Generic to Specific Recognition Models for Membership Analysis in Group VideosMOVING Project
 
MOVING the Industry 4.0
MOVING the Industry 4.0MOVING the Industry 4.0
MOVING the Industry 4.0MOVING Project
 

More from MOVING Project (20)

Opening up education through digitization. Remarks on recent developments in ...
Opening up education through digitization. Remarks on recent developments in ...Opening up education through digitization. Remarks on recent developments in ...
Opening up education through digitization. Remarks on recent developments in ...
 
MOVING: Applying digital science methodology for TVET
MOVING: Applying digital science methodology for TVETMOVING: Applying digital science methodology for TVET
MOVING: Applying digital science methodology for TVET
 
Learning analytics for reflective learning
Learning analytics for reflective learningLearning analytics for reflective learning
Learning analytics for reflective learning
 
Challenges in Developing Automatic Learning Guidance in Relation to an Inform...
Challenges in Developing Automatic Learning Guidance in Relation to an Inform...Challenges in Developing Automatic Learning Guidance in Relation to an Inform...
Challenges in Developing Automatic Learning Guidance in Relation to an Inform...
 
Unesco mobileweek 2019_frontier_tech_oer-final
Unesco mobileweek 2019_frontier_tech_oer-finalUnesco mobileweek 2019_frontier_tech_oer-final
Unesco mobileweek 2019_frontier_tech_oer-final
 
Inferring knowledge acquisition through Web navigation behaviour
Inferring knowledge acquisition through Web navigation behaviourInferring knowledge acquisition through Web navigation behaviour
Inferring knowledge acquisition through Web navigation behaviour
 
ITI-CERTH participation in TRECVID 2018
ITI-CERTH participation in TRECVID 2018ITI-CERTH participation in TRECVID 2018
ITI-CERTH participation in TRECVID 2018
 
Wissenschaft 2.0 und offene Forschungsmethoden vermitteln– Der MOOC "Science ...
Wissenschaft 2.0 und offene Forschungsmethoden vermitteln– Der MOOC "Science ...Wissenschaft 2.0 und offene Forschungsmethoden vermitteln– Der MOOC "Science ...
Wissenschaft 2.0 und offene Forschungsmethoden vermitteln– Der MOOC "Science ...
 
Wissenschaft 2.0 und offene Forschungsmethoden vermitteln: Der MOOC Science 2...
Wissenschaft 2.0 und offene Forschungsmethoden vermitteln: Der MOOC Science 2...Wissenschaft 2.0 und offene Forschungsmethoden vermitteln: Der MOOC Science 2...
Wissenschaft 2.0 und offene Forschungsmethoden vermitteln: Der MOOC Science 2...
 
VERGE: A Multimodal Interactive Search Engine for Video Browsing and Retrieval
VERGE: A Multimodal Interactive Search Engine for Video Browsing and RetrievalVERGE: A Multimodal Interactive Search Engine for Video Browsing and Retrieval
VERGE: A Multimodal Interactive Search Engine for Video Browsing and Retrieval
 
Temporal Lecture Video Fragmentation using Word Embeddings
Temporal Lecture Video Fragmentation using Word EmbeddingsTemporal Lecture Video Fragmentation using Word Embeddings
Temporal Lecture Video Fragmentation using Word Embeddings
 
The Impact of Blocking and Name-Matching on Author Disambiguation.
The Impact of Blocking and Name-Matching on Author Disambiguation.The Impact of Blocking and Name-Matching on Author Disambiguation.
The Impact of Blocking and Name-Matching on Author Disambiguation.
 
Effective Unsupervised Author Disambiguation with Relative Frequencies
Effective Unsupervised Author Disambiguation with Relative FrequenciesEffective Unsupervised Author Disambiguation with Relative Frequencies
Effective Unsupervised Author Disambiguation with Relative Frequencies
 
What to read next? Challenges and Preliminary Results in Selecting Represen...
What to read next? Challenges and  Preliminary Results in Selecting  Represen...What to read next? Challenges and  Preliminary Results in Selecting  Represen...
What to read next? Challenges and Preliminary Results in Selecting Represen...
 
Qualitative Analysis of Vocabulary Evolution on the Linked Open Data Cloud
Qualitative Analysis of Vocabulary Evolution on the Linked Open Data CloudQualitative Analysis of Vocabulary Evolution on the Linked Open Data Cloud
Qualitative Analysis of Vocabulary Evolution on the Linked Open Data Cloud
 
Analyzing the Evolution of Vocabulary Terms and Their Impact on the LOD Cloud
Analyzing the Evolution of Vocabulary Terms and Their Impact on the LOD CloudAnalyzing the Evolution of Vocabulary Terms and Their Impact on the LOD Cloud
Analyzing the Evolution of Vocabulary Terms and Their Impact on the LOD Cloud
 
Keeping Linked Open Data Caches Up-to-date by Predicting the Life-time of RDF...
Keeping Linked Open Data Caches Up-to-date by Predicting the Life-time of RDF...Keeping Linked Open Data Caches Up-to-date by Predicting the Life-time of RDF...
Keeping Linked Open Data Caches Up-to-date by Predicting the Life-time of RDF...
 
Deep Multi-task Learning with Label Correlation Constraint for Video Concept ...
Deep Multi-task Learning with Label Correlation Constraint for Video Concept ...Deep Multi-task Learning with Label Correlation Constraint for Video Concept ...
Deep Multi-task Learning with Label Correlation Constraint for Video Concept ...
 
Generic to Specific Recognition Models for Membership Analysis in Group Videos
Generic to Specific Recognition Models for Membership Analysis in Group VideosGeneric to Specific Recognition Models for Membership Analysis in Group Videos
Generic to Specific Recognition Models for Membership Analysis in Group Videos
 
MOVING the Industry 4.0
MOVING the Industry 4.0MOVING the Industry 4.0
MOVING the Industry 4.0
 

Recently uploaded

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 

Information theoritic analysis of entity dynamics on the linked open data cloud

  • 1. www.moving-project.eu TraininG towards a society of data-saVvy inforMation prOfessionals to enable open leadership INnovation Chifumi Nishioka and Ansgar Scherp ZBW -- Leibniz Information Centre for Economics and Kiel University, Germany Information-theoretic Analysis of Entity Dynamics on the Linked Open Data cloud
  • 2. www.moving-project.eu 2 of 19 Motivation • Understanding the dynamics of the LOD cloud is important for many applications • e.g., SPARQL query caching, crawling strategies, term recommendations • Related work • Evolution of LOD documents [Käfer et al. 13] • Dynamics of LOD sources [Dividino et al. 14] • Entities on the LOD cloud • Used by a lot of applications • Knowledge graph in search engines • Document modeling [Schuhmacher and Ponzetto 14] Chifumi Nishioka (chni@informatik.uni-kiel.de) Come to the presentation of “TermPicker” by Johann Schaible at 14:30 on 1st June (Wednesday) We conduct an analysis focusing on entities
  • 3. www.moving-project.eu 3 of 19 Research Goals • Measure the changes in entities between two points in time • Represent the temporal dynamics of entities as time-series • Time-series clustering • Periodicity detection • Evaluate four different features of entities Chifumi Nishioka (chni@informatik.uni-kiel.de) Goal 1: Represent the temporal dynamics of entities Goal 2: Find out the representative temporal patterns of entity dynamics Goal 3: Find out which features of entity more likely define temporal dynamics of entities
  • 4. www.moving-project.eu 4 of 19 Formalization • 𝑋𝑡: snapshot of LOD documents at point in time 𝑡 • Snapshot is a collection of triples 𝑥 • 𝑥: triple • 𝑥 = 𝑠, 𝑝, 𝑜 : subject, predicate, and object Chifumi Nishioka (chni@informatik.uni-kiel.de)
  • 5. www.moving-project.eu 5 of 19 Entity and Entity Representations • Entities are represented by a set of triples • Entity Representation: Out • Set of triples with common subject URI • e.g., db:John_Brown is defined by two triples • Entity Representation: InOut • Set of triples with common subject URI or object URI • e.g., db:John_Brown is defined by three triples Chifumi Nishioka (chni@informatik.uni-kiel.de) db:Anne_Smith db:spouseOf db:John_Brown db:John_Brown db:birthplace db:Los_Angels db:John_Brown db:works db:Green_University
  • 6. www.moving-project.eu 6 of 19 Triple Weighting • example: Barack Obama • <Barack_Obama, dbp:vicePresident , Joe_Biden> is more important than <Barack_Obama, rdf:type , foaf:Person> • Baseline • All triples have a same weight • Combined Information Content (combIC) [Schuhmacher and Ponzetto 14] • 𝐼𝐶 𝑣 = −log⁡( 𝑃(𝑣)) • 𝑝𝑟𝑒𝑑 𝑥 , 𝑜𝑏𝑗 𝑥 returns predicate and object of a triple 𝑥, respectively Chifumi Nishioka (chni@informatik.uni-kiel.de) 𝑤 𝑏𝑎𝑠𝑒𝑙𝑖𝑛𝑒(𝑥) = 1 Each triple in entities has different importance for entities 𝑤𝑐𝑜𝑚𝑏𝐼𝐶 𝑥 = 𝐼𝐶 𝑝𝑟𝑒𝑑 𝑥 + 𝐼𝐶(𝑜𝑏𝑗(𝑥))
  • 7. www.moving-project.eu 7 of 19 Measuring Entity Dynamics • Cosine distance • Euclidean distance Chifumi Nishioka (chni@informatik.uni-kiel.de) Goal 1: Represent the temporal dynamics of entities 𝛿 𝑐𝑜𝑠𝑑 𝐸𝑡1 , 𝐸𝑡2 = 1 − 𝐸𝑡1 ∙ 𝐸𝑡2 ||𝐸𝑡1 || ∙ | 𝐸𝑡2 | 1. Measure the amount of changes in entities between two successive snapshots by one of two distance measures 𝛿 𝑒𝑢𝑐 𝐸𝑡1 , 𝐸𝑡2 = (𝐸𝑡1,𝑖 − 𝐸𝑡2,𝑖)2 𝑖=1
  • 8. www.moving-project.eu 8 of 19 Vector Representation of Entities • Represent an entity 𝐸 by one-hot encoding • Extract all unique triples from different snapshots • Fix order of triples • e.g., db:Anne_Smith at 𝑡1 is (1,1,1,0,0) and at 𝑡2 is (1,0,1,1,1) • Cosine distance: 𝛿 𝑐𝑜𝑠𝑑 𝐸𝑡1 , 𝐸𝑡2 = 1 − 2 3∙ 4 = 0.42 • Euclidean distance: 𝛿 𝑒𝑢𝑐 𝐸𝑡1 , 𝐸𝑡2 = 3 Chifumi Nishioka (chni@informatik.uni-kiel.de) 𝑡1 1 db:Anne_Smith db:birthplace db:New_York 2 db:Anne_Smith db:works db:Green_University 3 db:Anne_Smith db:spouseOf db:John_Brown 𝑡2 1 db:Anne_Smith db:birthplace db:New_York 4 db:Anne_Smith db:works db:Royal_University 3 db:Anne_Smith db:spouseOf db:John_Brown 5 db:Anne_Smith db:degree db:Master_of_Science
  • 9. www.moving-project.eu 9 of 19 Temporal Dynamics of Entities • Temporal Dynamics of an entity 𝐸 • 𝑛: the number of snapshots Chifumi Nishioka (chni@informatik.uni-kiel.de) Δ(𝐸) = (𝛿 𝐸𝑡1 , 𝐸𝑡2 , 𝛿 𝐸𝑡2 , 𝐸𝑡3 , ⋯ 𝛿(𝐸𝑡 𝑛−1 , 𝐸𝑡 𝑛 )) 2. Represent temporal dynamics of entities by a time-series of the amount of changes in an entity between two successive snapshots Subsequently, we mine the resulted time-series to find out patterns of temporal dynamics of entities
  • 10. www.moving-project.eu 10 of 19 Time-series Clustering Chifumi Nishioka (chni@informatik.uni-kiel.de) • Clustering algorithm: k-means++ [Arthur and Vassilvitskii 07] • Introduce an improved initial seeding into k-means • Distance measure: Euclidean distance • The most efficient measure for distance between time-series with a reasonably high accuracy [Wang et al. 13] • Optimization of the number of clusters : Average Silhouette Goal 2: Find out the representative temporal patterns of entity dynamics
  • 11. www.moving-project.eu 11 of 19 Periodicity Detection • Periodicity Detection • A task of detecting periodicity from time-series • Example 1: (1, 3, 2, 1, 3, 2) -> periodicity of three • Example 2: (1, 2, 1, 2, 1, 2) -> periodicity of two • Employ a convolution-based algorithm [Elfeky et al. 05] Chifumi Nishioka (chni@informatik.uni-kiel.de) We assume that the amount of changes of entities have some periodicity We see the centroids of the resulted clusters as patterns of entity dynamics
  • 12. www.moving-project.eu 12 of 19 Dataset • Dynamic Linked Data Observatory (DyLDO) dataset [Käfer et al. 12] • Weekly snapshots of the fixed set of LOD documents • 165 snapshots over three years (05/2012 to 07/2015) • Entities in the DyLDO dataset • Almost 75% of entities appear only at one snapshot • Focus on entities that appear at >70% of snapshots Chifumi Nishioka (chni@informatik.uni-kiel.de) Entity representation # of unique entities in 165 snapshots # of entities that appear at >70% of snapshots Out 27,788,902 2,909,700 InOut 29,097,929 2,950,533
  • 13. www.moving-project.eu 13 of 19 Patterns of Entity Dynamics (1/3) • Analysis with respect to eight conditions • Conditions are made by two entity representations, two distance measures, two triple weighting methods • Result of clustering • # of clusters are smaller when using combIC Chifumi Nishioka (chni@informatik.uni-kiel.de)
  • 14. www.moving-project.eu 14 of 19 Patterns of Entity Dynamics (2/3) Chifumi Nishioka (chni@informatik.uni-kiel.de) Out Cosine Baseline Out Cosine CombIC Out Euclidean CombIC Out Euclidean Baseline
  • 15. www.moving-project.eu 15 of 19 Patterns of Entity Dynamics (3/3) Chifumi Nishioka (chni@informatik.uni-kiel.de) InOut Cosine Baseline InOut Cosine CombIC InOut Euclidean Baseline InOut Euclidean CombIC
  • 16. www.moving-project.eu 16 of 19 Periodicity of Entity Dynamics Chifumi Nishioka (chni@informatik.uni-kiel.de) We observe periodicities in temporal dynamics of entities • e.g., “Periodicity of 56” indicates that the amount of entity changes vary along with one-year cycle • Different patterns have different periodicities
  • 17. www.moving-project.eu 17 of 19 Features for Entity Dynamics (1/2) • Four features of entities • RDF Type (𝑓1) • Property (𝑓2) • Union of RDF types and properties (𝑓3) • Pay level domain (PLD) of entity URI (𝑓4) • e.g., http://dbpedia.org/resource/The_Beatles -> dbpedia.org • Evaluate four features by RandIndex • RandIndex: a metric of clustering • Measure the difference of clustering by a feature and by entity dynamics (i.e., time-series vectors) Chifumi Nishioka (chni@informatik.uni-kiel.de) Goal 3: Find out which features of entity more likely define temporal dynamics of entities
  • 18. www.moving-project.eu 18 of 19 Features for Entity Dynamics (2/2) • Entities that share a common PLD are more likely to have similar temporal dynamics of entities when employing baseline for triple weighting • When using combIC, entities that have a common RDF type or ECS more likely to belong a same cluster Chifumi Nishioka (chni@informatik.uni-kiel.de)
  • 19. www.moving-project.eu 19 of 19 Thank you for your attention! Project consortium and funding agency Chifumi Nishioka (chni@informatik.uni-kiel.de) MOVING is funded by the EU Horizon 2020 Programme under the project number INSO-4-2015: 693092
  • 20. www.moving-project.eu 20 of 19 Conclusion • Temporal dynamics of entities on the LOD cloud • Represent the temporal dynamics of entities as time- series • Find out the representative temporal patterns of entity dynamics • Find out which features of entity • Future work • e.g., SPARQL query caching Chifumi Nishioka (chni@informatik.uni-kiel.de) Goal 3: Find out which features of entity more likely define temporal dynamics of entities
  • 21. www.moving-project.eu 21 of 19 Appendix Chifumi Nishioka (chni@informatik.uni-kiel.de)
  • 22. www.moving-project.eu 22 of 19 Reference • [Arthur and Vassilvitskii 07] D. Arthur and S. Vassilvitskii. k-means++: The advantages of careful seeding. SODA, 2007. • [Elfeky et al. 05] M.G. Elfeky, W.G. Aref, and A.K. Elmagarmid. Periodicity detection in time series databases. IEEE TKDE, 2005. • [Käfer et al. 12] T. Käfer, J. Umbrich, A. Hogan, and A. Polleres. Towards a dynamic linked data observatory. LDOW, 2012. • [Käfer et al. 13] T. Käfer, A. Abdelrahman, J. Umbrich, P. O’Byrne, and A. Hogan. Observing linked data dynamics. ESWC, 2013. • [Neumann and Moerkotte 11] T. Neumann and G. Moerkotte. Characteristic sets: Accurate cardinality estimation for RDF queries with multiple joins. ICDE, 2011. • [Schuhmacher and Ponzetto 14] M. Schuhmacher and S.P. Ponzetto. Knowledge-based graph document modeling. WSDM, 2014. • [Wang et al. 13] X. Wang, A. Mueen, H. Ding, G. Trajcevski, P. Scheuermann, and E. Keogh. Experimental comparison of representation methods and distance measures for time series data. Data Mining and Knowledge Discovery, 2013. • [Yang and Leskovec 11] J. Yang and J. Leskovec. Patterns of temporal variation in online media. WSDM, 2011. Chifumi Nishioka (chni@informatik.uni-kiel.de)
  • 23. www.moving-project.eu 23 of 19 Entities in the DyLDO dataset • Distribution of # of times of appearances of entities in 165 snapshots Chifumi Nishioka (chni@informatik.uni-kiel.de) Entity representation Out Entity representation InOut
  • 24. www.moving-project.eu 24 of 19 Resulted Temporal Patterns (1/4) Chifumi Nishioka (chni@informatik.uni-kiel.de) Out Cosine Baseline Out Cosine CombIC
  • 25. www.moving-project.eu 25 of 19 Resulted Temporal Patterns (2/4) Chifumi Nishioka (chni@informatik.uni-kiel.de) Out Euclidean Baseline Out Euclidean CombIC
  • 26. www.moving-project.eu 26 of 19 Resulted Temporal Patterns (3/4) Chifumi Nishioka (chni@informatik.uni-kiel.de) InOut Cosine Baseline InOut Cosine CombIC
  • 27. www.moving-project.eu 27 of 19 Resulted Temporal Patterns (4/4) Chifumi Nishioka (chni@informatik.uni-kiel.de) InOut Euclidean Baseline InOut Euclidean CombIC
  • 28. www.moving-project.eu 28 of 19 Presentation • PROFILES 2016 Chifumi Nishioka (chni@informatik.uni-kiel.de)