New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Study concept drift in political ontologies
1. What is the problem? How can we deal with concept drift? Open questions
Studying Concept Drift in Political Ontologies
Shenghui Wang1,2 Stefan Schlobach1
Janet Takens2 Wouter van Atteveldt2
1
Department of Computer Science
2
Department of Communication Science
Vrije Universiteit Amsterdam
Workshop on Matching and Meaning 2010
2. What is the problem? How can we deal with concept drift? Open questions
Content analysis in Communication Science
Communication scientists study all sorts of media content
related to human communication
Content analysis based on the NET method
concepts: political actors and issues
relations: associations, opinions, or actions.
Example
Het Openbaar Ministerie (OM) wil de komende vier jaar mensen-
handel uitroeien.
3. What is the problem? How can we deal with concept drift? Open questions
Content analysis in Communication Science
Communication scientists study all sorts of media content
related to human communication
Content analysis based on the NET method
concepts: political actors and issues
relations: associations, opinions, or actions.
Example
Het Openbaar Ministerie (OM) wil de komende vier jaar mensen-
handel uitroeien.
4. What is the problem? How can we deal with concept drift? Open questions
Content analysis in Communication Science
Communication scientists study all sorts of media content
related to human communication
Content analysis based on the NET method
concepts: political actors and issues
relations: associations, opinions, or actions.
Example
Het Openbaar Ministerie (OM) wil de komende vier jaar mensen-
handel uitroeien.
-1
om human trafficking
6. What is the problem? How can we deal with concept drift? Open questions
Network-based communication science study
Politicians are networking
Politics is perceived by citizens via media
Media study by semantic network analysis
Who is determining the subjects?
Who is teaming up?
Who is more credible?
Who owns which topic?
7. What is the problem? How can we deal with concept drift? Open questions
Before network analysis
We first need to extract networks!
How do we extract such networks?
Requires: large corpora with annotated textual content
Manual coding against coding books (ontologies)
Automated content analysis in progress
8. What is the problem? How can we deal with concept drift? Open questions
Before network analysis
We first need to extract networks!
How do we extract such networks?
Requires: large corpora with annotated textual content
Manual coding against coding books (ontologies)
Automated content analysis in progress
9. What is the problem? How can we deal with concept drift? Open questions
What is the problem?
Problems with constructing annotated content
Data from different time periods or genres
Coded by different teams at different moments
Manifesto Research Group: 25 countries, from 1945 to 2006
Comparative Policy Agendas project: media content,
manifestos, legislative texts, government press statements, etc.
Election campaign coverage from 1994 to 2006
10. What is the problem? How can we deal with concept drift? Open questions
What is the problem?
Problems with constructing annotated content
Data from different time periods or genres
Coded by different teams at different moments
Manifesto Research Group: 25 countries, from 1945 to 2006
Comparative Policy Agendas project: media content,
manifestos, legislative texts, government press statements, etc.
Election campaign coverage from 1994 to 2006
11. What is the problem? How can we deal with concept drift? Open questions
What is the problem?
Problem 1: Interoperability problem while information sharing
Different coding books should be merged or at least
connected
illegal immigration
labour migrants
12. What is the problem? How can we deal with concept drift? Open questions
What is the problem, again?
Everything changes, quickly or slowly ...
13. What is the problem? How can we deal with concept drift? Open questions
Follow the Fashion?
14. What is the problem? How can we deal with concept drift? Open questions
Women’s role?
Suffragettes said that women’s role in society is unacceptable
Pope says that women’s role in society is unacceptable
15. What is the problem? How can we deal with concept drift? Open questions
Concept drift is a problem
Problem 2: Concept drift
Meaning of concepts changes over time
Analysis based on evolving concepts must consider temporal
locality
Study concept drift itself is useful
16. What is the problem? How can we deal with concept drift? Open questions
Datasets
Five political ontologies which were used to annotate
newspaper articles
23639 manually annotated newspaper articles during five
recent Dutch national election campaigns
There even exist manual mappings but most of them are
lexically very similar
17. What is the problem? How can we deal with concept drift? Open questions
What are the main issues?
What is concept drift?
How do we detect the concept drift?
How do we represent the concept drift?
How do we evaluate the concept drift?
How do we use the concept drift?
18. What is the problem? How can we deal with concept drift? Open questions
What are the main issues?
What is concept drift?
How do we detect the concept drift?
How do we represent the concept drift?
How do we evaluate the concept drift?
How do we use the concept drift?
19. What is the problem? How can we deal with concept drift? Open questions
What are the main issues?
What is concept drift?
How do we detect the concept drift?
How do we represent the concept drift?
How do we evaluate the concept drift?
How do we use the concept drift?
20. What is the problem? How can we deal with concept drift? Open questions
What are the main issues?
What is concept drift?
How do we detect the concept drift?
How do we represent the concept drift?
How do we evaluate the concept drift?
How do we use the concept drift?
21. What is the problem? How can we deal with concept drift? Open questions
What are the main issues?
What is concept drift?
How do we detect the concept drift?
How do we represent the concept drift?
How do we evaluate the concept drift?
How do we use the concept drift?
22. What is the problem? How can we deal with concept drift? Open questions
What is concept drift?
Definition
The meaning of concept
Label
Intension
Extension
Questions:
Can three of them change at the same time?
Should there be a rigid part in one concept which stays
constant all the time?
23. What is the problem? How can we deal with concept drift? Open questions
What is concept drift?
Definition
The meaning of concept
Label
Intension
Extension
Questions:
Can three of them change at the same time?
Should there be a rigid part in one concept which stays
constant all the time?
24. What is the problem? How can we deal with concept drift? Open questions
What is concept drift?
Definition
The meaning of concept
Label
Intension
Extension
Questions:
Can three of them change at the same time?
Should there be a rigid part in one concept which stays
constant all the time?
25. What is the problem? How can we deal with concept drift? Open questions
What is concept drift?
Definition
The meaning of concept
Label
Intension
Extension
Questions:
Can three of them change at the same time?
Should there be a rigid part in one concept which stays
constant all the time?
26. What is the problem? How can we deal with concept drift? Open questions
Detecting concept drift
Detecting concept drift in terms of
its labels
its extension: instance-based mapping between different time
its intension: using its hierarchical information and the
co-occurrence links to other concepts of the same time
27. What is the problem? How can we deal with concept drift? Open questions
Representing concept drift
28. What is the problem? How can we deal with concept drift? Open questions
Representing concept drift: Builder’ fraud
criminaliteit
fiod companies,_coroprations,_business criminaliteit
0.077 0.002
bouwfraude fraude_en_corruptie
2006
0.030 0.044
0.038 bestuurlijke_vernieuwing
0.047
0.043 parlementaire_enquete_algemeen
0.154 corruptie parlementaire_enquete_bouwfrau
0.198
bouwfraude 0.034 0.097
2003
0.050 0.130 0.117 0.152 0.124
klokkenluider belangenverstrengeling jusititie
0.083 0.058
bouwfraude
2002
29. What is the problem? How can we deal with concept drift? Open questions
Representing concept drift: Police
criminaliteit
criminaliteit
0.065 0.067
0.030 criminaliteitsbestrijding 0.072
0.046 0.055
politie criminaliteit justitie
2006
0.151 0.107 0.069
politie 0.037 criminaliteit jusititie
2003
0.055 0.073 0.052 0.042
politie 0.044 rcriminaliteit xjustitie
2002
0.066 0.071 0.064
rpolitie sjo_creawetsto criminelen 0.066 0.120
1998
0.083 0.123
politie
1994
30. What is the problem? How can we deal with concept drift? Open questions
What kinds of concept drift can we detect?
31. What is the problem? How can we deal with concept drift? Open questions
What kinds of concept drift can we detect?
Association shifting
hervorming_politiebestel hgekozen_burgemeester
0.018 0.033
soc_vernieuwing ontwikkelingshu hbestuurlijke_vernieuwin
1998
0.082 0.074
best_vernieuwin
1994
32. What is the problem? How can we deal with concept drift? Open questions
What kind of concept drift can we detect?
Generalising or specialising
criminality
0.038 fraud and corruption
corruption builder’s fraud 0.043
2006
builder’s fraud
2003
33. What is the problem? How can we deal with concept drift? Open questions
What kind of concept drift can we detect?
Overlapping
rwerkgevers lkoppeling
0.013 0.015
wgv_werkgevers rvno 0.037 0.077
1998
0.077
vno_ncw
1994
34. What is the problem? How can we deal with concept drift? Open questions
Open questions
What other types of concept drift can we identify and
automatically detect?
What is an appropriate (formal) representation for the
detected drift?
How can we evaluate the detected concept drift, both
qualitatively and quantitatively?
35. What is the problem? How can we deal with concept drift? Open questions
Thank you