Presentation of the paper 'Bringing parliamentary debates to the Semantic Web' by Damir Juric, Laura Hollink and Geert-Jan Houben at the workshop on Detection, Representation, and Exploitation of Events in the Semantic Web (DeRiVE2012) in conjunction with the 11th International Semantic Web Conference 2012 in Boston, USA.
See also the homepage of the PoliMedia project: http://polimedia.nl/
ICWE2013 - Discovering links between political debates and mediagjhouben
Discovering links between political debates and media
by Damir Juric, Laura Hollink, Geert-Jan Houben
TU Delft - WIS
at ICWE 2013, Aalborg, Denmark, July 2013
Using Topic Modeling to Study Everyday "Civic Talk" and Proto-political Engag...Tuukka Ylä-Anttila
We present a two-step topic modeling method of analysing political articulations in everyday proto-political "civic talk" on online social media and interpreting them in terms of cultural and political sociology.
Introduction to Research project PoliMediaMartijn Kleppe
Presentation about our research project 'PoliMedia - Interlinking multimedia for the analysis of media coverage of political debates'. Presented at the PoliMedia symposium, 23 January 2013, Amsterdam, the Netherlands
ICWE2013 - Discovering links between political debates and mediagjhouben
Discovering links between political debates and media
by Damir Juric, Laura Hollink, Geert-Jan Houben
TU Delft - WIS
at ICWE 2013, Aalborg, Denmark, July 2013
Using Topic Modeling to Study Everyday "Civic Talk" and Proto-political Engag...Tuukka Ylä-Anttila
We present a two-step topic modeling method of analysing political articulations in everyday proto-political "civic talk" on online social media and interpreting them in terms of cultural and political sociology.
Introduction to Research project PoliMediaMartijn Kleppe
Presentation about our research project 'PoliMedia - Interlinking multimedia for the analysis of media coverage of political debates'. Presented at the PoliMedia symposium, 23 January 2013, Amsterdam, the Netherlands
Building the PoliMedia search system; data- and user-drivenMaxKemman
Presentation at eHumanities group at Meerten's Institute (Amsterdam) on Thursday 18 April 2013.
Analysing media coverage across several types of media-outlets is a challenging task for (media) historians. A specific example of media coverage research investigates the coverage of political debates and how the representation of topics and people change over time. The PoliMedia project (http://www.polimedia.nl) aims to showcase the potential of cross-media analysis for research in the humanities, by 1) curating automatically detected semantic links between four data sets of different media types, and 2) developing a demonstrator application that allows researchers to deploy such an interlinked collection for quantitative and qualitative analysis of media coverage of debates in the Dutch parliament.
These two goals reflect the two perspectives on the development of a search system such as PoliMedia; data- and user-driven. In this presentation, Laura Hollink (VU) will present the data-driven perspective of linking between different datasets and the research questions that arise in achieving this linkage: how to combine different types of datasets and what kind of research questions are made possible by the data? Max Kemman (EUR) will present the user-driven perspective: which benefits can scholars have from linking of these datasets? What are the user requirements for the PoliMedia search system and how was the system evaluated with scholars in an eye tracking study?
Presentation of the Sense4us project at the 2nd European TA Conference - Berlin, 26 February 2015
"Policy Making in a Complex World:
The Opportunities and Risks Presented
by New Technologies"
Series of Leading Change slides illustrate an aspect of my resume, namely a range of early professional experiments related to advancing--in small ways--sources of government innovation: transparency, collaboration, public participation and organization design.
A multifaceted study of online news diversity: issues and methodssmyrnaios
Emmanuel Marty, Nikos Smyrnaios and Franck Rebillard
In Ramón Salaverría (ed.), Diversity of Journalisms, Proceedings of the ECREA Journalism Studies Section and 26th International Conference of Communication (CICOM) at University of Navarra, Pamplona, 4-5 July 2011, p. 228-242
Enriching Linked Open Data with distributional semantics to study concept driftLaura Hollink
Presentation at the "Proximity in Information Retrieval" symposium on the occasion of the PhD thesis defense of Jeroen Vuurens
April 26, 2017, Delft University of Technology
More Related Content
Similar to Bringing parliamentary debates to the Semantic Web
Building the PoliMedia search system; data- and user-drivenMaxKemman
Presentation at eHumanities group at Meerten's Institute (Amsterdam) on Thursday 18 April 2013.
Analysing media coverage across several types of media-outlets is a challenging task for (media) historians. A specific example of media coverage research investigates the coverage of political debates and how the representation of topics and people change over time. The PoliMedia project (http://www.polimedia.nl) aims to showcase the potential of cross-media analysis for research in the humanities, by 1) curating automatically detected semantic links between four data sets of different media types, and 2) developing a demonstrator application that allows researchers to deploy such an interlinked collection for quantitative and qualitative analysis of media coverage of debates in the Dutch parliament.
These two goals reflect the two perspectives on the development of a search system such as PoliMedia; data- and user-driven. In this presentation, Laura Hollink (VU) will present the data-driven perspective of linking between different datasets and the research questions that arise in achieving this linkage: how to combine different types of datasets and what kind of research questions are made possible by the data? Max Kemman (EUR) will present the user-driven perspective: which benefits can scholars have from linking of these datasets? What are the user requirements for the PoliMedia search system and how was the system evaluated with scholars in an eye tracking study?
Presentation of the Sense4us project at the 2nd European TA Conference - Berlin, 26 February 2015
"Policy Making in a Complex World:
The Opportunities and Risks Presented
by New Technologies"
Series of Leading Change slides illustrate an aspect of my resume, namely a range of early professional experiments related to advancing--in small ways--sources of government innovation: transparency, collaboration, public participation and organization design.
A multifaceted study of online news diversity: issues and methodssmyrnaios
Emmanuel Marty, Nikos Smyrnaios and Franck Rebillard
In Ramón Salaverría (ed.), Diversity of Journalisms, Proceedings of the ECREA Journalism Studies Section and 26th International Conference of Communication (CICOM) at University of Navarra, Pamplona, 4-5 July 2011, p. 228-242
Enriching Linked Open Data with distributional semantics to study concept driftLaura Hollink
Presentation at the "Proximity in Information Retrieval" symposium on the occasion of the PhD thesis defense of Jeroen Vuurens
April 26, 2017, Delft University of Technology
Lecture at the advanced course on Data Science of the SIKS research school, May 20, 2016, Vught, The Netherlands.
Contents
-Why do we create Linked Open Data? Example questions from the Humanities and Social Sciences
-Introduction into Linked Open Data
-Lessons learned about the creation of Linked Open Data (link discovery, knowledge representation, evaluation).
-Accessing Linked Open Data
Presentation at Digital Humanities Benelux 2015, Antwerp, Belgium: The possibilities and challenges of using linked data for academic research: the case of the Talk of Europe project. linked data for academic research: the case of the Talk of Europe project. Laura Hollink, Martijn Kleppe, Max Kemman, Astrid van Aggelen, Willem Robert Van Hage.
WWW2013: Web Usage Mining with Semantic AnalysisLaura Hollink
Laura Hollink, Peter Mika and Roi Blanco. Web Usage Mining with Semantic Analysis. In proceedings of the International World Wide Web Conference, Rio de Janeiro, Brazil, May 2013.
Bringing parliamentary debates to the Semantic Web
1. Bringing parliamentary debates to the Semantic Web
Damir Juric1,3, Laura Hollink2, Geert-Jan Houben1
1 Delft University of Technology, 2 VU University Amsterdam, 3 FER University of Zagreb
DERIVE 2012
Boston, 12.11.2012.
2. Motivation
Cross-media comparison:
• What choices do different media make in the coverage of people and topics while
reporting on political events?
• Does the representation of topics and people change over time and how do the
various media types differ?
3. Motivation
Political events
Media
Cross-media comparison:
• What choices do different media make in the coverage of people and topics while
reporting on political events?
• Does the representation of topics and people change over time and how do the
various media types differ?
4. Background: the
PoliMedia project
• Funded by CLARIN-NL
• May 2012 - May 2013
• 3 phases :
I. modeling phase: creating
a semantic model (this
presentation)
II. data production phase:
creating links between
political events and media
III.application phase:
searching and navigating
linked datasets
• www.polimedia.nl
5. Research questions
• How to represent political events on the Semantic Web?
• How to represent links between media and political events on
the Semantic Web?
6. Research questions
• How to represent political events on the Semantic Web?
• How to represent links between media and political events on
the Semantic Web?
7. Political events data set
• Events: Dutch parliamentary debates
Handelingen der Staten-General or Dutch Hansard
• Some provenance:
1. Transcripts are made of the complete
debates of the Dutch parliament.
2. Published online by the government on
http://www.statengeneraaldigitaal.nl/ (1818
1995) and http://
officielebekendmakingen.nl/ (from 1995)
3. PoliticalMashup project has translated
government pdf and txt files into XML, incl
URI’s as identifiers, see http://
politicalmashup.nl/
4. We build on that.
8. Media data sets
• newspaper articles and radio bulletins
• at the National Library of the Netherlands
• Many, mostly regional news papers 1950-
1995
• Text + images of newspaper layout
• newscasts
• at the Netherlands institute for Sound and
Vision
• evening news and current affairs
programs
• metadata in Dublin Core and CDMI format
• enriched with thesaurus terms from the
Gemeenschappelijke Thesaurus
Audiovisuele Archieven (GTAA)
9. Semantic model: what do we need to represent? 1/2
• Important information for every parliamentary debate is: Debate
• When the debate was held Metadata
• What is being said in the debate (topics)
Topic 1
• Who is giving the speeches in the debate and in which
role (persons)
Speaker 1 / Content
• Additional information about actors involved in the
event (names of the politicians, their party, age, etc.)
Speaker 2 / Content
• Structure: Subparts of the debate have their own
identifiers (part of the debate where only one speaker
can be identified as actor) Speaker 3 / Content
• chronological order (the order in which the subparts
where occurring inside the parliament debate,
• Named entities apart from politicians (persons, Topic 2
locations, etc.)
Speaker 1 / Content
10. Semantic model: what do we need to represent? 2/2
• Various information about media
items linked to the debate
• Links between subparts of the
debate and news articles, radio
bulletins and television newscasts
11. URI’s
• PoliMedia vocabulary: http://purl.org/linkedpolitics/nl/polivoc#Speech
• Politicians, parties: http://purl.org/linkedpolitics/nl/poli#Beel
• debates and part of debates: http://purl.org/linkedpolitics/nl/nl.proc.sgd.d.
198219830000846.2.11.12
• Media articles, bulletins and news casts: http://resolver.kb.nl/resolve?urn=ddd:
010069811:mpeg21:pdf
17. Semantic model W.R. van Hage, V. Malaisé, R.
Segers, L. Hollink and A.Th.
Schreiber. Design and use of
the Simple Event Model
(SEM)
18. Semantic model W.R. van Hage, V. Malaisé, R.
Segers, L. Hollink and A.Th.
Schreiber. Design and use of
the Simple Event Model
(SEM)
19. Current work: finding links
• Queries: speaker name + named entities + topics (created using
topic modeling methods) extracted from political events dataset
• used for retrieval of media articles
TopicList =
NamedEntitiesVector TopicWordSetVector NamedEntitiesVector TopicWordSetVector
Speech Speech PartOfDebate PartOfDebate
+
Speaker X =
ActorFromSpeech TimeFrame
20. Finally
• SPARQL endpoint with the PoliMedia vocabulary + RDF of Dutch Hansard
data will be available soon.
• Feel free to use it!
• Links to media + search/browse app are expected early next year.
21. Thank you for your
attention!
Henri Beunders (EUR) Damir Juric (TU Delft)
Jaap Blom (NISV) Max Kemman (EUR)
Laura Hollink (VU) Martijn Kleppe (EUR)
Geert-Jan Houben (TU Delft) Johan Oomen (NISV)