SlideShare a Scribd company logo
1 of 18
Download to read offline
Political Discourse in the
News 

(and other studies)
Antske Fokkens, VU University Amsterdam

Political discourse in the News is joint work with: Ellis Aizenberg, Wouter van Atteveldt, Carlotta
Cassimassima, Franz-Xaver Geiger, Laura Hollink, Annick van der Peet, Chantal van Son
Overview
• Introduction
• Interdisciplinary research & research questions
• Text analysis
• From basic to complex: possibilities and challenges
• Methodological issues
• Conclusion
Introduction
• Interdisciplinary research:
• Social Science: manual annotation, research
questions
• Humanities: research questions
• Computer Science: modeling, visualization
• Computational Linguistics: text analysis
Introduction
• Research questions:
• Has personalization increased in political news?
• What trends do we see in reported political
conflicts?
• How does news reporting relate to the
parliamentary debates?
• What perspectives are expressed by news
(explicitly and implicitly)?
Approaches
• Manual annotations:
• Expert (communication science researchers
and Master students)
• Crowd (crowdsourcing)
• Automatic annotation:
• Basic as well as advanced NLP approaches
Text analysis
• AmCAT (Wouter van Atteveldt):
• Open source infrastructure that facilitates large-scale
analysis and manual content analysis of text
• BiographyNet/NewsReader pipeline (Piek Vossen’s cltl group):
• NLP modules for event (and event relation) extraction &
named entity recognition and disambiguation
• OpeNER tools (Piek Vossen’s cltl group):
• Sentiment analysis and opinion mining
Basic methods
• Counting:
• occurrences of names in text
• identifying words from word lists (e.g.
sentiment words)
• Topic modeling (e.g. LDA)
Basic methods
• Can easily be run on large datasets
• Can address research questions (e.g. Aizenberg
(2014) shows increase of personalization)
• Limited to overall trends and tendencies
• For some tasks, high risk of unreliable results:
• e.g. erg is listed with ‘negative sentiment’
More advanced analyses
More advanced analyses
• Can provide more detailed insight into the content of the text
• Scalability becomes an issue (several complex language
models)
• to illustrate:
• +/- 5 minutes per article (regular university cluster)
• 11 days for 1.3 million articles on Hadoop cluster at
SURFsara
• Accuracy can be low for difficult tasks and because errors ‘pile
up’
Methodological issues
• Data interpretation
• Biases
• Example: OCR
Data interpretation
• Basic methods:
• results from counts are clear, but what do they
say?
• More advanced methods:
• attempt to provide semantic interpretations,
but what is the accuracy of the tools?
Biases
• One way to deal with errors is to assume that it is just noise in
a large pile of data
• This assumption works, if errors are equally distributed across
classes/information that matter for the research question
• For instance, counting sentiment related terms:
• are the lists for negative and positive terms of comparable
quality?
• does one of the list contain more ambiguous terms than the
other?
Bias example OCR
• Data from the KB still have some issues with OCR
• There tend to be more issues with older data
• Imagine we investigate whether emotional
expressions in text increased over time:



Does worse OCR lead to a lower percentage of
identification in older text?
Dealing with biases
• We cannot exclude the risk of biases completely
• We can:
• try to make sure researchers using output are
aware of the details of the method (raise
awareness of possible biases)
• carry out both intrinsic and extrinsic evaluation,
i.e. explicitly investigate the influence of a bias
on overall results
Conclusion
• Several research directions where technology (including NLP, linked
data, visualizations) is used to support research in Humanities and
Social Sciences
• NLP approaches vary from basic to complex pipelines carrying out
several steps
• Basic approaches can easily be applied to large datasets are
transparent, but do not say much
• More advanced approaches provide detailed information, but cannot
easily be applied to large sets and are less transparent
• Insight into how data was processed and both intrinsic and extrinsic
evaluation is needed to raise awareness about (or even avoid?) biases
Thank you!
References
• AmCAT: http://vanatteveldt.com/amcat/
• BiographyNet/NewsReader pipeline:
• Rodrigo Agerri et al. (2015). Event Detection version 2.2. NewsReader
Deliverable 4.2.2. http://www.newsreader-project.eu/files/2012/12/NWR-D4-2-2.pdf
• Methodological issues:
• Antske Fokkens, Serge ter Braake, Niels Ockeloen, Piek Vossen, Susan Legêne
and Guus Schreiber. 2014. BiographyNet: Methodological issues when NLP
supports historical research. Proceedings of LREC 2014.

http://www.lrec-conf.org/proceedings/lrec2014/pdf/1103_Paper.pdf
• Niels Ockeloen, Antske Fokkens, Serge ter Braake, Piek Vossen, Victor de
Boer, Guus Schreiber, and Susan Legêne. 2013. BiographyNet: Managing
Provenance at multiple levels and from different perspectives. Proceedings of
Linked Science 2013. http://linkedscience.org/wp-content/uploads/2013/04/paper7.pdf

More Related Content

What's hot

Automated arXiv feeds on Twitter: On the role of bots in scholarly communication
Automated arXiv feeds on Twitter:On the role of bots in scholarly communicationAutomated arXiv feeds on Twitter:On the role of bots in scholarly communication
Automated arXiv feeds on Twitter: On the role of bots in scholarly communicationStefanie Haustein
 
Data Analysis in Research: Descriptive Statistics & Normality
Data Analysis in Research: Descriptive Statistics & NormalityData Analysis in Research: Descriptive Statistics & Normality
Data Analysis in Research: Descriptive Statistics & NormalityIkbal Ahmed
 
Emil Pulido on Qualitative Research: Analyzing Qualitative Data
Emil Pulido on Qualitative Research: Analyzing Qualitative DataEmil Pulido on Qualitative Research: Analyzing Qualitative Data
Emil Pulido on Qualitative Research: Analyzing Qualitative DataEmilEJP
 
Qualitative Data Analysis
Qualitative Data AnalysisQualitative Data Analysis
Qualitative Data Analysisguest7f1ad678
 
Quantitative data analysis - Attitudes Towards Research
Quantitative data analysis - Attitudes Towards ResearchQuantitative data analysis - Attitudes Towards Research
Quantitative data analysis - Attitudes Towards ResearchLee Cox
 
EDUCATIONAL RESEARCH II (II Bimetsre Abril Agosto 2011)
EDUCATIONAL RESEARCH II (II Bimetsre Abril Agosto 2011)EDUCATIONAL RESEARCH II (II Bimetsre Abril Agosto 2011)
EDUCATIONAL RESEARCH II (II Bimetsre Abril Agosto 2011)Videoconferencias UTPL
 
Communities of attention' around journal papers: Who is tweeting about scient...
Communities of attention' around journal papers: Who is tweeting about scient...Communities of attention' around journal papers: Who is tweeting about scient...
Communities of attention' around journal papers: Who is tweeting about scient...Stefanie Haustein
 
Qualitative data analysis: many approaches to understand user insights
Qualitative data analysis: many approaches to understand user insightsQualitative data analysis: many approaches to understand user insights
Qualitative data analysis: many approaches to understand user insightsAgnieszka Szóstek
 
Qualitative data analysis - Student L
Qualitative data analysis - Student LQualitative data analysis - Student L
Qualitative data analysis - Student LLee Cox
 
NLP applicata a LIS
NLP applicata a LISNLP applicata a LIS
NLP applicata a LISnoemiricci2
 
Social Graphs for Better Drug Development
Social Graphs for Better Drug DevelopmentSocial Graphs for Better Drug Development
Social Graphs for Better Drug DevelopmentVaticle
 
Research Data Explored: Citations versus Altmetrics
Research Data Explored: Citations versus AltmetricsResearch Data Explored: Citations versus Altmetrics
Research Data Explored: Citations versus AltmetricsElisabeth Lex
 
Lesson 4 secondary research 2
Lesson 4   secondary research 2Lesson 4   secondary research 2
Lesson 4 secondary research 2Kavita Parwani
 
Semantometrics: Text Analysis in Research Evaluation
Semantometrics: Text Analysis in Research Evaluation Semantometrics: Text Analysis in Research Evaluation
Semantometrics: Text Analysis in Research Evaluation Dasha Herrmannova
 
Task of Correlation Research Questions
Task of Correlation Research QuestionsTask of Correlation Research Questions
Task of Correlation Research QuestionsHATS
 
Qualitative Data Analysis (Steps)
Qualitative Data Analysis (Steps)Qualitative Data Analysis (Steps)
Qualitative Data Analysis (Steps)guest7f1ad678
 

What's hot (20)

Automated arXiv feeds on Twitter: On the role of bots in scholarly communication
Automated arXiv feeds on Twitter:On the role of bots in scholarly communicationAutomated arXiv feeds on Twitter:On the role of bots in scholarly communication
Automated arXiv feeds on Twitter: On the role of bots in scholarly communication
 
Presentation1
Presentation1Presentation1
Presentation1
 
Data Analysis in Research: Descriptive Statistics & Normality
Data Analysis in Research: Descriptive Statistics & NormalityData Analysis in Research: Descriptive Statistics & Normality
Data Analysis in Research: Descriptive Statistics & Normality
 
Emil Pulido on Qualitative Research: Analyzing Qualitative Data
Emil Pulido on Qualitative Research: Analyzing Qualitative DataEmil Pulido on Qualitative Research: Analyzing Qualitative Data
Emil Pulido on Qualitative Research: Analyzing Qualitative Data
 
Qualitative Data Analysis
Qualitative Data AnalysisQualitative Data Analysis
Qualitative Data Analysis
 
Literature review
Literature reviewLiterature review
Literature review
 
Quantitative data analysis - Attitudes Towards Research
Quantitative data analysis - Attitudes Towards ResearchQuantitative data analysis - Attitudes Towards Research
Quantitative data analysis - Attitudes Towards Research
 
Qualitative data analysis
Qualitative data analysisQualitative data analysis
Qualitative data analysis
 
EDUCATIONAL RESEARCH II (II Bimetsre Abril Agosto 2011)
EDUCATIONAL RESEARCH II (II Bimetsre Abril Agosto 2011)EDUCATIONAL RESEARCH II (II Bimetsre Abril Agosto 2011)
EDUCATIONAL RESEARCH II (II Bimetsre Abril Agosto 2011)
 
Communities of attention' around journal papers: Who is tweeting about scient...
Communities of attention' around journal papers: Who is tweeting about scient...Communities of attention' around journal papers: Who is tweeting about scient...
Communities of attention' around journal papers: Who is tweeting about scient...
 
Qualitative data analysis: many approaches to understand user insights
Qualitative data analysis: many approaches to understand user insightsQualitative data analysis: many approaches to understand user insights
Qualitative data analysis: many approaches to understand user insights
 
Qualitative data analysis - Student L
Qualitative data analysis - Student LQualitative data analysis - Student L
Qualitative data analysis - Student L
 
NLP applicata a LIS
NLP applicata a LISNLP applicata a LIS
NLP applicata a LIS
 
Social Graphs for Better Drug Development
Social Graphs for Better Drug DevelopmentSocial Graphs for Better Drug Development
Social Graphs for Better Drug Development
 
Research Data Explored: Citations versus Altmetrics
Research Data Explored: Citations versus AltmetricsResearch Data Explored: Citations versus Altmetrics
Research Data Explored: Citations versus Altmetrics
 
Lesson 4 secondary research 2
Lesson 4   secondary research 2Lesson 4   secondary research 2
Lesson 4 secondary research 2
 
Semantometrics: Text Analysis in Research Evaluation
Semantometrics: Text Analysis in Research Evaluation Semantometrics: Text Analysis in Research Evaluation
Semantometrics: Text Analysis in Research Evaluation
 
Task of Correlation Research Questions
Task of Correlation Research QuestionsTask of Correlation Research Questions
Task of Correlation Research Questions
 
Bauer, H., Schui, G. & Krampen, G. (2012, August).From analogue to digital ps...
Bauer, H., Schui, G. & Krampen, G. (2012, August).From analogue to digital ps...Bauer, H., Schui, G. & Krampen, G. (2012, August).From analogue to digital ps...
Bauer, H., Schui, G. & Krampen, G. (2012, August).From analogue to digital ps...
 
Qualitative Data Analysis (Steps)
Qualitative Data Analysis (Steps)Qualitative Data Analysis (Steps)
Qualitative Data Analysis (Steps)
 

Viewers also liked

News Search Using Discourse Analytics
News Search Using Discourse AnalyticsNews Search Using Discourse Analytics
News Search Using Discourse AnalyticsClaudiu Mihăilă
 
Media discourse
Media discourseMedia discourse
Media discourse1101989
 
Critical discourse analysis of the ideology of media presented through news
Critical discourse analysis of the ideology of media presented through newsCritical discourse analysis of the ideology of media presented through news
Critical discourse analysis of the ideology of media presented through newsMazhar Ranjha
 
Eight Elements of News
Eight Elements of NewsEight Elements of News
Eight Elements of NewsLogan Aimone
 

Viewers also liked (6)

News Search Using Discourse Analytics
News Search Using Discourse AnalyticsNews Search Using Discourse Analytics
News Search Using Discourse Analytics
 
Discourse & newspaper
Discourse &  newspaperDiscourse &  newspaper
Discourse & newspaper
 
Media discourse
Media discourseMedia discourse
Media discourse
 
Media discourse
Media discourseMedia discourse
Media discourse
 
Critical discourse analysis of the ideology of media presented through news
Critical discourse analysis of the ideology of media presented through newsCritical discourse analysis of the ideology of media presented through news
Critical discourse analysis of the ideology of media presented through news
 
Eight Elements of News
Eight Elements of NewsEight Elements of News
Eight Elements of News
 

Similar to 15. political discourseinthenewskb

Not just for STEM: Open and reproducible research in the social sciences
Not just for STEM: Open and reproducible research in the social sciencesNot just for STEM: Open and reproducible research in the social sciences
Not just for STEM: Open and reproducible research in the social sciencesUoLResearchSupport
 
Twitter data analysis using R
Twitter data analysis using RTwitter data analysis using R
Twitter data analysis using Rsantoshi mangalgi
 
Altmetrics for kla final
Altmetrics for kla finalAltmetrics for kla final
Altmetrics for kla finalSarah Sutton
 
A Gentle Introduction to Text Analysis I
A Gentle Introduction to Text Analysis IA Gentle Introduction to Text Analysis I
A Gentle Introduction to Text Analysis IUNCResearchHub
 
A Summary of Computational Social Science - Lecture 8 in Introduction to Comp...
A Summary of Computational Social Science - Lecture 8 in Introduction to Comp...A Summary of Computational Social Science - Lecture 8 in Introduction to Comp...
A Summary of Computational Social Science - Lecture 8 in Introduction to Comp...Lauri Eloranta
 
A Gentle Introduction to Text Analysis :)
A Gentle Introduction to Text Analysis :)A Gentle Introduction to Text Analysis :)
A Gentle Introduction to Text Analysis :)UNCResearchHub
 
Data Science Master Specialisation
Data Science Master SpecialisationData Science Master Specialisation
Data Science Master SpecialisationArjen de Vries
 
Perceptual Data_04182016
Perceptual Data_04182016Perceptual Data_04182016
Perceptual Data_04182016Kunal Dash
 
ch 2 Tools of Research.docx
ch 2 Tools of Research.docxch 2 Tools of Research.docx
ch 2 Tools of Research.docxssuserf200491
 
BiographyNet: Linking the world of History
BiographyNet: Linking the world of HistoryBiographyNet: Linking the world of History
BiographyNet: Linking the world of HistoryBiographyNet
 
Social Media Analytics
Social Media AnalyticsSocial Media Analytics
Social Media AnalyticsMuhammad Rifqi
 
Sparc-Japan-Slow-revolution-in-scholarly-communication
Sparc-Japan-Slow-revolution-in-scholarly-communicationSparc-Japan-Slow-revolution-in-scholarly-communication
Sparc-Japan-Slow-revolution-in-scholarly-communicationhierohiero
 
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...Timo Wandhoefer
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Paul Groth
 
Week 10 writing research proposal
Week 10  writing research proposalWeek 10  writing research proposal
Week 10 writing research proposalwawaaa789
 
Cite track presentation
Cite track presentationCite track presentation
Cite track presentationAmir Razmjou
 
Writing a Scientific Article
Writing a Scientific ArticleWriting a Scientific Article
Writing a Scientific ArticleHythm Shibl
 
Evolving and emerging scholarly communication services in libraries: public a...
Evolving and emerging scholarly communication services in libraries: public a...Evolving and emerging scholarly communication services in libraries: public a...
Evolving and emerging scholarly communication services in libraries: public a...Claire Stewart
 

Similar to 15. political discourseinthenewskb (20)

Not just for STEM: Open and reproducible research in the social sciences
Not just for STEM: Open and reproducible research in the social sciencesNot just for STEM: Open and reproducible research in the social sciences
Not just for STEM: Open and reproducible research in the social sciences
 
Twitter data analysis using R
Twitter data analysis using RTwitter data analysis using R
Twitter data analysis using R
 
Altmetrics for kla final
Altmetrics for kla finalAltmetrics for kla final
Altmetrics for kla final
 
A Gentle Introduction to Text Analysis I
A Gentle Introduction to Text Analysis IA Gentle Introduction to Text Analysis I
A Gentle Introduction to Text Analysis I
 
A Summary of Computational Social Science - Lecture 8 in Introduction to Comp...
A Summary of Computational Social Science - Lecture 8 in Introduction to Comp...A Summary of Computational Social Science - Lecture 8 in Introduction to Comp...
A Summary of Computational Social Science - Lecture 8 in Introduction to Comp...
 
A Gentle Introduction to Text Analysis :)
A Gentle Introduction to Text Analysis :)A Gentle Introduction to Text Analysis :)
A Gentle Introduction to Text Analysis :)
 
Data Science Master Specialisation
Data Science Master SpecialisationData Science Master Specialisation
Data Science Master Specialisation
 
Perceptual Data_04182016
Perceptual Data_04182016Perceptual Data_04182016
Perceptual Data_04182016
 
ch 2 Tools of Research.docx
ch 2 Tools of Research.docxch 2 Tools of Research.docx
ch 2 Tools of Research.docx
 
BiographyNet: Linking the world of History
BiographyNet: Linking the world of HistoryBiographyNet: Linking the world of History
BiographyNet: Linking the world of History
 
Social Media Analytics
Social Media AnalyticsSocial Media Analytics
Social Media Analytics
 
Sparc-Japan-Slow-revolution-in-scholarly-communication
Sparc-Japan-Slow-revolution-in-scholarly-communicationSparc-Japan-Slow-revolution-in-scholarly-communication
Sparc-Japan-Slow-revolution-in-scholarly-communication
 
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.
 
Week 10 writing research proposal
Week 10  writing research proposalWeek 10  writing research proposal
Week 10 writing research proposal
 
Qualitative research methods kanhaiya sapkota
Qualitative research methods kanhaiya sapkotaQualitative research methods kanhaiya sapkota
Qualitative research methods kanhaiya sapkota
 
Cite track presentation
Cite track presentationCite track presentation
Cite track presentation
 
00952
0095200952
00952
 
Writing a Scientific Article
Writing a Scientific ArticleWriting a Scientific Article
Writing a Scientific Article
 
Evolving and emerging scholarly communication services in libraries: public a...
Evolving and emerging scholarly communication services in libraries: public a...Evolving and emerging scholarly communication services in libraries: public a...
Evolving and emerging scholarly communication services in libraries: public a...
 

More from ingeangevaare

17. kb.nederlab.20150324
17. kb.nederlab.2015032417. kb.nederlab.20150324
17. kb.nederlab.20150324ingeangevaare
 
14 hollink poli_media
14 hollink poli_media14 hollink poli_media
14 hollink poli_mediaingeangevaare
 
13. dunning europeana newspapers-kb big data workshop
13. dunning europeana newspapers-kb big data workshop13. dunning europeana newspapers-kb big data workshop
13. dunning europeana newspapers-kb big data workshopingeangevaare
 
12 janssen wikiproject_verzetskranten
12 janssen wikiproject_verzetskranten12 janssen wikiproject_verzetskranten
12 janssen wikiproject_verzetskranteningeangevaare
 
10 hellema tekstkwaliteit_en_tekstverbetering_kb_corpus
10 hellema tekstkwaliteit_en_tekstverbetering_kb_corpus10 hellema tekstkwaliteit_en_tekstverbetering_kb_corpus
10 hellema tekstkwaliteit_en_tekstverbetering_kb_corpusingeangevaare
 
09. crijns delpher nieuweaanwas
09. crijns delpher nieuweaanwas09. crijns delpher nieuweaanwas
09. crijns delpher nieuweaanwasingeangevaare
 
08. ham auteursrecht kb-kranten
08. ham auteursrecht kb-kranten08. ham auteursrecht kb-kranten
08. ham auteursrecht kb-kranteningeangevaare
 
07 verheul texcavator
07 verheul texcavator07 verheul texcavator
07 verheul texcavatoringeangevaare
 
05. de boer dutchships
05. de boer dutchships05. de boer dutchships
05. de boer dutchshipsingeangevaare
 
04 wijfjes pillarisationin_historicalresources
04 wijfjes pillarisationin_historicalresources04 wijfjes pillarisationin_historicalresources
04 wijfjes pillarisationin_historicalresourcesingeangevaare
 
02 claeyssens het_kb-krantencorpus
02 claeyssens het_kb-krantencorpus02 claeyssens het_kb-krantencorpus
02 claeyssens het_kb-krantencorpusingeangevaare
 
20111117 pdfa angevaare
20111117 pdfa angevaare20111117 pdfa angevaare
20111117 pdfa angevaareingeangevaare
 
20100906 kenniskoppelen
20100906 kenniskoppelen20100906 kenniskoppelen
20100906 kenniskoppeleningeangevaare
 

More from ingeangevaare (17)

17. kb.nederlab.20150324
17. kb.nederlab.2015032417. kb.nederlab.20150324
17. kb.nederlab.20150324
 
16 depuydt inl
16 depuydt inl16 depuydt inl
16 depuydt inl
 
14 hollink poli_media
14 hollink poli_media14 hollink poli_media
14 hollink poli_media
 
13. dunning europeana newspapers-kb big data workshop
13. dunning europeana newspapers-kb big data workshop13. dunning europeana newspapers-kb big data workshop
13. dunning europeana newspapers-kb big data workshop
 
12 janssen wikiproject_verzetskranten
12 janssen wikiproject_verzetskranten12 janssen wikiproject_verzetskranten
12 janssen wikiproject_verzetskranten
 
11 wilms kb_lab
11 wilms kb_lab11 wilms kb_lab
11 wilms kb_lab
 
10 hellema tekstkwaliteit_en_tekstverbetering_kb_corpus
10 hellema tekstkwaliteit_en_tekstverbetering_kb_corpus10 hellema tekstkwaliteit_en_tekstverbetering_kb_corpus
10 hellema tekstkwaliteit_en_tekstverbetering_kb_corpus
 
09. crijns delpher nieuweaanwas
09. crijns delpher nieuweaanwas09. crijns delpher nieuweaanwas
09. crijns delpher nieuweaanwas
 
08. ham auteursrecht kb-kranten
08. ham auteursrecht kb-kranten08. ham auteursrecht kb-kranten
08. ham auteursrecht kb-kranten
 
07 verheul texcavator
07 verheul texcavator07 verheul texcavator
07 verheul texcavator
 
06 traub
06 traub06 traub
06 traub
 
05. de boer dutchships
05. de boer dutchships05. de boer dutchships
05. de boer dutchships
 
04 wijfjes pillarisationin_historicalresources
04 wijfjes pillarisationin_historicalresources04 wijfjes pillarisationin_historicalresources
04 wijfjes pillarisationin_historicalresources
 
02 claeyssens het_kb-krantencorpus
02 claeyssens het_kb-krantencorpus02 claeyssens het_kb-krantencorpus
02 claeyssens het_kb-krantencorpus
 
20111117 pdfa angevaare
20111117 pdfa angevaare20111117 pdfa angevaare
20111117 pdfa angevaare
 
20100923i pres
20100923i pres20100923i pres
20100923i pres
 
20100906 kenniskoppelen
20100906 kenniskoppelen20100906 kenniskoppelen
20100906 kenniskoppelen
 

Recently uploaded

GFE Call Girls Service Indira Nagar Lucknow \ 9548273370 Indian Call Girls Se...
GFE Call Girls Service Indira Nagar Lucknow \ 9548273370 Indian Call Girls Se...GFE Call Girls Service Indira Nagar Lucknow \ 9548273370 Indian Call Girls Se...
GFE Call Girls Service Indira Nagar Lucknow \ 9548273370 Indian Call Girls Se...Delhi Call Girls
 
##9711199012 Call Girls Delhi Rs-5000 UpTo 10 K Hauz Khas Whats Up Number
##9711199012 Call Girls Delhi Rs-5000 UpTo 10 K Hauz Khas  Whats Up Number##9711199012 Call Girls Delhi Rs-5000 UpTo 10 K Hauz Khas  Whats Up Number
##9711199012 Call Girls Delhi Rs-5000 UpTo 10 K Hauz Khas Whats Up NumberMs Riya
 
Goa Escorts WhatsApp Number South Goa Call Girl … 8588052666…
Goa Escorts WhatsApp Number South Goa Call Girl … 8588052666…Goa Escorts WhatsApp Number South Goa Call Girl … 8588052666…
Goa Escorts WhatsApp Number South Goa Call Girl … 8588052666…nishakur201
 
VIP Call Girls Pune Vani 8617697112 Independent Escort Service Pune
VIP Call Girls Pune Vani 8617697112 Independent Escort Service PuneVIP Call Girls Pune Vani 8617697112 Independent Escort Service Pune
VIP Call Girls Pune Vani 8617697112 Independent Escort Service PuneCall girls in Ahmedabad High profile
 
(PRIYA) Call Girls Rajgurunagar ( 7001035870 ) HI-Fi Pune Escorts Service
(PRIYA) Call Girls Rajgurunagar ( 7001035870 ) HI-Fi Pune Escorts Service(PRIYA) Call Girls Rajgurunagar ( 7001035870 ) HI-Fi Pune Escorts Service
(PRIYA) Call Girls Rajgurunagar ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130 Available With Room
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130  Available With RoomVIP Kolkata Call Girl Jatin Das Park 👉 8250192130  Available With Room
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130 Available With Roomishabajaj13
 
(SHINA) Call Girls Khed ( 7001035870 ) HI-Fi Pune Escorts Service
(SHINA) Call Girls Khed ( 7001035870 ) HI-Fi Pune Escorts Service(SHINA) Call Girls Khed ( 7001035870 ) HI-Fi Pune Escorts Service
(SHINA) Call Girls Khed ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Zechariah Boodey Farmstead Collaborative presentation - Humble Beginnings
Zechariah Boodey Farmstead Collaborative presentation -  Humble BeginningsZechariah Boodey Farmstead Collaborative presentation -  Humble Beginnings
Zechariah Boodey Farmstead Collaborative presentation - Humble Beginningsinfo695895
 
CBO’s Recent Appeals for New Research on Health-Related Topics
CBO’s Recent Appeals for New Research on Health-Related TopicsCBO’s Recent Appeals for New Research on Health-Related Topics
CBO’s Recent Appeals for New Research on Health-Related TopicsCongressional Budget Office
 
Cunningham Road Call Girls Bangalore WhatsApp 8250192130 High Profile Service
Cunningham Road Call Girls Bangalore WhatsApp 8250192130 High Profile ServiceCunningham Road Call Girls Bangalore WhatsApp 8250192130 High Profile Service
Cunningham Road Call Girls Bangalore WhatsApp 8250192130 High Profile ServiceHigh Profile Call Girls
 
Powering Britain: Can we decarbonise electricity without disadvantaging poore...
Powering Britain: Can we decarbonise electricity without disadvantaging poore...Powering Britain: Can we decarbonise electricity without disadvantaging poore...
Powering Britain: Can we decarbonise electricity without disadvantaging poore...ResolutionFoundation
 
VIP Mumbai Call Girls Andheri West Just Call 9920874524 with A/C Room Cash on...
VIP Mumbai Call Girls Andheri West Just Call 9920874524 with A/C Room Cash on...VIP Mumbai Call Girls Andheri West Just Call 9920874524 with A/C Room Cash on...
VIP Mumbai Call Girls Andheri West Just Call 9920874524 with A/C Room Cash on...Garima Khatri
 
(SUHANI) Call Girls Pimple Saudagar ( 7001035870 ) HI-Fi Pune Escorts Service
(SUHANI) Call Girls Pimple Saudagar ( 7001035870 ) HI-Fi Pune Escorts Service(SUHANI) Call Girls Pimple Saudagar ( 7001035870 ) HI-Fi Pune Escorts Service
(SUHANI) Call Girls Pimple Saudagar ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
(ANIKA) Call Girls Wadki ( 7001035870 ) HI-Fi Pune Escorts Service
(ANIKA) Call Girls Wadki ( 7001035870 ) HI-Fi Pune Escorts Service(ANIKA) Call Girls Wadki ( 7001035870 ) HI-Fi Pune Escorts Service
(ANIKA) Call Girls Wadki ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
(TARA) Call Girls Chakan ( 7001035870 ) HI-Fi Pune Escorts Service
(TARA) Call Girls Chakan ( 7001035870 ) HI-Fi Pune Escorts Service(TARA) Call Girls Chakan ( 7001035870 ) HI-Fi Pune Escorts Service
(TARA) Call Girls Chakan ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
High Class Call Girls Bangalore Komal 7001305949 Independent Escort Service B...
High Class Call Girls Bangalore Komal 7001305949 Independent Escort Service B...High Class Call Girls Bangalore Komal 7001305949 Independent Escort Service B...
High Class Call Girls Bangalore Komal 7001305949 Independent Escort Service B...narwatsonia7
 
EDUROOT SME_ Performance upto March-2024.pptx
EDUROOT SME_ Performance upto March-2024.pptxEDUROOT SME_ Performance upto March-2024.pptx
EDUROOT SME_ Performance upto March-2024.pptxaaryamanorathofficia
 
Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...
Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...
Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...anilsa9823
 

Recently uploaded (20)

Call Girls In Rohini ꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCe
Call Girls In  Rohini ꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCeCall Girls In  Rohini ꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCe
Call Girls In Rohini ꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCe
 
GFE Call Girls Service Indira Nagar Lucknow \ 9548273370 Indian Call Girls Se...
GFE Call Girls Service Indira Nagar Lucknow \ 9548273370 Indian Call Girls Se...GFE Call Girls Service Indira Nagar Lucknow \ 9548273370 Indian Call Girls Se...
GFE Call Girls Service Indira Nagar Lucknow \ 9548273370 Indian Call Girls Se...
 
##9711199012 Call Girls Delhi Rs-5000 UpTo 10 K Hauz Khas Whats Up Number
##9711199012 Call Girls Delhi Rs-5000 UpTo 10 K Hauz Khas  Whats Up Number##9711199012 Call Girls Delhi Rs-5000 UpTo 10 K Hauz Khas  Whats Up Number
##9711199012 Call Girls Delhi Rs-5000 UpTo 10 K Hauz Khas Whats Up Number
 
Goa Escorts WhatsApp Number South Goa Call Girl … 8588052666…
Goa Escorts WhatsApp Number South Goa Call Girl … 8588052666…Goa Escorts WhatsApp Number South Goa Call Girl … 8588052666…
Goa Escorts WhatsApp Number South Goa Call Girl … 8588052666…
 
VIP Call Girls Pune Vani 8617697112 Independent Escort Service Pune
VIP Call Girls Pune Vani 8617697112 Independent Escort Service PuneVIP Call Girls Pune Vani 8617697112 Independent Escort Service Pune
VIP Call Girls Pune Vani 8617697112 Independent Escort Service Pune
 
(PRIYA) Call Girls Rajgurunagar ( 7001035870 ) HI-Fi Pune Escorts Service
(PRIYA) Call Girls Rajgurunagar ( 7001035870 ) HI-Fi Pune Escorts Service(PRIYA) Call Girls Rajgurunagar ( 7001035870 ) HI-Fi Pune Escorts Service
(PRIYA) Call Girls Rajgurunagar ( 7001035870 ) HI-Fi Pune Escorts Service
 
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130 Available With Room
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130  Available With RoomVIP Kolkata Call Girl Jatin Das Park 👉 8250192130  Available With Room
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130 Available With Room
 
(SHINA) Call Girls Khed ( 7001035870 ) HI-Fi Pune Escorts Service
(SHINA) Call Girls Khed ( 7001035870 ) HI-Fi Pune Escorts Service(SHINA) Call Girls Khed ( 7001035870 ) HI-Fi Pune Escorts Service
(SHINA) Call Girls Khed ( 7001035870 ) HI-Fi Pune Escorts Service
 
Zechariah Boodey Farmstead Collaborative presentation - Humble Beginnings
Zechariah Boodey Farmstead Collaborative presentation -  Humble BeginningsZechariah Boodey Farmstead Collaborative presentation -  Humble Beginnings
Zechariah Boodey Farmstead Collaborative presentation - Humble Beginnings
 
CBO’s Recent Appeals for New Research on Health-Related Topics
CBO’s Recent Appeals for New Research on Health-Related TopicsCBO’s Recent Appeals for New Research on Health-Related Topics
CBO’s Recent Appeals for New Research on Health-Related Topics
 
Cunningham Road Call Girls Bangalore WhatsApp 8250192130 High Profile Service
Cunningham Road Call Girls Bangalore WhatsApp 8250192130 High Profile ServiceCunningham Road Call Girls Bangalore WhatsApp 8250192130 High Profile Service
Cunningham Road Call Girls Bangalore WhatsApp 8250192130 High Profile Service
 
Powering Britain: Can we decarbonise electricity without disadvantaging poore...
Powering Britain: Can we decarbonise electricity without disadvantaging poore...Powering Britain: Can we decarbonise electricity without disadvantaging poore...
Powering Britain: Can we decarbonise electricity without disadvantaging poore...
 
Rohini Sector 37 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 37 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 37 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 37 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
VIP Mumbai Call Girls Andheri West Just Call 9920874524 with A/C Room Cash on...
VIP Mumbai Call Girls Andheri West Just Call 9920874524 with A/C Room Cash on...VIP Mumbai Call Girls Andheri West Just Call 9920874524 with A/C Room Cash on...
VIP Mumbai Call Girls Andheri West Just Call 9920874524 with A/C Room Cash on...
 
(SUHANI) Call Girls Pimple Saudagar ( 7001035870 ) HI-Fi Pune Escorts Service
(SUHANI) Call Girls Pimple Saudagar ( 7001035870 ) HI-Fi Pune Escorts Service(SUHANI) Call Girls Pimple Saudagar ( 7001035870 ) HI-Fi Pune Escorts Service
(SUHANI) Call Girls Pimple Saudagar ( 7001035870 ) HI-Fi Pune Escorts Service
 
(ANIKA) Call Girls Wadki ( 7001035870 ) HI-Fi Pune Escorts Service
(ANIKA) Call Girls Wadki ( 7001035870 ) HI-Fi Pune Escorts Service(ANIKA) Call Girls Wadki ( 7001035870 ) HI-Fi Pune Escorts Service
(ANIKA) Call Girls Wadki ( 7001035870 ) HI-Fi Pune Escorts Service
 
(TARA) Call Girls Chakan ( 7001035870 ) HI-Fi Pune Escorts Service
(TARA) Call Girls Chakan ( 7001035870 ) HI-Fi Pune Escorts Service(TARA) Call Girls Chakan ( 7001035870 ) HI-Fi Pune Escorts Service
(TARA) Call Girls Chakan ( 7001035870 ) HI-Fi Pune Escorts Service
 
High Class Call Girls Bangalore Komal 7001305949 Independent Escort Service B...
High Class Call Girls Bangalore Komal 7001305949 Independent Escort Service B...High Class Call Girls Bangalore Komal 7001305949 Independent Escort Service B...
High Class Call Girls Bangalore Komal 7001305949 Independent Escort Service B...
 
EDUROOT SME_ Performance upto March-2024.pptx
EDUROOT SME_ Performance upto March-2024.pptxEDUROOT SME_ Performance upto March-2024.pptx
EDUROOT SME_ Performance upto March-2024.pptx
 
Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...
Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...
Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...
 

15. political discourseinthenewskb

  • 1. Political Discourse in the News 
 (and other studies) Antske Fokkens, VU University Amsterdam
 Political discourse in the News is joint work with: Ellis Aizenberg, Wouter van Atteveldt, Carlotta Cassimassima, Franz-Xaver Geiger, Laura Hollink, Annick van der Peet, Chantal van Son
  • 2. Overview • Introduction • Interdisciplinary research & research questions • Text analysis • From basic to complex: possibilities and challenges • Methodological issues • Conclusion
  • 3. Introduction • Interdisciplinary research: • Social Science: manual annotation, research questions • Humanities: research questions • Computer Science: modeling, visualization • Computational Linguistics: text analysis
  • 4. Introduction • Research questions: • Has personalization increased in political news? • What trends do we see in reported political conflicts? • How does news reporting relate to the parliamentary debates? • What perspectives are expressed by news (explicitly and implicitly)?
  • 5. Approaches • Manual annotations: • Expert (communication science researchers and Master students) • Crowd (crowdsourcing) • Automatic annotation: • Basic as well as advanced NLP approaches
  • 6. Text analysis • AmCAT (Wouter van Atteveldt): • Open source infrastructure that facilitates large-scale analysis and manual content analysis of text • BiographyNet/NewsReader pipeline (Piek Vossen’s cltl group): • NLP modules for event (and event relation) extraction & named entity recognition and disambiguation • OpeNER tools (Piek Vossen’s cltl group): • Sentiment analysis and opinion mining
  • 7. Basic methods • Counting: • occurrences of names in text • identifying words from word lists (e.g. sentiment words) • Topic modeling (e.g. LDA)
  • 8. Basic methods • Can easily be run on large datasets • Can address research questions (e.g. Aizenberg (2014) shows increase of personalization) • Limited to overall trends and tendencies • For some tasks, high risk of unreliable results: • e.g. erg is listed with ‘negative sentiment’
  • 10. More advanced analyses • Can provide more detailed insight into the content of the text • Scalability becomes an issue (several complex language models) • to illustrate: • +/- 5 minutes per article (regular university cluster) • 11 days for 1.3 million articles on Hadoop cluster at SURFsara • Accuracy can be low for difficult tasks and because errors ‘pile up’
  • 11. Methodological issues • Data interpretation • Biases • Example: OCR
  • 12. Data interpretation • Basic methods: • results from counts are clear, but what do they say? • More advanced methods: • attempt to provide semantic interpretations, but what is the accuracy of the tools?
  • 13. Biases • One way to deal with errors is to assume that it is just noise in a large pile of data • This assumption works, if errors are equally distributed across classes/information that matter for the research question • For instance, counting sentiment related terms: • are the lists for negative and positive terms of comparable quality? • does one of the list contain more ambiguous terms than the other?
  • 14. Bias example OCR • Data from the KB still have some issues with OCR • There tend to be more issues with older data • Imagine we investigate whether emotional expressions in text increased over time:
 
 Does worse OCR lead to a lower percentage of identification in older text?
  • 15. Dealing with biases • We cannot exclude the risk of biases completely • We can: • try to make sure researchers using output are aware of the details of the method (raise awareness of possible biases) • carry out both intrinsic and extrinsic evaluation, i.e. explicitly investigate the influence of a bias on overall results
  • 16. Conclusion • Several research directions where technology (including NLP, linked data, visualizations) is used to support research in Humanities and Social Sciences • NLP approaches vary from basic to complex pipelines carrying out several steps • Basic approaches can easily be applied to large datasets are transparent, but do not say much • More advanced approaches provide detailed information, but cannot easily be applied to large sets and are less transparent • Insight into how data was processed and both intrinsic and extrinsic evaluation is needed to raise awareness about (or even avoid?) biases
  • 18. References • AmCAT: http://vanatteveldt.com/amcat/ • BiographyNet/NewsReader pipeline: • Rodrigo Agerri et al. (2015). Event Detection version 2.2. NewsReader Deliverable 4.2.2. http://www.newsreader-project.eu/files/2012/12/NWR-D4-2-2.pdf • Methodological issues: • Antske Fokkens, Serge ter Braake, Niels Ockeloen, Piek Vossen, Susan Legêne and Guus Schreiber. 2014. BiographyNet: Methodological issues when NLP supports historical research. Proceedings of LREC 2014.
 http://www.lrec-conf.org/proceedings/lrec2014/pdf/1103_Paper.pdf • Niels Ockeloen, Antske Fokkens, Serge ter Braake, Piek Vossen, Victor de Boer, Guus Schreiber, and Susan Legêne. 2013. BiographyNet: Managing Provenance at multiple levels and from different perspectives. Proceedings of Linked Science 2013. http://linkedscience.org/wp-content/uploads/2013/04/paper7.pdf