SlideShare a Scribd company logo
1 of 59
Analyzing Published and Consumed
Digital & Digitized News
Martijn Kleppe
Vrije Universiteit Amsterdam
m.kleppe@vu.nl
www.martijnkleppe.nl
@martijnkleppe
Slides on Slideshare:
bit.ly/LeuvenKleppe
Social Media: Incubators of a renewed news media landscape
27 November 2015
Leuven
Analyzing Published and Consumed
Digital & Digitized News
Newstracker
Analyzing Published and Consumed
Digital & Digitized News
Newstracker
Analyzing Published and Consumed
Digital & Digitized News
www.polimedia.nl
How do media cover
debates in Dutch Parliament?
http://www.tweedekamer.nl/hoe_werkt_het/de_tweede_kamer_in_beeld/handelingenkamer
https://commons.wikimedia.org/wiki/File:Hilversum-Nieuwjaarsborrel_WMNL_2015_bij_Beeld_en_Geluid_(9).JPG
PoliMedia approach
PoliMedia
Search
through
parliamentary
debate
Newspapers
KB
Television
Sound and Vision
Radio
KB
Dutch
Digital
Parliament
KB
Link debates to news items
Intuition 1: The news item contains a topic a/o name of a
politician and is published within a week after a debate
Intuition 2: The more overlap in topics and named entities, the
more probably there is a link.
www.polimedia.nl
Not only USE data
Also GIVE data
“Give me all fragments of
debates with over 60
related news items”
SELECT ?speech ?no_newsitems {{
SELECT ?speech (COUNT(?news) AS ?no_news_items)
WHERE{
?speech <http://purl.org/linkedpolitics/nl/polivoc#coveredAt>
?news .
}
GROUP BY ?speech }
FILTER (?no_news_items > 60) }
SPARQL Endpoint
• Yeah! It works (but no television)
• Not perfect
• But still ok (recall: 62%; precision: 80%)
• It is open for everyone: www.polimedia.nl
• + via a Sparql Endpoint
• People actually use it 
Results
NRC Handelsblad, Ewoud Sander, Voor al haar mantelzorgen, 14 April 2014
“Another digital source
I often use is PoliMedia.nl”
Yeah! An article in
NRC HANDELSBLAD!
• Yeah! It works (but no television)
• Not perfect
• But still ok (recall: 62%; precision: 80%)
• It is open for everyone: www.polimedia.nl
• + via a Sparql Endpoint
• People actually use it 
• We want more: social media, television, recent data
Results
Credits
Martijn Kleppe
Max Kemman
Henri Beunders
Laura Hollink
Damir Juric
Geert Jan Houben
Jaap Blom
Johan Oomen
Financed by Data files
Analyzing Published and Consumed
Digital & Digitized News
Analyzing Published and Consumed
Digital & Digitized News
Newstracker
The New News Consumer
www.news-use.com
Newstracker
http://www.nrc.nl/apps/bigboard/
http://www.nrc.nl/apps/bigboard/
THAT HOWWHAT?
Focus on most
read articles
24/7 News
consumption?
THAT HOWWHAT?
What genres of news websites
do news users consume 24/7?
www.metricsfornews.com
THAT HOWWHAT?
What genres of news websites
do news users consume 24/7?
For what do news users
consume these websites 24/7?
How does the consumption of news websites
fit in their everyday surfing behavior?
Reading
Watching
Viewing
Listening
Checking
Snacking
Monitoring
Searching
Clicking
The Newstracker
• Collects web activities
• Of specified & authenticated users
• Via a custom built system
• That collects & cleans web activities
• Extracts textual & visual content of news websites
• And stores this as a 1 dataset
The Newstracker
Web activities is a lot…
And monitoring everything is quite privacy intrusive…
So selection and structure is needed, via:
• Whitelist of 4.000 websites
• Labels indicating genre of website
• Subgenres of News and Information websites
The Newstracker
Internet
Cleaning & Processing:
Deduplicate, Only websites on
whitelist, Add labels
The Newstracker
• April – July 2015
• 42 participants: students
• Laptop main device
• 16.162 registered, relevant & labelled URLs
• 20 in-depth interviews
Results
News and Information
Shopping
Education
Search Engines
Video, Music & Radio
Visited websites per genre during the day
Website selection depends on
genre & time of the day
Visited News and Information websites
during the day
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
00.00 -
06.00 -
Night
06.00 -
09.00 -
Waking up
09.00 -
12.00 -
Morning
12.00 -
13.00 -
Lunch
13.00 -
17.00 -
Afternoon
17.00 -
19.00 - Eve
19.00 -
00.00 -
Evening
Generic Lifestyle Remarkable
Subgenre of News and Information
website is a determining factor
Which WhenHow?
How?
Via Homepage Via Referral
TOTAL
General News
Lifestyle
Remarkable
59% 41%
64% 36%
49% 51%
48% 52%
“Lindanieuws.nl is more
entertainment. Sometimes I
really think ‘this makes no
sense’, but it is fun to read. It’s
more entertainment then true
news, the way I consume it”.
Lean-back
Snacking
“Fashion is my hobby”.
• Visits same websites everyday
• In same order
• Starts at homepage
Lean-forward
monitoring
Lean-forward
monitoring
Lean-back
Snacking
Date Time URL
26-4-2015 16:53:05 http://www.vi.nl/home.htm
26-4-2015 16:54:02 http://www.vi.nl/nieuws/promes-maakt-opnieuw-het-verschil-voor-spartak.htm
26-4-2015 17:00:20
http://www.soccernews.nl/news/313971/Kramer_wil_PSV-
aanvallers_verslaan:_Ik_sta_er_dichtbij
26-4-2015 17:01:51 http://www.google.nl/
26-4-2015 17:02:01 http://en.wikipedia.org/wiki/Michiel_Kramer
26-4-2015 17:02:15 http://en.wikipedia.org/wiki/Mike_van_Duinen
26-4-2015 17:02:23 http://en.wikipedia.org/wiki/Gervane_Kastaneer
26-4-2015 17:03:00 http://nl.wikipedia.org/wiki/Wilmer_Kousemaker
26-4-2015 17:03:09 http://en.wikipedia.org/wiki/Wilmer_Kousemaker
26-4-2015 17:04:15 http://nl.wikipedia.org/wiki/Benny_Kerstens
26-4-2015 17:04:39 http://nl.wikipedia.org/wiki/Aykut_Demir
“Via VI.nl, I get to the Wikipedia
lemma of e.g. Wesley Sneijder
and then I look to a team player
of Sneijder and think ‘He!’ you’re
playing for years at that club and
then I click further.”
Lean-forward
monitoring
Serendipitous
consumption
Conclusion
News consumption 24/7
BUT….
Which website
When
In which order
≠ Subgenre News Website
Conclusion
News consumption 24/7
BUT….
Which website
When
In which order
=
Personal interest plays
essential role in what
they consider to be
news,
and determines the
pattern of everyday
news consumption
What’s next-1?
What’s next-2?
 Different usergroups:
• Different age groups
• Regional News
• Tech news
• Other countries
 Requires:
• Updated website whitelist
• Updated scraping templates
What’s next-3?
What role do form and content play?
26-4-2015 16:52:59 user28 http://www.bbc.co.uk/sport/0/football/32470569
26-4-2015 16:53:02 user28 http://www.bbc.com/sport/0/football/32470569
26-4-2015 16:53:05 user28 http://www.vi.nl/home.htm
26-4-2015 16:54:02 user28 http://www.vi.nl/nieuws/promes-maakt-opnieuw-het-verschil-voor-spartak.htm
26-4-2015 17:00:20 user28 http://www.soccernews.nl/news/313971/Kramer_wil_PSV-aanvallers_verslaan:_Ik_sta_er_dichtbij
News
+Sport
Next step:
• Automated
Content Analysis
of text on
topic + style
• Visual Content
Analysis
Acknowledgements
The New News Consumer
www.news-use.com
Marco Otte Hildebrand Bijleveld Leonie Durlinger Stefan Heijdra
Irene Costera Meijer Marcel Broersma Tim Groot KormelinkChris Peters Joelle SwartAnna van
Cauwenberge
Acknowledgements
The New News Consumer
www.news-use.com
Questions?
Martijn Kleppe
Vrije Universiteit Amsterdam
m.kleppe@vu.nl
@martijnkleppe
www.martijnkleppe.nl
www.polimedia.nl
www.news-use.com
Slides on Slideshare:
bit.ly/LeuvenKleppe
Social Media: Incubators of a renewed news media landscape
27 November 2015
Leuven

More Related Content

Similar to Analyzing Published and Consumed Digital & Digitized News

Tracking online user behaviour with a multimethod research design
Tracking online user behaviour with a multimethod research designTracking online user behaviour with a multimethod research design
Tracking online user behaviour with a multimethod research designMartijn Kleppe
 
Social media for procurement professionals
Social media for procurement professionalsSocial media for procurement professionals
Social media for procurement professionalsRuud Olthoff
 
Using open datasets for research purposes
Using open datasets for research purposesUsing open datasets for research purposes
Using open datasets for research purposesMartijn Kleppe
 
Social Media for Special Events #Sm4events
Social Media for Special Events #Sm4eventsSocial Media for Special Events #Sm4events
Social Media for Special Events #Sm4eventsLisa M. Chmiola, CFRE
 
Humber trends presentation
Humber trends presentationHumber trends presentation
Humber trends presentationRon Nurwisah
 
Keynote Discussion on Social Trends for Non-Profits #nonprofitsocial
Keynote Discussion on Social Trends for Non-Profits #nonprofitsocialKeynote Discussion on Social Trends for Non-Profits #nonprofitsocial
Keynote Discussion on Social Trends for Non-Profits #nonprofitsocialEdelman
 
Unit1- Online Journalism -CMS.pptx
Unit1- Online Journalism -CMS.pptxUnit1- Online Journalism -CMS.pptx
Unit1- Online Journalism -CMS.pptxMargaret Mary
 
Social Media Marketing Fall Series, Class 1
Social Media Marketing Fall Series, Class 1Social Media Marketing Fall Series, Class 1
Social Media Marketing Fall Series, Class 1Karen Kefauver
 
Why Social? How Social Media Builds Brands
Why Social? How Social Media Builds BrandsWhy Social? How Social Media Builds Brands
Why Social? How Social Media Builds BrandsWomen's Marketing, Inc.
 
Getting the Most Out of Social Media
Getting the Most Out of Social MediaGetting the Most Out of Social Media
Getting the Most Out of Social MediaVita Vaughn
 
Managing and measuring social media coventry combined
Managing and measuring social media coventry combinedManaging and measuring social media coventry combined
Managing and measuring social media coventry combinedWeb2LLP
 
Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...
Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...
Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...Hartford Foundation for Public Giving
 
Connecting with audiences with social media
Connecting with audiences with social mediaConnecting with audiences with social media
Connecting with audiences with social mediaEuforic Services
 
collecting twitter data w/social feed manager
collecting twitter data w/social feed managercollecting twitter data w/social feed manager
collecting twitter data w/social feed managerDan Chudnov
 
PR in a Digital World
PR in a Digital WorldPR in a Digital World
PR in a Digital WorldJon Monk
 
Social Media and Crisis Management
Social Media and Crisis ManagementSocial Media and Crisis Management
Social Media and Crisis ManagementMark Gibbs
 
Social software in libraries
Social software in librariesSocial software in libraries
Social software in librariesJennifer Cyr
 

Similar to Analyzing Published and Consumed Digital & Digitized News (20)

Tracking online user behaviour with a multimethod research design
Tracking online user behaviour with a multimethod research designTracking online user behaviour with a multimethod research design
Tracking online user behaviour with a multimethod research design
 
Social media for procurement professionals
Social media for procurement professionalsSocial media for procurement professionals
Social media for procurement professionals
 
Smm handouts
Smm handoutsSmm handouts
Smm handouts
 
Using open datasets for research purposes
Using open datasets for research purposesUsing open datasets for research purposes
Using open datasets for research purposes
 
Social Media for Special Events #Sm4events
Social Media for Special Events #Sm4eventsSocial Media for Special Events #Sm4events
Social Media for Special Events #Sm4events
 
Social Media Dataset
Social Media DatasetSocial Media Dataset
Social Media Dataset
 
Humber trends presentation
Humber trends presentationHumber trends presentation
Humber trends presentation
 
Keynote Discussion on Social Trends for Non-Profits #nonprofitsocial
Keynote Discussion on Social Trends for Non-Profits #nonprofitsocialKeynote Discussion on Social Trends for Non-Profits #nonprofitsocial
Keynote Discussion on Social Trends for Non-Profits #nonprofitsocial
 
Unit1- Online Journalism -CMS.pptx
Unit1- Online Journalism -CMS.pptxUnit1- Online Journalism -CMS.pptx
Unit1- Online Journalism -CMS.pptx
 
Social Media Marketing Fall Series, Class 1
Social Media Marketing Fall Series, Class 1Social Media Marketing Fall Series, Class 1
Social Media Marketing Fall Series, Class 1
 
Why Social? How Social Media Builds Brands
Why Social? How Social Media Builds BrandsWhy Social? How Social Media Builds Brands
Why Social? How Social Media Builds Brands
 
Information update February 2018
Information update February 2018Information update February 2018
Information update February 2018
 
Getting the Most Out of Social Media
Getting the Most Out of Social MediaGetting the Most Out of Social Media
Getting the Most Out of Social Media
 
Managing and measuring social media coventry combined
Managing and measuring social media coventry combinedManaging and measuring social media coventry combined
Managing and measuring social media coventry combined
 
Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...
Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...
Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...
 
Connecting with audiences with social media
Connecting with audiences with social mediaConnecting with audiences with social media
Connecting with audiences with social media
 
collecting twitter data w/social feed manager
collecting twitter data w/social feed managercollecting twitter data w/social feed manager
collecting twitter data w/social feed manager
 
PR in a Digital World
PR in a Digital WorldPR in a Digital World
PR in a Digital World
 
Social Media and Crisis Management
Social Media and Crisis ManagementSocial Media and Crisis Management
Social Media and Crisis Management
 
Social software in libraries
Social software in librariesSocial software in libraries
Social software in libraries
 

More from Martijn Kleppe

Bringing Digital Humanities to the wider public: libraries as incubator for D...
Bringing Digital Humanities to the wider public: libraries as incubator for D...Bringing Digital Humanities to the wider public: libraries as incubator for D...
Bringing Digital Humanities to the wider public: libraries as incubator for D...Martijn Kleppe
 
Digital Humanities in de KB
Digital Humanities in de KBDigital Humanities in de KB
Digital Humanities in de KBMartijn Kleppe
 
Introduction slides workshop Computer Vision in Digital Humanities
Introduction slides workshop Computer Vision in Digital HumanitiesIntroduction slides workshop Computer Vision in Digital Humanities
Introduction slides workshop Computer Vision in Digital HumanitiesMartijn Kleppe
 
Tracing afterlice iconic photographs using IPTC
Tracing afterlice iconic photographs using IPTCTracing afterlice iconic photographs using IPTC
Tracing afterlice iconic photographs using IPTCMartijn Kleppe
 
Intro presentation AVinDH Workshop
Intro presentation AVinDH WorkshopIntro presentation AVinDH Workshop
Intro presentation AVinDH WorkshopMartijn Kleppe
 
Voorlichting VU CIW: Media & Journalistiek 14 november 2015
Voorlichting VU CIW: Media & Journalistiek 14 november 2015Voorlichting VU CIW: Media & Journalistiek 14 november 2015
Voorlichting VU CIW: Media & Journalistiek 14 november 2015Martijn Kleppe
 
Tracing the afterlife of iconic photographs
Tracing the afterlife of iconic photographs Tracing the afterlife of iconic photographs
Tracing the afterlife of iconic photographs Martijn Kleppe
 
PoliMedia - LODLAM Challenge 2015
PoliMedia - LODLAM Challenge 2015PoliMedia - LODLAM Challenge 2015
PoliMedia - LODLAM Challenge 2015Martijn Kleppe
 
Presentatie PoliMedia op symposium 'Digitale kranten als 'big data''
Presentatie PoliMedia op symposium 'Digitale kranten als 'big data''Presentatie PoliMedia op symposium 'Digitale kranten als 'big data''
Presentatie PoliMedia op symposium 'Digitale kranten als 'big data''Martijn Kleppe
 
Tracing the afterlife of iconic photographs using IPTC
Tracing the afterlife of iconic photographs using IPTCTracing the afterlife of iconic photographs using IPTC
Tracing the afterlife of iconic photographs using IPTCMartijn Kleppe
 
How to obtain small grants
How to obtain small grantsHow to obtain small grants
How to obtain small grantsMartijn Kleppe
 
PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...
PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...
PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...Martijn Kleppe
 
Onderzoekers als marketeers
Onderzoekers als marketeersOnderzoekers als marketeers
Onderzoekers als marketeersMartijn Kleppe
 
Global & National Iconic Photographs
Global & National Iconic PhotographsGlobal & National Iconic Photographs
Global & National Iconic PhotographsMartijn Kleppe
 
Lekenpraatje verdediging proefschrift 'Canonieke icoonfoto's' 28-2-2013
Lekenpraatje verdediging proefschrift 'Canonieke icoonfoto's' 28-2-2013Lekenpraatje verdediging proefschrift 'Canonieke icoonfoto's' 28-2-2013
Lekenpraatje verdediging proefschrift 'Canonieke icoonfoto's' 28-2-2013Martijn Kleppe
 
Introduction to Research project PoliMedia
Introduction to Research project PoliMediaIntroduction to Research project PoliMedia
Introduction to Research project PoliMediaMartijn Kleppe
 

More from Martijn Kleppe (16)

Bringing Digital Humanities to the wider public: libraries as incubator for D...
Bringing Digital Humanities to the wider public: libraries as incubator for D...Bringing Digital Humanities to the wider public: libraries as incubator for D...
Bringing Digital Humanities to the wider public: libraries as incubator for D...
 
Digital Humanities in de KB
Digital Humanities in de KBDigital Humanities in de KB
Digital Humanities in de KB
 
Introduction slides workshop Computer Vision in Digital Humanities
Introduction slides workshop Computer Vision in Digital HumanitiesIntroduction slides workshop Computer Vision in Digital Humanities
Introduction slides workshop Computer Vision in Digital Humanities
 
Tracing afterlice iconic photographs using IPTC
Tracing afterlice iconic photographs using IPTCTracing afterlice iconic photographs using IPTC
Tracing afterlice iconic photographs using IPTC
 
Intro presentation AVinDH Workshop
Intro presentation AVinDH WorkshopIntro presentation AVinDH Workshop
Intro presentation AVinDH Workshop
 
Voorlichting VU CIW: Media & Journalistiek 14 november 2015
Voorlichting VU CIW: Media & Journalistiek 14 november 2015Voorlichting VU CIW: Media & Journalistiek 14 november 2015
Voorlichting VU CIW: Media & Journalistiek 14 november 2015
 
Tracing the afterlife of iconic photographs
Tracing the afterlife of iconic photographs Tracing the afterlife of iconic photographs
Tracing the afterlife of iconic photographs
 
PoliMedia - LODLAM Challenge 2015
PoliMedia - LODLAM Challenge 2015PoliMedia - LODLAM Challenge 2015
PoliMedia - LODLAM Challenge 2015
 
Presentatie PoliMedia op symposium 'Digitale kranten als 'big data''
Presentatie PoliMedia op symposium 'Digitale kranten als 'big data''Presentatie PoliMedia op symposium 'Digitale kranten als 'big data''
Presentatie PoliMedia op symposium 'Digitale kranten als 'big data''
 
Tracing the afterlife of iconic photographs using IPTC
Tracing the afterlife of iconic photographs using IPTCTracing the afterlife of iconic photographs using IPTC
Tracing the afterlife of iconic photographs using IPTC
 
How to obtain small grants
How to obtain small grantsHow to obtain small grants
How to obtain small grants
 
PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...
PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...
PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...
 
Onderzoekers als marketeers
Onderzoekers als marketeersOnderzoekers als marketeers
Onderzoekers als marketeers
 
Global & National Iconic Photographs
Global & National Iconic PhotographsGlobal & National Iconic Photographs
Global & National Iconic Photographs
 
Lekenpraatje verdediging proefschrift 'Canonieke icoonfoto's' 28-2-2013
Lekenpraatje verdediging proefschrift 'Canonieke icoonfoto's' 28-2-2013Lekenpraatje verdediging proefschrift 'Canonieke icoonfoto's' 28-2-2013
Lekenpraatje verdediging proefschrift 'Canonieke icoonfoto's' 28-2-2013
 
Introduction to Research project PoliMedia
Introduction to Research project PoliMediaIntroduction to Research project PoliMedia
Introduction to Research project PoliMedia
 

Recently uploaded

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxjana861314
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 

Recently uploaded (20)

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 

Analyzing Published and Consumed Digital & Digitized News

Editor's Notes

  1. I am a media scholar/historian and a typical research question I have is this: how do media over debates in the Dutch Parliament?
  2. Back in the old days (let’s see five year ago) I had to go to several places to find my resources. For example to the National Library of the Netherlands/KB in The Hague where I could read the analog minutes of the Dutch Parliament
  3. And there I had to find the old newspapers and go over them manually
  4. And the same counts for the radio bulletins which are great sources as you can see since they contain handwriting. But the horrible thing was that everything had to be done manually.
  5. That changed when this great stuff got digitized. So I can now look up recent newspapers in the Lexis Nexis database for example. Great database but for me there is one big downside: they do not contain images (in which I particular am interested in) nor the whole page but only single articles.
  6. Another great database I can now use is the Academia database of Sound and Vision. I can search through the metadata of program from home of office and watch the broadcast. But the downside is that I do not know which programs are in there but even more: this system and search engine is a complete different one from the one of Lexis Nexis. So I do have digitised materials but there still is a lot of work for me since I need to understand how these different databases work.
  7. And this is where PoliMedia comes in. With PoliMedia we have built a portal in which you can search through the digital minutes of the Dutch Parliament on any keyword or person. But PoliMedia is linked to media databases such as the digitised newspaper collection of the KB, Television broadcasts of Sound and Vision and Radio Bulletins at the KB.
  8. What PoliMedia does is basically the following. After you performed a query it searches for topics a/o names in all the newspapers within a week after a debate. And then calculates the most probable link by looking at the overlap in topics and named entities.
  9. And that looks like this. We made an open website which you can all visit via www.polimedia.nl You can type in your query in the Google Like search box.
  10. And on the Results page you will see all debates in which the query is found. On the left you filter your results (which is something we built as well) and on the right you see the magic of PoliMedia. Here it automatically says how many and which media items are retrieved that contain coverage about this particular debate.
  11. After you clicked on a result you will see the whole debate with your query highlighted and on the right side you will see the links to the relevant media items.
  12. After clicking on that, you will get to the newspaper item as it is stored at the National Library, so in their interface like this one of the newspapers
  13. Behind PoliMedia there is a database in which all our links are available in RDF. You can also search through this data without using PoliMedia.nl via a SPARQL Endpoint. You are then more flexible and can ask more complex questions, such as: Give me all fragments of debates with over 60 related news items”.
  14. Vangst (recall). Vangst is de verhouding tussen het aantal relevante gevonden documenten, en het totaal aantal relevante documenten dat er mogelijk zijn. Dit laatste is een van tevoren opgesteld 'wensenlijstje', vaak 'ground truth' of 'gouden standaard' genoemd. Precisie is de verhouding tussen het aantal relevante resultaten (documenten, treffers), en het totaal aantal resultaten dat door het systeem is teruggeven.
  15. Now, there are already quite some tools to monitor what people do on your website. Everyone who owns a website probably knows Google Analytics which gives very good insights into the clicks on your website. A similarlike tool that a lot of publishers are using is Chartbeat.
  16. And with Chartbeat you can actually make these kind of dashboards. This is the so called Big Board of NRC Handelsblad, a leading Dutch newspaper that made this dashboard open to the public. But in the newsroom these dashboards are constantly shown on screens giving the editors realtime information on what their website visitors are currently reading.
  17. And with Chartbeat you can actually make these kind of dashboards. This is the so called Big Board of NRC Handelsblad, a leading Dutch newspaper that made this dashboard open to the public. But in the newsroom these dashboards are constantly shown on screens giving the editors realtime information on what their website visitors are currently reading.
  18. We thus see a difference between the type of news website and how people end up at it. BUT: Tempting to make bold conclusions, but this does not mean everyone does it like that. This is wehere our qualitative analyses come in
  19. Komt via Facebook!
  20. Komt via Homepage!
  21. Kortom, we zien 24/7 patroon maar welke wanneer en hoe wordt niet bepaald door het type website maar door de individuele gebruiker die allemaal verschillend zijn.
  22. What we already have: The URLs We know this is a news website and we made subcategories for the News categorie so that is actually already added in the file. We have scraped the textual and visual content of the websites. But the difficult part comes now: what do the text and image say? And that is what we are currently working on by deploying an automated Content Analysis of the text on both the topic (soccer, ADO Den Haag) but also on the style: how is the news item written? In a factual manner, in a loose manner, etc. Plus we want to analyse the image: what does it tell us? Who are on there? What is the topic?
  23. What we already have: The URLs We know this is a news website and we made subcategories for the News categorie so that is actually already added in the file. We have scraped the textual and visual content of the websites. But the difficult part comes now: what do the text and image say? And that is what we are currently working on by deploying an automated Content Analysis of the text on both the topic (soccer, ADO Den Haag) but also on the style: how is the news item written? In a factual manner, in a loose manner, etc. Plus we want to analyse the image: what does it tell us? Who are on there? What is the topic?