SlideShare a Scribd company logo
1 of 16
Download to read offline
Useful.Beatiful.Data: social media
− To produce official statistics you need DATA
 Its getting more and more difficult to collect data from
respondents
• Response burden
• Decreasing response rates
• Mode effects (CAPI/PAPI/CATI/CAWI)
− What are alternatives?
 Admin data sources (since the 80’s)
 BIG DATA (NOW), such as social media
The glass if half full
Potential of social media
− 3 million public messages produced every day in the
Netherlands
 mainly on Twitter and Facebook (~60%)
 Nearly ‘real-time’ available
− Content: Topics discussed
 50% ‘pointless babble’ (noisy) but there are messages relevant for official
statistics
 Selecting the relevant part is important (removing noise)
− Producers: Not much info (directly) available
 But much can be derived
Social media in the Netherlands
Map by Eric Fischer (via Fast Company)
Map by Eric Fischer
Examples of social media studies at CBS/CBDS
− Content
1. Sentiment in social media
 What is the development of the average sentiment in social media
over time?
2. Feelings of social tension
 Can social media be used to measure specific feelings in (the online)
society?
3. Propensity to move (‘Wish to move’)
 Can we identify messages of people that wish to move to another
house?
− Population
4. Characterizing users
 Derive characteristics /discern subpopulations
1. Social media sentiment
1. Social media sentiment (2)
− Facebook and Twitter messages both contribute
− Daily data is highly volatile
− Monthly aggregates correlate well with consumer confidence (> 0.9)
− Including sentiment series improves the accuracy of consumer
confidence series (survey data)
− Product:
 Averaged monthly or smoothed weekly online Dutch sentiment could be a
potential new indicator
 Can also be produced for large Dutch cities
2. Social tension indicator
Available at: http://research.cbs.nl/socialtension/en/
Percentageofmessagesindicatingsocialtension
2. Social tension indicator (2)
− Currently based on Twitter messages alone
 Other platforms can be added
− Selected messages containing specific keywords
 These were originally derived from the safety monitor questionnaire
 Used the events detected as feedback
− Peaks indicate points in time at which increasing numbers of social
tension related messages are produced
 Usually don’t last long
 Sometime a shift in the base line is observed (i.e. MH17)
− Product: can be produced on a daily basis
 This is how ‘real-time’ statistics will look like
3. ‘Wish to move’
− Current topic of research
 Social media contains messages that indicate a ‘wish’ of people
to move to another house (on all platforms)
 Select messages containing ‘verhuiz*’ or ‘verhuis*’
 Created a model to identify messages of people that wish to
move (accuracy 0.85 ±0.02)
 Relate social media findings to findings derived from
survey/admin data
 Study time-series to check on what frequency such an indicator
could best be produced
4. Characterizing social media users
− Social media contains multiple populations
− Identifying Dutch users
• 3 approaches:
 From meta/para data available (language setting, location)
 From network structure (following), essential when hardly any
user info is present
 From texts, discern Dutch and Flemish ‘tweets’
4. Characterizing social media users (2)
− Social media contains multiple populations
− Discerning between accounts of people and companies
 2 step approach
Human
(private users)
Non-human
(corporate users)
Private persons
(77%)
Self-employed
(9%)
‘Non-profit groups’
(11%)
Companies
(3%)
4. Characterizing social media users (3)
− Social media contains multiple populations
− Identifying background characteristics
 Challenging topic:
• Gender (M/F) could be identified with 96% accuracy
- Combining user short bio, first names, tweet content and pictures
- 50% male, 33% female, 17% ‘others’
• Other characteristics are possible (future research)
Conclusion
− Social media is an interesting data source for official statistics
− To enable this, two steps are essential:
– Noise reduction
− By aggregating lots of data
− By removing ‘off-topic’ messages
− Correct differences between ‘on-line’ and ‘real-world’ populations
– By removing non-target population users
– By applying a model (work in progress)
Useful by Piet Daas

More Related Content

What's hot

IEEE 2014 ASP.NET with VB Projects
IEEE 2014 ASP.NET with VB ProjectsIEEE 2014 ASP.NET with VB Projects
IEEE 2014 ASP.NET with VB ProjectsVijay Karan
 
Social Media Mining: An Introduction
Social Media Mining: An IntroductionSocial Media Mining: An Introduction
Social Media Mining: An IntroductionAli Abbasi
 
Framework Design for Operational Scenario-based Emergency Response System
Framework Design for Operational Scenario-based Emergency Response SystemFramework Design for Operational Scenario-based Emergency Response System
Framework Design for Operational Scenario-based Emergency Response Systemstreamspotter
 
Dataset-driven research to improve TEL recommender systems
Dataset-driven research to improve TEL recommender systemsDataset-driven research to improve TEL recommender systems
Dataset-driven research to improve TEL recommender systemsKatrien Verbert
 
Trial Promoter: A Web-Based Tool to Test Stakeholder Engagement in Research o...
Trial Promoter: A Web-Based Tool to Test Stakeholder Engagement in Research o...Trial Promoter: A Web-Based Tool to Test Stakeholder Engagement in Research o...
Trial Promoter: A Web-Based Tool to Test Stakeholder Engagement in Research o...Katja Reuter, PhD
 
Well-Being - A Sunset Conversation
Well-Being - A Sunset ConversationWell-Being - A Sunset Conversation
Well-Being - A Sunset ConversationMicah Altman
 
Researching Misinformation
Researching MisinformationResearching Misinformation
Researching MisinformationScott A. Hale
 
Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?Alejandro Bellogin
 
From Where Have We Come & Where Are We Going
From Where Have We Come & Where Are We GoingFrom Where Have We Come & Where Are We Going
From Where Have We Come & Where Are We GoingPhilip Bourne
 
Introduction to webometrics(13 mar2011)
Introduction to webometrics(13 mar2011)Introduction to webometrics(13 mar2011)
Introduction to webometrics(13 mar2011)Myunggoon Choi
 

What's hot (12)

IEEE 2014 ASP.NET with VB Projects
IEEE 2014 ASP.NET with VB ProjectsIEEE 2014 ASP.NET with VB Projects
IEEE 2014 ASP.NET with VB Projects
 
Social Media Mining: An Introduction
Social Media Mining: An IntroductionSocial Media Mining: An Introduction
Social Media Mining: An Introduction
 
Webometrics report
Webometrics reportWebometrics report
Webometrics report
 
Framework Design for Operational Scenario-based Emergency Response System
Framework Design for Operational Scenario-based Emergency Response SystemFramework Design for Operational Scenario-based Emergency Response System
Framework Design for Operational Scenario-based Emergency Response System
 
Dataset-driven research to improve TEL recommender systems
Dataset-driven research to improve TEL recommender systemsDataset-driven research to improve TEL recommender systems
Dataset-driven research to improve TEL recommender systems
 
Trial Promoter: A Web-Based Tool to Test Stakeholder Engagement in Research o...
Trial Promoter: A Web-Based Tool to Test Stakeholder Engagement in Research o...Trial Promoter: A Web-Based Tool to Test Stakeholder Engagement in Research o...
Trial Promoter: A Web-Based Tool to Test Stakeholder Engagement in Research o...
 
Well-Being - A Sunset Conversation
Well-Being - A Sunset ConversationWell-Being - A Sunset Conversation
Well-Being - A Sunset Conversation
 
Researching Misinformation
Researching MisinformationResearching Misinformation
Researching Misinformation
 
Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?
 
From Where Have We Come & Where Are We Going
From Where Have We Come & Where Are We GoingFrom Where Have We Come & Where Are We Going
From Where Have We Come & Where Are We Going
 
Mike thelwall ritu
Mike thelwall rituMike thelwall ritu
Mike thelwall ritu
 
Introduction to webometrics(13 mar2011)
Introduction to webometrics(13 mar2011)Introduction to webometrics(13 mar2011)
Introduction to webometrics(13 mar2011)
 

Similar to Useful by Piet Daas

Extracting information from ' messy' social media data
Extracting information from ' messy' social media dataExtracting information from ' messy' social media data
Extracting information from ' messy' social media dataPiet J.H. Daas
 
Social Media Analytics Research at the QUT Digital Media Research Centre
Social Media Analytics Research at the QUT Digital Media Research CentreSocial Media Analytics Research at the QUT Digital Media Research Centre
Social Media Analytics Research at the QUT Digital Media Research CentreAxel Bruns
 
Researching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media AnalysisResearching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media AnalysisFarida Vis
 
New Approaches to Large-Scale Social Media Analytics: Investigating Twitter i...
New Approaches to Large-Scale Social Media Analytics: Investigating Twitter i...New Approaches to Large-Scale Social Media Analytics: Investigating Twitter i...
New Approaches to Large-Scale Social Media Analytics: Investigating Twitter i...Axel Bruns
 
Information Contagion through Social Media: Towards a Realistic Model of the ...
Information Contagion through Social Media: Towards a Realistic Model of the ...Information Contagion through Social Media: Towards a Realistic Model of the ...
Information Contagion through Social Media: Towards a Realistic Model of the ...Axel Bruns
 
No Money, No Problem - A Scalable Approach to Social Media Monitoring
No Money, No Problem - A Scalable Approach to Social Media MonitoringNo Money, No Problem - A Scalable Approach to Social Media Monitoring
No Money, No Problem - A Scalable Approach to Social Media MonitoringTamer Hadi
 
Chapter 6 presentation
Chapter 6 presentationChapter 6 presentation
Chapter 6 presentationMiles223
 
Team Lecture on Blog
Team Lecture on BlogTeam Lecture on Blog
Team Lecture on Blogmcleanq
 
Chapter 6 presentation
Chapter 6 presentationChapter 6 presentation
Chapter 6 presentationsabucher
 
Big Data @ CBS for Fontys students in Eindhoven
Big Data @ CBS for Fontys students in EindhovenBig Data @ CBS for Fontys students in Eindhoven
Big Data @ CBS for Fontys students in EindhovenPiet J.H. Daas
 
Social Media101 For City Feb16 2010
Social Media101 For City Feb16 2010Social Media101 For City Feb16 2010
Social Media101 For City Feb16 2010Jas Darrah
 
Stock prediction using social network
Stock prediction using social networkStock prediction using social network
Stock prediction using social networkChanon Hongsirikulkit
 
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...IRJET- Identification of Prevalent News from Twitter and Traditional Media us...
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...IRJET Journal
 
Project for executive summary v2
Project for executive summary v2Project for executive summary v2
Project for executive summary v200000000A1
 
Social Media in Australia: The Case of Twitter
Social Media in Australia: The Case of TwitterSocial Media in Australia: The Case of Twitter
Social Media in Australia: The Case of TwitterAxel Bruns
 
Social Media Marketing
Social Media MarketingSocial Media Marketing
Social Media MarketingLucianWebb
 

Similar to Useful by Piet Daas (20)

Extracting information from ' messy' social media data
Extracting information from ' messy' social media dataExtracting information from ' messy' social media data
Extracting information from ' messy' social media data
 
Social Media Analytics Research at the QUT Digital Media Research Centre
Social Media Analytics Research at the QUT Digital Media Research CentreSocial Media Analytics Research at the QUT Digital Media Research Centre
Social Media Analytics Research at the QUT Digital Media Research Centre
 
Researching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media AnalysisResearching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media Analysis
 
New Approaches to Large-Scale Social Media Analytics: Investigating Twitter i...
New Approaches to Large-Scale Social Media Analytics: Investigating Twitter i...New Approaches to Large-Scale Social Media Analytics: Investigating Twitter i...
New Approaches to Large-Scale Social Media Analytics: Investigating Twitter i...
 
s00146-014-0549-4.pdf
s00146-014-0549-4.pdfs00146-014-0549-4.pdf
s00146-014-0549-4.pdf
 
Big Data @ CBS
Big Data @ CBSBig Data @ CBS
Big Data @ CBS
 
Information Contagion through Social Media: Towards a Realistic Model of the ...
Information Contagion through Social Media: Towards a Realistic Model of the ...Information Contagion through Social Media: Towards a Realistic Model of the ...
Information Contagion through Social Media: Towards a Realistic Model of the ...
 
No Money, No Problem - A Scalable Approach to Social Media Monitoring
No Money, No Problem - A Scalable Approach to Social Media MonitoringNo Money, No Problem - A Scalable Approach to Social Media Monitoring
No Money, No Problem - A Scalable Approach to Social Media Monitoring
 
Chapter 6 presentation
Chapter 6 presentationChapter 6 presentation
Chapter 6 presentation
 
Team Lecture on Blog
Team Lecture on BlogTeam Lecture on Blog
Team Lecture on Blog
 
Chapter 6 presentation
Chapter 6 presentationChapter 6 presentation
Chapter 6 presentation
 
Big Data @ CBS for Fontys students in Eindhoven
Big Data @ CBS for Fontys students in EindhovenBig Data @ CBS for Fontys students in Eindhoven
Big Data @ CBS for Fontys students in Eindhoven
 
Social Media101 For City Feb16 2010
Social Media101 For City Feb16 2010Social Media101 For City Feb16 2010
Social Media101 For City Feb16 2010
 
Stock prediction using social network
Stock prediction using social networkStock prediction using social network
Stock prediction using social network
 
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...IRJET- Identification of Prevalent News from Twitter and Traditional Media us...
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...
 
Social media making it work for your organization oct 2011
Social media making it work for your organization   oct 2011Social media making it work for your organization   oct 2011
Social media making it work for your organization oct 2011
 
Project for executive summary v2
Project for executive summary v2Project for executive summary v2
Project for executive summary v2
 
Social Media in Australia: The Case of Twitter
Social Media in Australia: The Case of TwitterSocial Media in Australia: The Case of Twitter
Social Media in Australia: The Case of Twitter
 
Social Media Marketing
Social Media MarketingSocial Media Marketing
Social Media Marketing
 
Expertise Social Media Research - eng- out 2013
Expertise   Social Media Research - eng- out 2013Expertise   Social Media Research - eng- out 2013
Expertise Social Media Research - eng- out 2013
 

More from Centraal Bureau voor de Statistiek

6. parallelsessie 3 doorstromers op de woningmarkt paul_de_vries
6. parallelsessie 3 doorstromers op de woningmarkt paul_de_vries6. parallelsessie 3 doorstromers op de woningmarkt paul_de_vries
6. parallelsessie 3 doorstromers op de woningmarkt paul_de_vriesCentraal Bureau voor de Statistiek
 

More from Centraal Bureau voor de Statistiek (20)

Sensors by Maiki Ilves
Sensors by Maiki IlvesSensors by Maiki Ilves
Sensors by Maiki Ilves
 
Happens online by Bastiaan Zijlema
Happens online by Bastiaan ZijlemaHappens online by Bastiaan Zijlema
Happens online by Bastiaan Zijlema
 
Happens online Bastiaan Rooijakkers
Happens online Bastiaan RooijakkersHappens online Bastiaan Rooijakkers
Happens online Bastiaan Rooijakkers
 
Happens online by Oscar Delnooz
Happens online by Oscar DelnoozHappens online by Oscar Delnooz
Happens online by Oscar Delnooz
 
Mapping mobility Ioannis Tsalamanis
Mapping mobility Ioannis TsalamanisMapping mobility Ioannis Tsalamanis
Mapping mobility Ioannis Tsalamanis
 
Mapping mobility Marco Puts
Mapping mobility Marco PutsMapping mobility Marco Puts
Mapping mobility Marco Puts
 
Mapping mobility Piyushimita Thakuriah
Mapping mobility Piyushimita ThakuriahMapping mobility Piyushimita Thakuriah
Mapping mobility Piyushimita Thakuriah
 
Sensors Ralph Meijers
Sensors Ralph MeijersSensors Ralph Meijers
Sensors Ralph Meijers
 
Presentation Magchiel van Meeteren (ochtend)
Presentation Magchiel van Meeteren (ochtend)Presentation Magchiel van Meeteren (ochtend)
Presentation Magchiel van Meeteren (ochtend)
 
Presentation Sofie De Broe (ochtend)
Presentation Sofie De Broe (ochtend)Presentation Sofie De Broe (ochtend)
Presentation Sofie De Broe (ochtend)
 
Sensors Mathijs Vonder
Sensors Mathijs VonderSensors Mathijs Vonder
Sensors Mathijs Vonder
 
stand van de woningmarkt
 stand van de woningmarkt  stand van de woningmarkt
stand van de woningmarkt
 
6. parallelsessie 4 onderzoek doen met de bag
6. parallelsessie 4 onderzoek doen met de bag6. parallelsessie 4 onderzoek doen met de bag
6. parallelsessie 4 onderzoek doen met de bag
 
6. parallelsessie 3 doorstromers op de woningmarkt paul_de_vries
6. parallelsessie 3 doorstromers op de woningmarkt paul_de_vries6. parallelsessie 3 doorstromers op de woningmarkt paul_de_vries
6. parallelsessie 3 doorstromers op de woningmarkt paul_de_vries
 
6. parallelsessie 2 groningen hanneke posthumus
6. parallelsessie 2 groningen hanneke posthumus6. parallelsessie 2 groningen hanneke posthumus
6. parallelsessie 2 groningen hanneke posthumus
 
6. parallelsessie 1 duur scheefwonen kai gidding
6. parallelsessie 1 duur scheefwonen kai gidding6. parallelsessie 1 duur scheefwonen kai gidding
6. parallelsessie 1 duur scheefwonen kai gidding
 
5. leegstand in nederland luc verschuren
5. leegstand in nederland luc verschuren5. leegstand in nederland luc verschuren
5. leegstand in nederland luc verschuren
 
4. invloed van natuur michiel daams
4. invloed van natuur michiel daams4. invloed van natuur michiel daams
4. invloed van natuur michiel daams
 
2. woningverkopen per regio farley ishaak
2. woningverkopen per regio farley ishaak2. woningverkopen per regio farley ishaak
2. woningverkopen per regio farley ishaak
 
3. hervorming woningmarkt otb peter_boelhouwer
3. hervorming woningmarkt  otb peter_boelhouwer3. hervorming woningmarkt  otb peter_boelhouwer
3. hervorming woningmarkt otb peter_boelhouwer
 

Recently uploaded

Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknowmakika9823
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 

Recently uploaded (20)

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Decoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in ActionDecoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in Action
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 

Useful by Piet Daas

  • 1.
  • 2. Useful.Beatiful.Data: social media − To produce official statistics you need DATA  Its getting more and more difficult to collect data from respondents • Response burden • Decreasing response rates • Mode effects (CAPI/PAPI/CATI/CAWI) − What are alternatives?  Admin data sources (since the 80’s)  BIG DATA (NOW), such as social media
  • 3. The glass if half full
  • 4. Potential of social media − 3 million public messages produced every day in the Netherlands  mainly on Twitter and Facebook (~60%)  Nearly ‘real-time’ available − Content: Topics discussed  50% ‘pointless babble’ (noisy) but there are messages relevant for official statistics  Selecting the relevant part is important (removing noise) − Producers: Not much info (directly) available  But much can be derived
  • 5. Social media in the Netherlands Map by Eric Fischer (via Fast Company) Map by Eric Fischer
  • 6. Examples of social media studies at CBS/CBDS − Content 1. Sentiment in social media  What is the development of the average sentiment in social media over time? 2. Feelings of social tension  Can social media be used to measure specific feelings in (the online) society? 3. Propensity to move (‘Wish to move’)  Can we identify messages of people that wish to move to another house? − Population 4. Characterizing users  Derive characteristics /discern subpopulations
  • 7. 1. Social media sentiment
  • 8. 1. Social media sentiment (2) − Facebook and Twitter messages both contribute − Daily data is highly volatile − Monthly aggregates correlate well with consumer confidence (> 0.9) − Including sentiment series improves the accuracy of consumer confidence series (survey data) − Product:  Averaged monthly or smoothed weekly online Dutch sentiment could be a potential new indicator  Can also be produced for large Dutch cities
  • 9. 2. Social tension indicator Available at: http://research.cbs.nl/socialtension/en/ Percentageofmessagesindicatingsocialtension
  • 10. 2. Social tension indicator (2) − Currently based on Twitter messages alone  Other platforms can be added − Selected messages containing specific keywords  These were originally derived from the safety monitor questionnaire  Used the events detected as feedback − Peaks indicate points in time at which increasing numbers of social tension related messages are produced  Usually don’t last long  Sometime a shift in the base line is observed (i.e. MH17) − Product: can be produced on a daily basis  This is how ‘real-time’ statistics will look like
  • 11. 3. ‘Wish to move’ − Current topic of research  Social media contains messages that indicate a ‘wish’ of people to move to another house (on all platforms)  Select messages containing ‘verhuiz*’ or ‘verhuis*’  Created a model to identify messages of people that wish to move (accuracy 0.85 ±0.02)  Relate social media findings to findings derived from survey/admin data  Study time-series to check on what frequency such an indicator could best be produced
  • 12. 4. Characterizing social media users − Social media contains multiple populations − Identifying Dutch users • 3 approaches:  From meta/para data available (language setting, location)  From network structure (following), essential when hardly any user info is present  From texts, discern Dutch and Flemish ‘tweets’
  • 13. 4. Characterizing social media users (2) − Social media contains multiple populations − Discerning between accounts of people and companies  2 step approach Human (private users) Non-human (corporate users) Private persons (77%) Self-employed (9%) ‘Non-profit groups’ (11%) Companies (3%)
  • 14. 4. Characterizing social media users (3) − Social media contains multiple populations − Identifying background characteristics  Challenging topic: • Gender (M/F) could be identified with 96% accuracy - Combining user short bio, first names, tweet content and pictures - 50% male, 33% female, 17% ‘others’ • Other characteristics are possible (future research)
  • 15. Conclusion − Social media is an interesting data source for official statistics − To enable this, two steps are essential: – Noise reduction − By aggregating lots of data − By removing ‘off-topic’ messages − Correct differences between ‘on-line’ and ‘real-world’ populations – By removing non-target population users – By applying a model (work in progress)