SlideShare a Scribd company logo
2
Sheng-Wei (Kuan-Ta) Chen
Institute of Information Science
Academia Sinica
Computational Social Science
The Collaborative Futures of Big Data, Computer
Science, and Social Sciences
3
PEOPLE
4
The Favorite Major for US
College Athletes
(Source: USAToday, http://usatoday30.usatoday.com/sports/college/2008-11-18-majors-graphic_N.htm)
5
Social Science
6
Social Life is Hard to See
We can interview friends, but we cannot
interview a friendship
Fleeting interaction
In private
Tedious to record over time,
especially in large groups
7
Bigger Problems
Social phenomena involve many individuals
interacting to produce collective entities
firms, markets, cultures, political parties, social movements,
audiences
“Micro-Macro” problem (aka “Emergence”)
Micro-macro problems are hard to study
empirically
Difficult to collect observational data about individuals,
networks, and populations at same time
Even more difficult to do “macro” scale experiments
8
1890 US Census
1st time Hollerith machines were used to
tabulate US Census data
(population: 62,947,714)
9
The Era of Big Data
Past: Government data, national survey data
Today: A variety of new data sources
Economic data: trade, finance, e-cash / e-wallet, ...
GIS data: satellite, GPS loggers, laser scanning cars, …
Sensor data: video surveillance, smart phones, wearable
devices, mobile apps, beacons, …
10
New kinds of data
11
12
陳昇瑋 / 資料科學的第一堂課
Computer vision for healthcare
Video magnification
(Slide Credit: Jia-Bin Huang)
14
15
New kinds of data
16
Engagement and Exploration
Standing face-to-face?
Physical distance
Hand gesture, posture
Conversation patterns
Frequency of interruptions
18
Web as a Record of Social Interaction
Public web pages / discussions
Twitter, Facebook, blogs, news groups, wikis, MMOGs,
Instagram, LastFM, Flickr, Spotify
Private email, Whatsapp, LINE, Slack
Text, images, sounds: speeches, commercials
19
New kinds of data
20
Computational
Social Science
The science that investigates social phenomena
through the medium of computing and
statistical data processing.
23
Computational Social Science
An instrument-enabled scientific discipline
microbiology
microscope
radio astronomy
radar
nanoscience
electron microscope
24
25
26
27
Technical Challenges
Computational infrastructures for dealing with
More data: analyzing large amounts of data
Fuzzy data: cleaning up inprecise and noisy data
New kinds of data: processing real-time sensor streams and
web data
Need for new substantive ideas
Need for new statistical methods (WHY in
addition to WHAT and HOW)
28
3 Common Approaches
Macroscope Virtual Lab
Empirical
Modeling
29
APPROACHES
#1 MACROSCOPE
#2 VIRTUAL LAB
#3 EMPIRICAL MODELING
31
WE ARE WHAT WE SAY
Linguistics
Schwartz, H. Andrew, et al. "Personality, gender, and age in the language of social media:The
open-vocabulary approach." PloS one 8.9 (2013): e73791.
Macroscope
32
Dataset
700 million words, phrases, and topic instances
collected from 75,000 volunteers’ FB posts
Record users’ personality (5-factor), gender and
age
33
What Words Do You Use?
male
female
34
How Old Are You? (#1)
13 - 18
19 - 22
35
How Old Are You? (#2)
23 - 29
30 - 65
36
Personality Traits
Extraversion
Introversion
38
Topics Across 4 Age-groups
39
Warm and Negative Words
40
Usage of “I” & “We”
Huge-volume data + simple analysis 
crystal clear language use patterns
42
APPROACHES
#1 MACROSCOPE
#2 VIRTUAL LAB
#3 EMPIRICAL MODELING
43
Scaling up the Lab
Social science experimental heavily constrained
by scale and speed
Unit of analysis was individuals or small groups
Experiments took months to design and run
Potentially “virtual labs” lift both constraints
State of the art ~ 5000 workers, but in principle
could construct subject panel ~ 100K – 1M
Could shrink hypothesis-testing cycle to days or
hours
44
MOOD CONTAGION (& MANIPULATION)
ON FACEBOOK
Social Psychology
Kramer, Adam DI, Jamie E. Guillory, and Jeffrey T. Hancock. "Experimental evidence of massive-scale
emotional contagion through social networks.” Proceedings of the National Academy of Sciences111.24 (2014):
8788-8790.
Virtual Lab
45
Facebook Mood Contagion
0.7 million (~ 0.04%) users on Facebook
3 million posts manipulated in one week
Hide some “positive” or “negative” emotional
posts from users (in the experimental group)
46
Observations
Negative posts hidden
People who see more positive posts, tend
to post more positively, and vice versa.
Facebook users’ emotion can be easily
manipulated by changing ALGORITHMS
Positive posts hidden
47
Ethical Issues (!)
Unethical experiment because it’s conducted
without users’ consent
Serious invasion of users’ perceptions about
their friend circles (and the society)
Well, Facebook's data use policy states that users'
information will be used "for internal operations, including
troubleshooting, data analysis, testing, research and
service improvement," meaning that any user can become
a lab rat.
48
FACEBOOK “I VOTED” BUTTON
Social Psychology & Politics
Bond, Robert M., et al. "A 61-million-person experiment in social influence and political
mobilization." Nature 489.7415 (2012): 295-298.
Virtual Lab
49
“I Voted” Button
Direct messages to 61 million users on FB
Informational: 1% users received
Social: 98% users received
Control group: 1% (no message received)
Informational
Social
51
Effect of Manipulation
Ratio of friends voted
Prob. of oneself claimed voted
53
2% more likely to click “I voted” button and 0.3%
more likely to seek information about a polling place,
and 0.4% more likely to head to the polls.
54
Real-world Consequence (!)
In total there were about 60,000 votes of
turnout, and estimated 280,000 indirect
turnout (out of 61 million users)
What if Facebook did not randomize the
control/experimental groups?
62
APPROACHES
#1 MACROSCOPE
#2 VIRTUAL LAB
#3 EMPIRICAL MODELING
63
Empirical Modeling
Traditional mathematical or computational
modeling
Tends to rely on many, often unrealistic, assumptions
Not generally tested in detail against data
Result is proliferation of models that exist in parallel and are
often incompatible with each other
New sources/scales of data allow both to
learn/test models and also calibrate them
Observations  Models  Lab  Field  Observations
65
Google Flu Trends
Nature 457, 1012-1014 (2009)
66
PREDICTION OF COUNTY-LEVEL
HEART DISEASE MORTALITY
Medicine and Linguistics
Empirical Modeling
Eichstaedt, Johannes C., et al. "Psychological language on twitter predicts county-level heart disease
mortality." Psychological science 26.2 (2015): 159-169.
68
Datsets
Heart disease
Arteriosclerotic heart disease
mortality rates during 2009 -- 2010
Predictors
826 million tweets collected between June 2009 and March
2010
Socioeconomic (income and education)
Demographic (percentages of Black, Hispanic, married, and
female residents)
Health status (diabetes, obesity, smoking, and hypertension)
69
Prediction Accuracy
70
71
74
Language Use in Tweets
75
Social media opens up a new window of
what humans actually feel and think
76
YOU ARE WHAT YOU LIKE
Social Psychology
Empirical Modeling
Kosinski, Michal, David Stillwell, andThore Graepel. "Private traits and attributes are predictable from digital
records of human behavior." Proceedings of the National Academy of Sciences 110.15 (2013): 5802-5805.
78
Personality Prediction
Personality traits
Gender, age, relationship status, # friends
Sexual orientation, ethnicity, religion, political
inclination
Addictive substances (alcohol, drugs, cigarette),
parental separation
IQ, 5-Factor model, satisfaction with Life
79
Data Collection
9,939,220 Likes (55,814 unique ones) from
58,466 Facebook volunteers
Sports
Music
Books
Restaurants
Popular websites
80
Ground truth
Political Inclination
Sexual Orientation
Democrat Republican
Democratic GOP (Grand Old Party)
Democratic Party Republican Party
Homosexual Heterosexual
1 / 0 1 / 0
82
Ground truth
5-Factor Model
Openness
Conscientiousness
Extraversion
Agreeableness
Stability
83
Ground truth
Satisfaction with Life (SWL)
85
Methodology
User-Like matrix dimension reduction: Singular Value
Decomposition (SVD)
Prediction models: Logistic Regression & Linear Regression
87
Prediction Results
Solid: Pearson corr. coef. between pred. & actual values
Transparent: baseline acc. of the questionnaire, in terms of test-
retest reliability
89
Discriminative Likes (#1)
90
Discriminative Likes (#2)
91
Discriminative Likes (#3)
176
CONCLUSION & OUTLOOK
177
WE ARE STILL AT
THE VERY START
178
LOTS of Big Questions
The polarization of global economic inequality
What explains the success of social movements?
The emergence of pro-sociality behavior
The causality of video gaming and propensity of
violence?
The politics of censorship
The causality of social selection and social
influence?
…
179
The Data Divide
Social scientists have good questions but…
IT tools are not part of their toolkits
Not clear that we will/should make the investment
Computer scientists have powerful methods
but…
Trained to resolve technical problems
It seems there are less “methodological”
contributions
180
The Challenges
Education and habits of social and computer
scientists
Different ways of thinking
Different methodologies
Differences in framing questions and defining contributions
Data access and fragmentation issue
Data privacy issue
Ethics issue
Organizational issue
182
Institutional Innovations
New platforms and protocols for data
management
Better coordination of data collection, storage, sharing
Recruitment and management of subject pools, field panels
Integrated research designs
Coordination across theoretical, experimental and
observational studies
Collaborative interdisciplinary teams
For a given data set, often unclear what the most interesting
question is
For a given question, often unclear how to collect the right
data
184
Techniques to collect,
manage, and process large
datasets
Knowledge about social
theories, methods, and
issues
Computational
Social Science
185
Sheng-Wei Chen
Academia Sinica

More Related Content

What's hot

Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Lauri Eloranta
 
10 More than a Pretty Picture: Visual Thinking in Network Studies
10 More than a Pretty Picture: Visual Thinking in Network Studies10 More than a Pretty Picture: Visual Thinking in Network Studies
10 More than a Pretty Picture: Visual Thinking in Network Studies
dnac
 
Opinion Dynamics on Networks
Opinion Dynamics on NetworksOpinion Dynamics on Networks
Opinion Dynamics on Networks
Mason Porter
 
MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...
MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...
MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...
Micah Altman
 
13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)
13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)
13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)
dnac
 
Social Network Analysis: applications for education research
Social Network Analysis: applications for education researchSocial Network Analysis: applications for education research
Social Network Analysis: applications for education research
Christian Bokhove
 
Scientific Reproducibility from an Informatics Perspective
Scientific Reproducibility from an Informatics PerspectiveScientific Reproducibility from an Informatics Perspective
Scientific Reproducibility from an Informatics Perspective
Micah Altman
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures
dnac
 
Introduction to Topological Data Analysis
Introduction to Topological Data AnalysisIntroduction to Topological Data Analysis
Introduction to Topological Data Analysis
Mason Porter
 
05 Communities in Network
05 Communities in Network05 Communities in Network
05 Communities in Network
dnac
 
Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...
The Higher Education Academy
 
Using Data Integration Models for Understanding Complex Social Systems
Using Data Integration Modelsfor Understanding Complex Social SystemsUsing Data Integration Modelsfor Understanding Complex Social Systems
Using Data Integration Models for Understanding Complex Social Systems
Bruce Edmonds
 
Paper Writing in Applied Mathematics (slightly updated slides)
Paper Writing in Applied Mathematics (slightly updated slides)Paper Writing in Applied Mathematics (slightly updated slides)
Paper Writing in Applied Mathematics (slightly updated slides)
Mason Porter
 
04 Network Data Collection
04 Network Data Collection04 Network Data Collection
04 Network Data Collection
Duke Network Analysis Center
 
How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...
Jeromy Anglim
 
Who creates trends in online social media
Who creates trends in online social mediaWho creates trends in online social media
Who creates trends in online social media
Amir Razmjou
 
Social Network Analysis (Part 1)
Social Network Analysis (Part 1)Social Network Analysis (Part 1)
Social Network Analysis (Part 1)
Vala Ali Rohani
 
Data Ethics for Mathematicians
Data Ethics for MathematiciansData Ethics for Mathematicians
Data Ethics for Mathematicians
Mason Porter
 
04 Diffusion and Peer Influence
04 Diffusion and Peer Influence04 Diffusion and Peer Influence
04 Diffusion and Peer Influence
dnac
 
Mathematics and Social Networks
Mathematics and Social NetworksMathematics and Social Networks
Mathematics and Social Networks
Mason Porter
 

What's hot (20)

Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
 
10 More than a Pretty Picture: Visual Thinking in Network Studies
10 More than a Pretty Picture: Visual Thinking in Network Studies10 More than a Pretty Picture: Visual Thinking in Network Studies
10 More than a Pretty Picture: Visual Thinking in Network Studies
 
Opinion Dynamics on Networks
Opinion Dynamics on NetworksOpinion Dynamics on Networks
Opinion Dynamics on Networks
 
MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...
MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...
MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...
 
13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)
13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)
13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)
 
Social Network Analysis: applications for education research
Social Network Analysis: applications for education researchSocial Network Analysis: applications for education research
Social Network Analysis: applications for education research
 
Scientific Reproducibility from an Informatics Perspective
Scientific Reproducibility from an Informatics PerspectiveScientific Reproducibility from an Informatics Perspective
Scientific Reproducibility from an Informatics Perspective
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures
 
Introduction to Topological Data Analysis
Introduction to Topological Data AnalysisIntroduction to Topological Data Analysis
Introduction to Topological Data Analysis
 
05 Communities in Network
05 Communities in Network05 Communities in Network
05 Communities in Network
 
Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...
 
Using Data Integration Models for Understanding Complex Social Systems
Using Data Integration Modelsfor Understanding Complex Social SystemsUsing Data Integration Modelsfor Understanding Complex Social Systems
Using Data Integration Models for Understanding Complex Social Systems
 
Paper Writing in Applied Mathematics (slightly updated slides)
Paper Writing in Applied Mathematics (slightly updated slides)Paper Writing in Applied Mathematics (slightly updated slides)
Paper Writing in Applied Mathematics (slightly updated slides)
 
04 Network Data Collection
04 Network Data Collection04 Network Data Collection
04 Network Data Collection
 
How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...
 
Who creates trends in online social media
Who creates trends in online social mediaWho creates trends in online social media
Who creates trends in online social media
 
Social Network Analysis (Part 1)
Social Network Analysis (Part 1)Social Network Analysis (Part 1)
Social Network Analysis (Part 1)
 
Data Ethics for Mathematicians
Data Ethics for MathematiciansData Ethics for Mathematicians
Data Ethics for Mathematicians
 
04 Diffusion and Peer Influence
04 Diffusion and Peer Influence04 Diffusion and Peer Influence
04 Diffusion and Peer Influence
 
Mathematics and Social Networks
Mathematics and Social NetworksMathematics and Social Networks
Mathematics and Social Networks
 

Viewers also liked

An Empirical Evaluation of VoIP Playout Buffer Dimensioning in Skype, Google ...
An Empirical Evaluation of VoIP Playout Buffer Dimensioning in Skype, Google ...An Empirical Evaluation of VoIP Playout Buffer Dimensioning in Skype, Google ...
An Empirical Evaluation of VoIP Playout Buffer Dimensioning in Skype, Google ...
Academia Sinica
 
Quantifying User Satisfaction in Mobile Cloud Games
Quantifying User Satisfaction in Mobile Cloud GamesQuantifying User Satisfaction in Mobile Cloud Games
Quantifying User Satisfaction in Mobile Cloud Games
Academia Sinica
 
Games on Demand: Are We There Yet?
Games on Demand: Are We There Yet?Games on Demand: Are We There Yet?
Games on Demand: Are We There Yet?
Academia Sinica
 
Cloud Gaming Onward: Research Opportunities and Outlook
Cloud Gaming Onward: Research Opportunities and OutlookCloud Gaming Onward: Research Opportunities and Outlook
Cloud Gaming Onward: Research Opportunities and Outlook
Academia Sinica
 
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Lauri Eloranta
 
Crowdsourcing beyond Mechanical Turk: Building Crowdmining Services for Your ...
Crowdsourcing beyond Mechanical Turk: Building Crowdmining Services for Your ...Crowdsourcing beyond Mechanical Turk: Building Crowdmining Services for Your ...
Crowdsourcing beyond Mechanical Turk: Building Crowdmining Services for Your ...
Sheng-Wei (Kuan-Ta) Chen
 
網路購書大數據– 給出版者的洞察分析
網路購書大數據– 給出版者的洞察分析網路購書大數據– 給出版者的洞察分析
網路購書大數據– 給出版者的洞察分析
Sheng-Wei (Kuan-Ta) Chen
 

Viewers also liked (7)

An Empirical Evaluation of VoIP Playout Buffer Dimensioning in Skype, Google ...
An Empirical Evaluation of VoIP Playout Buffer Dimensioning in Skype, Google ...An Empirical Evaluation of VoIP Playout Buffer Dimensioning in Skype, Google ...
An Empirical Evaluation of VoIP Playout Buffer Dimensioning in Skype, Google ...
 
Quantifying User Satisfaction in Mobile Cloud Games
Quantifying User Satisfaction in Mobile Cloud GamesQuantifying User Satisfaction in Mobile Cloud Games
Quantifying User Satisfaction in Mobile Cloud Games
 
Games on Demand: Are We There Yet?
Games on Demand: Are We There Yet?Games on Demand: Are We There Yet?
Games on Demand: Are We There Yet?
 
Cloud Gaming Onward: Research Opportunities and Outlook
Cloud Gaming Onward: Research Opportunities and OutlookCloud Gaming Onward: Research Opportunities and Outlook
Cloud Gaming Onward: Research Opportunities and Outlook
 
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
 
Crowdsourcing beyond Mechanical Turk: Building Crowdmining Services for Your ...
Crowdsourcing beyond Mechanical Turk: Building Crowdmining Services for Your ...Crowdsourcing beyond Mechanical Turk: Building Crowdmining Services for Your ...
Crowdsourcing beyond Mechanical Turk: Building Crowdmining Services for Your ...
 
網路購書大數據– 給出版者的洞察分析
網路購書大數據– 給出版者的洞察分析網路購書大數據– 給出版者的洞察分析
網路購書大數據– 給出版者的洞察分析
 

Similar to Computational Social Science:The Collaborative Futures of Big Data, Computer Science, and Social Sciences

計算社會科學初探- 當電腦科學家遇上社會科學
計算社會科學初探-當電腦科學家遇上社會科學計算社會科學初探-當電腦科學家遇上社會科學
計算社會科學初探- 當電腦科學家遇上社會科學
Sheng-Wei (Kuan-Ta) Chen
 
1112 social media and public health
1112 social media and public health1112 social media and public health
1112 social media and public health
Mélodie YunJu Song
 
Data Science definition
Data Science definitionData Science definition
Data Science definition
CarloLauro1
 
Let's talk about Data Science
Let's talk about Data ScienceLet's talk about Data Science
Let's talk about Data Science
Carlo Lauro
 
OII Summer Doctoral Programme 2010: Global brain by Meyer & Schroeder
OII Summer Doctoral Programme 2010: Global brain by Meyer & SchroederOII Summer Doctoral Programme 2010: Global brain by Meyer & Schroeder
OII Summer Doctoral Programme 2010: Global brain by Meyer & Schroeder
Eric Meyer
 
G. Poste. Managing the Data Deluge: Critical Issues in the Integration and An...
G. Poste. Managing the Data Deluge: Critical Issues in the Integration and An...G. Poste. Managing the Data Deluge: Critical Issues in the Integration and An...
G. Poste. Managing the Data Deluge: Critical Issues in the Integration and An...
CASI, Arizona State University
 
Data Responsibly: The next decade of data science
Data Responsibly: The next decade of data scienceData Responsibly: The next decade of data science
Data Responsibly: The next decade of data science
University of Washington
 
Inauguration Function - Ohio Center of Excellence in Knowledge-Enabled Comput...
Inauguration Function - Ohio Center of Excellence in Knowledge-Enabled Comput...Inauguration Function - Ohio Center of Excellence in Knowledge-Enabled Comput...
Inauguration Function - Ohio Center of Excellence in Knowledge-Enabled Comput...
Artificial Intelligence Institute at UofSC
 
Weller pleasures+perils social media
Weller pleasures+perils social mediaWeller pleasures+perils social media
Weller pleasures+perils social media
Katrin Weller
 
Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...
Symeon Papadopoulos
 
Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...
Eleftherios Spyromitros-Xioufis
 
Depression Detection in Tweets using Logistic Regression Model
Depression Detection in Tweets using Logistic Regression ModelDepression Detection in Tweets using Logistic Regression Model
Depression Detection in Tweets using Logistic Regression Model
ijtsrd
 
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Darlene Cavalier
 
Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science  Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science
suresh sood
 
NYC Data Science Meetup: Computational Social Science
NYC Data Science Meetup: Computational Social ScienceNYC Data Science Meetup: Computational Social Science
NYC Data Science Meetup: Computational Social Science
jakehofman
 
An introduction to machine learning in biomedical research: Key concepts, pr...
An introduction to machine learning in biomedical research:  Key concepts, pr...An introduction to machine learning in biomedical research:  Key concepts, pr...
An introduction to machine learning in biomedical research: Key concepts, pr...
FranciscoJAzuajeG
 
Enrique RCODI presentation symposium 2017
Enrique RCODI presentation symposium 2017Enrique RCODI presentation symposium 2017
Enrique RCODI presentation symposium 2017
Jesus Enrique Aldana S.
 
Smart Data - How you and I will exploit Big Data for personalized digital hea...
Smart Data - How you and I will exploit Big Data for personalized digital hea...Smart Data - How you and I will exploit Big Data for personalized digital hea...
Smart Data - How you and I will exploit Big Data for personalized digital hea...
Amit Sheth
 
Social Media Analytics: Concepts, Models, Methods, & Tools - Ravi Vatrapu
Social Media Analytics: Concepts, Models, Methods, & Tools - Ravi VatrapuSocial Media Analytics: Concepts, Models, Methods, & Tools - Ravi Vatrapu
Social Media Analytics: Concepts, Models, Methods, & Tools - Ravi Vatrapu
CBS Competitiveness Platform
 
TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...
TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...
TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...
Amit Sheth
 

Similar to Computational Social Science:The Collaborative Futures of Big Data, Computer Science, and Social Sciences (20)

計算社會科學初探- 當電腦科學家遇上社會科學
計算社會科學初探-當電腦科學家遇上社會科學計算社會科學初探-當電腦科學家遇上社會科學
計算社會科學初探- 當電腦科學家遇上社會科學
 
1112 social media and public health
1112 social media and public health1112 social media and public health
1112 social media and public health
 
Data Science definition
Data Science definitionData Science definition
Data Science definition
 
Let's talk about Data Science
Let's talk about Data ScienceLet's talk about Data Science
Let's talk about Data Science
 
OII Summer Doctoral Programme 2010: Global brain by Meyer & Schroeder
OII Summer Doctoral Programme 2010: Global brain by Meyer & SchroederOII Summer Doctoral Programme 2010: Global brain by Meyer & Schroeder
OII Summer Doctoral Programme 2010: Global brain by Meyer & Schroeder
 
G. Poste. Managing the Data Deluge: Critical Issues in the Integration and An...
G. Poste. Managing the Data Deluge: Critical Issues in the Integration and An...G. Poste. Managing the Data Deluge: Critical Issues in the Integration and An...
G. Poste. Managing the Data Deluge: Critical Issues in the Integration and An...
 
Data Responsibly: The next decade of data science
Data Responsibly: The next decade of data scienceData Responsibly: The next decade of data science
Data Responsibly: The next decade of data science
 
Inauguration Function - Ohio Center of Excellence in Knowledge-Enabled Comput...
Inauguration Function - Ohio Center of Excellence in Knowledge-Enabled Comput...Inauguration Function - Ohio Center of Excellence in Knowledge-Enabled Comput...
Inauguration Function - Ohio Center of Excellence in Knowledge-Enabled Comput...
 
Weller pleasures+perils social media
Weller pleasures+perils social mediaWeller pleasures+perils social media
Weller pleasures+perils social media
 
Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...
 
Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...
 
Depression Detection in Tweets using Logistic Regression Model
Depression Detection in Tweets using Logistic Regression ModelDepression Detection in Tweets using Logistic Regression Model
Depression Detection in Tweets using Logistic Regression Model
 
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
 
Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science  Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science
 
NYC Data Science Meetup: Computational Social Science
NYC Data Science Meetup: Computational Social ScienceNYC Data Science Meetup: Computational Social Science
NYC Data Science Meetup: Computational Social Science
 
An introduction to machine learning in biomedical research: Key concepts, pr...
An introduction to machine learning in biomedical research:  Key concepts, pr...An introduction to machine learning in biomedical research:  Key concepts, pr...
An introduction to machine learning in biomedical research: Key concepts, pr...
 
Enrique RCODI presentation symposium 2017
Enrique RCODI presentation symposium 2017Enrique RCODI presentation symposium 2017
Enrique RCODI presentation symposium 2017
 
Smart Data - How you and I will exploit Big Data for personalized digital hea...
Smart Data - How you and I will exploit Big Data for personalized digital hea...Smart Data - How you and I will exploit Big Data for personalized digital hea...
Smart Data - How you and I will exploit Big Data for personalized digital hea...
 
Social Media Analytics: Concepts, Models, Methods, & Tools - Ravi Vatrapu
Social Media Analytics: Concepts, Models, Methods, & Tools - Ravi VatrapuSocial Media Analytics: Concepts, Models, Methods, & Tools - Ravi Vatrapu
Social Media Analytics: Concepts, Models, Methods, & Tools - Ravi Vatrapu
 
TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...
TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...
TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...
 

More from Academia Sinica

Detecting In-Situ Identity Fraud on Social Network Services: A Case Study on ...
Detecting In-Situ Identity Fraud on Social Network Services: A Case Study on ...Detecting In-Situ Identity Fraud on Social Network Services: A Case Study on ...
Detecting In-Situ Identity Fraud on Social Network Services: A Case Study on ...
Academia Sinica
 
量化「樂趣」-以心理生理量測探究數位娛樂商品之市場價值
量化「樂趣」-以心理生理量測探究數位娛樂商品之市場價值量化「樂趣」-以心理生理量測探究數位娛樂商品之市場價值
量化「樂趣」-以心理生理量測探究數位娛樂商品之市場價值Academia Sinica
 
On The Battle between Online Gamers and Lags
On The Battle between Online Gamers and LagsOn The Battle between Online Gamers and Lags
On The Battle between Online Gamers and Lags
Academia Sinica
 
Understanding The Performance of Thin-Client Gaming
Understanding The Performance of Thin-Client GamingUnderstanding The Performance of Thin-Client Gaming
Understanding The Performance of Thin-Client Gaming
Academia Sinica
 
Quantifying QoS Requirements of Network Services: A Cheat-Proof Framework
Quantifying QoS Requirements of Network Services: A Cheat-Proof FrameworkQuantifying QoS Requirements of Network Services: A Cheat-Proof Framework
Quantifying QoS Requirements of Network Services: A Cheat-Proof Framework
Academia Sinica
 
Online Game QoE Evaluation using Paired Comparisons
Online Game QoE Evaluation using Paired ComparisonsOnline Game QoE Evaluation using Paired Comparisons
Online Game QoE Evaluation using Paired Comparisons
Academia Sinica
 
GamingAnywhere: An Open Cloud Gaming System
GamingAnywhere: An Open Cloud Gaming SystemGamingAnywhere: An Open Cloud Gaming System
GamingAnywhere: An Open Cloud Gaming System
Academia Sinica
 
Are All Games Equally Cloud-Gaming-Friendly? An Electromyographic Approach
Are All Games Equally Cloud-Gaming-Friendly? An Electromyographic ApproachAre All Games Equally Cloud-Gaming-Friendly? An Electromyographic Approach
Are All Games Equally Cloud-Gaming-Friendly? An Electromyographic Approach
Academia Sinica
 
Forecasting Online Game Addictiveness
Forecasting Online Game AddictivenessForecasting Online Game Addictiveness
Forecasting Online Game Addictiveness
Academia Sinica
 
Identifying MMORPG Bots: A Traffic Analysis Approach
Identifying MMORPG Bots: A Traffic Analysis ApproachIdentifying MMORPG Bots: A Traffic Analysis Approach
Identifying MMORPG Bots: A Traffic Analysis Approach
Academia Sinica
 
Toward an Understanding of the Processing Delay of Peer-to-Peer Relay Nodes
Toward an Understanding of the Processing Delay of Peer-to-Peer Relay NodesToward an Understanding of the Processing Delay of Peer-to-Peer Relay Nodes
Toward an Understanding of the Processing Delay of Peer-to-Peer Relay Nodes
Academia Sinica
 
Inferring Speech Activity from Encrypted Skype Traffic
Inferring Speech Activity from Encrypted Skype TrafficInferring Speech Activity from Encrypted Skype Traffic
Inferring Speech Activity from Encrypted Skype Traffic
Academia Sinica
 
Game Bot Detection Based on Avatar Trajectory
Game Bot Detection Based on Avatar TrajectoryGame Bot Detection Based on Avatar Trajectory
Game Bot Detection Based on Avatar Trajectory
Academia Sinica
 
Improving Reliability of Web 2.0-based Rating Systems Using Per-user Trustiness
Improving Reliability of Web 2.0-based Rating Systems Using Per-user TrustinessImproving Reliability of Web 2.0-based Rating Systems Using Per-user Trustiness
Improving Reliability of Web 2.0-based Rating Systems Using Per-user TrustinessAcademia Sinica
 
A Collusion-Resistant Automation Scheme for Social Moderation Systems
A Collusion-Resistant Automation Scheme for Social Moderation SystemsA Collusion-Resistant Automation Scheme for Social Moderation Systems
A Collusion-Resistant Automation Scheme for Social Moderation Systems
Academia Sinica
 
Tuning Skype’s Redundancy Control Algorithm for User Satisfaction
Tuning Skype’s Redundancy Control Algorithm for User SatisfactionTuning Skype’s Redundancy Control Algorithm for User Satisfaction
Tuning Skype’s Redundancy Control Algorithm for User Satisfaction
Academia Sinica
 
Network Game Design: Hints and Implications of Player Interaction
Network Game Design: Hints and Implications of Player InteractionNetwork Game Design: Hints and Implications of Player Interaction
Network Game Design: Hints and Implications of Player Interaction
Academia Sinica
 
Mitigating Active Attacks Towards Client Networks Using the Bitmap Filter
Mitigating Active Attacks Towards Client Networks Using the Bitmap FilterMitigating Active Attacks Towards Client Networks Using the Bitmap Filter
Mitigating Active Attacks Towards Client Networks Using the Bitmap Filter
Academia Sinica
 
An Analysis of WoW Players’ Game Hours
An Analysis of WoW Players’ Game HoursAn Analysis of WoW Players’ Game Hours
An Analysis of WoW Players’ Game Hours
Academia Sinica
 
Game Traffic Analysis: An MMORPG Perspective
Game Traffic Analysis: An MMORPG PerspectiveGame Traffic Analysis: An MMORPG Perspective
Game Traffic Analysis: An MMORPG Perspective
Academia Sinica
 

More from Academia Sinica (20)

Detecting In-Situ Identity Fraud on Social Network Services: A Case Study on ...
Detecting In-Situ Identity Fraud on Social Network Services: A Case Study on ...Detecting In-Situ Identity Fraud on Social Network Services: A Case Study on ...
Detecting In-Situ Identity Fraud on Social Network Services: A Case Study on ...
 
量化「樂趣」-以心理生理量測探究數位娛樂商品之市場價值
量化「樂趣」-以心理生理量測探究數位娛樂商品之市場價值量化「樂趣」-以心理生理量測探究數位娛樂商品之市場價值
量化「樂趣」-以心理生理量測探究數位娛樂商品之市場價值
 
On The Battle between Online Gamers and Lags
On The Battle between Online Gamers and LagsOn The Battle between Online Gamers and Lags
On The Battle between Online Gamers and Lags
 
Understanding The Performance of Thin-Client Gaming
Understanding The Performance of Thin-Client GamingUnderstanding The Performance of Thin-Client Gaming
Understanding The Performance of Thin-Client Gaming
 
Quantifying QoS Requirements of Network Services: A Cheat-Proof Framework
Quantifying QoS Requirements of Network Services: A Cheat-Proof FrameworkQuantifying QoS Requirements of Network Services: A Cheat-Proof Framework
Quantifying QoS Requirements of Network Services: A Cheat-Proof Framework
 
Online Game QoE Evaluation using Paired Comparisons
Online Game QoE Evaluation using Paired ComparisonsOnline Game QoE Evaluation using Paired Comparisons
Online Game QoE Evaluation using Paired Comparisons
 
GamingAnywhere: An Open Cloud Gaming System
GamingAnywhere: An Open Cloud Gaming SystemGamingAnywhere: An Open Cloud Gaming System
GamingAnywhere: An Open Cloud Gaming System
 
Are All Games Equally Cloud-Gaming-Friendly? An Electromyographic Approach
Are All Games Equally Cloud-Gaming-Friendly? An Electromyographic ApproachAre All Games Equally Cloud-Gaming-Friendly? An Electromyographic Approach
Are All Games Equally Cloud-Gaming-Friendly? An Electromyographic Approach
 
Forecasting Online Game Addictiveness
Forecasting Online Game AddictivenessForecasting Online Game Addictiveness
Forecasting Online Game Addictiveness
 
Identifying MMORPG Bots: A Traffic Analysis Approach
Identifying MMORPG Bots: A Traffic Analysis ApproachIdentifying MMORPG Bots: A Traffic Analysis Approach
Identifying MMORPG Bots: A Traffic Analysis Approach
 
Toward an Understanding of the Processing Delay of Peer-to-Peer Relay Nodes
Toward an Understanding of the Processing Delay of Peer-to-Peer Relay NodesToward an Understanding of the Processing Delay of Peer-to-Peer Relay Nodes
Toward an Understanding of the Processing Delay of Peer-to-Peer Relay Nodes
 
Inferring Speech Activity from Encrypted Skype Traffic
Inferring Speech Activity from Encrypted Skype TrafficInferring Speech Activity from Encrypted Skype Traffic
Inferring Speech Activity from Encrypted Skype Traffic
 
Game Bot Detection Based on Avatar Trajectory
Game Bot Detection Based on Avatar TrajectoryGame Bot Detection Based on Avatar Trajectory
Game Bot Detection Based on Avatar Trajectory
 
Improving Reliability of Web 2.0-based Rating Systems Using Per-user Trustiness
Improving Reliability of Web 2.0-based Rating Systems Using Per-user TrustinessImproving Reliability of Web 2.0-based Rating Systems Using Per-user Trustiness
Improving Reliability of Web 2.0-based Rating Systems Using Per-user Trustiness
 
A Collusion-Resistant Automation Scheme for Social Moderation Systems
A Collusion-Resistant Automation Scheme for Social Moderation SystemsA Collusion-Resistant Automation Scheme for Social Moderation Systems
A Collusion-Resistant Automation Scheme for Social Moderation Systems
 
Tuning Skype’s Redundancy Control Algorithm for User Satisfaction
Tuning Skype’s Redundancy Control Algorithm for User SatisfactionTuning Skype’s Redundancy Control Algorithm for User Satisfaction
Tuning Skype’s Redundancy Control Algorithm for User Satisfaction
 
Network Game Design: Hints and Implications of Player Interaction
Network Game Design: Hints and Implications of Player InteractionNetwork Game Design: Hints and Implications of Player Interaction
Network Game Design: Hints and Implications of Player Interaction
 
Mitigating Active Attacks Towards Client Networks Using the Bitmap Filter
Mitigating Active Attacks Towards Client Networks Using the Bitmap FilterMitigating Active Attacks Towards Client Networks Using the Bitmap Filter
Mitigating Active Attacks Towards Client Networks Using the Bitmap Filter
 
An Analysis of WoW Players’ Game Hours
An Analysis of WoW Players’ Game HoursAn Analysis of WoW Players’ Game Hours
An Analysis of WoW Players’ Game Hours
 
Game Traffic Analysis: An MMORPG Perspective
Game Traffic Analysis: An MMORPG PerspectiveGame Traffic Analysis: An MMORPG Perspective
Game Traffic Analysis: An MMORPG Perspective
 

Recently uploaded

Cell The Unit of Life for NEET Multiple Choice Questions.docx
Cell The Unit of Life for NEET Multiple Choice Questions.docxCell The Unit of Life for NEET Multiple Choice Questions.docx
Cell The Unit of Life for NEET Multiple Choice Questions.docx
vasanthatpuram
 
How To Control IO Usage using Resource Manager
How To Control IO Usage using Resource ManagerHow To Control IO Usage using Resource Manager
How To Control IO Usage using Resource Manager
Alireza Kamrani
 
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
1tyxnjpia
 
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
z6osjkqvd
 
Data Scientist Machine Learning Profiles .pdf
Data Scientist Machine Learning  Profiles .pdfData Scientist Machine Learning  Profiles .pdf
Data Scientist Machine Learning Profiles .pdf
Vineet
 
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
asyed10
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
actyx
 
Econ3060_Screen Time and Success_ final_GroupProject.pdf
Econ3060_Screen Time and Success_ final_GroupProject.pdfEcon3060_Screen Time and Success_ final_GroupProject.pdf
Econ3060_Screen Time and Success_ final_GroupProject.pdf
blueshagoo1
 
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
osoyvvf
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 
Drownings spike from May to August in children
Drownings spike from May to August in childrenDrownings spike from May to August in children
Drownings spike from May to August in children
Bisnar Chase Personal Injury Attorneys
 
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptxREUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
KiriakiENikolaidou
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
一比一原版悉尼大学毕业证如何办理
一比一原版悉尼大学毕业证如何办理一比一原版悉尼大学毕业证如何办理
一比一原版悉尼大学毕业证如何办理
keesa2
 
ML-PPT-UNIT-2 Generative Classifiers Discriminative Classifiers
ML-PPT-UNIT-2 Generative Classifiers Discriminative ClassifiersML-PPT-UNIT-2 Generative Classifiers Discriminative Classifiers
ML-PPT-UNIT-2 Generative Classifiers Discriminative Classifiers
MastanaihnaiduYasam
 
Template xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptxTemplate xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptx
TeukuEriSyahputra
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
bmucuha
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
Vietnam Cotton & Spinning Association
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
Vietnam Cotton & Spinning Association
 

Recently uploaded (20)

Cell The Unit of Life for NEET Multiple Choice Questions.docx
Cell The Unit of Life for NEET Multiple Choice Questions.docxCell The Unit of Life for NEET Multiple Choice Questions.docx
Cell The Unit of Life for NEET Multiple Choice Questions.docx
 
How To Control IO Usage using Resource Manager
How To Control IO Usage using Resource ManagerHow To Control IO Usage using Resource Manager
How To Control IO Usage using Resource Manager
 
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
 
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
 
Data Scientist Machine Learning Profiles .pdf
Data Scientist Machine Learning  Profiles .pdfData Scientist Machine Learning  Profiles .pdf
Data Scientist Machine Learning Profiles .pdf
 
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
 
Econ3060_Screen Time and Success_ final_GroupProject.pdf
Econ3060_Screen Time and Success_ final_GroupProject.pdfEcon3060_Screen Time and Success_ final_GroupProject.pdf
Econ3060_Screen Time and Success_ final_GroupProject.pdf
 
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 
Drownings spike from May to August in children
Drownings spike from May to August in childrenDrownings spike from May to August in children
Drownings spike from May to August in children
 
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptxREUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 
一比一原版悉尼大学毕业证如何办理
一比一原版悉尼大学毕业证如何办理一比一原版悉尼大学毕业证如何办理
一比一原版悉尼大学毕业证如何办理
 
ML-PPT-UNIT-2 Generative Classifiers Discriminative Classifiers
ML-PPT-UNIT-2 Generative Classifiers Discriminative ClassifiersML-PPT-UNIT-2 Generative Classifiers Discriminative Classifiers
ML-PPT-UNIT-2 Generative Classifiers Discriminative Classifiers
 
Template xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptxTemplate xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptx
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
 

Computational Social Science:The Collaborative Futures of Big Data, Computer Science, and Social Sciences

  • 1. 2 Sheng-Wei (Kuan-Ta) Chen Institute of Information Science Academia Sinica Computational Social Science The Collaborative Futures of Big Data, Computer Science, and Social Sciences
  • 3. 4 The Favorite Major for US College Athletes (Source: USAToday, http://usatoday30.usatoday.com/sports/college/2008-11-18-majors-graphic_N.htm)
  • 5. 6 Social Life is Hard to See We can interview friends, but we cannot interview a friendship Fleeting interaction In private Tedious to record over time, especially in large groups
  • 6. 7 Bigger Problems Social phenomena involve many individuals interacting to produce collective entities firms, markets, cultures, political parties, social movements, audiences “Micro-Macro” problem (aka “Emergence”) Micro-macro problems are hard to study empirically Difficult to collect observational data about individuals, networks, and populations at same time Even more difficult to do “macro” scale experiments
  • 7. 8 1890 US Census 1st time Hollerith machines were used to tabulate US Census data (population: 62,947,714)
  • 8. 9 The Era of Big Data Past: Government data, national survey data Today: A variety of new data sources Economic data: trade, finance, e-cash / e-wallet, ... GIS data: satellite, GPS loggers, laser scanning cars, … Sensor data: video surveillance, smart phones, wearable devices, mobile apps, beacons, …
  • 10. 11
  • 11. 12
  • 12. 陳昇瑋 / 資料科學的第一堂課 Computer vision for healthcare Video magnification (Slide Credit: Jia-Bin Huang)
  • 13. 14
  • 15. 16 Engagement and Exploration Standing face-to-face? Physical distance Hand gesture, posture Conversation patterns Frequency of interruptions
  • 16. 18 Web as a Record of Social Interaction Public web pages / discussions Twitter, Facebook, blogs, news groups, wikis, MMOGs, Instagram, LastFM, Flickr, Spotify Private email, Whatsapp, LINE, Slack Text, images, sounds: speeches, commercials
  • 18. 20 Computational Social Science The science that investigates social phenomena through the medium of computing and statistical data processing.
  • 19. 23 Computational Social Science An instrument-enabled scientific discipline microbiology microscope radio astronomy radar nanoscience electron microscope
  • 20. 24
  • 21. 25
  • 22. 26
  • 23. 27 Technical Challenges Computational infrastructures for dealing with More data: analyzing large amounts of data Fuzzy data: cleaning up inprecise and noisy data New kinds of data: processing real-time sensor streams and web data Need for new substantive ideas Need for new statistical methods (WHY in addition to WHAT and HOW)
  • 24. 28 3 Common Approaches Macroscope Virtual Lab Empirical Modeling
  • 25. 29 APPROACHES #1 MACROSCOPE #2 VIRTUAL LAB #3 EMPIRICAL MODELING
  • 26. 31 WE ARE WHAT WE SAY Linguistics Schwartz, H. Andrew, et al. "Personality, gender, and age in the language of social media:The open-vocabulary approach." PloS one 8.9 (2013): e73791. Macroscope
  • 27. 32 Dataset 700 million words, phrases, and topic instances collected from 75,000 volunteers’ FB posts Record users’ personality (5-factor), gender and age
  • 28. 33 What Words Do You Use? male female
  • 29. 34 How Old Are You? (#1) 13 - 18 19 - 22
  • 30. 35 How Old Are You? (#2) 23 - 29 30 - 65
  • 32. 38 Topics Across 4 Age-groups
  • 34. 40 Usage of “I” & “We” Huge-volume data + simple analysis  crystal clear language use patterns
  • 35. 42 APPROACHES #1 MACROSCOPE #2 VIRTUAL LAB #3 EMPIRICAL MODELING
  • 36. 43 Scaling up the Lab Social science experimental heavily constrained by scale and speed Unit of analysis was individuals or small groups Experiments took months to design and run Potentially “virtual labs” lift both constraints State of the art ~ 5000 workers, but in principle could construct subject panel ~ 100K – 1M Could shrink hypothesis-testing cycle to days or hours
  • 37. 44 MOOD CONTAGION (& MANIPULATION) ON FACEBOOK Social Psychology Kramer, Adam DI, Jamie E. Guillory, and Jeffrey T. Hancock. "Experimental evidence of massive-scale emotional contagion through social networks.” Proceedings of the National Academy of Sciences111.24 (2014): 8788-8790. Virtual Lab
  • 38. 45 Facebook Mood Contagion 0.7 million (~ 0.04%) users on Facebook 3 million posts manipulated in one week Hide some “positive” or “negative” emotional posts from users (in the experimental group)
  • 39. 46 Observations Negative posts hidden People who see more positive posts, tend to post more positively, and vice versa. Facebook users’ emotion can be easily manipulated by changing ALGORITHMS Positive posts hidden
  • 40. 47 Ethical Issues (!) Unethical experiment because it’s conducted without users’ consent Serious invasion of users’ perceptions about their friend circles (and the society) Well, Facebook's data use policy states that users' information will be used "for internal operations, including troubleshooting, data analysis, testing, research and service improvement," meaning that any user can become a lab rat.
  • 41. 48 FACEBOOK “I VOTED” BUTTON Social Psychology & Politics Bond, Robert M., et al. "A 61-million-person experiment in social influence and political mobilization." Nature 489.7415 (2012): 295-298. Virtual Lab
  • 42. 49 “I Voted” Button Direct messages to 61 million users on FB Informational: 1% users received Social: 98% users received Control group: 1% (no message received) Informational Social
  • 43. 51 Effect of Manipulation Ratio of friends voted Prob. of oneself claimed voted
  • 44. 53 2% more likely to click “I voted” button and 0.3% more likely to seek information about a polling place, and 0.4% more likely to head to the polls.
  • 45. 54 Real-world Consequence (!) In total there were about 60,000 votes of turnout, and estimated 280,000 indirect turnout (out of 61 million users) What if Facebook did not randomize the control/experimental groups?
  • 46. 62 APPROACHES #1 MACROSCOPE #2 VIRTUAL LAB #3 EMPIRICAL MODELING
  • 47. 63 Empirical Modeling Traditional mathematical or computational modeling Tends to rely on many, often unrealistic, assumptions Not generally tested in detail against data Result is proliferation of models that exist in parallel and are often incompatible with each other New sources/scales of data allow both to learn/test models and also calibrate them Observations  Models  Lab  Field  Observations
  • 48.
  • 49. 65 Google Flu Trends Nature 457, 1012-1014 (2009)
  • 50. 66 PREDICTION OF COUNTY-LEVEL HEART DISEASE MORTALITY Medicine and Linguistics Empirical Modeling Eichstaedt, Johannes C., et al. "Psychological language on twitter predicts county-level heart disease mortality." Psychological science 26.2 (2015): 159-169.
  • 51. 68 Datsets Heart disease Arteriosclerotic heart disease mortality rates during 2009 -- 2010 Predictors 826 million tweets collected between June 2009 and March 2010 Socioeconomic (income and education) Demographic (percentages of Black, Hispanic, married, and female residents) Health status (diabetes, obesity, smoking, and hypertension)
  • 53. 70
  • 54. 71
  • 56. 75 Social media opens up a new window of what humans actually feel and think
  • 57. 76 YOU ARE WHAT YOU LIKE Social Psychology Empirical Modeling Kosinski, Michal, David Stillwell, andThore Graepel. "Private traits and attributes are predictable from digital records of human behavior." Proceedings of the National Academy of Sciences 110.15 (2013): 5802-5805.
  • 58. 78 Personality Prediction Personality traits Gender, age, relationship status, # friends Sexual orientation, ethnicity, religion, political inclination Addictive substances (alcohol, drugs, cigarette), parental separation IQ, 5-Factor model, satisfaction with Life
  • 59. 79 Data Collection 9,939,220 Likes (55,814 unique ones) from 58,466 Facebook volunteers Sports Music Books Restaurants Popular websites
  • 60. 80 Ground truth Political Inclination Sexual Orientation Democrat Republican Democratic GOP (Grand Old Party) Democratic Party Republican Party Homosexual Heterosexual 1 / 0 1 / 0
  • 63. 85 Methodology User-Like matrix dimension reduction: Singular Value Decomposition (SVD) Prediction models: Logistic Regression & Linear Regression
  • 64. 87 Prediction Results Solid: Pearson corr. coef. between pred. & actual values Transparent: baseline acc. of the questionnaire, in terms of test- retest reliability
  • 69. 177 WE ARE STILL AT THE VERY START
  • 70. 178 LOTS of Big Questions The polarization of global economic inequality What explains the success of social movements? The emergence of pro-sociality behavior The causality of video gaming and propensity of violence? The politics of censorship The causality of social selection and social influence? …
  • 71. 179 The Data Divide Social scientists have good questions but… IT tools are not part of their toolkits Not clear that we will/should make the investment Computer scientists have powerful methods but… Trained to resolve technical problems It seems there are less “methodological” contributions
  • 72. 180 The Challenges Education and habits of social and computer scientists Different ways of thinking Different methodologies Differences in framing questions and defining contributions Data access and fragmentation issue Data privacy issue Ethics issue Organizational issue
  • 73. 182 Institutional Innovations New platforms and protocols for data management Better coordination of data collection, storage, sharing Recruitment and management of subject pools, field panels Integrated research designs Coordination across theoretical, experimental and observational studies Collaborative interdisciplinary teams For a given data set, often unclear what the most interesting question is For a given question, often unclear how to collect the right data
  • 74. 184 Techniques to collect, manage, and process large datasets Knowledge about social theories, methods, and issues Computational Social Science