SlideShare a Scribd company logo
Enhancing a Social Science
Model-building Workflow with
InteractiveVisualisation
CagatayTurkay, Aidan Slingsby,
Kaisa Lahtinen, Sarah Butt and Jason Dykes
giCentre & Centre for Comparative Social Surveys at City University London
ESANN 2016, 29 April 2016
“We (social scientists) need (data-based)
models that we can understand and
explain so that we can defend them to
our peers in full confidence.”
A quote that motivates this work (from collaborators within our AddResponse project)
Image from: Lahtinen, K. et al. (2015). Informing
Non-Response Bias Model Creation in Social
Surveys with Visualisation. Poster VIS 2015
Numerical models to predict phenomena or, act as a
simulation of the phenomena being investigated
Good predictive power is often desired in models, BUT, (in
some fields) explanatory power is also crucial (Shmueli, 2010 for a detailed
[*] Shmueli, Galit. "To explain or to predict?." Statistical science (2010): 289-310.
discussion)
AddResponse Project -- https://blogs.city.ac.uk/addresponse/
… utilise organically generated auxiliary data (from commercial
transactions, public administration and other sources) to understand propensity
to respond and eventually tackle nonresponse bias (i.e.,
respondents differ from nonrespondents ).
AddResponse - Details
• European Social Survey (ESS) UK 2012 - 13
• 4,520 households
• linked to auxiliary data from:
• administrative sources
• commercial consumer profiling
• open-source data
• 401 auxiliary variables
• 32 survey response variables
(only for the respondents)
e.g., Proportion
of house
sharing adults
e.g., Sports
facilities
within walking
distance
Existing workflow
• Iteratively add and/or removing variables from a
logistic regression model
• Assess the changes through model fitness metrics
(e.g.,AIC, McFadden)
• Put up a sticker !
• Highly manual but involved!
Key roles for interactive visualisation
• Incorporating Theory
• Exploring variables
• Interactively building models
• Considering Geography
• Recording the model-building process, i.e., provenance
VarXplorer ModelBuilder
Prototype-1:VarXplorer
Co-variation plot
Correlations with
indicators
Theory-related
meta-data
Interactive
modelling
Link to the Video: http://goo.gl/XNiOIX
Exploring variables – 1: Investigate Covariation
- Compute pairwise correlation within all
401 variables
- Use this as a distance matrix and
project to 2D (using MDS)
- Visualise on a scatterplot where each
point is a variable
Exploring variables – 2: Correlation with indicators
- Compute correlations within all 32
response variables + response rate
- Use this as meta-data on variables to
check whether they relate to indicators
Incorporating Theory-related data
- Associate variables to social-science
concepts and theory
- Concepts relate to theories
- Variables act as proxies for concepts
- Use these as meta-data on variables
and visualise through histograms
Concepts, e.g.,
deprivation or quality
of life
Theories, e.g., social
isolation or social
disorganisation
Prototype-2: ModelBuilder
Variable selection
Model provenance
Interactive modelling
(through R)
Model quality
metrics
Prototype-2: ModelBuilder
Link to the Video: http://goo.gl/itUlm2
Interactively building models & evaluating them
- R scripts are called with the variable
selections and the variable to predict
(response or ESS variable)
- Quality metrics (AIC, McFadden) &
variables weights visualised
Interactive model building
also in VarXplorer
with variable weights
Considering Geography
- Facet data (geographically) into 12 regions
- Build local models
- Evaluate locally
Model provenance & annotations
- Save and analyse the model-building
trail
- Mark dead-ends and good models
- Attach notes to models
A brief example of the modelling process
1. Select two
concepts ,
economic
circumstances and
quality of life
A brief example of the modelling process
2. Select variables
that are distinct
and relevant
A brief example of the modelling process
3. Select variables
that correlate
with an ESS
indicator
(happiness)
3.1 Observe that
they relate to
“Social Isolation”
A brief example of the modelling process
4. Use these variables as a
starting point, check local
variations and plug into
existing scripts
4.1 Model performs
“better” in South-East UK
and in Greater London
Lessons learned
• Enhanced analysis through informed use of computation
• Interactive visual methods improve reliability and
interpretability
• Improved trust in models
• Tight integration enables quick hypothesis prototyping
• Important to communicate the certainty of the findings
Looking into the future
• Explanatory models not only predictive models
• Incorporating more complex methods (already
incorporated random forests)
• Other ways to make models more accessible?
• Use models & findings as scientific evidence ?
Acknowledgments
• giCentre team @ City
• ADDResponse project funded by the UK Economic
and Social Research Council (grant ES/L013118/1)
Thank you !
Cagatay.Turkay.1@city.ac.uk
@cagatay_turkay
http://staff.city.ac.uk/cagatay.turkay.1/
https://blogs.city.ac.uk/addresponse/
http://www.gicentre.net/
!!We are hiring !!
* Researcher in visualisation of cyber-security data
(H2020 funded RIA)
* PhD studentships
Deadlines in late May and June
check giCentre.net

More Related Content

What's hot

QNT 275 Week 5 Apply Connect Week 5 Case Qnt 275 qnt275 https://uopcourses.co...
QNT 275 Week 5 Apply Connect Week 5 Case Qnt 275 qnt275 https://uopcourses.co...QNT 275 Week 5 Apply Connect Week 5 Case Qnt 275 qnt275 https://uopcourses.co...
QNT 275 Week 5 Apply Connect Week 5 Case Qnt 275 qnt275 https://uopcourses.co...
NewUOPCourse
 
Visualization of Publication Impact
Visualization of Publication ImpactVisualization of Publication Impact
Visualization of Publication Impact
Eamonn Maguire
 
QNT 275 qnt275 QNT275 Qnt 275 qnt275 QNT/275 STATISTICS FOR DECISION MAKING h...
QNT 275 qnt275 QNT275 Qnt 275 qnt275 QNT/275 STATISTICS FOR DECISION MAKING h...QNT 275 qnt275 QNT275 Qnt 275 qnt275 QNT/275 STATISTICS FOR DECISION MAKING h...
QNT 275 qnt275 QNT275 Qnt 275 qnt275 QNT/275 STATISTICS FOR DECISION MAKING h...
UOPCourseHelp
 
Common Method Variance
Common Method Variance Common Method Variance
Common Method Variance
Hiệp Phạm
 
Similarity learning
  Similarity learning  Similarity learning
Similarity learning
Learnbay Datascience
 
Collaborative Metric Learning (WWW'17)
Collaborative Metric Learning (WWW'17)Collaborative Metric Learning (WWW'17)
Collaborative Metric Learning (WWW'17)
承剛 謝
 
Business Basic Statistics
Business Basic StatisticsBusiness Basic Statistics
Business Basic Statistics
Carmeline Coronado
 
GeneticProgramming
GeneticProgrammingGeneticProgramming
GeneticProgrammingDave Coulter
 
Quantitative Methods for Management_MBA_Bharathiar University
Quantitative Methods for Management_MBA_Bharathiar UniversityQuantitative Methods for Management_MBA_Bharathiar University
Quantitative Methods for Management_MBA_Bharathiar University
Victor Seelan
 
Multi variate presentation
Multi variate presentationMulti variate presentation
Multi variate presentationArun Kumar
 

What's hot (12)

QNT 275 Week 5 Apply Connect Week 5 Case Qnt 275 qnt275 https://uopcourses.co...
QNT 275 Week 5 Apply Connect Week 5 Case Qnt 275 qnt275 https://uopcourses.co...QNT 275 Week 5 Apply Connect Week 5 Case Qnt 275 qnt275 https://uopcourses.co...
QNT 275 Week 5 Apply Connect Week 5 Case Qnt 275 qnt275 https://uopcourses.co...
 
Visualization of Publication Impact
Visualization of Publication ImpactVisualization of Publication Impact
Visualization of Publication Impact
 
QNT 275 qnt275 QNT275 Qnt 275 qnt275 QNT/275 STATISTICS FOR DECISION MAKING h...
QNT 275 qnt275 QNT275 Qnt 275 qnt275 QNT/275 STATISTICS FOR DECISION MAKING h...QNT 275 qnt275 QNT275 Qnt 275 qnt275 QNT/275 STATISTICS FOR DECISION MAKING h...
QNT 275 qnt275 QNT275 Qnt 275 qnt275 QNT/275 STATISTICS FOR DECISION MAKING h...
 
Common Method Variance
Common Method Variance Common Method Variance
Common Method Variance
 
Ari resume 2015
Ari resume 2015Ari resume 2015
Ari resume 2015
 
Similarity learning
  Similarity learning  Similarity learning
Similarity learning
 
Presentation of Findings Quiz
Presentation of Findings QuizPresentation of Findings Quiz
Presentation of Findings Quiz
 
Collaborative Metric Learning (WWW'17)
Collaborative Metric Learning (WWW'17)Collaborative Metric Learning (WWW'17)
Collaborative Metric Learning (WWW'17)
 
Business Basic Statistics
Business Basic StatisticsBusiness Basic Statistics
Business Basic Statistics
 
GeneticProgramming
GeneticProgrammingGeneticProgramming
GeneticProgramming
 
Quantitative Methods for Management_MBA_Bharathiar University
Quantitative Methods for Management_MBA_Bharathiar UniversityQuantitative Methods for Management_MBA_Bharathiar University
Quantitative Methods for Management_MBA_Bharathiar University
 
Multi variate presentation
Multi variate presentationMulti variate presentation
Multi variate presentation
 

Viewers also liked

Data Science: Origins, Methods, Challenges and the future?
Data Science: Origins, Methods, Challenges and the future?Data Science: Origins, Methods, Challenges and the future?
Data Science: Origins, Methods, Challenges and the future?
Cagatay Turkay
 
Visualization, A Primer - Basics, Techniques and Guidelines
Visualization, A Primer - Basics, Techniques and GuidelinesVisualization, A Primer - Basics, Techniques and Guidelines
Visualization, A Primer - Basics, Techniques and Guidelines
Cagatay Turkay
 
Designing Progressive and Interactive Analytics Processes for High-Dimensiona...
Designing Progressive and Interactive Analytics Processes for High-Dimensiona...Designing Progressive and Interactive Analytics Processes for High-Dimensiona...
Designing Progressive and Interactive Analytics Processes for High-Dimensiona...
Cagatay Turkay
 
Drawing Euler diagrams and graphs in combination
Drawing Euler diagrams and graphs in combinationDrawing Euler diagrams and graphs in combination
Drawing Euler diagrams and graphs in combination
Mithileysh Sathiyanarayanan
 
TGE Project Controls
TGE Project ControlsTGE Project Controls
TGE Project ControlsDavid Mathews
 
Proyecto mejorando laseguridad de control de procesos
Proyecto   mejorando laseguridad de control de procesosProyecto   mejorando laseguridad de control de procesos
Proyecto mejorando laseguridad de control de procesos
Robyns Torres
 
Designing with People: Balancing Data & Rational Thinking with Human Emotions
Designing with People: Balancing Data & Rational Thinking with Human EmotionsDesigning with People: Balancing Data & Rational Thinking with Human Emotions
Designing with People: Balancing Data & Rational Thinking with Human Emotions
Brad Gutting
 
ReScience
ReScienceReScience
ReScience
Nicolas Rougier
 
Examples for leverage points
Examples for leverage pointsExamples for leverage points
Examples for leverage pointsGeorges Grinstein
 
Employee Engagement - More than just saying Thanks!
Employee Engagement - More than just saying Thanks! Employee Engagement - More than just saying Thanks!
Employee Engagement - More than just saying Thanks!
Sat Sindhar
 
Winning Together - How Great Leaders Inspire Winning Teams
Winning Together - How Great Leaders Inspire Winning TeamsWinning Together - How Great Leaders Inspire Winning Teams
Winning Together - How Great Leaders Inspire Winning Teams
Drew Bedard
 
Predictability of popularity on online social media: Gaps between prediction ...
Predictability of popularity on online social media: Gaps between prediction ...Predictability of popularity on online social media: Gaps between prediction ...
Predictability of popularity on online social media: Gaps between prediction ...
Amit Sharma
 
Management Fundamentals: The Iceberg Model
Management Fundamentals: The Iceberg ModelManagement Fundamentals: The Iceberg Model
Management Fundamentals: The Iceberg Model
Bar-Ezer Yossi
 
High Load Strategy 2016 - Project Management: from Stone Age to DevOps
High Load Strategy 2016 - Project Management: from Stone Age to DevOps High Load Strategy 2016 - Project Management: from Stone Age to DevOps
High Load Strategy 2016 - Project Management: from Stone Age to DevOps
OpenCredo
 
2017 Design Innovation Project Management
2017 Design Innovation Project Management2017 Design Innovation Project Management
2017 Design Innovation Project Management
erikbohemia
 
Cause2Create -- Updated to Project Challenge
Cause2Create -- Updated to Project ChallengeCause2Create -- Updated to Project Challenge
Cause2Create -- Updated to Project Challenge
erikbohemia
 
Centric
CentricCentric
Centric
BigDataExpo
 
Project management presentation (power point)
Project management presentation (power point)Project management presentation (power point)
Project management presentation (power point)
jafrin akter
 
Delivering Modern Operations on AWS
Delivering Modern Operations on AWSDelivering Modern Operations on AWS
Delivering Modern Operations on AWS
Amazon Web Services
 

Viewers also liked (20)

Data Science: Origins, Methods, Challenges and the future?
Data Science: Origins, Methods, Challenges and the future?Data Science: Origins, Methods, Challenges and the future?
Data Science: Origins, Methods, Challenges and the future?
 
Visualization, A Primer - Basics, Techniques and Guidelines
Visualization, A Primer - Basics, Techniques and GuidelinesVisualization, A Primer - Basics, Techniques and Guidelines
Visualization, A Primer - Basics, Techniques and Guidelines
 
Designing Progressive and Interactive Analytics Processes for High-Dimensiona...
Designing Progressive and Interactive Analytics Processes for High-Dimensiona...Designing Progressive and Interactive Analytics Processes for High-Dimensiona...
Designing Progressive and Interactive Analytics Processes for High-Dimensiona...
 
Drawing Euler diagrams and graphs in combination
Drawing Euler diagrams and graphs in combinationDrawing Euler diagrams and graphs in combination
Drawing Euler diagrams and graphs in combination
 
Dynamic Project Management Model
Dynamic Project Management ModelDynamic Project Management Model
Dynamic Project Management Model
 
TGE Project Controls
TGE Project ControlsTGE Project Controls
TGE Project Controls
 
Proyecto mejorando laseguridad de control de procesos
Proyecto   mejorando laseguridad de control de procesosProyecto   mejorando laseguridad de control de procesos
Proyecto mejorando laseguridad de control de procesos
 
Designing with People: Balancing Data & Rational Thinking with Human Emotions
Designing with People: Balancing Data & Rational Thinking with Human EmotionsDesigning with People: Balancing Data & Rational Thinking with Human Emotions
Designing with People: Balancing Data & Rational Thinking with Human Emotions
 
ReScience
ReScienceReScience
ReScience
 
Examples for leverage points
Examples for leverage pointsExamples for leverage points
Examples for leverage points
 
Employee Engagement - More than just saying Thanks!
Employee Engagement - More than just saying Thanks! Employee Engagement - More than just saying Thanks!
Employee Engagement - More than just saying Thanks!
 
Winning Together - How Great Leaders Inspire Winning Teams
Winning Together - How Great Leaders Inspire Winning TeamsWinning Together - How Great Leaders Inspire Winning Teams
Winning Together - How Great Leaders Inspire Winning Teams
 
Predictability of popularity on online social media: Gaps between prediction ...
Predictability of popularity on online social media: Gaps between prediction ...Predictability of popularity on online social media: Gaps between prediction ...
Predictability of popularity on online social media: Gaps between prediction ...
 
Management Fundamentals: The Iceberg Model
Management Fundamentals: The Iceberg ModelManagement Fundamentals: The Iceberg Model
Management Fundamentals: The Iceberg Model
 
High Load Strategy 2016 - Project Management: from Stone Age to DevOps
High Load Strategy 2016 - Project Management: from Stone Age to DevOps High Load Strategy 2016 - Project Management: from Stone Age to DevOps
High Load Strategy 2016 - Project Management: from Stone Age to DevOps
 
2017 Design Innovation Project Management
2017 Design Innovation Project Management2017 Design Innovation Project Management
2017 Design Innovation Project Management
 
Cause2Create -- Updated to Project Challenge
Cause2Create -- Updated to Project ChallengeCause2Create -- Updated to Project Challenge
Cause2Create -- Updated to Project Challenge
 
Centric
CentricCentric
Centric
 
Project management presentation (power point)
Project management presentation (power point)Project management presentation (power point)
Project management presentation (power point)
 
Delivering Modern Operations on AWS
Delivering Modern Operations on AWSDelivering Modern Operations on AWS
Delivering Modern Operations on AWS
 

Similar to Enhancing a Social Science Model-building Workflow with Interactive Visualisation

Chapters 4,5 and 6Into policymaking and modeling in a comple.docx
Chapters 4,5 and 6Into policymaking and modeling in a comple.docxChapters 4,5 and 6Into policymaking and modeling in a comple.docx
Chapters 4,5 and 6Into policymaking and modeling in a comple.docx
tiffanyd4
 
Chapters 4,5 and 6Into policymaking and modeling in a comple.docx
Chapters 4,5 and 6Into policymaking and modeling in a comple.docxChapters 4,5 and 6Into policymaking and modeling in a comple.docx
Chapters 4,5 and 6Into policymaking and modeling in a comple.docx
mccormicknadine86
 
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
Platforma Otwartej Nauki
 
Goal Dynamics_From System Dynamics to Implementation
Goal Dynamics_From System Dynamics to ImplementationGoal Dynamics_From System Dynamics to Implementation
Goal Dynamics_From System Dynamics to ImplementationAmjad Adib
 
Shibani Antonette_Augmenting pedagogic writing practice with CLAD.pdf
Shibani Antonette_Augmenting pedagogic writing practice with CLAD.pdfShibani Antonette_Augmenting pedagogic writing practice with CLAD.pdf
Shibani Antonette_Augmenting pedagogic writing practice with CLAD.pdf
Shibani22
 
Model management for systems biology projects
Model management for systems biology projectsModel management for systems biology projects
Model management for systems biology projects
University Medicine Greifswald
 
Modeling Framework to Support Evidence-Based Decisions
Modeling Framework to Support Evidence-Based DecisionsModeling Framework to Support Evidence-Based Decisions
Modeling Framework to Support Evidence-Based Decisions
Albert Simard
 
Introduction to Logic Models
Introduction to Logic ModelsIntroduction to Logic Models
Introduction to Logic Models
DiscoveryCenterMU
 
2015-11-11 research seminar
2015-11-11 research seminar2015-11-11 research seminar
2015-11-11 research seminar
ifi8106tlu
 
MUMS Opening Workshop - The Isaac Newton Institute Uncertainty Quantification...
MUMS Opening Workshop - The Isaac Newton Institute Uncertainty Quantification...MUMS Opening Workshop - The Isaac Newton Institute Uncertainty Quantification...
MUMS Opening Workshop - The Isaac Newton Institute Uncertainty Quantification...
The Statistical and Applied Mathematical Sciences Institute
 
Boru Douthwaite: Theory of Change to lever change
Boru Douthwaite: Theory of Change to lever changeBoru Douthwaite: Theory of Change to lever change
Boru Douthwaite: Theory of Change to lever change
STEPS Centre
 
Using Theory of Change to Lever Change: Experience from the CGIAR
Using Theory of Change to Lever Change: Experience from the CGIARUsing Theory of Change to Lever Change: Experience from the CGIAR
Using Theory of Change to Lever Change: Experience from the CGIAR
WorldFish
 
Paving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflowsPaving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflows
The University of Edinburgh
 
The Early Stage Analysis of a Systemic Innovation Lab
The Early Stage Analysis of a Systemic Innovation LabThe Early Stage Analysis of a Systemic Innovation Lab
The Early Stage Analysis of a Systemic Innovation Lab
RSD7 Symposium
 
Simulation in Social Sciences - Lecture 6 in Introduction to Computational S...
Simulation in Social Sciences -  Lecture 6 in Introduction to Computational S...Simulation in Social Sciences -  Lecture 6 in Introduction to Computational S...
Simulation in Social Sciences - Lecture 6 in Introduction to Computational S...
Lauri Eloranta
 
Aligning Learning Analytics with Classroom Practices & Needs
Aligning Learning Analytics with Classroom Practices & NeedsAligning Learning Analytics with Classroom Practices & Needs
Aligning Learning Analytics with Classroom Practices & Needs
Simon Knight
 
3 D Project Based Learning Basics for the New Generation Science Standards
3 D Project Based  Learning Basics for the New Generation Science Standards3 D Project Based  Learning Basics for the New Generation Science Standards
3 D Project Based Learning Basics for the New Generation Science Standards
rekharajaseran
 
Theories in Empirical Software Engineering
Theories in Empirical Software EngineeringTheories in Empirical Software Engineering
Theories in Empirical Software Engineering
Daniel Mendez
 
Paving the way to open and interoperable research data service workflows Prog...
Paving the way to open and interoperable research data service workflows Prog...Paving the way to open and interoperable research data service workflows Prog...
Paving the way to open and interoperable research data service workflows Prog...
ResearchSpace
 
Social Event Detection using Multimodal Clustering and Integrating Supervisor...
Social Event Detection using Multimodal Clustering and Integrating Supervisor...Social Event Detection using Multimodal Clustering and Integrating Supervisor...
Social Event Detection using Multimodal Clustering and Integrating Supervisor...
Symeon Papadopoulos
 

Similar to Enhancing a Social Science Model-building Workflow with Interactive Visualisation (20)

Chapters 4,5 and 6Into policymaking and modeling in a comple.docx
Chapters 4,5 and 6Into policymaking and modeling in a comple.docxChapters 4,5 and 6Into policymaking and modeling in a comple.docx
Chapters 4,5 and 6Into policymaking and modeling in a comple.docx
 
Chapters 4,5 and 6Into policymaking and modeling in a comple.docx
Chapters 4,5 and 6Into policymaking and modeling in a comple.docxChapters 4,5 and 6Into policymaking and modeling in a comple.docx
Chapters 4,5 and 6Into policymaking and modeling in a comple.docx
 
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
 
Goal Dynamics_From System Dynamics to Implementation
Goal Dynamics_From System Dynamics to ImplementationGoal Dynamics_From System Dynamics to Implementation
Goal Dynamics_From System Dynamics to Implementation
 
Shibani Antonette_Augmenting pedagogic writing practice with CLAD.pdf
Shibani Antonette_Augmenting pedagogic writing practice with CLAD.pdfShibani Antonette_Augmenting pedagogic writing practice with CLAD.pdf
Shibani Antonette_Augmenting pedagogic writing practice with CLAD.pdf
 
Model management for systems biology projects
Model management for systems biology projectsModel management for systems biology projects
Model management for systems biology projects
 
Modeling Framework to Support Evidence-Based Decisions
Modeling Framework to Support Evidence-Based DecisionsModeling Framework to Support Evidence-Based Decisions
Modeling Framework to Support Evidence-Based Decisions
 
Introduction to Logic Models
Introduction to Logic ModelsIntroduction to Logic Models
Introduction to Logic Models
 
2015-11-11 research seminar
2015-11-11 research seminar2015-11-11 research seminar
2015-11-11 research seminar
 
MUMS Opening Workshop - The Isaac Newton Institute Uncertainty Quantification...
MUMS Opening Workshop - The Isaac Newton Institute Uncertainty Quantification...MUMS Opening Workshop - The Isaac Newton Institute Uncertainty Quantification...
MUMS Opening Workshop - The Isaac Newton Institute Uncertainty Quantification...
 
Boru Douthwaite: Theory of Change to lever change
Boru Douthwaite: Theory of Change to lever changeBoru Douthwaite: Theory of Change to lever change
Boru Douthwaite: Theory of Change to lever change
 
Using Theory of Change to Lever Change: Experience from the CGIAR
Using Theory of Change to Lever Change: Experience from the CGIARUsing Theory of Change to Lever Change: Experience from the CGIAR
Using Theory of Change to Lever Change: Experience from the CGIAR
 
Paving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflowsPaving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflows
 
The Early Stage Analysis of a Systemic Innovation Lab
The Early Stage Analysis of a Systemic Innovation LabThe Early Stage Analysis of a Systemic Innovation Lab
The Early Stage Analysis of a Systemic Innovation Lab
 
Simulation in Social Sciences - Lecture 6 in Introduction to Computational S...
Simulation in Social Sciences -  Lecture 6 in Introduction to Computational S...Simulation in Social Sciences -  Lecture 6 in Introduction to Computational S...
Simulation in Social Sciences - Lecture 6 in Introduction to Computational S...
 
Aligning Learning Analytics with Classroom Practices & Needs
Aligning Learning Analytics with Classroom Practices & NeedsAligning Learning Analytics with Classroom Practices & Needs
Aligning Learning Analytics with Classroom Practices & Needs
 
3 D Project Based Learning Basics for the New Generation Science Standards
3 D Project Based  Learning Basics for the New Generation Science Standards3 D Project Based  Learning Basics for the New Generation Science Standards
3 D Project Based Learning Basics for the New Generation Science Standards
 
Theories in Empirical Software Engineering
Theories in Empirical Software EngineeringTheories in Empirical Software Engineering
Theories in Empirical Software Engineering
 
Paving the way to open and interoperable research data service workflows Prog...
Paving the way to open and interoperable research data service workflows Prog...Paving the way to open and interoperable research data service workflows Prog...
Paving the way to open and interoperable research data service workflows Prog...
 
Social Event Detection using Multimodal Clustering and Integrating Supervisor...
Social Event Detection using Multimodal Clustering and Integrating Supervisor...Social Event Detection using Multimodal Clustering and Integrating Supervisor...
Social Event Detection using Multimodal Clustering and Integrating Supervisor...
 

Recently uploaded

Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
fafyfskhan251kmf
 
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptxBREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
RASHMI M G
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
muralinath2
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
sanjana502982
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
Abdul Wali Khan University Mardan,kP,Pakistan
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
Wasswaderrick3
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
tonzsalvador2222
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Ana Luísa Pinho
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptxANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
RASHMI M G
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
David Osipyan
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
SSR02
 

Recently uploaded (20)

Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
 
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptxBREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptxANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
 

Enhancing a Social Science Model-building Workflow with Interactive Visualisation

  • 1. Enhancing a Social Science Model-building Workflow with InteractiveVisualisation CagatayTurkay, Aidan Slingsby, Kaisa Lahtinen, Sarah Butt and Jason Dykes giCentre & Centre for Comparative Social Surveys at City University London ESANN 2016, 29 April 2016
  • 2. “We (social scientists) need (data-based) models that we can understand and explain so that we can defend them to our peers in full confidence.” A quote that motivates this work (from collaborators within our AddResponse project) Image from: Lahtinen, K. et al. (2015). Informing Non-Response Bias Model Creation in Social Surveys with Visualisation. Poster VIS 2015
  • 3. Numerical models to predict phenomena or, act as a simulation of the phenomena being investigated Good predictive power is often desired in models, BUT, (in some fields) explanatory power is also crucial (Shmueli, 2010 for a detailed [*] Shmueli, Galit. "To explain or to predict?." Statistical science (2010): 289-310. discussion)
  • 4.
  • 5. AddResponse Project -- https://blogs.city.ac.uk/addresponse/ … utilise organically generated auxiliary data (from commercial transactions, public administration and other sources) to understand propensity to respond and eventually tackle nonresponse bias (i.e., respondents differ from nonrespondents ).
  • 6. AddResponse - Details • European Social Survey (ESS) UK 2012 - 13 • 4,520 households • linked to auxiliary data from: • administrative sources • commercial consumer profiling • open-source data • 401 auxiliary variables • 32 survey response variables (only for the respondents) e.g., Proportion of house sharing adults e.g., Sports facilities within walking distance
  • 7.
  • 8. Existing workflow • Iteratively add and/or removing variables from a logistic regression model • Assess the changes through model fitness metrics (e.g.,AIC, McFadden) • Put up a sticker ! • Highly manual but involved!
  • 9. Key roles for interactive visualisation • Incorporating Theory • Exploring variables • Interactively building models • Considering Geography • Recording the model-building process, i.e., provenance VarXplorer ModelBuilder
  • 11. Link to the Video: http://goo.gl/XNiOIX
  • 12. Exploring variables – 1: Investigate Covariation - Compute pairwise correlation within all 401 variables - Use this as a distance matrix and project to 2D (using MDS) - Visualise on a scatterplot where each point is a variable
  • 13. Exploring variables – 2: Correlation with indicators - Compute correlations within all 32 response variables + response rate - Use this as meta-data on variables to check whether they relate to indicators
  • 14. Incorporating Theory-related data - Associate variables to social-science concepts and theory - Concepts relate to theories - Variables act as proxies for concepts - Use these as meta-data on variables and visualise through histograms Concepts, e.g., deprivation or quality of life Theories, e.g., social isolation or social disorganisation
  • 15. Prototype-2: ModelBuilder Variable selection Model provenance Interactive modelling (through R) Model quality metrics
  • 16. Prototype-2: ModelBuilder Link to the Video: http://goo.gl/itUlm2
  • 17. Interactively building models & evaluating them - R scripts are called with the variable selections and the variable to predict (response or ESS variable) - Quality metrics (AIC, McFadden) & variables weights visualised Interactive model building also in VarXplorer with variable weights
  • 18. Considering Geography - Facet data (geographically) into 12 regions - Build local models - Evaluate locally
  • 19. Model provenance & annotations - Save and analyse the model-building trail - Mark dead-ends and good models - Attach notes to models
  • 20. A brief example of the modelling process 1. Select two concepts , economic circumstances and quality of life
  • 21. A brief example of the modelling process 2. Select variables that are distinct and relevant
  • 22. A brief example of the modelling process 3. Select variables that correlate with an ESS indicator (happiness) 3.1 Observe that they relate to “Social Isolation”
  • 23. A brief example of the modelling process 4. Use these variables as a starting point, check local variations and plug into existing scripts 4.1 Model performs “better” in South-East UK and in Greater London
  • 24. Lessons learned • Enhanced analysis through informed use of computation • Interactive visual methods improve reliability and interpretability • Improved trust in models • Tight integration enables quick hypothesis prototyping • Important to communicate the certainty of the findings
  • 25. Looking into the future • Explanatory models not only predictive models • Incorporating more complex methods (already incorporated random forests) • Other ways to make models more accessible? • Use models & findings as scientific evidence ?
  • 26. Acknowledgments • giCentre team @ City • ADDResponse project funded by the UK Economic and Social Research Council (grant ES/L013118/1)
  • 27. Thank you ! Cagatay.Turkay.1@city.ac.uk @cagatay_turkay http://staff.city.ac.uk/cagatay.turkay.1/ https://blogs.city.ac.uk/addresponse/ http://www.gicentre.net/ !!We are hiring !! * Researcher in visualisation of cyber-security data (H2020 funded RIA) * PhD studentships Deadlines in late May and June check giCentre.net