SlideShare a Scribd company logo
1 of 25
Using Clustering as a Tool:
Mixed Methods in Qualitative Data Analysis
Laura Macia, PhD
Behavioral and Community Health
Sciences
Graduate School of Public Health
University of Pittsburgh
Types of data
Mixed Methods
• Type of Data / Data Collection
• Data Analysis
Mixed Methods in Data Analysis
Cluster Analysis
• Method for grouping data by their similarity
– Appropriate data
– Defining similarity
– Clustering
Data Preparation
• Types of data:
– Nominal
– Ordinal
– Interval / Ratio
Qualitative Data
(an example)
Latino Grievances Project
Summary Table: Nodes and Attributes (after thematic analysis using Nvivo)
Select Variables Values [description]
Part 1: Gender
Strata
Legal status
Income
Education
(0) Male; (1) Female
(0) Blue-collar; (1) Spouse of American citizen; (2) White-collar
(0) US citizen; (1) Legal permanent resident; (2) Immigrant visa; (3) Non-immigrant visa;
(4) Visa overstay; (5) Undocumented
(0) Under $20k; (1) $20k to $40k; (2) $40k to $60k; (1) $60k to $80k; (1) $80k to $100k;
(5) Over $100k
(0) Primary; (1) Some secondary; (2) High-school diploma; (3) College degree; (4) Graduate
degree; (5) Other degree
Part 2: Type
Nationality
(0) Male; (1) Female; (2) Individual [when gender unknown]; (3) Institution; (4)
Government; (5) Other
(0) American; (1) Latino; (2) Other; (3) Unknown
Grievance (0) Debt; (1) Discrimination; (2) Domestic; (3) With the law
Procedural
mode
(1) None
(2) Adjudication [third party with authority to intervene, i.e. courts]
(3) Arbitration [third party agreed to by principals]
(4) Mediation [third party aiding principals reach an agreement]
(5) Negotiation [two principals decide on settlement]
(6) Coercion [imposition of outcome by unilateral threat or use of force]
(7) Avoidance [terminate relationship / withdraw from situation]
(8) Lumping it [“letting go” as of grievance]
(9) Assumed fault* [structure grievance as occurring due to own situation/fault]
(10)Talk back* [letting know of grievance without expecting further action]
(11)Other
* Data-driven codes, not included in predefined coding scheme
Data Preparation
• Types of data:
– Nominal
– Ordinal
– Interval / Ratio
Qualitative Data
Gender: (0) Male, (1) Female, …
Type of Grievance:
(0) Debt, (1) Discrimination, …
Chosen Procedure:
(2) Adjudication, …(6) Coercion, …
Income: (0) <$20k, (1) $20k-$40k, …
Education:
(0) primary , … (2) high school diploma, …
Units of analysis: Cases
ID Strata Part2 Part2Natlity Type ProcMode1ProcMode2ProcMode3Support1 Support2
1 WC Individual Unknown Debt Other None None None None
2 WC Institution American Debt NegotiationAvoidanceNone None None
3 WC Female American DiscriminationAssumed faultLumping itTalk back None None
4 WC Individual American DiscriminationOther None None None None
5 WC Male Latino Domestic Other NegotiationNone Other None
6 WC Female Latino Domestic NegotiationOther None Family None
7 WC Male Latino Domestic NegotiationNone None Family None
8 WC Government American Law NegotiationAssumed faultOther Family None
9 WC Male Latino Debt NegotiationLumping itOther Family Friend
10 WC Female Other Debt Talk back AvoidanceOther Family None
11 WC Institution American Debt AvoidanceOther None Friend None
12 WC Institution American Debt Other None None None None
13 WC Male Unknown Debt Assumed faultNegotiationNone Friend None
14 WC Male American DiscriminationLumping itNone None None None
15 WC Institution American DiscriminationOther None None Church None
16 WC Male Other DiscriminationLumping itOther None Family None
17 WC Other Latino Domestic NegotiationNone None None None
18 WC Female Other Domestic NegotiationNone None Other None
19 WC Female Other Domestic NegotiationOther None None None
20 WC Government American Law Assumed faultNone None None None
12 variables
Cluster Analysis: Data Reduction
• Transform qualitative data into binary data
ID 1-Fem 1-Male 2-Fem 2-Male 2-Indiv 2-Govmnt 2-Instit 2-Other 2N-American
WC-F-De-11-1 1 0 0 0 1 0 0 0 0
WC-F-De-11-2 1 0 0 0 0 0 1 0 1
WC-F-Di-11-3 1 0 1 0 1 0 0 0 1
WC-F-Di-11-4 1 0 0 0 1 0 0 0 1
WC-F-Do-11-6 1 0 1 0 1 0 0 0 0
WC-F-L-11-8 1 0 0 0 0 1 0 0 1
WC-M-De-45-9 0 1 0 1 1 0 0 0 0
WC-M-De-45-10 0 1 1 0 1 0 0 0 0
WC-M-De-45-11 0 1 0 0 0 0 1 0 1
WC-M-De-45-12 0 1 0 0 0 0 1 0 1
WC-M-De-45-13 0 1 0 1 1 0 0 0 0
WC-M-Di-45-14 0 1 0 1 1 0 0 0 1
WC-M-Di-45-15 0 1 0 0 0 0 1 0 1
WC-M-Do-45-18 0 1 1 0 1 0 0 0 0
WC-M-Do-45-19 0 1 1 0 1 0 0 0 0
WC-M-L-45-20 0 1 0 0 0 1 0 0 1
WC-M-O-45-21 0 1 0 0 0 0 1 0 1
BC-M-Do-29-22 0 1 0 0 1 0 0 0 0
BC-M-De-32-23 0 1 0 0 0 0 1 0 1
BC-M-De-32-24 0 1 0 1 1 0 0 0 0
59 binary
variables
Clustering decisions: variables
• Variables to include
– All relevant variables
what is your question?
• Variables to exclude
– irrelevant variables that bias towards certain
cluster solutions
Clustering decisions: similarity
• For binary data: Contingency Tables
• Pay attention to the a, b, c and ds in your data:
– Which are more common?
– More meaningful?
Example similarity measures
aa+b+c+d=ap.
𝑅𝑅 𝑥, 𝑦 =
𝑎
𝑎+𝑏+𝑐+𝑑
[Russel and Rao]
𝑆𝑀 𝑥, 𝑦 =
𝑎+𝑑
𝑎+𝑏+𝑐+𝑑
[Simple Matching]
𝐽𝐴𝐶𝐶𝐴𝑅𝐷 𝑥, 𝑦 =
𝑎
𝑎+𝑏+𝑐
[Jaccard]
𝐷𝐼𝐶𝐸 𝑥, 𝑦 =
2𝑎
2𝑎+𝑏+𝑐
[Dice]
𝑆𝑆1 𝑥, 𝑦 =
2 𝑎+𝑑
2 𝑎+𝑑 +𝑏+𝑐
[Sokal and Sneath 1]
Clustering decisions: linkage
• Classification strategy
– Hierarchical clustering
• Good for “smaller” sizes (in the hundreds)
• Allows choosing from many similarity measures
• Randomize order, repeat, compare
agglomerative
divisive
Clustering decisions: method
• Linkage method:
• NOT: centroid, median, or Ward
• Between-groups linkage:
d = smallest resulting avg cross-linkage distance
• Within-groups:
d = smallest resulting avg within linkage distance
• Nearest neighbor(single linkage):
d = smallest between two points
• Furthest neighbor (complete linkage):
d = largest between two points
How This Looks in SPSS
Select “Hierarchical Cluster…”
Select variables to
include
Methods Menu: Measure (BINARY), Cluster Method
Statistics Menu: Cluster Membership (CHOOSE)
Plots Menu: Select Dendogram / Icicle Plots [Optional]
Results -
Output:
Agglomeration
Schedule
Results -
Output:
Dendogram
Results: Cluster Membership (as new variables)
Laura Macia: lam60@pitt.edu
THANK YOU!

More Related Content

Viewers also liked

Tamela M. McGhee--PSYC4900--Unit 10 Portfolio Presentation
Tamela M. McGhee--PSYC4900--Unit 10 Portfolio PresentationTamela M. McGhee--PSYC4900--Unit 10 Portfolio Presentation
Tamela M. McGhee--PSYC4900--Unit 10 Portfolio PresentationTamela McGhee
 
Teofanía Aquímica Rosacruz Develado por VM Principe Gurdjieff
Teofanía Aquímica Rosacruz Develado por VM Principe GurdjieffTeofanía Aquímica Rosacruz Develado por VM Principe Gurdjieff
Teofanía Aquímica Rosacruz Develado por VM Principe GurdjieffNiika111
 
Soteriología Alquímica Rosacruz Develado por VM Principe Gurdjieff
Soteriología Alquímica Rosacruz Develado por VM Principe GurdjieffSoteriología Alquímica Rosacruz Develado por VM Principe Gurdjieff
Soteriología Alquímica Rosacruz Develado por VM Principe GurdjieffNiika111
 
Anderson c pcp_final_ppp
Anderson c pcp_final_pppAnderson c pcp_final_ppp
Anderson c pcp_final_pppConnor Anderson
 
España mistica camino iniciático de santiago develado por vm principe gurdjieff
España mistica camino iniciático de santiago develado por vm principe gurdjieffEspaña mistica camino iniciático de santiago develado por vm principe gurdjieff
España mistica camino iniciático de santiago develado por vm principe gurdjieffNiika111
 
latest gen gaming motherboards for every one
latest gen gaming motherboards for every one latest gen gaming motherboards for every one
latest gen gaming motherboards for every one Blazing List
 
Dr. Wail Alzebdah CV 2016
Dr. Wail Alzebdah CV 2016Dr. Wail Alzebdah CV 2016
Dr. Wail Alzebdah CV 2016Wail Alzebdah
 
Stimulus and Exposure Therapy--Final Project
Stimulus and Exposure Therapy--Final ProjectStimulus and Exposure Therapy--Final Project
Stimulus and Exposure Therapy--Final ProjectTamela McGhee
 

Viewers also liked (12)

Tamela M. McGhee--PSYC4900--Unit 10 Portfolio Presentation
Tamela M. McGhee--PSYC4900--Unit 10 Portfolio PresentationTamela M. McGhee--PSYC4900--Unit 10 Portfolio Presentation
Tamela M. McGhee--PSYC4900--Unit 10 Portfolio Presentation
 
Teofanía Aquímica Rosacruz Develado por VM Principe Gurdjieff
Teofanía Aquímica Rosacruz Develado por VM Principe GurdjieffTeofanía Aquímica Rosacruz Develado por VM Principe Gurdjieff
Teofanía Aquímica Rosacruz Develado por VM Principe Gurdjieff
 
Soteriología Alquímica Rosacruz Develado por VM Principe Gurdjieff
Soteriología Alquímica Rosacruz Develado por VM Principe GurdjieffSoteriología Alquímica Rosacruz Develado por VM Principe Gurdjieff
Soteriología Alquímica Rosacruz Develado por VM Principe Gurdjieff
 
Anderson c pcp_final_ppp
Anderson c pcp_final_pppAnderson c pcp_final_ppp
Anderson c pcp_final_ppp
 
España mistica camino iniciático de santiago develado por vm principe gurdjieff
España mistica camino iniciático de santiago develado por vm principe gurdjieffEspaña mistica camino iniciático de santiago develado por vm principe gurdjieff
España mistica camino iniciático de santiago develado por vm principe gurdjieff
 
latest gen gaming motherboards for every one
latest gen gaming motherboards for every one latest gen gaming motherboards for every one
latest gen gaming motherboards for every one
 
gedragsprofiel Ellen
gedragsprofiel Ellen gedragsprofiel Ellen
gedragsprofiel Ellen
 
Dr. Wail Alzebdah CV 2016
Dr. Wail Alzebdah CV 2016Dr. Wail Alzebdah CV 2016
Dr. Wail Alzebdah CV 2016
 
Stimulus and Exposure Therapy--Final Project
Stimulus and Exposure Therapy--Final ProjectStimulus and Exposure Therapy--Final Project
Stimulus and Exposure Therapy--Final Project
 
Taking an order
Taking an order Taking an order
Taking an order
 
January 25th
January 25thJanuary 25th
January 25th
 
Le Talo Cloud
Le Talo CloudLe Talo Cloud
Le Talo Cloud
 

Similar to Using Clustering as a Tool: Mixed Methods in Qualitative Data Analysis

An Inside Look at the Elusive Planned Giving Donor
An Inside Look at the Elusive Planned Giving DonorAn Inside Look at the Elusive Planned Giving Donor
An Inside Look at the Elusive Planned Giving DonorKatherine Swank
 
You Don't Have a Data Management Plan?
You Don't Have a Data Management Plan?You Don't Have a Data Management Plan?
You Don't Have a Data Management Plan?adcieo
 
Why don't you have a data management plan final
Why don't you have a data management plan finalWhy don't you have a data management plan final
Why don't you have a data management plan finalBrandon Fix
 
Comparing and contrasting Veterans’ experiences of access with the SOTA Acces...
Comparing and contrasting Veterans’ experiences of access with the SOTA Acces...Comparing and contrasting Veterans’ experiences of access with the SOTA Acces...
Comparing and contrasting Veterans’ experiences of access with the SOTA Acces...CJKoenig
 
1-Data Understanding.pdf
1-Data Understanding.pdf1-Data Understanding.pdf
1-Data Understanding.pdfgopikahari7
 
Pres fcsm2012 jan10_judson
Pres fcsm2012 jan10_judsonPres fcsm2012 jan10_judson
Pres fcsm2012 jan10_judsonsoder145
 
ICT security and Open Data
ICT security and Open DataICT security and Open Data
ICT security and Open DataSecuRing
 
A Coalition Approach to Data
A Coalition Approach to DataA Coalition Approach to Data
A Coalition Approach to DataVarMedPR
 
PERC_031215_UI Briefing_Final_1(1)
PERC_031215_UI Briefing_Final_1(1)PERC_031215_UI Briefing_Final_1(1)
PERC_031215_UI Briefing_Final_1(1)Michael Turner
 
Entity Resolution Using Patient Records at CMMI
Entity Resolution Using Patient Records at CMMIEntity Resolution Using Patient Records at CMMI
Entity Resolution Using Patient Records at CMMIDatabricks
 
EXPLORATORY DATA ANALYSIS
EXPLORATORY DATA ANALYSISEXPLORATORY DATA ANALYSIS
EXPLORATORY DATA ANALYSISBabasID2
 
Default of Credit Card Payments
Default of Credit Card PaymentsDefault of Credit Card Payments
Default of Credit Card PaymentsVikas Virani
 
Data explosion
Data explosionData explosion
Data explosionG Prachi
 

Similar to Using Clustering as a Tool: Mixed Methods in Qualitative Data Analysis (16)

An Inside Look at the Elusive Planned Giving Donor
An Inside Look at the Elusive Planned Giving DonorAn Inside Look at the Elusive Planned Giving Donor
An Inside Look at the Elusive Planned Giving Donor
 
02Data-osu-0829.pdf
02Data-osu-0829.pdf02Data-osu-0829.pdf
02Data-osu-0829.pdf
 
You Don't Have a Data Management Plan?
You Don't Have a Data Management Plan?You Don't Have a Data Management Plan?
You Don't Have a Data Management Plan?
 
Why don't you have a data management plan final
Why don't you have a data management plan finalWhy don't you have a data management plan final
Why don't you have a data management plan final
 
Comparing and contrasting Veterans’ experiences of access with the SOTA Acces...
Comparing and contrasting Veterans’ experiences of access with the SOTA Acces...Comparing and contrasting Veterans’ experiences of access with the SOTA Acces...
Comparing and contrasting Veterans’ experiences of access with the SOTA Acces...
 
1-Data Understanding.pdf
1-Data Understanding.pdf1-Data Understanding.pdf
1-Data Understanding.pdf
 
Pres fcsm2012 jan10_judson
Pres fcsm2012 jan10_judsonPres fcsm2012 jan10_judson
Pres fcsm2012 jan10_judson
 
ICT security and Open Data
ICT security and Open DataICT security and Open Data
ICT security and Open Data
 
A Coalition Approach to Data
A Coalition Approach to DataA Coalition Approach to Data
A Coalition Approach to Data
 
PERC_031215_UI Briefing_Final_1(1)
PERC_031215_UI Briefing_Final_1(1)PERC_031215_UI Briefing_Final_1(1)
PERC_031215_UI Briefing_Final_1(1)
 
Entity Resolution Using Patient Records at CMMI
Entity Resolution Using Patient Records at CMMIEntity Resolution Using Patient Records at CMMI
Entity Resolution Using Patient Records at CMMI
 
Business analysis of pandemic v3
Business analysis of pandemic v3Business analysis of pandemic v3
Business analysis of pandemic v3
 
Data Journalism 101 - Day 1 by Michael J. Berens
Data Journalism 101 - Day 1 by Michael J. BerensData Journalism 101 - Day 1 by Michael J. Berens
Data Journalism 101 - Day 1 by Michael J. Berens
 
EXPLORATORY DATA ANALYSIS
EXPLORATORY DATA ANALYSISEXPLORATORY DATA ANALYSIS
EXPLORATORY DATA ANALYSIS
 
Default of Credit Card Payments
Default of Credit Card PaymentsDefault of Credit Card Payments
Default of Credit Card Payments
 
Data explosion
Data explosionData explosion
Data explosion
 

Recently uploaded

10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024Mind IT Systems
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension AidPhilip Schwarz
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfVishalKumarJha10
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...masabamasaba
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...masabamasaba
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...masabamasaba
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...Health
 
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburgmasabamasaba
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastPapp Krisztián
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park masabamasaba
 
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdfThe Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdfayushiqss
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisamasabamasaba
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdfPearlKirahMaeRagusta1
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareJim McKeeth
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
 
%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Hararemasabamasaba
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfonteinmasabamasaba
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 

Recently uploaded (20)

10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdfThe Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 

Using Clustering as a Tool: Mixed Methods in Qualitative Data Analysis

  • 1. Using Clustering as a Tool: Mixed Methods in Qualitative Data Analysis Laura Macia, PhD Behavioral and Community Health Sciences Graduate School of Public Health University of Pittsburgh
  • 3. Mixed Methods • Type of Data / Data Collection • Data Analysis
  • 4. Mixed Methods in Data Analysis
  • 5. Cluster Analysis • Method for grouping data by their similarity – Appropriate data – Defining similarity – Clustering
  • 6. Data Preparation • Types of data: – Nominal – Ordinal – Interval / Ratio Qualitative Data (an example) Latino Grievances Project
  • 7. Summary Table: Nodes and Attributes (after thematic analysis using Nvivo) Select Variables Values [description] Part 1: Gender Strata Legal status Income Education (0) Male; (1) Female (0) Blue-collar; (1) Spouse of American citizen; (2) White-collar (0) US citizen; (1) Legal permanent resident; (2) Immigrant visa; (3) Non-immigrant visa; (4) Visa overstay; (5) Undocumented (0) Under $20k; (1) $20k to $40k; (2) $40k to $60k; (1) $60k to $80k; (1) $80k to $100k; (5) Over $100k (0) Primary; (1) Some secondary; (2) High-school diploma; (3) College degree; (4) Graduate degree; (5) Other degree Part 2: Type Nationality (0) Male; (1) Female; (2) Individual [when gender unknown]; (3) Institution; (4) Government; (5) Other (0) American; (1) Latino; (2) Other; (3) Unknown Grievance (0) Debt; (1) Discrimination; (2) Domestic; (3) With the law Procedural mode (1) None (2) Adjudication [third party with authority to intervene, i.e. courts] (3) Arbitration [third party agreed to by principals] (4) Mediation [third party aiding principals reach an agreement] (5) Negotiation [two principals decide on settlement] (6) Coercion [imposition of outcome by unilateral threat or use of force] (7) Avoidance [terminate relationship / withdraw from situation] (8) Lumping it [“letting go” as of grievance] (9) Assumed fault* [structure grievance as occurring due to own situation/fault] (10)Talk back* [letting know of grievance without expecting further action] (11)Other * Data-driven codes, not included in predefined coding scheme
  • 8. Data Preparation • Types of data: – Nominal – Ordinal – Interval / Ratio Qualitative Data Gender: (0) Male, (1) Female, … Type of Grievance: (0) Debt, (1) Discrimination, … Chosen Procedure: (2) Adjudication, …(6) Coercion, … Income: (0) <$20k, (1) $20k-$40k, … Education: (0) primary , … (2) high school diploma, …
  • 9. Units of analysis: Cases ID Strata Part2 Part2Natlity Type ProcMode1ProcMode2ProcMode3Support1 Support2 1 WC Individual Unknown Debt Other None None None None 2 WC Institution American Debt NegotiationAvoidanceNone None None 3 WC Female American DiscriminationAssumed faultLumping itTalk back None None 4 WC Individual American DiscriminationOther None None None None 5 WC Male Latino Domestic Other NegotiationNone Other None 6 WC Female Latino Domestic NegotiationOther None Family None 7 WC Male Latino Domestic NegotiationNone None Family None 8 WC Government American Law NegotiationAssumed faultOther Family None 9 WC Male Latino Debt NegotiationLumping itOther Family Friend 10 WC Female Other Debt Talk back AvoidanceOther Family None 11 WC Institution American Debt AvoidanceOther None Friend None 12 WC Institution American Debt Other None None None None 13 WC Male Unknown Debt Assumed faultNegotiationNone Friend None 14 WC Male American DiscriminationLumping itNone None None None 15 WC Institution American DiscriminationOther None None Church None 16 WC Male Other DiscriminationLumping itOther None Family None 17 WC Other Latino Domestic NegotiationNone None None None 18 WC Female Other Domestic NegotiationNone None Other None 19 WC Female Other Domestic NegotiationOther None None None 20 WC Government American Law Assumed faultNone None None None 12 variables
  • 10. Cluster Analysis: Data Reduction • Transform qualitative data into binary data ID 1-Fem 1-Male 2-Fem 2-Male 2-Indiv 2-Govmnt 2-Instit 2-Other 2N-American WC-F-De-11-1 1 0 0 0 1 0 0 0 0 WC-F-De-11-2 1 0 0 0 0 0 1 0 1 WC-F-Di-11-3 1 0 1 0 1 0 0 0 1 WC-F-Di-11-4 1 0 0 0 1 0 0 0 1 WC-F-Do-11-6 1 0 1 0 1 0 0 0 0 WC-F-L-11-8 1 0 0 0 0 1 0 0 1 WC-M-De-45-9 0 1 0 1 1 0 0 0 0 WC-M-De-45-10 0 1 1 0 1 0 0 0 0 WC-M-De-45-11 0 1 0 0 0 0 1 0 1 WC-M-De-45-12 0 1 0 0 0 0 1 0 1 WC-M-De-45-13 0 1 0 1 1 0 0 0 0 WC-M-Di-45-14 0 1 0 1 1 0 0 0 1 WC-M-Di-45-15 0 1 0 0 0 0 1 0 1 WC-M-Do-45-18 0 1 1 0 1 0 0 0 0 WC-M-Do-45-19 0 1 1 0 1 0 0 0 0 WC-M-L-45-20 0 1 0 0 0 1 0 0 1 WC-M-O-45-21 0 1 0 0 0 0 1 0 1 BC-M-Do-29-22 0 1 0 0 1 0 0 0 0 BC-M-De-32-23 0 1 0 0 0 0 1 0 1 BC-M-De-32-24 0 1 0 1 1 0 0 0 0 59 binary variables
  • 11. Clustering decisions: variables • Variables to include – All relevant variables what is your question? • Variables to exclude – irrelevant variables that bias towards certain cluster solutions
  • 12. Clustering decisions: similarity • For binary data: Contingency Tables • Pay attention to the a, b, c and ds in your data: – Which are more common? – More meaningful?
  • 13. Example similarity measures aa+b+c+d=ap. 𝑅𝑅 𝑥, 𝑦 = 𝑎 𝑎+𝑏+𝑐+𝑑 [Russel and Rao] 𝑆𝑀 𝑥, 𝑦 = 𝑎+𝑑 𝑎+𝑏+𝑐+𝑑 [Simple Matching] 𝐽𝐴𝐶𝐶𝐴𝑅𝐷 𝑥, 𝑦 = 𝑎 𝑎+𝑏+𝑐 [Jaccard] 𝐷𝐼𝐶𝐸 𝑥, 𝑦 = 2𝑎 2𝑎+𝑏+𝑐 [Dice] 𝑆𝑆1 𝑥, 𝑦 = 2 𝑎+𝑑 2 𝑎+𝑑 +𝑏+𝑐 [Sokal and Sneath 1]
  • 14. Clustering decisions: linkage • Classification strategy – Hierarchical clustering • Good for “smaller” sizes (in the hundreds) • Allows choosing from many similarity measures • Randomize order, repeat, compare agglomerative divisive
  • 15. Clustering decisions: method • Linkage method: • NOT: centroid, median, or Ward • Between-groups linkage: d = smallest resulting avg cross-linkage distance • Within-groups: d = smallest resulting avg within linkage distance • Nearest neighbor(single linkage): d = smallest between two points • Furthest neighbor (complete linkage): d = largest between two points
  • 16. How This Looks in SPSS
  • 19. Methods Menu: Measure (BINARY), Cluster Method
  • 20. Statistics Menu: Cluster Membership (CHOOSE)
  • 21. Plots Menu: Select Dendogram / Icicle Plots [Optional]
  • 24. Results: Cluster Membership (as new variables)

Editor's Notes

  1. Clarify that some discussion can be found. However, it allows using distance measures that are non-euclidean.