Am cat workshop part 1

  • 1,149 views
Uploaded on

 

More in: Technology , Business
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
No Downloads

Views

Total Views
1,149
On Slideshare
0
From Embeds
0
Number of Embeds
1

Actions

Shares
Downloads
128
Comments
0
Likes
1

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi 8 July 2013 Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 2. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Workshop outline Workshop Outline ˆ Three Sessions ˆ 14:00 - 15:00 I: AmCAT: Introduction ˆ 15:00 - 16:00 II: Automatic Analysis ˆ 16:00 - 17:00 III: Manual Analysis ˆ Session Format ˆ Presentation + Hands-on ˆ Material ˆ Outline ˆ Work book ˆ AmCAT Manual Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 3. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up AmCAT: Introduction AmCAT: Introduction ˆ The role of quantitative content analysis ˆ What is AmCAT? ˆ Getting started with AmCAT Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 4. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up AmCAT: Introduction Quantitative Content Analysis ˆ Repeatable and valid ˆ Latent or manifest, `objective' or `subjective' Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 5. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up AmCAT: Introduction Quantitative Content Analysis ˆ Repeatable and valid ˆ Latent or manifest, `objective' or `subjective' ˆ Fixed units of measurement ˆ Fixed values for variables Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 6. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up AmCAT: Introduction Manual and Auatmtic Analysis ˆ Content Analysis as measurement ˆ Coding of symbolic content to numeric values ˆ Interpretation / Assignment of meaning ˆ Automatic Analysis: instruction to computer ˆ Manual Analysis: instruction and interpretation Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 7. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up AmCAT: Introduction Combination of analysis techniques 1 Qualitative analysis (inductive) 2 Automtic quantitative analysis ˆ Measuring concepts under study ˆ Finding interesting articles 3 Manual quantitative analysis ˆ Measuring concepts 4 Qualitative analysis (interpretative) Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 8. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Getting started with AmCAT (ch. 1 and 2.2) What is AmCAT? ˆ Framework for text analysis ˆ Developed at VU University ˆ Open source, open data, open platform ˆ Main functions: ˆ Text storage ˆ Automatic analysis ˆ Manual analysis Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 9. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Getting started with AmCAT (ch. 1 and 2.2) Key terminology ˆ Project ˆ Article ˆ Article Set ˆ Index ˆ Coding Job, Coding Schema, Code Book ˆ Navigator, Annotator, API Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 10. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Getting started with AmCAT (ch. 1 and 2.2) Who can use it? What does it cost? ˆ AmCAT source code is free and open source ˆ VU University hosts amcat.vu.nl ˆ Free to use for small projects ˆ (no support) ˆ Contribution requested for larger projects ˆ (or just setup your own server) ˆ Development is open but centrally controlled Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 11. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Getting started with AmCAT (ch. 1 and 2.2) Getting started with AmCAT ˆ Creating an account ˆ Creating a project ˆ Uploading data Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 12. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Getting started with AmCAT (ch. 1 and 2.2) Hands-on session 1 ˆ Create an account and project ˆ Upload data ˆ Until 15:00 Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 13. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Finding Conceptsx in Texts using Keywords (ch. 4.1) Automatic Analysis ˆ Keywords as concept indicators ˆ Making good keyword queries ˆ Analysing and exporting results Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 14. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Finding Conceptsx in Texts using Keywords (ch. 4.1) Concepts, words, and indicators ˆ A Dictionary is a list of: ˆ Theoretical Concepts ˆ Operationalized using keywords ˆ Which are indicators for the concept Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 15. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Finding Conceptsx in Texts using Keywords (ch. 4.1) Example searches (interactive demo) Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 16. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Finding Conceptsx in Texts using Keywords (ch. 4.1) AmCAT Query Syntax (See manual Ÿ4.2 and `search syntax help' in AmCAT) ˆ Connectors ˆ war OR conflict ˆ gaza AND war ˆ gaza AND (war OR conflict) ˆ (war OR conflict) NOT drugs ˆ Wildcards ˆ war* ˆ Phrases ˆ "war on drugs" ˆ "war drugs"~5 ˆ "palest* (terror* OR suicide)"~10 Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 17. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Finding Conceptsx in Texts using Keywords (ch. 4.1) Precision and recall ˆ Query too broad: ˆ Articles are found that don't contain the concept ˆ Low precision ˆ Narrow query using phrases, AND/NOT conditions ˆ Query too narrow ˆ Articles are missed that do contain the concept ˆ Low recall ˆ Widen query using wildcards, OR synonyms Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 18. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Finding Conceptsx in Texts using Keywords (ch. 4.1) Testing the eect of widening/narrowing a search ˆ Search for narrow and for wide NOT narrow ˆ After widening a search: ˆ e.g. war → war* ˆ Search for war* NOT war, and check resulting documents ˆ After narrowing a search ˆ e.g. suicide → "suicide (bomb* OR attack*)"~10 ˆ Search for suicide NOT ("suicide (bomb* OR attack*)"~10) Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 19. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Analysing and Exporting Results (ch. 4.3) Analysing and exporting data in AmCAT ˆ Trends and patterns: Tabulating data ˆ You can export raw data, hits, and aggregate results (interactive demo) Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 20. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Analysing and Exporting Results (ch. 4.3) Hands-on session 2 ˆ Create search terms for actors and issues ˆ Export results ˆ Until 16:00 Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 21. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Quantitative Content Analysis (ch. 5-7) Quantitative manual content analysis ˆ Measure specic, known concepts in specic textual units ˆ Fixed Unit of measurement ˆ Sentence or Article ˆ Fixed sample of articles ˆ Fixed variables and values Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 22. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Quantitative Content Analysis (ch. 5-7) AmCAT jargon ˆ Codebook: list of values for a single variable ˆ Coding schema: list of variables and their values ˆ Coding Job: Article set assigned to someone for coding using a specic schema Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 23. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Quantitative Content Analysis (ch. 5-7) Codebooks ˆ List of values (codes) for a single variable ˆ Possible hierarchical organization ˆ Comparable to SPSS value labels ˆ Issues ˆ Economy ˆ Unemployment ˆ Economic growth ˆ . . . ˆ . . . Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 24. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Quantitative Content Analysis (ch. 5-7) Coding schemas ˆ List of variables and their type ˆ Field types in AmCAT: ˆ Text ˆ Number ˆ Yes/No ˆ Quality (-1 . . . +1) ˆ Codebook ˆ Comparable to SPSS variables view Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 25. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Manual analysis in AmCAT (ch. 10) Manual content analysis in AmCAT (interactive demo) ˆ Creating a codebook and coding schema ˆ Selecting and assigning a sample to code ˆ Coding ˆ Exporting results Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 26. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Manual analysis in AmCAT (ch. 10) Hands-on session 3 ˆ Create code book, coding schema, coding job ˆ Code articles and export results ˆ Until 16:55 Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 27. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Wrap-up: conclusions and road ahead? What can AmCAT do for me? / What can I do for AmCAT? ˆ Use AmCAT for your coding or storage needs ˆ Install your own AmCAT server ˆ Improve AmCAT by reporting bugs, providing plugins, adding features ˆ Write and share (R) scripts/libraries for text-based analyses Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi
  • 28. Introduction I: AmCAT II: Automatic Analysis III: Manual Analysis Wrap-up Wrap-up: conclusions and road ahead? Programma for Thursday ˆ Proposal: Advanced automatic analysis: ˆ Tools for improving queries: cluster maps, keyword in context ˆ Vocabulary analysis and LDA using the R API ˆ Semantic Network Analysis: keyword association, grammatical analysis Text Analysis using AmCAT Wouter van Atteveldt - Nel Ruigrok - Carina Jacobi