Introduction to Text Analysis

2,226 views

Published on

Slides from my text analysis workshop at the MLA, a part of "Getting Started in the Digital Humanities with Help from DHCommons."

Published in: Education, Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
2,226
On SlideShare
0
From Embeds
0
Number of Embeds
70
Actions
Shares
0
Downloads
106
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Introduction to Text Analysis

  1. 1. Introduction to Text Analysis MLA Annual Convention Getting Started in the Digital Humanities January 9, 2014 Lauren F. Klein Georgia Institute of Technology lauren.klein@lmc.gatech.edu @laurenfklein
  2. 2. Introduction to Text Analysis • What is text analysis?
  3. 3. Introduction to Text Analysis • What is text analysis? • Why should you use it?
  4. 4. Introduction to Text Analysis • What is text analysis? • Why should you use it? • How do you use it? – Examples – Tools
  5. 5. What is Text Analysis?
  6. 6. What is Text Analysis? According to Geoffrey Rockwell: • “Text analysis systems can search large texts quickly. They do this by preparing electronic indexes to the text so that the computer does not have to read through the entire text. When finding words can be done so quickly that it is "interactive", it changes how you can work with the text - you can serendipitously explore without being frustrated by the slowness of the search process. • “Text analysis systems can conduct complex searches. Text analysis systems will often allow you to search for lists of words or for complex patterns of words. For example you can search for the co-occurrence of two words. • “Text analysis systems can present the results in ways that suit the study of texts. Text analysis systems can display the results in a number of ways; for example, a Keyword In Context display shows you all the occurrences of the found word with one line of context.” http://tada.mcmaster.ca/Main/WhatTA
  7. 7. http://www.wordle.net
  8. 8. http://www.wordle.net
  9. 9. Why Use Text Analysis?
  10. 10. Why Use Text Analysis? Geoff Rockwell, again: • • • “Text analysis tools aide the interpreter asking questions of electronic texts.” “Text analysis practices encourage reflection on the questions asked and formalization of queries.” “Text analysis is a way of targeting rereading that tests intuitions.”
  11. 11. Why Use Text Analysis? Geoff Rockwell, again: • • • “Text analysis tools aide the interpreter asking questions of electronic texts.” “Text analysis practices encourage reflection on the questions asked and formalization of queries.” “Text analysis is a way of targeting rereading that tests intuitions.”
  12. 12. Why Use Text Analysis? Geoff Rockwell, again: • • • “Text analysis tools aide the interpreter asking questions of electronic texts.” “Text analysis practices encourage reflection on the questions asked and formalization of queries.” “Text analysis is a way of targeting rereading that tests intuitions.” Ted Underwood: • “Proving a literary thesis with statistical analysis is often like cracking a nut with a jackhammer. You can do it: but the results are not necessarily better than you would get by hand.”
  13. 13. Why Use Text Analysis? Geoff Rockwell, again: • • • “Text analysis tools aide the interpreter asking questions of electronic texts.” “Text analysis practices encourage reflection on the questions asked and formalization of queries.” “Text analysis is a way of targeting rereading that tests intuitions.” Ted Underwood: • “Proving a literary thesis with statistical analysis is often like cracking a nut with a jackhammer. You can do it: but the results are not necessarily better than you would get by hand.” What I think (in the spirit of Movable Type): • Text analysis as “a way to tell a new story.”
  14. 14. How to Use Text Analysis?
  15. 15. Ben Blatt, http://www.slate.com/articles/arts/culturebox/2013/11/hunger_games_catching_fire_a_text
  16. 16. Sarah Lohman, http://www.fourpoundsflour.com/the-gallery-data-visualization-of-a-timeline-of-taste/
  17. 17. Daniel, http://lkleincourses.lmc.gatech.edu/dh12/2012/02/22/the-role-of-senses-in-a-studyin-scarlet/
  18. 18. Ted Underwood and Jordan Sellers, http://journalofdigitalhumanities.org/1-2/theemergence-of-literary-diction-by-ted-underwood-and-jordan-sellers/
  19. 19. Rob Nelson, http://dsl.richmond.edu/dispatch/
  20. 20. Matt Jockers, http://www.nbcnews.com/technology/data-mining-classics-makes-beautifulscience-954577
  21. 21. Matt Jockers, from Macroanalysis (Univ. of Illinois Press, 2013)
  22. 22. Lauren Klein, from “The Image of Absence” (American Literature 85.4)
  23. 23. Tools for Text Analysis • • • • • • • • Wordle Google Ngram Viewer IBM Many Eyes Voyant MONK (requires institutional access) MALLET Stanford’s Natural Language Processing Toolkit R
  24. 24. Google Ngram Viewer Google Ngram Viewer https://books.google.com/ngrams
  25. 25. IBM Many Eyes Many Eyes http://www-958.ibm.com/software/analytics/manyeyes/
  26. 26. Voyant Tools Voyant Tools http://voyant-tools.org/
  27. 27. MALLET MALLET http://mallet.cs.umass.edu/
  28. 28. Stanford NLP Toolkit Stanford NLP Toolkit http://nlp.stanford.edu/downloads/
  29. 29. R Programming Language R (programming language) http://www.r-project.org/
  30. 30. TAPoR TAPoR (Text Analysis PoRtal) http://tapor.ca/
  31. 31. More Lists of Tools • http://toolingup.stanford.edu/?page_id=367 • http://guides.library.upenn.edu/dhtextanalysi s • http://dirt.projectbamboo.org/categories/text -mining
  32. 32. Many Eyes Demo http://lkle.in/1bTr2eT
  33. 33. Voyant Tools Demo http://lkle.in/1e186zN

×