SlideShare a Scribd company logo
1 of 16
The New Past, and a Speculative Future, of Literature:
A Brief Discussion of Two Text Analysis Tools
Nat Gustafson-Sundell
Minnesota State University, Mankato
OpenResearch.weebly.com
1
Franco Moretti
“Writing about comparative social history, Marc Bloch once coined a lovely ‘slogan,’ as he himself
called it: ‘years of analysis for a day of synthesis’; and if you read Braudel or Wallerstein you
immediately see what Bloch had in mind. The text which is strictly Wallerstein’s, his ‘day of
synthesis’, occupies one-third of a page … the rest are quotations … Years of analysis; other
people’s analysis, which Wallerstein’s page synthesizes into a system.
Note, if we take this model seriously, the study of world literature will somehow have to
reproduce this ‘page’ – which is to say: this relationship between analysis and synthesis – for the
literary field. But in that case, literary history will quickly become very different from what it is
now: it will become ‘second hand’: a patchwork of other people’s research, without a single
direct textual reading. Still ambitious, and actually even more so than before (world literature!);
but the ambition is now directly proportional to the distance from the text: the more ambitious
the project, the greater must the distance be.” (Moretti 47-8, 2000)
“Distant reading: where distance … is a condition of knowledge: it
allows you to focus on units that are much smaller or much larger
than the text: devices, themes, tropes – or genres and systems.
And if, between the very small and the very large, the text itself
disappears, well, it is one of those cases when one can justifiably
say, Less is more. It we want to understand the system in its
entirety, we must accept losing something…” (Moretti 48-9, 2000)
2
Matthew Jockers
“The literary scholar of the twenty-first century can no longer be content with
anecdotal evidence, with random ‘things’ generated from a few , even
‘representative’ texts. We must strive to understand these things in the
context of everything else, including a mass of possibly ‘uninteresting’ texts.”
(Jockers 8)
“At the macro scale , we see evidence of time and gender influences on theme and
style. By superimposing these two network snapshots in our minds, we can begin
to imagine a larger context in which to read and study nineteenth-century
literature. What is clear is that the books we have traditionally studied are not
isolated books. The canonical greats are not even outliers: they are books that are
similar to other books…” (Jockers 168)
“It is the exact interplay between the macro and micro scale that promises a new, enhanced, and perhaps even
better understanding of the literary record. The two approaches work in tandem and inform each other.
Human interpretation of the ‘data,’ whether it be mined at the macro or micro level, remains essential … The
most fundamental and important difference in the two approaches is that the macroanalytic approach reveals
details about texts that are for all intents and purposes unavailable to close-readers of the texts.” (Jockers
online)
3
“The value of the computer-mediated exercises is that they enable readers to
readily perceive and appreciate features that are not obvious in a conventional
reading of a printed text.” (Irizarry 155, 1996)
“The computer is, among other things, an instrument uniquely suited to play
activities ...” (Irizarry 156, 1996)
“Assembling and disassembling a text, like playing with blocks
of Lego, may not necessarily contribute immediately to its
understanding, but it is likely to contribute to the aggregate
experience of the text in valuable ways. … I am suggesting
that play is an integral part of a humanist’s interpretive
activities…” (Sinclair 181, 2003)
“Playful experimentation is a pragmatic approach of trying something, seeing if you
obtain interesting results, and if you do, then trying to theorize why those results are
interesting rather than starting from articulated principles.” (Rockwell 214, 2003)
Play
4
http://voyant-tools.org/ 5
Word Trends 6
Johnny
Johnny, Dave, Doris
Johnny, Dave, Doris, Mildred, Arrow
Collocate Clusters 7
Johnny, Dave, Doris, Thought
Johnny, Dave, Doris, Thought, Strange
Collocate Clusters 8
Topic Modeling
(Blei 78)
9
http://code.google.com/p/topic-modeling-tool/ 10
Topics in Documents: % of Topics in All 390 Documents
11
For Topic 1: Top 25 Documents in Topic 1
In the arrangement of poems, what is the topic trend? What can we learn about arrangement in this book?
How often is this topic the “dominant” topic? What topics are most common across documents, or most rare?
What topics tend to dominate? What topics tend to be subordinate?
Does this topic relate to certain topics more than others?
12
Imagine
Texts
Constructed
Only
To
Be
Read
At
A
Distance
Imagine
Texts
Topic In Doc 1
Reading 55%
Distance 38%
Imagine texts constructed only to be read at a distance.
Read
13
14
http://www.saic.edu/webspaces/portal/degrees_resources/departments/writing/DN
SP11_SeaandSparBetween/index.html
Read
Read
15
16
Works Cited
Blei, David. "Probabilistic Topic Models." Communications of the ACM 55.4 (2012): 77-84. Web.
Brett, Megan. "Topic Modeling: A Basic Introduction." Journal of Digital Humanities 2.1 (2012): 12-16. Web.
Irizarry, Estelle. "Tampering with the Text to Increase Awareness of poetry’s Art: Theory and Practice with a
Hispanic Perspective." Literary and Linguistic Computing 11 (1996): 155-162. Print.
Jockers, Matthew Lee. Macroanalysis: Digital Methods and Literary History. University of Illinois Press, 2013.
Print.
Moretti, Franco. Distant Reading. London: Verso, 2013. Print.
Rockwell, Geoffrey. "What is Text Analysis, really?" Literary and Linguistic Computing 18.2 (2003): 209-19.
Web.
Samuels, Lisa, and Jerome J. McGann. "Deformance and Interpretation." New Literary History 30.1 (1999):
25-56. Web.
Sinclair, Stefan. "Computer-Assisted Reading: Reconceiving Text Analysis." Literary and Linguistic Computing
18.2 (2003): 175-84. Web.

More Related Content

Viewers also liked

The impact of innovation on travel and tourism industries (World Travel Marke...
The impact of innovation on travel and tourism industries (World Travel Marke...The impact of innovation on travel and tourism industries (World Travel Marke...
The impact of innovation on travel and tourism industries (World Travel Marke...Brian Solis
 
Open Source Creativity
Open Source CreativityOpen Source Creativity
Open Source CreativitySara Cannon
 
Reuters: Pictures of the Year 2016 (Part 2)
Reuters: Pictures of the Year 2016 (Part 2)Reuters: Pictures of the Year 2016 (Part 2)
Reuters: Pictures of the Year 2016 (Part 2)maditabalnco
 
The Six Highest Performing B2B Blog Post Formats
The Six Highest Performing B2B Blog Post FormatsThe Six Highest Performing B2B Blog Post Formats
The Six Highest Performing B2B Blog Post FormatsBarry Feldman
 
The Outcome Economy
The Outcome EconomyThe Outcome Economy
The Outcome EconomyHelge Tennø
 

Viewers also liked (6)

Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
Succession “Losers”: What Happens to Executives Passed Over for the CEO Job? Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
 
The impact of innovation on travel and tourism industries (World Travel Marke...
The impact of innovation on travel and tourism industries (World Travel Marke...The impact of innovation on travel and tourism industries (World Travel Marke...
The impact of innovation on travel and tourism industries (World Travel Marke...
 
Open Source Creativity
Open Source CreativityOpen Source Creativity
Open Source Creativity
 
Reuters: Pictures of the Year 2016 (Part 2)
Reuters: Pictures of the Year 2016 (Part 2)Reuters: Pictures of the Year 2016 (Part 2)
Reuters: Pictures of the Year 2016 (Part 2)
 
The Six Highest Performing B2B Blog Post Formats
The Six Highest Performing B2B Blog Post FormatsThe Six Highest Performing B2B Blog Post Formats
The Six Highest Performing B2B Blog Post Formats
 
The Outcome Economy
The Outcome EconomyThe Outcome Economy
The Outcome Economy
 

Similar to The New Past, and a Speculative Future, of Literature: A Brief Discussion of Two Text Analysis Tools

Digital Humanities and Computer Assisted Literary Criticism
Digital Humanities and Computer Assisted Literary CriticismDigital Humanities and Computer Assisted Literary Criticism
Digital Humanities and Computer Assisted Literary CriticismDilip Barad
 
Med 505 Seminar on Online Technologies
Med 505 Seminar on Online TechnologiesMed 505 Seminar on Online Technologies
Med 505 Seminar on Online TechnologiesErkan Saka
 
Elements of Dystopian Science Fiction in David Mitchell's Cloud Atlas: Gener...
 Elements of Dystopian Science Fiction in David Mitchell's Cloud Atlas: Gener... Elements of Dystopian Science Fiction in David Mitchell's Cloud Atlas: Gener...
Elements of Dystopian Science Fiction in David Mitchell's Cloud Atlas: Gener...English Literature and Language Review ELLR
 
MacroMicroZoom.pdf
MacroMicroZoom.pdfMacroMicroZoom.pdf
MacroMicroZoom.pdfMartin Wynne
 
Digital, Humanities, Latour and Networks. By Moses A. Boudourides
Digital, Humanities, Latour and Networks. By Moses A. BoudouridesDigital, Humanities, Latour and Networks. By Moses A. Boudourides
Digital, Humanities, Latour and Networks. By Moses A. BoudouridesMoses Boudourides
 
Introduction to Electracy
Introduction to ElectracyIntroduction to Electracy
Introduction to ElectracyRichard Smyth
 
Chicago Manual of Style Sample Paper - Online Writing Lab (OWL)
Chicago Manual of Style Sample Paper - Online Writing Lab (OWL)Chicago Manual of Style Sample Paper - Online Writing Lab (OWL)
Chicago Manual of Style Sample Paper - Online Writing Lab (OWL)Jonathan Underwood
 
Sample chicago paper
Sample chicago paperSample chicago paper
Sample chicago papergilcreastj
 
Chicago Manual of Style 17th Edition Notes and Bibliography Sample Paper - Pr...
Chicago Manual of Style 17th Edition Notes and Bibliography Sample Paper - Pr...Chicago Manual of Style 17th Edition Notes and Bibliography Sample Paper - Pr...
Chicago Manual of Style 17th Edition Notes and Bibliography Sample Paper - Pr...Jonathan Underwood
 
MOVING NETWORKS” INTO THE COMPOSITION CLASSROOM.docx
MOVING NETWORKS” INTO THE COMPOSITION CLASSROOM.docxMOVING NETWORKS” INTO THE COMPOSITION CLASSROOM.docx
MOVING NETWORKS” INTO THE COMPOSITION CLASSROOM.docxrosemarybdodson23141
 
Pinterest and the Crisis of Paratext - Handout
Pinterest and the Crisis of Paratext - HandoutPinterest and the Crisis of Paratext - Handout
Pinterest and the Crisis of Paratext - HandoutItalo Marconi
 
Notational systems and cognitive evolution
Notational systems and cognitive evolutionNotational systems and cognitive evolution
Notational systems and cognitive evolutionJeff Long
 
Foundations camb july 2012
Foundations camb july 2012Foundations camb july 2012
Foundations camb july 2012Brendan Larvor
 
evted Perspectives on Ergodic Literature Espen J. Aarset
evted Perspectives on Ergodic Literature Espen J. Aarsetevted Perspectives on Ergodic Literature Espen J. Aarset
evted Perspectives on Ergodic Literature Espen J. AarsetBetseyCalderon89
 
Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...Toby Burrows
 
Literary criticism
Literary criticism Literary criticism
Literary criticism marcialzsara
 

Similar to The New Past, and a Speculative Future, of Literature: A Brief Discussion of Two Text Analysis Tools (20)

Digital Humanities and Computer Assisted Literary Criticism
Digital Humanities and Computer Assisted Literary CriticismDigital Humanities and Computer Assisted Literary Criticism
Digital Humanities and Computer Assisted Literary Criticism
 
Med 505 Seminar on Online Technologies
Med 505 Seminar on Online TechnologiesMed 505 Seminar on Online Technologies
Med 505 Seminar on Online Technologies
 
Elements of Dystopian Science Fiction in David Mitchell's Cloud Atlas: Gener...
 Elements of Dystopian Science Fiction in David Mitchell's Cloud Atlas: Gener... Elements of Dystopian Science Fiction in David Mitchell's Cloud Atlas: Gener...
Elements of Dystopian Science Fiction in David Mitchell's Cloud Atlas: Gener...
 
MacroMicroZoom.pdf
MacroMicroZoom.pdfMacroMicroZoom.pdf
MacroMicroZoom.pdf
 
MDST 3703 F10 Seminar 2
MDST 3703 F10 Seminar 2MDST 3703 F10 Seminar 2
MDST 3703 F10 Seminar 2
 
Digital, Humanities, Latour and Networks. By Moses A. Boudourides
Digital, Humanities, Latour and Networks. By Moses A. BoudouridesDigital, Humanities, Latour and Networks. By Moses A. Boudourides
Digital, Humanities, Latour and Networks. By Moses A. Boudourides
 
Introduction to Electracy
Introduction to ElectracyIntroduction to Electracy
Introduction to Electracy
 
Chicago Manual of Style Sample Paper - Online Writing Lab (OWL)
Chicago Manual of Style Sample Paper - Online Writing Lab (OWL)Chicago Manual of Style Sample Paper - Online Writing Lab (OWL)
Chicago Manual of Style Sample Paper - Online Writing Lab (OWL)
 
Sample chicago paper
Sample chicago paperSample chicago paper
Sample chicago paper
 
Chicago Manual of Style 17th Edition Notes and Bibliography Sample Paper - Pr...
Chicago Manual of Style 17th Edition Notes and Bibliography Sample Paper - Pr...Chicago Manual of Style 17th Edition Notes and Bibliography Sample Paper - Pr...
Chicago Manual of Style 17th Edition Notes and Bibliography Sample Paper - Pr...
 
MOVING NETWORKS” INTO THE COMPOSITION CLASSROOM.docx
MOVING NETWORKS” INTO THE COMPOSITION CLASSROOM.docxMOVING NETWORKS” INTO THE COMPOSITION CLASSROOM.docx
MOVING NETWORKS” INTO THE COMPOSITION CLASSROOM.docx
 
Pinterest and the Crisis of Paratext - Handout
Pinterest and the Crisis of Paratext - HandoutPinterest and the Crisis of Paratext - Handout
Pinterest and the Crisis of Paratext - Handout
 
Notational systems and cognitive evolution
Notational systems and cognitive evolutionNotational systems and cognitive evolution
Notational systems and cognitive evolution
 
Foundations camb july 2012
Foundations camb july 2012Foundations camb july 2012
Foundations camb july 2012
 
evted Perspectives on Ergodic Literature Espen J. Aarset
evted Perspectives on Ergodic Literature Espen J. Aarsetevted Perspectives on Ergodic Literature Espen J. Aarset
evted Perspectives on Ergodic Literature Espen J. Aarset
 
Miki
MikiMiki
Miki
 
Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...
 
Literary criticism
Literary criticism Literary criticism
Literary criticism
 
Opening the book
Opening the bookOpening the book
Opening the book
 
BA Thesis
BA ThesisBA Thesis
BA Thesis
 

Recently uploaded

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 

Recently uploaded (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 

The New Past, and a Speculative Future, of Literature: A Brief Discussion of Two Text Analysis Tools

  • 1. The New Past, and a Speculative Future, of Literature: A Brief Discussion of Two Text Analysis Tools Nat Gustafson-Sundell Minnesota State University, Mankato OpenResearch.weebly.com 1
  • 2. Franco Moretti “Writing about comparative social history, Marc Bloch once coined a lovely ‘slogan,’ as he himself called it: ‘years of analysis for a day of synthesis’; and if you read Braudel or Wallerstein you immediately see what Bloch had in mind. The text which is strictly Wallerstein’s, his ‘day of synthesis’, occupies one-third of a page … the rest are quotations … Years of analysis; other people’s analysis, which Wallerstein’s page synthesizes into a system. Note, if we take this model seriously, the study of world literature will somehow have to reproduce this ‘page’ – which is to say: this relationship between analysis and synthesis – for the literary field. But in that case, literary history will quickly become very different from what it is now: it will become ‘second hand’: a patchwork of other people’s research, without a single direct textual reading. Still ambitious, and actually even more so than before (world literature!); but the ambition is now directly proportional to the distance from the text: the more ambitious the project, the greater must the distance be.” (Moretti 47-8, 2000) “Distant reading: where distance … is a condition of knowledge: it allows you to focus on units that are much smaller or much larger than the text: devices, themes, tropes – or genres and systems. And if, between the very small and the very large, the text itself disappears, well, it is one of those cases when one can justifiably say, Less is more. It we want to understand the system in its entirety, we must accept losing something…” (Moretti 48-9, 2000) 2
  • 3. Matthew Jockers “The literary scholar of the twenty-first century can no longer be content with anecdotal evidence, with random ‘things’ generated from a few , even ‘representative’ texts. We must strive to understand these things in the context of everything else, including a mass of possibly ‘uninteresting’ texts.” (Jockers 8) “At the macro scale , we see evidence of time and gender influences on theme and style. By superimposing these two network snapshots in our minds, we can begin to imagine a larger context in which to read and study nineteenth-century literature. What is clear is that the books we have traditionally studied are not isolated books. The canonical greats are not even outliers: they are books that are similar to other books…” (Jockers 168) “It is the exact interplay between the macro and micro scale that promises a new, enhanced, and perhaps even better understanding of the literary record. The two approaches work in tandem and inform each other. Human interpretation of the ‘data,’ whether it be mined at the macro or micro level, remains essential … The most fundamental and important difference in the two approaches is that the macroanalytic approach reveals details about texts that are for all intents and purposes unavailable to close-readers of the texts.” (Jockers online) 3
  • 4. “The value of the computer-mediated exercises is that they enable readers to readily perceive and appreciate features that are not obvious in a conventional reading of a printed text.” (Irizarry 155, 1996) “The computer is, among other things, an instrument uniquely suited to play activities ...” (Irizarry 156, 1996) “Assembling and disassembling a text, like playing with blocks of Lego, may not necessarily contribute immediately to its understanding, but it is likely to contribute to the aggregate experience of the text in valuable ways. … I am suggesting that play is an integral part of a humanist’s interpretive activities…” (Sinclair 181, 2003) “Playful experimentation is a pragmatic approach of trying something, seeing if you obtain interesting results, and if you do, then trying to theorize why those results are interesting rather than starting from articulated principles.” (Rockwell 214, 2003) Play 4
  • 7. Johnny Johnny, Dave, Doris Johnny, Dave, Doris, Mildred, Arrow Collocate Clusters 7
  • 8. Johnny, Dave, Doris, Thought Johnny, Dave, Doris, Thought, Strange Collocate Clusters 8
  • 11. Topics in Documents: % of Topics in All 390 Documents 11
  • 12. For Topic 1: Top 25 Documents in Topic 1 In the arrangement of poems, what is the topic trend? What can we learn about arrangement in this book? How often is this topic the “dominant” topic? What topics are most common across documents, or most rare? What topics tend to dominate? What topics tend to be subordinate? Does this topic relate to certain topics more than others? 12
  • 13. Imagine Texts Constructed Only To Be Read At A Distance Imagine Texts Topic In Doc 1 Reading 55% Distance 38% Imagine texts constructed only to be read at a distance. Read 13
  • 14. 14
  • 16. 16 Works Cited Blei, David. "Probabilistic Topic Models." Communications of the ACM 55.4 (2012): 77-84. Web. Brett, Megan. "Topic Modeling: A Basic Introduction." Journal of Digital Humanities 2.1 (2012): 12-16. Web. Irizarry, Estelle. "Tampering with the Text to Increase Awareness of poetry’s Art: Theory and Practice with a Hispanic Perspective." Literary and Linguistic Computing 11 (1996): 155-162. Print. Jockers, Matthew Lee. Macroanalysis: Digital Methods and Literary History. University of Illinois Press, 2013. Print. Moretti, Franco. Distant Reading. London: Verso, 2013. Print. Rockwell, Geoffrey. "What is Text Analysis, really?" Literary and Linguistic Computing 18.2 (2003): 209-19. Web. Samuels, Lisa, and Jerome J. McGann. "Deformance and Interpretation." New Literary History 30.1 (1999): 25-56. Web. Sinclair, Stefan. "Computer-Assisted Reading: Reconceiving Text Analysis." Literary and Linguistic Computing 18.2 (2003): 175-84. Web.

Editor's Notes

  1. From a topic modeling (LDA) perspective, a text consists of some number of topics, each of which makes up some percent of the text. A topic can be thought of as a “bag of words.” We can think of a text as resulting from a number of random drawings from those bags of words based on the percentage allocation of topics (and the numbers of various words in those bags will dependon the percentage allocation of words within those topics).“One way to think about how the process of topic modeling works is to imagine working though an article with a set of highlighters. As you read through the article, you use a different color for the key words of themes within the paper as you come across them. When you were done, you could copy out the words as grouped by the color you assigned them. That list of words is a topic, and each color represents a different topic. Note: this description is inspired by the following illustration from David Blei’saricle, which is one of the best visual representation of a topic I’ve found.” (Brett 12)My caveat: the computer does not know the meanings of the words. The algorithm finds topics based on the co-occurrence of the words: “They look like ‘topics’ because terms that frequently occur together tend to be about the same subject” (Blei 9)