Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Beyond the Document Lou Burnard
The message  <ul><li>The metaphor of the digital book is so pervasive that we can barely see it.  </li></ul><ul><li>But go...
Plan <ul><li>What's that noise in the digital library? </li></ul><ul><li>From  Literary and Linguistic Computing  to  Huma...
Three simple truths <ul><li>There is no going back : the knowledge infrastructure is now irrevocably digital </li></ul><ul...
Irrevocable digitality <ul><li>The  objects  of Humanities scholarship are now digital, even if its methods are not </li><...
Literary & Linguistic Computing
1960-1980 <ul><li>The Heroic age... </li></ul><ul><ul><li>Father Busa and the Index Thomasticum </li></ul></ul><ul><ul><li...
LLC is also a journal,  and an annual conference <ul><li>http://llc.oxfordjournals.org/ </li></ul>
LLC is alive and well and living in France  <ul><li>Text as a statistical phenomenon </li></ul><ul><li>Factor analysis and...
Humanities Computing
1980-1994 <ul><li>Institutionalization </li></ul><ul><li>Is Humanities Computing an Academic Discipline? </li></ul><ul><li...
Institutionalization <ul><li>http://www.allc.org/imhc </li></ul>
<ul><li>In the home, the eighties was a decade of technology that  nearly  worked </li></ul><ul><li>In academia, digital m...
Communities <ul><li>E-mail and e-mail lists:  Humanist </li></ul><ul><li>Electronic Text paradigms </li></ul><ul><ul><li>O...
<ul><li>Once we have made our digital surrogates, what then? </li></ul><ul><li>Traditions (”scholarly primitives”) </li></...
Resources digital resources encoding analysis abstract model
<ul><li>scholarship depends on continuity </li></ul><ul><li>it is not enough to preserve the bytes of an encoding </li></u...
TEI: the main achievement of HC? <ul><li>Originally a response to the multiplicity of formats and lack of standards </li><...
Digital Humanities
1995 - ? <ul><li>While we were talking about the theory.... </li></ul><ul><ul><li>digital libraries </li></ul></ul><ul><ul...
The rise of the digital library <ul><li>“ Public good” digitization efforts </li></ul><ul><ul><li>From  Gallica  to JISC D...
An alternative model <ul><li>What works for software could work equally well for digital resources </li></ul>When programm...
Open up the data warehouse!
Digital humanities manifesto 2.0 Digital Humanities is not a unified field but  an array of convergent practices  that exp...
ibid... Digital Humanities implies the  multi-purposing and multiple channeling of humanistic knowledge : no channel exclu...
The importance of  not  reading <ul><li>“ What can you do with a million books?”  (Greg Crane) </li></ul><ul><li>“ Althoug...
How to not read <ul><li>We need to find ways of cross-searching, decomposing, and re-composing </li></ul><ul><ul><li>rich ...
Escaping from the text <ul><li>From footnote to hypertext </li></ul>
 
A classical case study
CLAROS , for example <ul><li>(Current) Partners </li></ul><ul><ul><li>University of Oxford: Faculty of Classics </li></ul>...
A mix of technologies... Beazley Archive DAI Arachne LGPN (Oxford) LIMC (Paris) .NET / ASP XSLT, PHP Java XSLT Relational ...
...but a common conceptual model
How does it actually work?
What makes this possible? <ul><li>It's not rocket science! </li></ul><ul><li>XML markup with a shared semantics (TEI) </li...
Rethinking the digital edition <ul><li>The insights of critical editing/edition philology need to be re-discovered and re-...
Components of the digital edition <ul><li>Manuscript page images </li></ul><ul><li>Annotated transcriptions </li></ul><ul>...
The textual trinity <ul><li>Textual descriptions tend to focus on one of: </li></ul><ul><ul><li>its linguistic nature (bec...
Convergence <ul><li>But the digital agenda requires us to mash these things up: for example to combine </li></ul><ul><ul><...
conclusions
A key role for the Humanities <ul><li>We know about textual objects </li></ul><ul><ul><li>how is this discourse represente...
Upcoming SlideShare
Loading in …5
×

Du Literary and linguistic computing aux Digital Humanities : retour sur 40 ans de relations entre sciences humaines et informatique

2,070 views

Published on

Par Lou Burnard. Tous droits réservés

Published in: Education
  • DOWNLOAD THE BOOK INTO AVAILABLE FORMAT (New Update) ......................................................................................................................... ......................................................................................................................... Download Full PDF EBOOK here { https://urlzs.com/UABbn } ......................................................................................................................... Download Full EPUB Ebook here { https://urlzs.com/UABbn } ......................................................................................................................... Download Full doc Ebook here { https://urlzs.com/UABbn } ......................................................................................................................... Download PDF EBOOK here { https://urlzs.com/UABbn } ......................................................................................................................... Download EPUB Ebook here { https://urlzs.com/UABbn } ......................................................................................................................... Download doc Ebook here { https://urlzs.com/UABbn } ......................................................................................................................... ......................................................................................................................... ................................................................................................................................... eBook is an electronic version of a traditional print book THE can be read by using a personal computer or by using an eBook reader. (An eBook reader can be a software application for use on a computer such as Microsoft's free Reader application, or a book-sized computer THE is used solely as a reading device such as Nuvomedia's Rocket eBook.) Users can purchase an eBook on diskette or CD, but the most popular method of getting an eBook is to purchase a downloadable file of the eBook (or other reading material) from a Web site (such as Barnes and Noble) to be read from the user's computer or reading device. Generally, an eBook can be downloaded in five minutes or less ......................................................................................................................... .............. Browse by Genre Available eBOOK .............................................................................................................................. Art, Biography, Business, Chick Lit, Children's, Christian, Classics, Comics, Contemporary, CookBOOK, Manga, Memoir, Music, Mystery, Non Fiction, Paranormal, Philosophy, Poetry, Psychology, Religion, Romance, Science, Science Fiction, Self Help, Suspense, Spirituality, Sports, Thriller, Travel, Young Adult, Crime, EBOOK, Fantasy, Fiction, Graphic Novels, Historical Fiction, History, Horror, Humor And Comedy, ......................................................................................................................... ......................................................................................................................... .....BEST SELLER FOR EBOOK RECOMMEND............................................................. ......................................................................................................................... Blowout: Corrupted Democracy, Rogue State Russia, and the Richest, Most Destructive Industry on Earth,-- The Ride of a Lifetime: Lessons Learned from 15 Years as CEO of the Walt Disney Company,-- Call Sign Chaos: Learning to Lead,-- StrengthsFinder 2.0,-- Stillness Is the Key,-- She Said: Breaking the Sexual Harassment Story THE Helped Ignite a Movement,-- Atomic Habits: An Easy &amp; Proven Way to Build Good Habits &amp; Break Bad Ones,-- Everything Is Figureoutable,-- What It Takes: Lessons in the Pursuit of Excellence,-- Rich Dad Poor Dad: What the Rich Teach Their Kids About Money THE the Poor and Middle Class Do Not!,-- The Total Money Makeover: Classic Edition: A Proven Plan for Financial Fitness,-- Shut Up and Listen!: Hard Business Truths THE Will Help You Succeed, ......................................................................................................................... .........................................................................................................................
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
  • DOWNLOAD THIS BOOKS INTO AVAILABLE FORMAT (2019 Update) ......................................................................................................................... ......................................................................................................................... Download Full PDF EBOOK here { https://soo.gd/irt2 } ......................................................................................................................... Download Full EPUB Ebook here { https://soo.gd/irt2 } ......................................................................................................................... Download Full doc Ebook here { https://soo.gd/irt2 } ......................................................................................................................... Download PDF EBOOK here { https://soo.gd/irt2 } ......................................................................................................................... Download EPUB Ebook here { https://soo.gd/irt2 } ......................................................................................................................... Download doc Ebook here { https://soo.gd/irt2 } ......................................................................................................................... ......................................................................................................................... ................................................................................................................................... eBook is an electronic version of a traditional print book THIS can be read by using a personal computer or by using an eBook reader. (An eBook reader can be a software application for use on a computer such as Microsoft's free Reader application, or a book-sized computer THIS is used solely as a reading device such as Nuvomedia's Rocket eBook.) Users can purchase an eBook on diskette or CD, but the most popular method of getting an eBook is to purchase a downloadable file of the eBook (or other reading material) from a Web site (such as Barnes and Noble) to be read from the user's computer or reading device. Generally, an eBook can be downloaded in five minutes or less ......................................................................................................................... .............. Browse by Genre Available eBooks .............................................................................................................................. Art, Biography, Business, Chick Lit, Children's, Christian, Classics, Comics, Contemporary, Cookbooks, Manga, Memoir, Music, Mystery, Non Fiction, Paranormal, Philosophy, Poetry, Psychology, Religion, Romance, Science, Science Fiction, Self Help, Suspense, Spirituality, Sports, Thriller, Travel, Young Adult, Crime, Ebooks, Fantasy, Fiction, Graphic Novels, Historical Fiction, History, Horror, Humor And Comedy, ......................................................................................................................... ......................................................................................................................... .....BEST SELLER FOR EBOOK RECOMMEND............................................................. ......................................................................................................................... Blowout: Corrupted Democracy, Rogue State Russia, and the Richest, Most Destructive Industry on Earth,-- The Ride of a Lifetime: Lessons Learned from 15 Years as CEO of the Walt Disney Company,-- Call Sign Chaos: Learning to Lead,-- StrengthsFinder 2.0,-- Stillness Is the Key,-- She Said: Breaking the Sexual Harassment Story THIS Helped Ignite a Movement,-- Atomic Habits: An Easy &amp; Proven Way to Build Good Habits &amp; Break Bad Ones,-- Everything Is Figureoutable,-- What It Takes: Lessons in the Pursuit of Excellence,-- Rich Dad Poor Dad: What the Rich Teach Their Kids About Money THIS the Poor and Middle Class Do Not!,-- The Total Money Makeover: Classic Edition: A Proven Plan for Financial Fitness,-- Shut Up and Listen!: Hard Business Truths THIS Will Help You Succeed, ......................................................................................................................... .........................................................................................................................
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
  • Be the first to like this

Du Literary and linguistic computing aux Digital Humanities : retour sur 40 ans de relations entre sciences humaines et informatique

  1. 1. Beyond the Document Lou Burnard
  2. 2. The message <ul><li>The metaphor of the digital book is so pervasive that we can barely see it. </li></ul><ul><li>But going digital is not only about producing cheaper and more accessible simulations of printed or painted pages. </li></ul><ul><li>Digital applications should enable us to do more with a text than simply read it from beginning to end, or attach annotations to it for others to read, or link it to other digital texts </li></ul><ul><li>We are at last moving beyond the document, towards a distributed world, in which “the books in the library can talk to each other” </li></ul>
  3. 3. Plan <ul><li>What's that noise in the digital library? </li></ul><ul><li>From Literary and Linguistic Computing to Humanities Computing to Digital Humanities </li></ul><ul><li>A classical case study </li></ul><ul><li>What should we be proud of? </li></ul>
  4. 4. Three simple truths <ul><li>There is no going back : the knowledge infrastructure is now irrevocably digital </li></ul><ul><li>The business models of the knowledge infrastructure have changed irrevocably </li></ul><ul><li>The quantititative changes facilitated by digital technologies approximate qualitative change </li></ul>
  5. 5. Irrevocable digitality <ul><li>The objects of Humanities scholarship are now digital, even if its methods are not </li></ul><ul><li>And our methods are changing all around us... </li></ul><ul><li>We are moving from hyper text to hyper data </li></ul><ul><li>From a web of documents to a web of data </li></ul><ul><ul><li>The technology is here (more or less) </li></ul></ul><ul><ul><li>The problems are mostly socio-politico-cultural </li></ul></ul><ul><li>But first, a little history lesson </li></ul>
  6. 6. Literary & Linguistic Computing
  7. 7. 1960-1980 <ul><li>The Heroic age... </li></ul><ul><ul><li>Father Busa and the Index Thomasticum </li></ul></ul><ul><ul><li>The Brown Corpus </li></ul></ul><ul><ul><li>Thesaurus Linguae Graecae </li></ul></ul><ul><li>concordances, stylistic analysis, authorship studies, language corpora </li></ul><ul><li>technical barriers, inpenetrable for all but the determined (or mad) </li></ul>
  8. 8. LLC is also a journal, and an annual conference <ul><li>http://llc.oxfordjournals.org/ </li></ul>
  9. 9. LLC is alive and well and living in France <ul><li>Text as a statistical phenomenon </li></ul><ul><li>Factor analysis and data mining </li></ul><ul><li>Textometrie </li></ul>
  10. 10. Humanities Computing
  11. 11. 1980-1994 <ul><li>Institutionalization </li></ul><ul><li>Is Humanities Computing an Academic Discipline? </li></ul><ul><li>The “text encoding” project </li></ul>
  12. 12. Institutionalization <ul><li>http://www.allc.org/imhc </li></ul>
  13. 13. <ul><li>In the home, the eighties was a decade of technology that nearly worked </li></ul><ul><li>In academia, digital methods and resources, though perceived as alien and difficult, were also finding their place </li></ul><ul><li>In the UK </li></ul><ul><ul><li>Computers in Teaching Initiative </li></ul></ul><ul><ul><li>Arts and Humanities Data Service </li></ul></ul><ul><li>Something new, or something old done better? </li></ul>The rise of the HC centre
  14. 14. Communities <ul><li>E-mail and e-mail lists: Humanist </li></ul><ul><li>Electronic Text paradigms </li></ul><ul><ul><li>Oxford Text Archive </li></ul></ul><ul><ul><li>Project Gutenberg </li></ul></ul><ul><li>NLP (TALN) </li></ul><ul><li>Public funding becomes important </li></ul><ul><ul><li>Computers in Teaching Initiative (CTI) </li></ul></ul><ul><li>And private enterprise is curious </li></ul><ul><ul><li>Electronic Publishing SIG </li></ul></ul>
  15. 15. <ul><li>Once we have made our digital surrogates, what then? </li></ul><ul><li>Traditions (”scholarly primitives”) </li></ul><ul><ul><li>finding by means of external characteristics </li></ul></ul><ul><ul><li>analysing by means of internal features </li></ul></ul><ul><ul><li>associating by means of shared perceptions </li></ul></ul><ul><li>What tools and methods will help combine these approaches? </li></ul><ul><li>What theory will inform their application? </li></ul>The challenge for HC
  16. 16. Resources digital resources encoding analysis abstract model
  17. 17. <ul><li>scholarship depends on continuity </li></ul><ul><li>it is not enough to preserve the bytes of an encoding </li></ul><ul><li>there must also be a continuity of comprehension: the encoding must be self-descriptive </li></ul>Transmitting our interpretations
  18. 18. TEI: the main achievement of HC? <ul><li>Originally a response to the multiplicity of formats and lack of standards </li></ul><ul><li>The TEI emerged as a single, encyclopaedic model of the “significant particularities” of textual resources </li></ul><ul><li>And also an adaptable architecture able to respond to changing needs and priorities </li></ul>
  19. 19. Digital Humanities
  20. 20. 1995 - ? <ul><li>While we were talking about the theory.... </li></ul><ul><ul><li>digital libraries </li></ul></ul><ul><ul><li>mass digitization </li></ul></ul><ul><ul><li>commodity computing, folksonomies, cloud computing... </li></ul></ul><ul><li>Convergence and collaboration </li></ul><ul><ul><li>rethinking scholarly editing </li></ul></ul><ul><ul><li>redefining the discipline </li></ul></ul><ul><li>New infrastructures? </li></ul>
  21. 21. The rise of the digital library <ul><li>“ Public good” digitization efforts </li></ul><ul><ul><li>From Gallica to JISC Digititization Programme </li></ul></ul><ul><li>The metadata challenge </li></ul><ul><ul><li>Authority and link-rot: Resource Discovery Network to Intute </li></ul></ul><ul><ul><li>From Dublin Core to OAI/PMH </li></ul></ul><ul><ul><li>Can systems be self-organizing? </li></ul></ul><ul><li>What is the right business model? </li></ul>
  22. 22. An alternative model <ul><li>What works for software could work equally well for digital resources </li></ul>When programmers can read, redistribute, and modify the source code for a piece of software, the software evolves. People improve it, people adapt it, people fix bugs. When developers can access, redistribute, and enhance the digital resources underlying a digital application, new applications can evolve. People can add value, people can adapt it, people can fix bugs.
  23. 23. Open up the data warehouse!
  24. 24. Digital humanities manifesto 2.0 Digital Humanities is not a unified field but an array of convergent practices that explore a universe in which: a) print is no longer the exclusive or the normative medium in which knowledge is produced and/or disseminated; instead, print finds itself absorbed into new, multimedia configurations; and b) digital tools, techniques, and media have altered the production and dissemination of knowledge in the arts, human and social sciences. http://dev.cdh.ucla.edu/digitalhumanities/2009/05/29/the-digital-humanities-manifesto-20/#0
  25. 25. ibid... Digital Humanities implies the multi-purposing and multiple channeling of humanistic knowledge : no channel excludes the other. Its economy is abundance based, not one based upon scarcity .... though notions of humanistic research are everywhere under institutional pressure, there is (potentially) plenty for all. And, indeed, there is plenty to do.
  26. 26. The importance of not reading <ul><li>“ What can you do with a million books?” (Greg Crane) </li></ul><ul><li>“ Although there is still a need for close-reading... we never don't not read” (John Unsworth) </li></ul><ul><li>A new synergy of methods: </li></ul><ul><ul><li>Corpus linguistics </li></ul></ul><ul><ul><li>Pattern recognition </li></ul></ul><ul><ul><li>Data mining </li></ul></ul>http://www3.isrl.illinois.edu/~unsworth/hownot2read.html
  27. 27. How to not read <ul><li>We need to find ways of cross-searching, decomposing, and re-composing </li></ul><ul><ul><li>rich xml documents </li></ul></ul><ul><ul><li>complex relational database structures </li></ul></ul><ul><ul><li>simple presentation-focussed websites </li></ul></ul><ul><ul><li>sound, image, video... </li></ul></ul><ul><li>The challenge is to do this in an open and standards-compliant manner </li></ul><ul><li>And on a massive scale </li></ul>
  28. 28. Escaping from the text <ul><li>From footnote to hypertext </li></ul>
  29. 30. A classical case study
  30. 31. CLAROS , for example <ul><li>(Current) Partners </li></ul><ul><ul><li>University of Oxford: Faculty of Classics </li></ul></ul><ul><ul><ul><li>Beazley Archive: documentation of pottery, jewels, etc. </li></ul></ul></ul><ul><ul><ul><li>Lexicon of Greek Personal Names: attested names </li></ul></ul></ul><ul><ul><li>University of Cologne </li></ul></ul><ul><ul><ul><li>Arachne Archive: data about sculpture </li></ul></ul></ul><ul><ul><li>German Archaeological Institute, Berlin </li></ul></ul><ul><ul><ul><li>Images from archaeological sites </li></ul></ul></ul><ul><ul><li>University of Paris X </li></ul></ul><ul><ul><ul><li>Lexicon Iconographicum Mythologiae Classicae: </li></ul></ul></ul><ul><li>Over 2 million records and images </li></ul><ul><li>Four different database systems </li></ul>
  31. 32. A mix of technologies... Beazley Archive DAI Arachne LGPN (Oxford) LIMC (Paris) .NET / ASP XSLT, PHP Java XSLT Relational database: MS SQL Server XML database Relational database: MySQL Relational database: MySQL Browser Browser Browser Browser
  32. 33. ...but a common conceptual model
  33. 34. How does it actually work?
  34. 35. What makes this possible? <ul><li>It's not rocket science! </li></ul><ul><li>XML markup with a shared semantics (TEI) </li></ul><ul><li>Appropriate use of new technologies (e.g. Unicode, javascript) </li></ul><ul><li>A willingness to open up our data </li></ul>
  35. 36. Rethinking the digital edition <ul><li>The insights of critical editing/edition philology need to be re-discovered and re-applied in the new context </li></ul><ul><li>We need a new synergy of semiotics and hermeneutics </li></ul><ul><li>Combined with the traditional virtues of skepticism and empiricism </li></ul>
  36. 37. Components of the digital edition <ul><li>Manuscript page images </li></ul><ul><li>Annotated transcriptions </li></ul><ul><li>Critical (synthetic) edition </li></ul><ul><li>Modern translation and summary </li></ul><ul><li>Notes, glossary, foreword, bibliography, etc. </li></ul><ul><li>Manuscript descriptions and metadata </li></ul><ul><li>“ Factoids” about the real world </li></ul>
  37. 38. The textual trinity <ul><li>Textual descriptions tend to focus on one of: </li></ul><ul><ul><li>its linguistic nature (because texts are made of words used in particular ways) </li></ul></ul><ul><ul><li>its physical state (because texts are made up of glyphs arranged in particular ways) </li></ul></ul><ul><ul><li>its intentions (because texts are supposed to tell us something about the world) </li></ul></ul><ul><li>Likewise, software tends to distinguish </li></ul><ul><ul><li>document management and production systems </li></ul></ul><ul><ul><li>image management and production systems </li></ul></ul><ul><ul><li>database systems </li></ul></ul>
  38. 39. Convergence <ul><li>But the digital agenda requires us to mash these things up: for example to combine </li></ul><ul><ul><li>a GIS database about places in the Aegean sea </li></ul></ul><ul><ul><li>a historical gazeteer of placenames in the same area </li></ul></ul><ul><ul><li>a corpus of texts mentioning those placenames </li></ul></ul><ul><li>TEI has recently expanded its scope to support this kind of convergence </li></ul>
  39. 40. conclusions
  40. 41. A key role for the Humanities <ul><li>We know about textual objects </li></ul><ul><ul><li>how is this discourse represented? </li></ul></ul><ul><ul><li>what stories does it tell </li></ul></ul><ul><li>We know about hermeneutics </li></ul><ul><ul><li>what does this discourse mean? </li></ul></ul><ul><ul><li>what does it say aside from its denotational content? </li></ul></ul><ul><li>This is our contribution to the semantic web </li></ul>

×