Treebanking example in Russian: http://www.aclweb.org/anthology/C08-1081 TreeTagger, TnT provide good Parts of Speech, with TreeTagger the easiest to use and TnT more flexible. MST, Bona Parser and TigerSearch provide treebanking but all are very specialized
Computer vision image from https://dh101.ch/2014/10/22/images-digital-humanities-and-challenges/ TinEye is a good off the shelf for one-offs, and ImageJ
“Digital” in Arts & Humanities
Computational tools in humanities tasks
Why is it useful?
Ask new questions
Ask bigger questions
Interact with the objects of your research in previously impossible ways
Put students and readers in the driver’s seat
This will be
Digitization (aka “Scanning”)
• Convert to digital
• Optical Character
• Create new artifacts
• Which version?
• In what format?
• To whom?
• With markup?
• In digital form
• Is this creation?
• In analog form
Digital Humanities In Practice, “Image processing in the digital humanities”
Data Mining (aka “Distant Reading”)
• Word clouds
• Topic Modeling
• Corpus linguistics
• Treebanking & part of-
• Latent Semantic
• Named Entity
• Sentiment Analysis
Image analysis &
• Image similarity
Word Clouds & n-grams
It's valuable to lemmatize (or stem) text for the purposes of text mining
because it provides a more consistent set of words with which to work.
This can be particularly important for languages that are under
supported in one analytical tool but well supported in another.
Topic Modeling & Corpus Linguistics
Hammered metal with niello Laser cut acrylic with paint
The Fuller Brooch (9th c Anglo-Saxon)
The British Museum, 1952,0404.1
Local community Virtual community
Collaboration and Community
Trees and Distant
and Yoh Kawano,
(we’ll get to that)
“Digital” in your Arts & Humanities
How might your research change?
Are you struggling to find a question in a flood of documents?
Do you have a question that can’t be answered using traditional means?
Do you have an answer that needs verification on a larger scale?
Who is your public? How does they fit into this?
Can you use digital methodology to present more succinctly?
From a different, more engaging perspective?
Given your new questions, your consideration of audience, will you
create new things?
analyze existing research objects?