Interesting digital humanities projects
Course: Computational methods in the
humanities (Ger 1550 , Ling 1050 , Slav 1050 )
Prepared by: David J. Birnbaum (email@example.com)
Last modified: 2011-06-05
Main course page:
- Figura TG.
- Tool for exploring Chrétien de Troyes’s
Le chevalier de la Charrette.
- For better for verse.
- Interactive tool for teaching prosody.
- TokenX. http://tokenx.unl.edu
- A text visualization, analysis, and play tool.
- The Archimedes palimpsest project.
- A thirteenth-century prayer book sold at auction to a private collector in 1998
contains erased texts that were written several centuries earlier still, including
two treatises by Archimedes that can be found nowhere else, The Method
and Stomachion. The Project provides digital images and
- The classroom electric.
- The Classroom Electric is a constellation of web sites on Emily Dickinson, Walt
Whitman, and nineteenth-century American culture. Here users can explore images of
original manuscripts, rare photographs, notebooks, scrapbooks, letters, and maps in
sites informed by cutting-edge scholarship. While each site works as a stand-alone
case study useful to students and teachers, the sites also link to each other, to
other resources, and to the Dickinson Electronic Archives and the Walt Whitman
- The first voyage of Othere.
- Applies dense linguistic XML markup of linguistic data to Old English prose. The
text is the first five paragraphs of King Alfred’s record of Ohthere’s tale of his
voyage to the north along the Norwegian coast to the White Sea region. The voyage
took place around 890, and is the first known voyage around the North Cape. Ohthere
(Ottar) was a viking trader. Alfred included the story in his world history, The Old
English Orosius, which mainly consists of a translation of the writings of the
Spanish cleric and historian Paulus Orosius, dating from around 400 AD.
- The map of early modern London.
- The Map of Early Modern London maps the streets, sites, and significant
boundaries of late sixteenth-century and early seventeenth-century London
(1560–1640). Taking the "Agas" map as its platform, the project links
encyclopedia-style articles, scholarly work, student work, editions, and literary
texts to the places mentioned therein. Students will view the landmarks of
Shakespeare's London, and learn about the history and culture of the city in which
he lived and worked. Teachers will find the map and index useful in teaching
Renaissance plays and other texts set in London. Scholars are welcome to contribute
articles, links, or compilations of data.
- Monk (metadata offer new knowledge).
- MONK is a digital environment designed to help humanities scholars discover and
analyze patterns in the texts they study. It supports both micro analyses of the
verbal texture of an individual text and macro analyses that let you locate texts in
the context of a large document space consisting of hundreds or thousands of other
texts. Shuttling between the
micro and the
macro is a distinctive
feature of the MONK environment, where you may read as closely as you wish but can
also practice many forms of what Franco Moretti has provocatively called
Student projects from Matthew Jockers’s “Digital humanities: beyond the book” (English
153H) course at Stanford University
- The heart of Rachael.
- Our website enables readers to read the text of The heart of Rachael
and to use our search functions as research tools. We have presented the text as it
appears in the book, with reader-friendly pages that can be turned and skipped. The
tools we provide users include a word concordance, a word search function, and a
quote search by speaker.
- Kindred of the dust.
- Our project was to encode the novel given us in TEI-compliant XML, display it using
either PHP or an XSLT transformation, and write three analysis tools in PHP for use
with the text. The tools enable the user to view the complete text in XML or HTML,
to search the text and generate KWIC reports, to
examine a frequency table of words in the text, and to display a table of chapter
- Martie the unconquered.
- Our intention for this project is to provide a reader-friendly display of the novel
Martie the unconquered by Kathleen Norris and to develop tools to
assist in the accumulation of textual data for literary analysis. Our site also
provides tools for literary analysis, which include a concordance tool and a keyword
in context tool. The tools allow the user to access the full text, and they also
provide the attractive option of searching only within the spoken words of a
- The sound and the fury.
- This project aims to determine what elements contribute to an author’s style, how
that style and changes in that style influence the reading of the novel, and how or
if we can quantify these elements of style. We use The sound and the
fury as a test bed for stylistic analysis of narrative and authorial
voice because it contains these multiple voices, which perhaps result in multiple
styles. To examine these different voices, we use humanities computing techniques,
including authorship attribution, to investigate narrative voice and to explore the
extent to which Faulkner is able to
create distinct narrative styles within
The sound and the fury.
- Valley of the giants.
- We used PHP primarily for writing scripts to display specific text analysis
functions, such as a script for the frequency of every single word in the text,
sorted by both frequency and alphabetical order. We extended this feature to compare
frequency within this novel with data from the Brown corpus. We also wrote, per
class assignment instructions, a keyword search function that would allow users to
search for the frequency and relative frequency for a specific word in the text.
This tool allowed users to see the context of their keyword as well, with the number
of surrounding context words as specified by the user. Both of these features took
into account case sensitivity for the word (The vs. the). Our original special
feature is a php script which allows us to extract all quotes from the novel and
then either search for a character’s quotes or view them by chapter. We also added a
feature that charts most frequently used phrases with n-gram search.
- The wilderness (volume 1).
- Our project was to encode a piece of text entitled The wilderness; or,
Braddock’s times. A tale of the west, Volume 1. The text, written by
James McHenry in 1823, is ultimately to be published for the Irish-American West web
site, and as such was prepared to be fully searchable for research purposes
including concordance, keyword, quote, and
hypothetical searches. A
concordance search would be used to help a researcher view the number of times a
certain word appears in our text, or to show how often one word may show up in
comparison to another. Since concordance deals with pure numbers, researchers could
use the data for various projects. A keyword search is different as it will show a
researcher where in the text the word appears, given a few words of context before
and after it. Searching by quote would be helpful to researchers interested in
specific character dialogue, or the type of dialogue, whether it be spoken or
thought within. The hypothetical searches were created by the group to include
aspects of the novel such as songs, poems, thoughts, and all quote attribution.
- The wilderness (volume 2): an online, searchable database. http://www.stanford.edu/dept/english/cgi-bin/humComp/2004Gp1/
- This final project consists of an encoded version of James McHenry’s The
wilderness, volume 2. The features of this site allow you to:
The analysis that this site performs will allow you to determine how often
specific words occur, search for keywords in the text, and see the amount of speech
attributed to each character, especially in relation to that of others.
- view a list of words contained within the text and their frequencies, both
by occurrence and alphabetical order,
- search for occurrences of specific words within the context of a chosen
number of surrounding words,
- find all instances of dialog by a character, searchable by name.