Installing Medieval Unicode

Our word alphabet comes from the names of the first two letters of the Greek alphabet (alpha + beta). In Latin, the alphabet was called an abecedarium (A, Be, Ce, D -arium) from the names of the first four letters in the Latin alphabet. Transcribing medieval documents however requires us to recognize many more characters than the twenty-six letters used in modern English.

Transcribing by hand means that you can approximate unfamiliar characters while you are working through a draft transcription. A digital transcription that retains the characters/ glyphs of the original text, however, is more difficult, as you will need to use a special characters atypical in modern usage/ typical keyboard set up. Two decades ago, there was not an option to use non-standard Latin characters on your computer, but in the past decade we have seen considerable advances in establishing standards for a wider range of characters (in the Latin alphabet, but also in Runic and non-Western alphabets). We will access these characters in unicode. We can thank a small group of dedicated scholars who founded the Medieval Unicode Font Initiative (MUFI), which has set out to make a diverse range of manuscript characters reproducible digitally.

To get a sense of what unicode is, take a look at this resource, Decode Unicode.

Setting up your computer to use unicode and have access to historical fonts takes a few steps, but is straightforward. To use unicode for medieval scripts, however, is a bit more arcane; you might feel like you're a codebreaker trying to crack the Enigma device. As part of this process, we'll put together a list of the most typical characters you'll use in transcribing your folios.

For this exercise, it pays to have a standard word processor that easily allows you to choose what font you want to use. While students at Carleton, you all can use Microsoft Word at no charge, so in the interest of standardization, I encourage you all to download a free copy if it is not already set up on your computer.

To use medieval characters in your transcriptions, download the two MUFI codebooks organized by letters and other symbols and characters. Like Capelli's guide to medieval abbreviations (which you will be introduced to next week), these allow you to choose specific characters to match what you see on the page. Before you can use them, however, you will need to get set up to use unicode.

Setting up Unicode to run on your machine.

  1. Set up your device to use unicode. On my Mac, I first enabled Unicode Hex Input in System Preferences -> Keyboard -> Input Sources (more detailed instructions here). The wikipedia article on unicode outlines how to enable unicode on different platforms.

  2. Download a font that will allow you to use medieval unicode features (a much wider base of characters than one normally uses) from options suggested by MUFI. I would suggest you download and install the Andron font, but Junicode and Titus also work well.

  3. It should be as easy as double clicking on the file(s) to add them to a Mac or Windows computer. If you are using Linux, in takes a few more steps.

  4. Open up a word processor and after selecting the font you have installed, attempt to type some medieval characters, many of which can't be reproduced in markdown (and thus can't be shown here).

  5. With the option key held down, type EEC5. This should input a small ligature ct (i.e. a c and t together linked by a little curvy pen flourish).

  6. You should see something that looks like "đ" (a small letter d with cross stroke), if you hold down the option key and type "0111".

  7. Looking through the two MUFI code books you have downloaded, look for and try out other characters. Not all characters will be the same as indicated in the codebook (some fonts like Junicode, were completed before the most recent standards were set).

Working with MUFI Unicode

  1. By this class, you should already have started a handwritten transcription of your folio. Your goal now is to write out at least 10 lines as an abbreviated text using the now available (to you) unicode characters. Go!

  2. Assign a number to each line of your text and transcribe the words/ characters line by line. It will take some time to find matching character in the unicode codebooks, but it will help you discern what is going on.

  3. Post the completed file in your personal Github repository as a Word document (.docx) or .pdf, named "LastName-MsX-UnicodeTranscription" (where LastName is replaced by your last name, and MsX is replaced the name and number of your folio – e.g. Ottawa CU ARC ms. f. x).

Last updated