Digitizing Medieval Archives
  • Introduction
  • Overview of the Course
  • Assignments
  • Lead a seminar
  • Readings
  • Week 1 & 2 - Setting Up
    • Signing Up
    • Using Twitter
    • Twitter Strategies
    • Twitter Abbreviations
  • Week 3 - Omeka
    • Dublin Core
  • Week 4 - Github
    • Markdown
    • Github
    • Turning Github into a website
    • Markdown II (Optional)
  • Week 5 - Writing Supports
    • Your first transcription
    • Getting prepared for Transkribus
  • Week 6 - Palæography
    • Distinguishing Late Medieval Scripts
    • Handwriting Analysis Tools
    • Installing Medieval Unicode
    • IRL Abbreviations
  • Week 7 - Abbreviations
    • Transcribing with Transkribus
    • Transcribing
    • Java 8
  • Week 8 - Codicology
    • Codicological Spreadsheet
  • Week 9 - Liturgical Genres
    • Medieval Liturgy - Basic Bibliography
  • Week 10 - Cataloguing
    • Template for Folio Cataloguing
  • Week 11 - Workday
  • Week 12 - Whetting your Digital Appetite
  • Week 13 - The Theory of the Digital
    • Github Project Boards
  • Week 14 - The Promise of DH
    • Criteria for Evaluating DH Projects
  • Week 15 - Capturing Manuscripts
    • How to take photos of documents
    • Image File Formats
  • Week 16 - IIIF
    • Our IIIF Images
    • Understanding IIIF Image Presentation
    • Using IIIF Manifests
  • Week 17 - IIIF Annotations
    • Annotations in Mirador
    • Annotations with Transkribus
  • Week 18 - Online Exhibits
  • Week 19 - TEI
    • Look at a TEI folio description
    • Digital Latin Library
    • TEI export from Transkribus
  • Week 20 - Accessibility and Longevity
  • Week 21 - Work
  • Week 22 - Work
  • Week 23 - Presentations I
  • Week 24 - Presentations II
  • Week 25 - End of the Line
  • About
Powered by GitBook
On this page
  1. Week 19 - TEI

TEI export from Transkribus

PreviousDigital Latin LibraryNextWeek 20 - Accessibility and Longevity

Last updated 6 years ago

You can take a look at an exported transcription of CU, ARC, scans 27-30 as a TEI export on . Much of the initial section of the TEI xml file is devoted to identifying parts of the images where text is located (i.e. the "zone points' in the <facsimile> section). From lines 250 until the end, the <text> block records the transcription of two columns of 24 lines per page (multiplied by four pages). Letters in [square] brackets are expanded abbreviations.

Exporting to TEI from Transkribus

  1. Open and login to Transkribus

  2. Ensure the "Ottawa, ARC, Carleton University ..." Collection is selected in the drop down menu entitled "Collections" on the "Server" tab.

  3. Double-click on the item with the ID 36712 (current title, "Bifolium (f. 105, 112) from an Iberian Carthusian...").

  4. Look to the icons along the top left corner and locate the icon of the folder with a green arrow pointing right.

  5. An export window will pop up, which should look like the following:

  1. In the top left corner, click on the "Client Export" tab.

  2. For an easy export, you can save the image as a PDF with a text layer.

  3. But for our purposes, it is more useful to export it as TEI, so click on the TEI box in the left column (and unclick others). Leave export options as is.

  4. It will ask you where to save the file and will take a few moments to export after you have confirmed the export.

  5. Locate the file and open it with Atom. It will have the TEI Header that you are now familiar with.

  6. This file, however, will also have a reference to a "facsimile", which are the digital images of the bifolium.

<?xml version='1.0' encoding='UTF-8'?>
<TEI xmlns='http://www.tei-c.org/ns/1.0'>
	<teiHeader>

...

<facsimile xml:id='facs_1'>
		<surface ulx='0' uly='0' lrx='5090' lry='7010'>
			<graphic url='manuscript027.tif' width='5090px' height='7010px'/>
			<zone points='1238,843 1238,6732 2774,6742 2774,853' rendition='TextRegion' xml:id='facs_1_r1'>

Scroll down to line 250 and you will see the beginning of the transcription, referencing the earlier cited fascimile images:

	<text>
		<body>
			<pb facs='#facs_1' n='1'/>
			<p facs='#facs_1_r1'>
				<lb facs='#facs_1_r1l1' n='N001'/>Et misit Petru[m] et Ioh[ann]em
				<lb facs='#facs_1_r1l2' n='N002'/>dice[n]s: eu[n]tes parate [no]b[is]
				<lb facs='#facs_1_r1l3' n='N003'/>pascha. ut ma[n]ducem[us].
				<lb facs='#facs_1_r1l4' n='N004'/>At illli dixerunt. Ubi uis.
				<lb facs='#facs_1_r1l5' n='N005'/>pare[mus]? Et dix[it] ad eos. Ec-
				<lb facs='#facs_1_r1l6' n='N006'/>ce i[n]troeu[n]tibus uob[is] i[n] ciui-
				<lb facs='#facs_1_r1l7' n='N007'/>tate[m]: occurret uob[is] ho[mo] a[m]pho-
				
				...
				

And you're done!

Gitbook
Export Screen on Transkribus