TEI export from Transkribus
Last updated
Last updated
You can take a look at an exported transcription of CU, ARC, scans 27-30 as a TEI export on Gitbook. Much of the initial section of the TEI xml file is devoted to identifying parts of the images where text is located (i.e. the "zone points' in the <facsimile> section). From lines 250 until the end, the <text> block records the transcription of two columns of 24 lines per page (multiplied by four pages). Letters in [square] brackets are expanded abbreviations.
Open and login to Transkribus
Ensure the "Ottawa, ARC, Carleton University ..." Collection is selected in the drop down menu entitled "Collections" on the "Server" tab.
Double-click on the item with the ID 36712 (current title, "Bifolium (f. 105, 112) from an Iberian Carthusian...").
Look to the icons along the top left corner and locate the icon of the folder with a green arrow pointing right.
An export window will pop up, which should look like the following:
In the top left corner, click on the "Client Export" tab.
For an easy export, you can save the image as a PDF with a text layer.
But for our purposes, it is more useful to export it as TEI, so click on the TEI box in the left column (and unclick others). Leave export options as is.
It will ask you where to save the file and will take a few moments to export after you have confirmed the export.
Locate the file and open it with Atom. It will have the TEI Header that you are now familiar with.
This file, however, will also have a reference to a "facsimile", which are the digital images of the bifolium.
Scroll down to line 250 and you will see the beginning of the transcription, referencing the earlier cited fascimile images:
And you're done!