Skip to content
Snippets Groups Projects

Codex palatinus graecus 23

Ground Truth dataset for the Codex palatinus graecus 23 (Palatine Anthology), byzantine writing from the X^th^ century.

License

This work is licensed under CC BY 4.0. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Description

The model was trained from the ground truth produced by the Canada Research Chair on Digital Textualities, as part of the Anthologia graeca project. We focused our ground truth on 50 pages (143-195) and did finetuning on 20 extra pages (196-215).

Transcription guidelines

To come.

Sources

Place Library Signature Date Pages transcribed IIIF Manifest URL
Universitätsbibliothek Heidelberg Bibliotheca Palatina Cod. Pal. gr. 23 Xth century p. 143-215 https://digi.ub.uni-heidelberg.de/diglit/iiif3/cpgraec23/manifest

The training has been done with images of the codex palatinus graecus 23 digitized by the Universitätsbibliothek Heidelberg (where the first part of the manuscript is kept -- the second one being in the BNF, as Supplementum graecum 384), and then uploaded to eScriptorium using IIIF. Find the manuscript here.

How to cite

This dataset was built and is maintained by Maxime Guénette (@mguenette), Mathilde Verstraete (@mverstraete), Alix Chagué (@achague), Marcello Vitali-Rosati (@marviro). The digitization is not copyright-free, but the transcription is. However, properly annotating a corpus takes time and is a task that should be recognized. If you use any item from this corpus of ground truth, cite the dataset using the following information:

  • Ajouter la référence Zenodo.