Skip to content
Snippets Groups Projects
Mathilde Verstraete's avatar
Mathilde Verstraete authored
#   README.md
19760067
History

Codex palatinus graecus 23 - Ground Truth Dataset Medieval Greek Manuscripts

characters badge regions badge lines badge files badge

Dataset of HTR ground truth for the Codex palatinus graecus 23 (Palatine Anthology), byzantine writing from the X^th^ century.

License

This work is licensed under CC BY 4.0. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Dataset description

This dataset was produced by the Canada Research Chair on Digital Textualities, as part of the Anthologia graeca project.

A first batch of 50 pages (143-195) were initially transcribed to train a transcription model prototype. We then added 20 pages (196-215) to produce the first version of a transcription model for Greek manuscripts. The transcription of these 70 pages can be found in data/CPgr23.

Transcription guidelines

To come.

Model description

A transcription model for Greek manuscripts was trained using this dataset. It can be found here: {placeholder}.

Images

This ground truth is based on images of the codex palatinus graecus 23 digitized by the Universitätsbibliothek Heidelberg (where the first part of the manuscript is kept -- the second one being in the BNF, as Supplementum graecum 384), and then uploaded to eScriptorium using IIIF. Find the manuscript here.

Sources

Place Library Signature Date Pages transcribed IIIF Manifest URL
Universitätsbibliothek Heidelberg Bibliotheca Palatina Cod. Pal. gr. 23 Xth century p. 143-215 https://digi.ub.uni-heidelberg.de/diglit/iiif3/cpgraec23/manifest

The training has been done with images of the codex palatinus graecus 23 digitized by the Universitätsbibliothek Heidelberg (where the first part of the manuscript is kept -- the second one being in the BNF, as Supplementum graecum 384), and then uploaded to eScriptorium using IIIF. Find the manuscript here.

Cite the Model

Cite the Dataset

Guénette, M., Verstraete, M., Chagué, A., & Vitali-Rosati, M. Codex palatinus graecus 23 - Ground Truth Dataset Medieval Greek Manuscripts [Data set]. https://gitlab.huma-num.fr/ecrinum/anthologia/htr_cpgr23

@misc{Guenette_Codex_palatinus_graecus,
author = {Guénette, Maxime and Verstraete, Mathilde and Chagué, Alix and Vitali-Rosati, Marcello},
title = {{Codex palatinus graecus 23 - Ground Truth Dataset Medieval Greek Manuscripts}},
url = {https://gitlab.huma-num.fr/ecrinum/anthologia/htr_cpgr23}
}

Cite the Project

Funding

Infrastructure

This dataset project relied on the CREMMA infrastructure.