Skip to main content

Research Repository

Advanced Search

Text Recognition for Nepalese Manuscripts in Pracalit Script

O’Neill, Alexander James; Hill, Nathan W.

Text Recognition for Nepalese Manuscripts in Pracalit Script Thumbnail


Authors

Alexander James O’Neill



Abstract

This dataset is a model for handwritten text recognition (HTR) of Sanskrit and Newar Nepalese manuscripts in Pracalit script. This paper introduces the state of the field in Newar literature, Newar manuscripts, and HTR engines. It explains our methodology for developing the requisite ground truth consisting of manuscript images and corresponding transcriptions, training our model with a PyLAia engine, and this model’s limitations. This dataset shared on Zenodo can be used by anyone working with manuscripts in Pracalit script, which will benefit the fields of Indology and Newar studies, as well as historical and linguistic analysis.

Citation

O’Neill, A. J., & Hill, N. W. (2022). Text Recognition for Nepalese Manuscripts in Pracalit Script. Journal of Open Humanities Data, 8(26), https://doi.org/10.5334/johd.90

Journal Article Type Article
Acceptance Date Nov 10, 2022
Publication Date Nov 30, 2022
Deposit Date Aug 7, 2023
Publicly Available Date Aug 7, 2023
Journal Journal of Open Humanities Data
Electronic ISSN 2059-481X
Publisher Ubiquity Press
Peer Reviewed Peer Reviewed
Volume 8
Issue 26
DOI https://doi.org/10.5334/johd.90
Keywords handwritten text recognition; PyLAia; Transkribus; Sanskrit; Newar; Manuscripts
Publisher URL https://doi.org/10.5334/johd.90

Files





You might also like



Downloadable Citations