Alexander James O’Neill
Text Recognition for Nepalese Manuscripts in Pracalit Script
O’Neill, Alexander James; Hill, Nathan W.
Abstract
This dataset is a model for handwritten text recognition (HTR) of Sanskrit and Newar Nepalese manuscripts in Pracalit script. This paper introduces the state of the field in Newar literature, Newar manuscripts, and HTR engines. It explains our methodology for developing the requisite ground truth consisting of manuscript images and corresponding transcriptions, training our model with a PyLAia engine, and this model’s limitations. This dataset shared on Zenodo can be used by anyone working with manuscripts in Pracalit script, which will benefit the fields of Indology and Newar studies, as well as historical and linguistic analysis.
Citation
O’Neill, A. J., & Hill, N. W. (2022). Text Recognition for Nepalese Manuscripts in Pracalit Script. Journal of Open Humanities Data, 8(26), https://doi.org/10.5334/johd.90
Journal Article Type | Article |
---|---|
Acceptance Date | Nov 10, 2022 |
Publication Date | Nov 30, 2022 |
Deposit Date | Aug 7, 2023 |
Publicly Available Date | Aug 7, 2023 |
Journal | Journal of Open Humanities Data |
Electronic ISSN | 2059-481X |
Publisher | Ubiquity Press |
Peer Reviewed | Peer Reviewed |
Volume | 8 |
Issue | 26 |
DOI | https://doi.org/10.5334/johd.90 |
Keywords | handwritten text recognition; PyLAia; Transkribus; Sanskrit; Newar; Manuscripts |
Publisher URL | https://doi.org/10.5334/johd.90 |
Files
Prachalit.pdf
(1.2 Mb)
PDF
Licence
http://creativecommons.org/licenses/by/4.0/
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
You might also like
A Tibetan Passive Construction in the Old Tibetan Rāmāyaṇa
(2023)
Journal Article
Chinese Transcription of Buddhist Terms in the Late Hàn Dynasty
(2023)
Journal Article
Origin of the r- allomorph of the Tibetan causative s-
(2023)
Book Chapter
Downloadable Citations
About SOAS Research Online
Administrator e-mail: outputs@soas.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search