Skip to main content

Research Repository

Advanced Search

A part-of-speech (POS) lexicon of Classical Tibetan for NLP

Hill, Nathan W.; Garrett, Edward

Authors



Abstract

This part-of-speech (POS) lexicon of Classical Tibetan was prepared in the course of the research project 'Tibetan in Digital Communication' (2012-2015) hosted at SOAS, University of London and funded by the UK's Arts and Humanities Research Council (grant code: AH/J00152X/1). The data for verbs comes from a digitized version of A Lexicon of Tibetan Verb Stems as Reported by the Grammatical Tradition (Munich: Bayerische Akademie der Wissenschaften, 2010) by Nathan W. Hill. Otherwise data comes from the manually part-of-speech tagged training data produced by the corpus and a few lexical items specifically added by hand to improve rule based tagging

Citation

Hill, N. W., & Garrett, E. A part-of-speech (POS) lexicon of Classical Tibetan for NLP. [Data]

Online Publication Date May 11, 2017
Deposit Date Jun 16, 2017
Publicly Available Date Jun 16, 2017
Publisher URL https://doi.org/10.5281/zenodo.574876
Type of Data lexical data
Additional Information References : Hill, Nathan W. A Lexicon of Tibetan Verb Stems as Reported by the Grammatical Tradition (Munich: Bayerische Akademie der Wissenschaften, 2010)