PROF Nathan Hill nh36@soas.ac.uk
Professor Tibetan&Historical Linguistics
A part-of-speech (POS) lexicon of Classical Tibetan for NLP
Hill, Nathan W.; Garrett, Edward
Authors
DR Edward Garrett eg15@soas.ac.uk
Research Assistant
Abstract
This part-of-speech (POS) lexicon of Classical Tibetan was prepared in the course of the research project 'Tibetan in Digital Communication' (2012-2015) hosted at SOAS, University of London and funded by the UK's Arts and Humanities Research Council (grant code: AH/J00152X/1). The data for verbs comes from a digitized version of A Lexicon of Tibetan Verb Stems as Reported by the Grammatical Tradition (Munich: Bayerische Akademie der Wissenschaften, 2010) by Nathan W. Hill. Otherwise data comes from the manually part-of-speech tagged training data produced by the corpus and a few lexical items specifically added by hand to improve rule based tagging
Citation
Hill, N. W., & Garrett, E. A part-of-speech (POS) lexicon of Classical Tibetan for NLP. [Data]
Online Publication Date | May 11, 2017 |
---|---|
Deposit Date | Jun 16, 2017 |
Publicly Available Date | Jun 16, 2017 |
Publisher URL | https://doi.org/10.5281/zenodo.574876 |
Type of Data | lexical data |
Additional Information | References : Hill, Nathan W. A Lexicon of Tibetan Verb Stems as Reported by the Grammatical Tradition (Munich: Bayerische Akademie der Wissenschaften, 2010) |
Files
Lexicons.zip
(88 Kb)
Archive
You might also like
The lexicography of Tibetan
(2017)
Book Chapter
Downloadable Citations
About SOAS Research Online
Administrator e-mail: outputs@soas.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search