Skip to main content

Research Repository

Advanced Search

A rule based Tibetan part-of-speech (POS) tagger for the creation of gold standard training data (2017)
Data
Garrett, E., & Hill, N. W. A rule based Tibetan part-of-speech (POS) tagger for the creation of gold standard training data. [Data]

This rule based Tibetan part-of-speech (POS) tagger was prepared in the course of the research project 'Tibetan in Digital Communication' (2012-2015) hosted at SOAS, University of London and funded by the UK's Arts and Humanities Research Council (gr... Read More about A rule based Tibetan part-of-speech (POS) tagger for the creation of gold standard training data.

Constituent order in the Tibetan noun phrase (2015)
Journal Article
Garrett, E., & Hill, N. W. (2015). Constituent order in the Tibetan noun phrase. SOAS working papers in linguistics, 17, 35-48

This paper gives a schematic presentation of the order of constituents in the Tibetan noun phrase as revealed by an investigation of a corpus of Classical Tibetan texts.

A Constraint Grammar POS-Tagger for Tibetan (2015)
Book Chapter
Garrett, E., & Hill, N. W. (2015). A Constraint Grammar POS-Tagger for Tibetan. In E. Bick, & K. Hagen (Eds.), Proceedings of the Workshop on “Constraint Grammar - methods, tools and applications” at NODALIDA 2015, May 11-13, 2015 (19-22). Institute of the Lithuanian Language

This paper describes a rule-based part-of speech tagger for Tibetan, implemented in Constraint Grammar and with rules operating over sequences of syllables rather than words.

The contribution of corpus linguistics to lexicography and the future of Tibetan dictionaries (2015)
Journal Article
Garrett, E., Hill, N. W., Kilgarriff, A., Vadlapudi, R., & Zadoks, A. (2015). The contribution of corpus linguistics to lexicography and the future of Tibetan dictionaries. Revue d'études tibétaines, 32, 51-86

The first alphabetized dictionary of Tibetan appeared in 1829 (cf. Bray 2008) and the intervening 184 years have witnessed the publication of scores of other Tibetan dictionaries (cf. Simon 1964). Hundreds of Tibetan dictionaries are now available; t... Read More about The contribution of corpus linguistics to lexicography and the future of Tibetan dictionaries.