Skip to main content

Research Repository

Advanced Search

All Outputs (3)

Evaluating Rhyme Annotations for Large Corpora: Metrics and Data (2023)
Journal Article
Baley, J. (2023). Evaluating Rhyme Annotations for Large Corpora: Metrics and Data. Cahiers de Linguistique Asie Orientale, 52(2), 137-162. https://doi.org/10.1163/19606028-bja10032

Recent methods have been proposed to produce automatic rhyme annotators for large rhymed corpora. These methods, such as Baley (2022b) greatly reduce the cost of annotating rhymed material, allowing historical linguists to focus on the analysis of th... Read More about Evaluating Rhyme Annotations for Large Corpora: Metrics and Data.

Chinese Transcription of Buddhist Terms in the Late Hàn Dynasty (2023)
Journal Article
Baley, J., Hill, N. W., & Caldwell, E. (2023). Chinese Transcription of Buddhist Terms in the Late Hàn Dynasty. Journal of Open Humanities Data, 9(10), https://doi.org/10.5334/johd.110

This dataset is a compilation of Chinese transcriptions of Buddhist terms produced by translators from the late Hàn period. It is a compilation of the previous works of Coblin (1983), Karashima (2010), Vetter (2012), Hill, Nattier, Granger, and Kollm... Read More about Chinese Transcription of Buddhist Terms in the Late Hàn Dynasty.

Leveraging graph algorithms to speed up the annotation of large rhymed corpora (2022)
Journal Article
Baley, J. (2022). Leveraging graph algorithms to speed up the annotation of large rhymed corpora. Cahiers de Linguistique Asie Orientale, 51(1), 46-80. https://doi.org/10.1163/19606028-bja10019

Abstract Rhyming patterns play a crucial role in the phonological reconstruction of earlier stages of Chinese. The past few years have seen the emergence of the use of graphs to model rhyming patterns, notably with List’s (2016) proposal to use graph... Read More about Leveraging graph algorithms to speed up the annotation of large rhymed corpora.