Mei-Shin Wu
Computer-Assisted Language Comparison: State of the ArtW
Wu, Mei-Shin; Schweikhard, Nathanael E.; Bodt, Tim; Hill, Nathan W.; List, Johann-Mattis
Authors
Nathanael E. Schweikhard
Tim Bodt
PROF Nathan Hill nh36@soas.ac.uk
Professor Tibetan&Historical Linguistics
Johann-Mattis List
Abstract
Historical language comparison opens windows onto a human past, long before the availability of written records. Since traditional language comparison within the framework of the comparative method is largely based on manual data comparison, requiring the meticulous sifting through dictionaries, word lists, and grammars, the framework is difficult to apply, especially in times where more and more data have become available in digital form. Unfortunately, it is not possible to simply automate the process of historical language comparison, not only because computational solutions lag behind human judgments in historical linguistics, but also because they lack the flexibility that would allow them to integrate various types of information from various kinds of sources. A more promising approach is to integrate computational and classical approaches within a computer-assisted framework, “neither completely computer-driven nor ignorant of the assistance computers afford” [1, p. 4]. In this paper, we will illustrate what we consider the current state of the art of computer-assisted language comparison by presenting a workflow that starts with raw data and leads up to a stage where sound correspondence patterns across multiple languages have been identified and can be readily presented, inspected, and discussed. We illustrate this workflow with the help of a newly prepared dataset on Hmong-Mien languages. Our illustration is accompanied by Python code and instructions on how to use additional web-based tools we developed so that users can apply our workflow for their own purposes.
Citation
Wu, M.-S., Schweikhard, N. E., Bodt, T., Hill, N. W., & List, J.-M. (2020). Computer-Assisted Language Comparison: State of the ArtW. Journal of Open Humanities Data, 6(2), 1-14. https://doi.org/10.5334/johd.12
Journal Article Type | Article |
---|---|
Acceptance Date | Feb 3, 2020 |
Publication Date | May 22, 2020 |
Deposit Date | May 26, 2020 |
Publicly Available Date | May 26, 2020 |
Journal | Journal of Open Humanities Data |
Electronic ISSN | 2059-481X |
Publisher | Ubiquity Press |
Peer Reviewed | Peer Reviewed |
Volume | 6 |
Issue | 2 |
Pages | 1-14 |
DOI | https://doi.org/10.5334/johd.12 |
Keywords | computer-assisted; language comparison; historical linguistics; Hmong-Mien language family |
Publisher URL | https://openhumanitiesdata.metajnl.com/article/10.5334/johd.12/ |
Files
Wu 2020 CALC state of the art.pdf
(2 Mb)
PDF
Licence
http://creativecommons.org/licenses/by/4.0/
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
Copyright Statement
© 2020 The Author(s). This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 Unported License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See http://creativecommons.org/licenses/by/4.0/
You might also like
A Tibetan Passive Construction in the Old Tibetan Rāmāyaṇa
(2023)
Journal Article
Chinese Transcription of Buddhist Terms in the Late Hàn Dynasty
(2023)
Journal Article
Origin of the r- allomorph of the Tibetan causative s-
(2023)
Book Chapter
Downloadable Citations
About SOAS Research Online
Administrator e-mail: outputs@soas.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search