NorthEuraLex: a wide-coverage lexical database of Northern Eurasia
- PMID: 32214931
- PMCID: PMC7067722
- DOI: 10.1007/s10579-019-09480-6
NorthEuraLex: a wide-coverage lexical database of Northern Eurasia
Abstract
This article describes the first release version of a new lexicostatistical database of Northern Eurasia, which includes Europe as the most well-researched linguistic area. Unlike in other areas of the world, where databases are restricted to covering a small number of concepts as far as possible based on often sparse documentation, good lexical resources providing wide coverage of the lexicon are available even for many smaller languages in our target area. This makes it possible to attain near-completeness for a substantial number of concepts. The resulting database provides a basis for rich benchmarks that can be used to test automated methods which aim to derive new knowledge about language history in underresearched areas.
Keywords: Caucasian languages; Indo-European languages; Lexical database; Northern Eurasia; Siberian languages; Turkic languages; Uralic languages.
© The Author(s) 2019.
Figures



Similar articles
-
Origins of Indo-Europeans and the spread of agriculture in Europe: comparison of lexicostatistical and genetic evidence.Hum Biol. 1995 Aug;67(4):577-94. Hum Biol. 1995. PMID: 7649532
-
LexiRumah: An online lexical database of the Lesser Sunda Islands.PLoS One. 2018 Oct 17;13(10):e0205250. doi: 10.1371/journal.pone.0205250. eCollection 2018. PLoS One. 2018. PMID: 30332446 Free PMC article.
-
Using hybridization networks to retrace the evolution of Indo-European languages.BMC Evol Biol. 2016 Sep 6;16(1):180. doi: 10.1186/s12862-016-0745-6. BMC Evol Biol. 2016. PMID: 27600442 Free PMC article.
-
The causality of borrowing: Lexical loans in Eurasian languages.PLoS One. 2019 Oct 30;14(10):e0223588. doi: 10.1371/journal.pone.0223588. eCollection 2019. PLoS One. 2019. PMID: 31665148 Free PMC article.
-
A New View of Language Development: The Acquisition of Lexical Tone.Child Dev. 2016 May;87(3):834-54. doi: 10.1111/cdev.12512. Epub 2016 Mar 23. Child Dev. 2016. PMID: 27007329 Review.
Cited by
-
Cultural influences on word meanings revealed through large-scale semantic alignment.Nat Hum Behav. 2020 Oct;4(10):1029-1038. doi: 10.1038/s41562-020-0924-8. Epub 2020 Aug 10. Nat Hum Behav. 2020. PMID: 32778801
-
Inference of partial colexifications from multilingual wordlists.Front Psychol. 2023 Jun 16;14:1156540. doi: 10.3389/fpsyg.2023.1156540. eCollection 2023. Front Psychol. 2023. PMID: 37397315 Free PMC article.
-
A Body Map Beyond Perceptual Experience.J Cogn. 2024 Feb 1;7(1):22. doi: 10.5334/joc.347. eCollection 2024. J Cogn. 2024. PMID: 38312940 Free PMC article.
-
Gaussian process models for geographic controls in phylogenetic trees.Open Res Eur. 2024 Jan 22;3:57. doi: 10.12688/openreseurope.15490.2. eCollection 2023. Open Res Eur. 2024. PMID: 38778905 Free PMC article.
-
From Text to Thought: How Analyzing Language Can Advance Psychological Science.Perspect Psychol Sci. 2022 May;17(3):805-826. doi: 10.1177/17456916211004899. Epub 2021 Oct 4. Perspect Psychol Sci. 2022. PMID: 34606730 Free PMC article.
References
-
- Bowern, C. (2016). Chirila: contemporary and historical resources for the indigenous languages of Australia. Language Documentation and Conservation (Vol. 10). http://nflrc.hawaii.edu/ldc/.
-
- Buck CD. A dictionary of selected synonyms in the principal Indo-European languages: A contribution to the history of ideas. Chicago: University of Chicago Press; 1949.
-
- Dellert, J. (2017). Information-theoretic causal inference of lexical flow. PhD thesis, Eberhard Karls Universität Tübingen.
-
- Dellert, J. (2018). Combining information-weighted sequence alignment and sound correspondence models for improved cognate detection. In 27th International Conference on Computational Linguistics (COLING 2018).
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials