Swadesh list

format_list_bulleted Contenido keyboard_arrow_down
ImprimirCitar

A Swadesh List is a highly loan-resistant basic vocabulary list, made up of common words existing in any human language. The original list proposed by Morris Swadesh included about 200 terms - a shortened list of words most resistant to change, consisting of exactly 100 terms, was later used. This list was compiled by Swadesh in the 1940s and 1950s with the aim of using it in the lexicostatistical comparison of languages.

It is used in historical linguistics as a means of establishing the relationship of poorly or poorly documented languages, and also in glottochronology to estimate a quantitative measure of the divergence time of two or more related languages —which also makes it possible to establish the time of evolution of a family from its common ancestor or common protolanguage.

Introduction

The Swadesh list allows both establishing the relationship of languages, as well as establishing the degree of divergence between two languages of a linguistic family. The list is based on a list of lexicons or basic vocabulary of two languages between which they are supposed to be related, so that their degree of divergence over time can be found. In this way, this list is a fundamental instrument in comparative historical linguistics, glottochronology and lexicostatistics.

Notion of basic vocabulary

The main application that the Swadesh list has had is to homogenize the vocabulary base on which to make reliable inferences. It happens that most of the vocabulary of a language depends on cultural factors and changes, not due to historical evolution, but for reasons related to social changes that are extralinguistic. Mainly the uncontrolled changes are due to cultural reasons, reasons of technological diffusion (the word television is found in almost all current languages), political, economic, etc. In short, there are too many cultural factors of unpredictable consequences that affect the vocabulary of a language. For that reason, it is unreliable to make comparisons of languages on the basis of words that designate concepts dependent on contact between civilizations such as 'potato'; (of American origin), which is currently found in all Romance languages. The existence of this word in these languages and its comparative form could lead us to the erroneous conclusion that it is a word from the Latin **potatus, something impossible given that the potato was discovered by the Europeans in the 16th century. Other cultural reasons brought the word potato into the vocabulary of the Romance languages, but not the evolution of a Latin word that never existed.

Aware of this problem, Morris Swadesh assumed that there was, however, what he called a "basic vocabulary", that is, a more stable vocabulary and less subject to cultural changes, but to much more evolution. slow and less influenced by extralinguistic factors. This basic vocabulary had to lack cultural concepts, and therefore universal to all human cultures, that is, it had to be common and likely to be found in any language and at any time of its development. For example, the word snow is not used to compare tropical languages of Central Africa, nor 'television' to compare classical Greek with modern Greek.

This basic vocabulary consisted of words like 'water', 'hand' or 'mujer', known in any culture and so fundamental and simple that they would hardly be replaced by borrowings from other languages. The Swadesh list is intended to be the embodiment of this stable vocabulary and provides linguists with a reliable standard for comparison between languages.

Use in glottochronology

Glottochronology assumes that there is an approximately constant rate of change in linguistic evolution. That rate has been calculated at 86% of basic vocabulary words maintained every thousand years. Statistical consistency has been highly criticized because it is based on results obtained for a very limited number of languages. Some linguists have pointed out that it is unreasonable to think that even the same language evolves at a constant rate at two different historical moments, although Swadesh's claim is that over long periods the periods of accelerated change and slowed change roughly offset each other. Another criticism is that it could be that not all languages would necessarily share an identical exchange rate, and there may be different exchange rates in different regions of the planet, that is, it has been criticized that the rate obtained for Indo-European languages does not have to be applied. to all the languages of the planet. In general, it is considered that, while the Swadesh list has stabilized the lexical basis of glottochronological comparison, the same has not happened with the rate of language change, which remains subject to a multitude of cultural and historical factors, all of them uncontrollable..

Even with the above criticisms, the statistical assumptions of stability and universality of the exchange rate over long periods of time, lead to the following glottochronological estimate for part of the divergence time:

t=12log (c)log (r){displaystyle t={frac {1}{2}}{frac {log(c)}{log(r)}}}}}

being

t = time of separation between languages
c = lexic similarity coefficient (using Swadesh lists)
r = Glotocronological constant (established at 86%)

Swadesh List Lexicon

There are two versions of the Swadesh list: one with 207 terms and one with exactly 100 terms. Both lists and some shorter alternative lists are reproduced below.

List of 100 words

(originally in English)

English Spanish
I, me(me, me)
you(you)
we(we)
this(this)
that(That)
who(who)
what(what)
not(no)
all(all/s)
many(many)
One(one)
two(two)
big(large)
long(long)
small(small)
woman(woman)
man(man)
person(person)
fish(pez)
bird(bird)
dog(dog)
I used it.(beginning)
tree(tree)
seed(smile)
leaf(daughter)
English Spanish
root(raice)
bark(courtesy)
skin(piel)
flesh(meat)
blood(blood)
bone(bones)
grease(laughs)
egg(bones)
bacon(bell)
Thai(rabo)
feather(pluma)
hair(sighs)
head(head)
ear(laughs)
eye(yet)
Nose(nariz)
mouth(mouth)
tongue(language)
tooth(dient)
claw(screams)
foot(pie)
knee(rodilla)
hand(mano)
Belly.(pance)
neck(neck)
English Spanish
breasts(whispers)
heart(heart)
liver(daughter)
to eat(comer)
to drink(beber)
to bite(morder)
to see(see)
to hear(hearing)
to know(knowing)
to sleep(dormir)
to die(die)
to kill(killing)
to swim(nadar)
to fly(flying)
to walk(caminar)
to lie(laughs)
to eat(to come)
to sit(sing)
to stand(Standing)
To say(saying)
Sun(sol)
Moon(moon)
star(star)
water(water)
rain(laughs)
English Spanish
stone(stones)
sand(arena)
earth(land)
cloud(Number)
smoke(humo)
fire(fire)
ash(Czeiza)
burn(burning)
path(camino)
mountain(mountain)
network(red)
green(green)
yellow(yellow)
white(white)
black(black)
night(night)
hot(hot)
cold(cold)
full(filled)
new(new)
good(good)
Round(chuckles)
dry(seco)
name(name)
to give(dar)

Swadesh–Yakhontov List

The Swadesh–Yakhontov list is a set of 35 particularly stable terms culled from the original list by the Russian linguist Sergei Yakhontov or, in English transcription, Sergei Yakhontov (Starostin, 1991). Linguists such as Sergei Stárostin have used this shorter list in lexicostatistics for far-reaching comparative work. Below is the subset of the Swadesh-Yakhontov list keeping the numbers of the original Swadesh list:

1. I
2. you (singular)
7. this
11. who
12. what
22. one
23. two
45. Fish
47. dog
48. louse
64. blood
65. bone
67. egg
68. bacon
69. Thai
73. ear
74. eye
75. nose
77. tooth
78. tongue
83. hand
103. know
109. die
128. give
147. Sun
148. moon
150. water
155. jump
156. stone
163. wind
167. fire
179. year
182. full
183. new
207. name

Holman et al. (2008) found that the Swadesh-Yakhontov list was less accurate in identifying known relationships between Chinese variants. These authors saw that a set of 40 words from the Swadesh list could give results as good as the original list, so that 40-word list does achieve what the Swadesh-Yakhontov list does not seem to.

Swadesh List Stability

Holman et al. (2008) investigated the relative stability of words in the Swadesh list of 100 terms by comparing the retention rates of terms in well-established language families. Thanks to that, they were able to reorder the Swadesh list from the most stable to the least stable terms:

  1. 22 *louse (42.8)
  2. 12 *two (39.8)
  3. 75 *water (37.4)
  4. 39 *ear (37.2)
  5. 61 *die (36.3)
  6. 1 *I (35.9)
  7. 53 *liver (35.7)
  8. 40 *eye (35.4)
  9. 48 *hand (34.9)
  10. 58 *hear (33.8)
  11. 23 *tree (33.6)
  12. 19 *fish (33.4)
  13. 100 *name (32.4)
  14. 77 *stone (32.1)
  15. 43 *tooth (30.7)
  16. 51 *breasts (30.7)
  17. 2 *you (30.6)
  18. 85 *path (30.2)
  19. 31 *bone (30.1)
  20. 44 *tongue (30.1)
  21. 28 *skin (29.6)
  22. 92 *night (29.6)
  23. 25 *leaf (29.4)
  24. 76 rain (29.3)
  25. 62 kill (29.2)
  26. 30 *blood (29.0)
  27. 34 *horn (28.8)
  28. 18 *person (28.7)
  29. 47 *knee (28.0)
  30. 11 *one (27.4)
  31. 41 *nose (27.3)
  32. 95 *full (26.9)
  33. 66 *come (26.8)
  34. 74 *star (26.6)
  35. 86 *mountain (26.2)
  36. 82 *fire (25.7)
  37. 3 *we (25.4)
  38. 54 *drink (25.0)
  39. 57 *see (24.7)
  40. 27 bark (24.5)
  41. 96 *new (24.3)
  42. 21 *dog (24.2)
  43. 72 *sun (24.2)
  44. 64 fly (24.1)
  45. 32 grease (23.4)
  46. 73 moon (23.4)
  47. 70 give (23.3)
  48. 52 heart (23.2)
  49. 36 feather (23.1)
  50. 90 white (22.7)
  51. 89 yellow (22.5)
  52. 20 bird (21.8)
  53. 38 head (21.7)
  54. 79 earth (21.7)
  55. 46 foot (21.6)
  56. 91 black (21.6)
  57. 42 mouth (21.5)
  58. 88 green (21.1)
  59. 60 sleep (21.0)
  60. 7 what (20.7)
  61. 26 root (20.5)
  62. 45 claw (20.5)
  63. 56 bite (20.5)
  64. 83 ash (20.3)
  65. 87 network (20.2)
  66. 55 eat (20.0)
  67. 33 egg (19.8)
  68. 6 who (19.0)
  69. 99 dry (18.9)
  70. 37 hair (18.6)
  71. 81 smoke (18.5)
  72. 8 not (18.3)
  73. 4 this (18.2)
  74. 24 seed (18.2)
  75. 16 woman (17.9)
  76. 98 round (17.9)
  77. 14 long (17.4)
  78. 69 stand (17.1)
  79. 97 good (16.9)
  80. 17 man (16.7)
  81. 94 cold (16.6)
  82. 29 flesh (16.4)
  83. 50 neck (16.0)
  84. 71 say (16.0)
  85. 84 burn (15.5)
  86. 35 Thai (14.9)
  87. 78 sand (14.9)
  88. 5 that (14.7)
  89. 65 walk (14.4)
  90. 68 sit (14.3)
  91. 10 many (14.2)
  92. 9 all (14.1)
  93. 59 know (14.1)
  94. 80 cloud (13.9)
  95. 63 swim (13.6)
  96. 49 belly (13.5)
  97. 13 big (13.4)
  98. 93 hot (11.6)
  99. 67 lie (11.2)
  100. 15 small (6.3)

The words marked with an asterisk would make it possible to build a list of 40 terms, which is statistically as significant as the original Swadesh list of 100 terms, and therefore this shorter list would represent an advantage over the Swadesh list -Yakhontov.

Contenido relacionado

Μ

My or my is the twelfth letter of the Greek alphabet. Called mu, in an extraordinary way, in the scientific field.[citation required] Its phoneme corresponds...

Italic languages

The Italic languages constitute a group of Indo-European languages with a series of common features. It includes Latin together with its descendants, the...

Κ

Kappa or cappa is the tenth letter of the Greek...
Más resultados...
Tamaño del texto:
undoredo
format_boldformat_italicformat_underlinedstrikethrough_ssuperscriptsubscriptlink
save