Distans inter europano nacionale linguas

A forum for discussing linguistics or just languages in general.
Post Reply
Zé do Rock
cuneiform
cuneiform
Posts: 180
Joined: 12 Jul 2018 18:22

Distans inter europano nacionale linguas

Post by Zé do Rock »

LA DISTANSE INTER LINGUAS EUROPAN
THE DISTANCE BETWEEN EUROPEAN LANGUAGES

Yo conta na momento similaritees inter europano linguas, mas solo nacionale linguas. Co 2 exepcion: i zel au plattdüütsh un braziliano portugalian als eigene spraken - dat sein eben die spraken/dialekte dat i kenn: wil ik brasilianer bin, snack ik brasiliaansh portugeesh, un ik heb en "metaplatt" kreert, das heet de wörd de in de meeste wördböker un texten zo finnen sind, un wo ok de körz en roll speelt, un wo ok das nich helft, de form de alfabetish to erst kaamt.

At the moment i'm counting the similarities between european languages, but only national languages. With 2 exceptions: i also count plattdüütsh (low german) and brazilian portuguese as own languages - these are the languages/dialects i know: because i'm brazilian, i speak brazilian portuguese, and i have created a "metaplatt" (meta-low-german), where i take the most frequent words in the many low german dialects, and when there is a tie, the brevity is decisive (and when even that doesnt help, the form that comes first in the alphabetical order).

xxx

It is all básd on google transláter, wich is dêfinitly not flawless, but i didnt se anuther alternativ. Mi canai fazer meyor co la linguas ki mi coness, ma lu serai tendencial in relacion au linguas ki mi parla no.

It is all based on google translator, which is definitely not flawless, but i didnt see another alternative. I could do better with the languages i know, but that would be tendentious towards the languages i dont speak.

xxx

Yo solo studa la similaridee in pronunciacion, no na lingua scrivee. Un i fergleike nich die gramatik, nur wortshaz. Un ik heb 2 zeelmetoden: in method 1 i compare simply the number of équal or similer sounds, wich méns that if a werd with 2 fónèms is the sám in 2 languajes, thés 2 languajes wil hav 4 points similarity (2 points for éch letter), and if a werd with 10 fónèms is the sám in 2 languajes, thay get 20 points similarity. Na metodo 2, cada palavra hav un maximu de 10 posible punt et un minimu de 5 punto (die 50% der mögliche punkte entspreke wür - mid weniger as das, de eenligheid is kum erkennbar).

I just study the similarity in pronunciation, not in written language. And i dont compare grammar, only vocabulary. And i have 2 counting methods: in method 1 i compare simply the number of equal or similar sounds, which means that if a word with 2 phonemes is the same in 2 languages, these 2 languages will have 4 points similarity (2 points for each letter), and if a word with 10 phonemes is the same in 2 languages, they get 20 points similarity. In method 2, every word has a maximum of 10 possible points and a minimum of 5 points (which would be 50% of the possible points - with less than that, the similarity is hardly recognizable).

xxx

In method 1, i count 2 points for the sám letter, 1,5 point for the sám letter in a rong pozicion (1 pozicion tu the left or tu the rít), 1 point for 2 pozicions rong, 0,5 point for 3 pozicions rong. Pois ai puntos pro similare letras, as I e Y, D e T, shwa et un otre vocal. Si 1 pozicion fals, 0,5 punt. Für foneme one entspreku in die wort in el andre sprake, 2 punkte werd abnommen.

In method 1, i count 2 points for the same letter, 1,5 point for the same letter in a wrong position (1 position to the left or to the right), 1 point for 2 positions wrong, 0,5 point for 3 positions wrong. Then there are points for similar letters, like I and Y, D and T, shwa and another vowel. If 1 position wrong, 0,5 point. For phonemes without correspondence in the word of the other language, 2 points are taken off.

xxx

In metod 2, wenn en word zo bispil 7 fonemen het, bedeud es dass 14 punkte möglig weren. If the werds in the tu languajes giv 10 points, it méns that 71% of the posible points ar mád, wich méns that for the maximum of 10 points it gets 7 points. In ta metodo nenun punt is abstrait.

In method 2, if a word has say 7 phonemes, it means that 14 points could be made. If the words in the two languages give 10 points, it means that 71% of the possible points are made, which means that for the maximum of 10 points it gets 7 points. In this method, no points are taken off.

xxx

Je mer punkte, desto starker el eenliheit. Unnersheden von 10, 20 or 30 punkt sind nich statistish signifikant. The abreviations: bi - bielarusski, bra - braziliano portugaliano, bu - bulgarski, cze - ceski, da - danishe, de - deutsh, en - english, es - espanian, et - eestiano, fra - franciano, gre - grekian, is - islandish, it - italiano, ju - serbokrovatski, la - latviano, li - lietuviano, mag - magyariano, mak - makedonski, mal - maltiano, ne - hollandishe, no - norgishe, pla - plattdüütsh, pol - polski, por - portugalian (europano), ro - romeniano, ru - russki, shi - shqiperiano, sko - slovaski, sno - slovenski, su - suomiano, sve - sverigishe, tü - türkian, uk - ukrainski.

The more points, the stronger the similarity. Differences of 10, 20 or 30 points arent statistically significant. The abbreviations: bi - belarussian, bra - brazilian portuguese, bu - bulgarian, cze - czech, da - danish, de - german, en - english, es - spanish, et - estonian, fra - french, gre - greek, is - icelandic, it - italian, ju - serbo-croatian, la - latvian, li - lithuanian, mag - hungarian, mak - macedonian, mal - maltese, ne - dutch, no - norwegian, pla - low german (plattdüütsh), pol - polish, por - portuguese (european), ro - romanian, ru - russian, shi - albanian, sko - slovakian, sno - slovenian, su - finnish, sve - swedish, tü - turkish, uk - ukranian.

Ai tamben una tabel onlain, co colores pro la diverso niveles de similaritee: http://www.zedorock.net/distans.pdf.
There is also a table online, with colors for the various levels or similarity: www.zedorock.net/distans.pdf


BI-uk369-ru321-pol238-ju189-sko153-cze151-sno136-bu117-mak98-la64-li61-it50-bra49-de45-ro44-es43-pla/shi40-por39-ne34-su30-gre/sve24-mal/no22-fra20-et/is16-tü13-da12-en10-mag6 - 2.447 - FA 196 - AF 30 = 113 (12) - 1.772-675

BRA-por495-es342-it270-fra169-mal131-ro115-shi91-sno65-ru63-pol61-da58-bu57-no55-en/uk54-mak53-sve51-bi49-ju45-de42-la/li/ne40-sko39-pla33-cze32-su26-is24-et20-gre17-mag14-tü12 - 2.160 - FA 224 - AF 46 = 135 (12)

BU-mak379-ru236-ju203-uk183-sno165-pol128-bi117-cze/sko114-bra57-it51-la49-es/por45-ro42-de33-li31-pla30-su29-shi28-da/mal22-et18-en17-fra/mag/ne/no15-tü11-sve10-gre/is9 - 2.200 - FA 182 - AF 25 = 103 (11)

CZE-sko450-pol262-sno204-ju177-ru164-uk155-bi151-mak118-bu114-la54-es44-li39-su38-bra32-it/por30-et29-de/ro27-mag26-fra/pla25-ne22-da/shi15-mal13-no/sve12-gre11-tü9-is6-en1 - 2.267 - FA 199 - AF 21 = 110 (11)

DA-no460-sve385-pla193-ne191-de189-is155-en126-ro72-fra/shi64-bra58-it57-es51-la/mal49-por46-mak43-su36-pol34-mag32-uk31-gre28-li27-ju/ru26-et24-bu22-sko/sno20-tü19-cze15-bi12 - 2.566 - FA 242 - AF 36 = 139 (12)

DE-pla408-ne333-da189-sve174-en/no165-it92-ro91-is85-la79-es67-shi57-mak/ru52-fra/pol51-uk50-li47-bi/mag45-bra/ju42-et41-por40-mal38-sno37-bu/su33-cze27-sko26-gre24-tü14 - 2.653 - FA 217 - AF 47 = 132 (11)

EN-ne/pla185-de165-sve136-no131-da126-ro72-fra64-es/it60-mal57-por55-bra54-is52-shi47-la38-mag25-mak23-ru22-ju/pol19-bu/et17-uk16-gre15-sno14-li12-bi10-sko8-su/tü5-cze1 - 1.713 - FA 140 - AF 30 = 85 (2)

ES-por385-it360-bra342-fra196-ro169-mal106-pla90-shi86-de/ne67-ru61-en60-la55-da51-pol49-bu45-cze/sno/uk44-bi/sve43-no41-ju40-gre39-mak37-et34-li/sko31-su29-mag18-is16-tü8 - 2.344 - FA 277 - AF 45 = 161 (11)

ET-su165-de41-sno38-ro36-la35-es34-ru33-ne/pla/sve32-sko31-pol30-cze29-it/mag28-is/no26-da24-fra/uk22-ju/li21-bra20-bu18-en/por17-bi/shi16-mak15-tü11-gre10-mal8 - 884 - FA 96 - AF 23 = 59 (0)

FRA-it220-es196-por192-ro175-bra169-shi99-da/en64-ne/sve61-no56-mal53-de51-ru50-pla45-gre41-la40-sno29-pol28-cze25-uk24-et/mak22-bi20-ju/sko19-tü18-li17-bu15-is/mag12-su10 - 1.743 - FA 195 - AF 36 = 115 (6)

GRE-fra41-es39-shi36-it/su30-ru29-da/la28-ne27-pol25-bi/de/pla24-uk22-mal/no20-ro/sve19-ju18-bra17-en/li15-is/tü13-por12-cze/mak/sno11-et10-bu9-sko8-mag4 - 615 - FA XX - AF 19 = 20 (-3)

IS-sve233-no200-da155-de85-ne84-pla77-en52-li32-et26-ro25-bra/su24-shi/uk22-it20-la18-bi/es/sno16-pol15-mal14-gre/mag/por13-fra/mak12-ju10-bu/ru9-sko8-tü7-cze6 - 1.244 - FA 126 - AF 14 = 70 (0)

IT-es360-por268-ro225-fra220-bra270-mal143-shi109-de92-pla85-ne78-la63-ru61-en60-sve58-da/uk57-no53-bu51-bi50-sno46-pol43-mak40-ju39-li36-cze/gre/su30-et/mag28-sko27-is20-tü14 - 2.411 - FA 268 - AF 49 = 158 (13)

JU-sno296-mak285-uk236-sko211-bu203-bi/ru189-pol178-cze177-li61-la58-bra45-de42-es40-it/pla39-shi38-por33-ro32-da26-mag/no/su/sve25-ne22-et21-en/fra19-gre18-mal15-is10-tü5 - 2.601 - FA 218 - AF 28 = 123 (13)

LA-li192-ro82-de79-pol70-pla68-bi64-it63-mak62-ne59-ju/ru58-es55-cze/uk54-shi50-bu/da49-no/sve48-mal46-sno45-sko42-bra/fra40-en38-et35-por34-gre28-su24-mag21-is18-tü6 - 1.639 - FA 192 - AF 48 = 120 (0) - 192-1447

LI-la192-uk69-pol65-bi/ju/sko61-no49-mak48-de/sno47-ru46-ro45-sve42-bra40-cze39-it/shi36-is32-bu/es31-pla30-por29-ne28-da27-su22-et/mal21-fra17-gre15-en12-mag11-tü6 - 1.273 - FA 192 - AF 42 - 117 (0)

MAG-de/ne45-pla/su40-no33-da32-sve31-et/it/ro/sno28-mak/shi27-cze26-en/ju25-la/sko21-por19-es18-pol17-bu15-bra/uk14-is13-fra12-li11-mal10-ru/tü8-bi6-gre4 - 703 - FA 34 - AF 21 = 27 (-6)

MAK-bu379-ju285-sno218-uk190-ru171-pol151-sko137-cze118-bi98-la62-bra/shi53-de52-li/pla48-por45-da43-it40-no38-es37-sve36-mal/ro33-ne31-mag27-en23-fra22-su17-et15-is12-gre11-tü8 - 2.481 - FA 194 - AF 33 - 113 (11)

MAL-it143-bra131-por115-es106-shi65-ro59-en57-fra53-da49-ne/no48-la46-sve43-pla42-de38-mak33-pol/uk26-bi/bu/sko22-li/su21-gre20-sno19-ru16-ju/tü15-is14-cze13-mag10-et8 - 1.094 - FA XX - AF 35 = 35 (3)

NE-pla405-de333-no/sve205-da191-en185-ro95-is84-it78-es67-shi65-fra61-la59-mal48-mag45-uk42-bra40-sno/su38-ru37-bi34-et32-mak31-sko30-li/pol/por28-gre27-cze/ju22-bu15-tü14 - 2.561 - FA 229 - AF 39 = 134 (13)

NO-sve511-da460-ne205-is200-pla196-de165-en131-shi64-ro57-fra56-bra55-it53-li49-la/mal48-es/pol41-mak38-uk36-mag33-ru31-sno30-por29-su27-et26-ju25-bi/sko22-gre20-tü18-bu15-cze12 - 2.682 - FA 266 - AF 33 = 149 (15)

PLA-de408-ne405-no196-sve194-da193-en185-es90-it85-is77-ro75-la68-mak48-shi46-fra45-ru/uk44-sno43-mal/pol42-bi/mag40-ju39-bra/por33-et32-bu/li30-cze25-gre/su24-sko23-tü13 - 2.683 - FA 236 - AF 42 = 139 (12)

POL-sko289-cze262-uk 246-bi238-ru234-sno185-ju178-mak151-bu128-la70-li65-bra61-por58-de51-es49-it/shi43-pla42-no/sve41-ro36-da34-et/su30-fra/ne28-mal26-gre25-tü21-en19-mag17-is15 - 2.723 - FA 212 - AF 36 = 124 (14)

POR-bra495-es385-it268-fra192-ro130-mal/shi115-pol58-en55-uk50-da46-bu/mak45-sno43-ru42-de40-bi39-la34-ju/pla33-sko31-cze30-li/no29-ne28-sve24-tü22-mag/su19-et17-is13-gre12 - 2.041 - FA 243 -AF 39 = 141 (13)

RO-it225-fra175-es169-por130-bra115-shi108-ne95-de91-la82-pla75-da/en72-sve62-ru60-mal59-no57-uk51-li45-bi44-bu42-et/pol36-mak33-ju32-mag28-cze27-is25-sko/su23-sno21-gre19-tü9 - 2.023 - FA 174 - AF 49 = 111 (6)

RU-bi321-uk266-bu236-pol234-sko197-ju189-sno179-mak171-cze164-bra63-es/it61-ro60-la58-de52-fra50-shi48-li46-pla44-por42-ne37-sve34-et33-no31-gre29-da26-su24-en22-mal16-tü11-is9-mag8 - 2.759 - FA 217 - AF 36 = 126 (12)

SHI-por115-it109-ro108-fra99-bra91-es86-mal/ne65-da/no64-de57-mak53-sve52-la50-ru48-en47-pla/uk46-pol43-bi40-ju38-gre/li36-sko29-bu28-mag27-su26-is22-sno21-tü20-et16-cze15 - 1.570 - FA XX - AF 50 = 50 (3)

SKO-cze450-pol289-sno245-uk222-ju211-ru197-bi153-mak137-bu114-li61-la42-bra39-es/et/por31-ne30-shi29-it/su27-de26-pla/ro23-mal/no/sve22-mag21-da20-fra19-tü12-en/gre/is8 - 2.533 - FA 224 - AF 23 = 123 (13)

SNO-ju296-sko245-mak218-uk216-cze204-pol185-ru179-bu165-bi136-bra65-li47-it46-la45-es44-pla/por43-et/ne38-de37-sve33-su32-no30-fra29-mag28-ro/shi21-da20-mal19-is16-en14-gre11-tü8 - 2.474 - FA 204 - AF 28 = 116 (13)

SU-et165-mag/uk40-cze/ne38-sve37-da36-de33-sno32-bi/gre/it/pol30-bu/es29-no/sko27-bra/shi26-ju25-is/la/pla/ru24-ro23-li22-mal21-por19-mak17-tü12-fra10-en5 - 967 - FA 102 - AF 26 = 64 (0)

SVE-no511-da385-is233-ne205-pla194-de174-en136-ro62-fra61-it58-shi52-bra51-la48-es/mal43-li42-pol41-uk38-su37-mak36-ru34-sno33-et32-mag31-ju25-bi/por24-sko22-gre19-tü18-cze12-bu10 - 2.683 - FA 262 - AF 35 = 148 (14)

TÜ-por22-pol21-shi20-da19-fra/no/sve18-mal15-de/it/ne/uk14-bi/gre/pla13-bra/sko/su12-bu/et/ru11-cze/ro9-es/mag/mak/sno8-is7-la/li6-en/ju5 - 374 - FA XX - AF 11 = 11 (-11)

UK-bi369-ru266-pol246-ju236-sko222-sno216-mak190-bu183-cze155-li69-it57-bra/la54-ro51-de/por50-shi46-es/pla44-ne42-su40-sve38-no36-da31-mal26-fra24-et/gre/is22-en16-mag/tü14 - 2.895 - FA 231 - AF 36 = 133 (16)

xxx

Cuant in comun la linguas ha co linguas de su propra familias (baltica, fino-ugrica, germanishe, romanic e slavski):
How much in common the languages have with languages of their own families (baltic, finno-ugric, germanic, romanic and slavic):

FA - es277 - it268 - no266 - sve262 - por243 - da242 - pla236 - uk231 - ne229 - bra/sko224 - ju218 - de/ru217 - pol212 - sno204 - cze199 - bi196 - fra 195 - mak194 - la/li192 - bu182 - ro174 - en 140 - is126 - su102 - et 96 - mag34

Wifil gemeinsam ha die spraken mit spraken fon andre familien:
How much in common the languages have with languages of other families:

AF - shi50 - it/ro49 - la48 - de47 - bra46 - es45 - li/pla42 - ne/por39 - da/fra/pol/ru/uk 36 - mal/sve35 - mak/no33 - bi/en30 - ju/sno28 - su26 - bu 25 - et/sko23 - cze/mag 21 - gre19 - is14 - tü11

Wuveel gemeensam hebben de spraken mid all annere spraken (in dörsnid):
How much in common the languages have with all other languages (in average):

MED - es161 - it158 - no149 - sve148 - por141 - da/pla139 - bra135 - ne134 - uk133 - de132 - ru126 - pol124 - ju/sko123 - la120 - li117 - sno116 - fra115 - bi/mak113 - ro111 - cze110 - bu103 - en 85 - is70 - su64 - et59 - shi50 - mal35 - mag27 - gre20 - tü11

xxx

The germanishe languaj with the best conection tu its famly: norgish. La lingua co la pir coneccion: islandish.
The germanic language with the best conection to its family: norwegian. The one with the worst conection: icelandic.

De romanico sprak mid de beste verbinn zo eer familie: espanian. La lingua co la pir coneccion: romenian.
The romanic language with the best conection to its family: spanish. The one with the worst conection: romanian.

Die slavski sprake mit der beste ferbindu zu ir familie: ukrainski. The wun with the werst conection: bulgarski.
The slavic language with the best conection to its family: ukrainian. The one with the worst conection: bulgarian.

xxx

La lingua germanishe co la meyor coneccion co linguas fora su propra familia: deutsh. De slechteste: islandish.
The germanic language with the best conection with languages outside its own family: german. The worst: icelandic.

La romanico lingua co la meyor coneccion co linguas fora su propra familie: italiano/romenian. Die slechteste: francian.
The romanic language with the best conection with languages outside its own family: italian/romanian. The worst: french.

The slavski languaj with the best conection with languajes outsíd its ón famly: polski/russki/ukrainski. La peor: ceski.
The slavic language with the best conection with languages outside its own family: polish/russian/ukrainian. The worst: czech.

xxx

De germanishe sprak mid de beste dörsnid von punkten in un ut de familie: norgish. La pir: islandish.
The germanic language with the best average of family and outside-family points: norwegian. The worst: icelandic.

Die romanico sprake mit die beste dursnitt fon punkte in un aussa die familie: espanian. The werst: romenian.
The romanic language with the best average of family and outside-family points: spanish. The worst: romanian.

La slavski lingua co la meyor media na familia y fora: ukrainski. De slechteste: bulgarski.
The slavic language with the best average of family and outside-family points: ukrainian. The worst: bulgarian.

xxx

La lingua co la maximo numero de punto (mas eso no signifi multu, porke slavski linguas, ki è mais numeroz, ten un vantag): ukrainski (2.895).
The language with the maximum number of points (but it doesnt mean much, because slavic languages, being more numerous, hav an advantage): ukrainian (2.895).

De sprak mid der kleinste zall punkten: türkian (374).
The language with the smallest number of points: turkish (374).

La lingua co la meyor media na familie: espanian (277).
The language with the best average inside the family: spanish (277).

Die sprake mit die slechteste dursnitt in der familie: magyar (34).
The language with the worst average in the family: hungarian (34).

The best avraj outsíd the famly: italian (52).
The best average outside the family: italian (52).

La peor media fora du familia: türkian (11).
The worst average outside the family: turkish (11).

De beste dörsnidd all in all: italian (160).
The best average altogether: italian (160).

La pir media in total: türkian (11).
The worst average altogether: turkish (11).

Die 2 spraken die nahest zueinanda sei: norgish un sverigish (511).
The 2 languages that are the nearest to each other: norwegian and swedish (511).

The 2 languajes of the sám famly that ar the móst distant from éch uther: eestian e magyarian (28).
The 2 languages of the same family that are the most distant from each other: estonian and hungarian (28).

La 2 lingua de familias diferenta ki è la mas proxim un del otru: italian un maltian (143).
The 2 languages from different families that are the nearest to each other: italian and maltese (143).

xxx

Een kunn fragen wie es kaamt dass francian un inglish, woll de influssrikste spraken in Europa, hebben so minn gemeensam mid annere spraken. I gess that the rézon is that thay evolvd a lot, and lost meny sounds in thare history. Meme la coneccion inter lus mem è pluto debil, lo ki serai no la cazo si la lingua analizee serai la scrivee. Mas pensa pro exemplo na palavra 'points', ki è scrived exactli na mesma manera na dos lingua: in der sribene sprake würd es 12 punkte bedeut, aber in der sprokene sprake el inglish un die franciano förme ha nur 1 punkt in der metod 1 un kei punkt in metode 2.

One might ask how come that french and english, supposedly the most influent languages in Europe, have rather little in common with other languages. I guess that the reason is that they evolved a lot, and lost many sounds in their history. Even the conection between themselves is rather weak, which wouldnt be the case if it was the written language that was analysed. But consider for example the word 'points', which is written exactly the same in both languages: by the written language, it would mean 12 points, but in spoken language the english and the french forms hav only 1 point in method 1 and no point at all in method 2.

xxx

Das könnte sik en bisschen ennern wenn ik en anner text neem: for this text i took the begining of a Harry Potter book, wich is rather a simple text, éven if it has a rather elaboràted languaj, so the proportion of germanic werds is much hier in inglish than in a rather academic text. Mi próximo texto va ser noticias de un jornal, inton la proximitee af inglish a la romanico linguas devai deveni plus hai.

This might change a bit when i take another text: for this text i took the beginning of a Harry Potter book, which is rather a simple text, even if it has a rather elaborated language, so the proportion of germanic words is much higher in english than in a rather academic text. My next text will be news in a paper, so the proximity of english to the romanic languages should get higher.
Post Reply