Knowledge

:Knowledge Signpost/2019-11-29/Special report - Knowledge

Source 📝

1309:
is the PRC. Thirdly, the records are likely to be skewed not simply towards native speakers (as the article hypothesises), but also towards fluent non-native speakers. That would explain, eg, the high performance of the Netherlands and Germany (the latter of which has more fluent speakers of English than Australia), and possibly also Brazil, a country with a large population (about 2 1/2 times that of Germany) and mandatory learning of at least one foreign language for all 12 grades of compulsory schooling. Fourthly, one needs to be cautious about Canada, where only about 56% of the population speaks English as a mother tongue, and about 21% use French as their mother tongue. Fifthly, the figures indicate that some countries with smaller populations have disproportionately large numbers of contributors. So, eg, New Zealand and Ireland both have populations of about one fifth of that of Australia, but New Zealand has more than one fifth as many prolific contributors, and Ireland has more than one fifth as many small contributors. Similarly, the number of contributors from the UK (both prolific and small) is disproportionately large by comparison with the USA.
1331:
the data. This dataset has some special quirks, designed right-in to hide identities, and even the most basic measure - what is an "edit"? - is pretty vague covering everything from changing a comma to a semi-colon, to adding in 1,000 words to an article. Beyond simple curiosity, I suppose my motivation has to do mostly with so-called "political bias". A lot of Americans seem to think WP has a liberal bias, but is that due to age, gender, or country of residence of editors? I do think that this dataset will be examined in detail, so getting out all the quirks, biases, and hypotheses now is a worthwhile exercise. Thanks.
110: 130: 1067: 191:. It allows the public to see, more or less, how many active editors (5–99 edits in a month) and very active editors (100+ edits) from about 180 individual countries contribute to active Knowledge versions, each month from January 2019 onward. For example, if you wanted to know how many people editing from the UK made more than 99 edits to the French version of Knowledge in September, you can look it up in this dataset. The answer is somewhere between 11 and 20. 90: 120: 36: 140: 100: 601:
Dominican Republic. Note that Venezuela and Cuba are excluded by the WMF from the dataset. The population rankings for native English-speaking countries are almost identical to the rankings in Knowledge contributions of the same countries. But the population rankings for native Spanish-speaking countries are much less similar to their rankings in Knowledge Spanish-language contributions.
150: 103: 1308:
be a record of the nationality of the contributor. Secondly, the assumed location may not be correct. So, eg, if a contributor in the PRC is using a VPN that says that the contributor's location is the USA, then the record will show the USA, not the PRC, as the location, even though the true location
1037:
Another area of interest might involve combining this dataset with other datasets. For example, say a program is undertaken to increase the quality – rather than the quantity – of articles about country Z. Using this data in conjunction with data on readership might give a more complete understanding
587:
Six rich European Union countries where English is not the mother tongue, Germany, the Netherlands, Italy, Sweden, France and Spain, together account for 8.4% of the reported very active editors. Of the countries in this table, only the rankings of Brazil and perhaps South Africa do not appear to be
1330:
Thanks for this - all datasets have limits or quirks of course, and it's important for everybody to understand the limits. Also you're starting to get into some new hypotheses about the data (e.g. from foreign students, fluent non-native speakers, or VPNs), which is the start of really understanding
1303:
These are interesting figures, but they need to be viewed with caution, for a number of reasons, including the following. For a start, the records are of the assumed location, and not necessarily the nationality, of the contributor. So, eg, if a contributor is a foreign student in the USA, the UK or
600:
Nevertheless, wealth – or perhaps dialect – may be playing a stronger role in eswiki than it does in enwiki. The 12 largest countries by native Spanish-speaking population are, in order, Mexico, Colombia, Spain, Argentina, the United States, Venezuela, Peru, Chile, Ecuador, Cuba, Guatemala, and the
596:
Table 2 shows analogous rankings for the Spanish language Knowledge. While Spain and Argentina combine for slightly over half of the reported very active editors, the very active editors are distributed more evenly over all the reported countries. Only one country without Spanish as its predominant
564:
The countries with the most very active editors in enwiki are the US (43%) and the UK (17%) , or almost 60% of the total reported editors between them. The two large rich countries predominate. Two rich but less populous countries, Canada and Australia, are also well-represented with almost 12% of
1034:
will likely be of greater interest. For example, let's say that there was a new program introduced intended to increase the number of editors from country Y. The full effects of the program might not be seen after 9 months, but after 2 or 3 years hopefully any effects could be seen in the data.
1033:
Time is the main variable of interest that was left out of the above examinations. Right now we could see how edit contributions from different countries change over the nine months from January through September 2019. As time goes by, more months of data will be released, and the effect of time
198:
are excluded, e.g. China, Kazakhstan, Russia, Saudi Arabia and Venezuela. Exact data on the number of editors in each category (editors from country x who edited Knowledge version y) are not given. Rather these numbers are only given in “buckets” of ten: 1–10, 11–20, 21–30, 31–40, etc. Technical
172:
Let's say you are interested in how many active editors from France are editing the English-language Knowledge; or conversely, you'd like to know how many editors from the UK are editing the French-language Knowledge. All the necessary information needed to calculate these numbers is recorded, at
210:
But enough for the preliminaries! What questions can the dataset answer that I’ve been dying to know the answer to? The following analysis is only the briefest overview of data from one month, September, quickly done. It’s not in any sense academic research, but hopefully will allow people to
884:
Table 3 shows how very active editors from the US and the UK edit the non-English Wikipedias. Altogether very active editors from the US edit in 44 different Knowledge versions. Those from the UK edit in 29 versions. Among those versions with 11–20 very active editors from the US are an
1162: 583:
but the first language of only a small fraction. The Philippines, with nearly 2% of the reported very active editors, may be affected by similar factors as India. The percentages of reported active editors (5–99 edits) appear to be similar to the percentages for very active editors.
143: 113: 597:
language, the United States, has a fairly large proportion of the very active editors. The same three factors that seem to explain the rankings for enwiki editors, mother tongue, population, and wealth, may very well explain the rankings for eswiki as well.
70: 234:
Table 1 shows the 11 countries with the most active editors and the 11 with the most very active editors to enwiki (14 countries total), plus two other large English-speaking countries, Ireland and South Africa. Numbers marked * are not in the largest 11.
177:
database you could never find those numbers. The WMF did not wish to disclose this data out of concerns that the numbers were precise enough that governments or others could back out material that might lead to the identification of individual editors.
153: 133: 218:
What countries contribute most to the English-language Knowledge (enwiki)? Are they the richer, or the more populous English-speaking countries? Or perhaps those countries where English is widely spoken as a second
568:
The much smaller but still relatively rich New Zealand and Ireland, with about 1% of the total reported very active editors each, trail among those countries where English is the predominant first language.
579:
India, which has the 5th largest group of very active editors (4%) and third largest group of active editors (9%), has a very large population, for whom English is an important
576:. The four countries with the largest native English-speaking populations are also the largest four contributors to enwiki – in the same order: USA, UK, Canada, and Australia. 76: 222:
Do these relations differ across different Knowledge language versions? Answering the above questions for the Spanish-language Knowledge (eswiki) allows a simple comparison.
1198:
Well I am a Knowledge editor from Sri Lanka and of course English is not a popular language though it is one of the official languages of the country. It is regarded as a
1344: 1318: 1177: 1296: 1226: 1111: 1106: 879: 1141: 1121: 123: 1131: 1091: 885:
interesting mix of the Chinese, Spanish, Farsi (Persian), Japanese, and Russian Wikipedias. The similar data from UK editors only includes the French Knowledge.
1116: 1096: 1054: 1045: 1101: 195: 1277: 1084: 1211: 1028: 1146: 1078: 55: 44: 1202:
in the country. Surprised to Portugese speaking Brazil in the top 15 list even ahead of South Africa for gaining popularity in English Knowledge.
1126: 1136: 1409: 1182: 225:
And finally, how do contributions across countries to different language versions compare. Edits from the US and UK are examined here.
1189: 21: 1385: 1166: 93: 1380: 1375: 1370: 1365: 1066: 49: 35: 17: 211:
understand what type of data the dataset contains and what type of questions it can be used to address.
573: 173:
least temporarily, by the Wikimedia Foundation, but unless you worked for the WMF and had access to the
591: 229: 200: 183: 1222: 580: 1391: 1337: 1270: 30:
How many people edit in your favorite language? Where are they from?: Only now can we say!
8: 1314: 174: 1292: 1173: 1218: 1207: 588:
directly explained by the three factors of mother tongue, population, and wealth.
1332: 1284: 1265: 163: 1325: 1310: 1304:
Australia, all of which have big foreign student populations, the records will
1403: 1199: 204: 1288: 1203: 194:
Because of privacy concerns exact numbers are not given. Data from
181:
This month a new dataset was made public by the Wikimedia Foundation
71:
How many people edit in your favorite language? Where are they from?
572:
The proportion of native English speakers by country is shown at
1217:
Fascinating. I'm surprised Nigeria did not make the list. -
1241:language . country quant. lower limit upper limit 1187:If your comment has not appeared here, you can try 880:
US and UK editors editing on non-English Wikipedias
1401: 214:My main questions – of personal interest – are: 565:the total very active editors between them. 1235:This is all there is from the September file: 161: 1029:So what else can you do with this dataset? 1190: 14: 1402: 574:English language#Pluricentric English 54: 29: 1410:Knowledge Signpost archives 2019-11 27: 1065: 56: 34: 28: 1421: 1251:enwiki Nigeria 5 to 99 251 260 1248:enwiki Nigeria 100 or more 11 20 1172:These comments are automatically 148: 138: 128: 118: 108: 98: 88: 1360:: doing it for free since 2005. 1038:of the effects of the program. 1264:yowiki is prob. yorba (sp?) . 1260:yowiki Nigeria 5 to 99 1 10 1257:jawiki Nigeria 5 to 99 1 10 1254:hawiki Nigeria 5 to 99 1 10 1245:arwiki Nigeria 5 to 99 1 10 1183:add the page to your watchlist 13: 1: 1297:15:13, 30 November 2019 (UTC) 1278:14:12, 30 November 2019 (UTC) 1227:08:04, 30 November 2019 (UTC) 1212:03:36, 30 November 2019 (UTC) 1345:17:19, 2 December 2019 (UTC) 1319:09:46, 2 December 2019 (UTC) 1158: 18:Knowledge:Knowledge Signpost 7: 10: 1426: 203:. The data are available 199:information is available 189:Active Editors by country 1180:. To follow comments, 1070: 39: 1069: 581:medium of instruction 187:, or more informally 38: 1176:from this article's 840:Dominican Republic 1167:Discuss this story 1112:Arbitration report 1107:On the bright side 1071: 1058:"Special report" → 175:Geoeditors Monthly 45:← Back to Contents 40: 1191:purging the cache 1142:From the archives 1122:Technology report 1026: 1025: 877: 876: 592:Who edits eswiki? 562: 561: 230:Who edits enwiki? 184:Geoeditors/Public 50:View Latest Issue 1417: 1394: 1340: 1329: 1273: 1194: 1192: 1186: 1165: 1089: 1081: 1079:29 November 2019 1074: 1057: 1050:"Special report" 1049: 888: 887: 858:Total (in table) 604: 603: 543:Total (in table) 238: 237: 166: 152: 151: 142: 141: 132: 131: 122: 121: 112: 111: 102: 101: 92: 91: 62: 60: 58: 57:29 November 2019 1425: 1424: 1420: 1419: 1418: 1416: 1415: 1414: 1400: 1399: 1398: 1397: 1396: 1395: 1390: 1388: 1383: 1378: 1373: 1368: 1361: 1353: 1352: 1343: 1338: 1323: 1285:Yoruba language 1276: 1271: 1196: 1188: 1181: 1170: 1169: 1163:+ Add a comment 1161: 1157: 1156: 1155: 1132:Recent research 1092:From the editor 1082: 1077: 1075: 1072: 1061: 1060: 1055: 1052: 1047: 1041: 1040: 1031: 1019:United Kingdom 1008:United Kingdom 997:United Kingdom 891:Version edited 882: 631:total reported 619:total reported 594: 287:United Kingdom 265:total reported 253:total reported 232: 168: 167: 160: 159: 158: 149: 139: 129: 119: 109: 99: 89: 83: 80: 69: 65: 63: 53: 52: 47: 41: 31: 26: 25: 24: 12: 11: 5: 1423: 1413: 1412: 1389: 1384: 1379: 1374: 1369: 1364: 1363: 1362: 1355: 1354: 1351: 1350: 1349: 1348: 1347: 1335: 1300: 1299: 1268: 1262: 1261: 1258: 1255: 1252: 1249: 1246: 1239: 1238: 1237: 1236: 1230: 1229: 1171: 1168: 1160: 1159: 1154: 1152:Special report 1149: 1144: 1139: 1134: 1129: 1124: 1119: 1117:Traffic report 1114: 1109: 1104: 1099: 1097:News and notes 1094: 1088: 1076: 1064: 1063: 1062: 1053: 1044: 1043: 1042: 1030: 1027: 1024: 1023: 1020: 1017: 1013: 1012: 1009: 1006: 1002: 1001: 998: 995: 991: 990: 987: 986:United States 984: 980: 979: 976: 975:United States 973: 969: 968: 965: 964:United States 962: 958: 957: 954: 953:United States 951: 947: 946: 943: 942:United States 940: 936: 935: 932: 931:United States 929: 925: 924: 921: 920:United States 918: 914: 913: 910: 909:United States 907: 903: 902: 901:(lower bound) 900: 898: 895: 892: 881: 878: 875: 874: 869: 867: 862: 860: 854: 853: 850: 847: 844: 841: 837: 836: 833: 830: 827: 824: 820: 819: 816: 813: 810: 807: 803: 802: 799: 796: 793: 790: 786: 785: 782: 779: 776: 773: 772:United States 769: 768: 765: 762: 759: 756: 752: 751: 748: 745: 742: 739: 735: 734: 731: 728: 725: 722: 718: 717: 714: 711: 708: 705: 701: 700: 697: 694: 691: 688: 684: 683: 680: 677: 674: 671: 667: 666: 663: 660: 657: 654: 650: 649: 646: 643: 640: 637: 633: 632: 630: 627: 626:(lower bound) 625: 623: 620: 618: 615: 614:(lower bound) 613: 611: 608: 593: 590: 560: 559: 554: 552: 547: 545: 539: 538: 535: 532: 529: 526: 522: 521: 518: 515: 512: 509: 505: 504: 501: 498: 495: 492: 488: 487: 484: 481: 478: 475: 471: 470: 467: 464: 461: 458: 454: 453: 450: 447: 444: 441: 437: 436: 433: 430: 427: 424: 420: 419: 416: 413: 410: 407: 403: 402: 399: 396: 393: 390: 386: 385: 382: 379: 376: 373: 369: 368: 365: 362: 359: 356: 352: 351: 348: 345: 342: 339: 335: 334: 331: 328: 325: 322: 318: 317: 314: 311: 308: 305: 301: 300: 297: 294: 291: 288: 284: 283: 280: 277: 274: 271: 270:United States 267: 266: 264: 261: 260:(lower bound) 259: 257: 254: 252: 249: 248:(lower bound) 247: 245: 242: 231: 228: 227: 226: 223: 220: 170: 169: 157: 156: 146: 136: 126: 116: 106: 96: 85: 84: 81: 75: 74: 73: 72: 68:Special report 67: 66: 64: 61: 48: 43: 42: 33: 32: 15: 9: 6: 4: 3: 2: 1422: 1411: 1408: 1407: 1405: 1393: 1387: 1382: 1377: 1372: 1367: 1359: 1346: 1341: 1334: 1327: 1322: 1321: 1320: 1316: 1312: 1307: 1302: 1301: 1298: 1294: 1290: 1286: 1282: 1281: 1280: 1279: 1274: 1267: 1259: 1256: 1253: 1250: 1247: 1244: 1243: 1242: 1234: 1233: 1232: 1231: 1228: 1224: 1220: 1216: 1215: 1214: 1213: 1209: 1205: 1201: 1200:link language 1193: 1184: 1179: 1175: 1164: 1153: 1150: 1148: 1145: 1143: 1140: 1138: 1135: 1133: 1130: 1128: 1125: 1123: 1120: 1118: 1115: 1113: 1110: 1108: 1105: 1103: 1100: 1098: 1095: 1093: 1090: 1086: 1080: 1073:In this issue 1068: 1059: 1051: 1039: 1035: 1021: 1018: 1015: 1014: 1010: 1007: 1004: 1003: 999: 996: 993: 992: 988: 985: 982: 981: 977: 974: 971: 970: 966: 963: 960: 959: 955: 952: 949: 948: 944: 941: 938: 937: 933: 930: 927: 926: 922: 919: 916: 915: 911: 908: 905: 904: 897:Editors with 896: 893: 890: 889: 886: 873: 870: 868: 866: 863: 861: 859: 856: 855: 851: 848: 845: 842: 839: 838: 834: 831: 828: 825: 822: 821: 817: 814: 811: 808: 805: 804: 800: 797: 794: 791: 788: 787: 783: 780: 777: 774: 771: 770: 766: 763: 760: 757: 754: 753: 749: 746: 743: 740: 737: 736: 732: 729: 726: 723: 720: 719: 715: 712: 709: 706: 703: 702: 698: 695: 692: 689: 686: 685: 681: 678: 675: 672: 669: 668: 664: 661: 658: 655: 652: 651: 647: 644: 641: 638: 635: 634: 628: 622:Editors with 621: 616: 610:Editors with 609: 607:Editors from 606: 605: 602: 598: 589: 585: 582: 577: 575: 570: 566: 558: 555: 553: 551: 548: 546: 544: 541: 540: 536: 533: 530: 527: 525:South Africa 524: 523: 519: 516: 513: 510: 507: 506: 502: 499: 496: 493: 490: 489: 485: 482: 479: 476: 473: 472: 468: 465: 462: 459: 456: 455: 451: 448: 445: 442: 439: 438: 434: 431: 428: 425: 422: 421: 417: 414: 411: 408: 405: 404: 400: 397: 394: 391: 388: 387: 383: 380: 377: 374: 371: 370: 366: 363: 360: 357: 354: 353: 349: 346: 343: 340: 337: 336: 332: 329: 326: 323: 320: 319: 315: 312: 309: 306: 303: 302: 298: 295: 292: 289: 286: 285: 281: 278: 275: 272: 269: 268: 262: 256:Editors with 255: 250: 244:Editors with 243: 241:Editors from 240: 239: 236: 224: 221: 217: 216: 215: 212: 208: 206: 202: 197: 192: 190: 186: 185: 179: 176: 165: 155: 147: 145: 137: 135: 127: 125: 117: 115: 107: 105: 97: 95: 87: 86: 78: 59: 51: 46: 37: 23: 19: 1357: 1305: 1263: 1240: 1197: 1151: 1102:In the media 1085:all comments 1036: 1032: 883: 871: 864: 857: 599: 595: 586: 578: 571: 567: 563: 556: 549: 542: 423:New Zealand 389:Netherlands 372:Philippines 233: 213: 209: 196:30 countries 193: 188: 182: 180: 171: 94:PDF download 1392:Suggestions 1219:Indy beetle 1174:transcluded 972:simplewiki 144:X (Twitter) 1333:Smallbones 1266:Smallbones 1016:27 others 983:37 others 899:100+ edits 755:Nicaragua 653:Argentina 624:5–99 edits 612:100+ edits 321:Australia 258:5–99 edits 246:100+ edits 164:Smallbones 82:Share this 77:Contribute 22:2019-11-29 1386:Subscribe 1339:smalltalk 1326:Bahnfrend 1311:Bahnfrend 1272:smalltalk 1178:talk page 704:Colombia 219:language? 1404:Category 1381:Newsroom 1376:Archives 1358:Signpost 1147:In focus 1048:Previous 823:Bolivia 806:unknown 789:Uruguay 738:Ecuador 474:Ireland 355:Germany 134:Facebook 124:LinkedIn 114:Mastodon 20:‎ | 1289:MPS1992 1127:Gallery 1005:frwiki 994:enwiki 961:ruwiki 950:jawiki 939:fawiki 928:eswiki 917:zhwiki 906:enwiki 670:Mexico 508:Brazil 457:France 440:Sweden 304:Canada 279:25,401 1204:Abishe 687:Chile 682:13.4% 679:1,471 676:12.1% 665:12.0% 662:1,421 659:17.2% 648:35.4% 645:3,881 642:35.9% 636:Spain 491:Spain 406:Italy 381:1,021 364:1,281 347:5,241 338:India 330:2,491 313:3,321 299:12.1% 296:7,491 293:16.7% 282:41.0% 276:42.9% 273:1,881 154:Reddit 104:E-mail 1371:About 1137:Essay 912:1881 894:From 872:91.8% 865:96.0% 852:0.9% 846:0.2% 835:0.9% 829:0.2% 818:0.1% 812:1.9% 801:1.3% 795:1.9% 784:2.3% 778:1.9% 767:0.7% 761:1.9% 750:2.1% 744:1.9% 733:5.8% 727:5.3% 721:Peru 716:7.8% 710:7.0% 699:7.6% 693:8.7% 629:% of 617:% of 557:83.6% 550:88.8% 537:0.5% 534:291* 531:0.5% 520:1.2% 514:0.7% 503:1.3% 497:0.9% 486:1.1% 483:661* 480:0.9% 469:1.3% 463:0.9% 452:0.7% 449:431* 446:1.2% 435:0.7% 432:441* 429:1.2% 418:1.3% 412:1.2% 401:1.0% 398:621* 395:1.4% 384:1.6% 378:1.8% 367:2.1% 361:2.8% 350:8.5% 344:4.4% 333:4.0% 327:5.3% 316:5.4% 310:6.2% 263:% of 251:% of 16:< 1366:Home 1356:The 1315:talk 1293:talk 1283:See 1223:talk 1208:talk 1056:Next 1000:731 849:101 832:101 815:11* 798:211 781:251 764:81* 747:231 730:631 713:851 696:831 656:101 639:211 528:21* 517:721 511:31* 500:681 494:41* 477:41* 466:791 460:41* 415:831 358:121 341:191 324:231 307:271 290:731 205:here 201:here 1306:not 1022:27 1011:11 989:37 978:11 967:11 956:11 945:11 934:11 923:51 843:1* 826:1* 809:11 792:11 775:11 758:11 741:11 724:31 707:41 690:51 673:71 443:51 426:51 409:51 392:61 375:81 162:By 79:— 1406:: 1317:) 1295:) 1287:. 1225:) 1210:) 1046:← 207:. 1342:) 1336:( 1328:: 1324:@ 1313:( 1291:( 1275:) 1269:( 1221:( 1206:( 1195:. 1185:. 1087:) 1083:(

Index

Knowledge:Knowledge Signpost
2019-11-29
The Signpost
← Back to Contents
View Latest Issue
29 November 2019
Contribute
PDF download
E-mail
Mastodon
LinkedIn
Facebook
X (Twitter)
Reddit
Smallbones
Geoeditors Monthly
Geoeditors/Public
30 countries
here
here
English language#Pluricentric English
medium of instruction
Previous "Special report"
Next "Special report" →
S
29 November 2019
all comments
From the editor
News and notes
In the media

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.