1309:
is the PRC. Thirdly, the records are likely to be skewed not simply towards native speakers (as the article hypothesises), but also towards fluent non-native speakers. That would explain, eg, the high performance of the
Netherlands and Germany (the latter of which has more fluent speakers of English than Australia), and possibly also Brazil, a country with a large population (about 2 1/2 times that of Germany) and mandatory learning of at least one foreign language for all 12 grades of compulsory schooling. Fourthly, one needs to be cautious about Canada, where only about 56% of the population speaks English as a mother tongue, and about 21% use French as their mother tongue. Fifthly, the figures indicate that some countries with smaller populations have disproportionately large numbers of contributors. So, eg, New Zealand and Ireland both have populations of about one fifth of that of Australia, but New Zealand has more than one fifth as many prolific contributors, and Ireland has more than one fifth as many small contributors. Similarly, the number of contributors from the UK (both prolific and small) is disproportionately large by comparison with the USA.
1331:
the data. This dataset has some special quirks, designed right-in to hide identities, and even the most basic measure - what is an "edit"? - is pretty vague covering everything from changing a comma to a semi-colon, to adding in 1,000 words to an article. Beyond simple curiosity, I suppose my motivation has to do mostly with so-called "political bias". A lot of
Americans seem to think WP has a liberal bias, but is that due to age, gender, or country of residence of editors? I do think that this dataset will be examined in detail, so getting out all the quirks, biases, and hypotheses now is a worthwhile exercise. Thanks.
110:
130:
1067:
191:. It allows the public to see, more or less, how many active editors (5–99 edits in a month) and very active editors (100+ edits) from about 180 individual countries contribute to active Knowledge versions, each month from January 2019 onward. For example, if you wanted to know how many people editing from the UK made more than 99 edits to the French version of Knowledge in September, you can look it up in this dataset. The answer is somewhere between 11 and 20.
90:
120:
36:
140:
100:
601:
Dominican
Republic. Note that Venezuela and Cuba are excluded by the WMF from the dataset. The population rankings for native English-speaking countries are almost identical to the rankings in Knowledge contributions of the same countries. But the population rankings for native Spanish-speaking countries are much less similar to their rankings in Knowledge Spanish-language contributions.
150:
103:
1308:
be a record of the nationality of the contributor. Secondly, the assumed location may not be correct. So, eg, if a contributor in the PRC is using a VPN that says that the contributor's location is the USA, then the record will show the USA, not the PRC, as the location, even though the true location
1037:
Another area of interest might involve combining this dataset with other datasets. For example, say a program is undertaken to increase the quality – rather than the quantity – of articles about country Z. Using this data in conjunction with data on readership might give a more complete understanding
587:
Six rich
European Union countries where English is not the mother tongue, Germany, the Netherlands, Italy, Sweden, France and Spain, together account for 8.4% of the reported very active editors. Of the countries in this table, only the rankings of Brazil and perhaps South Africa do not appear to be
1330:
Thanks for this - all datasets have limits or quirks of course, and it's important for everybody to understand the limits. Also you're starting to get into some new hypotheses about the data (e.g. from foreign students, fluent non-native speakers, or VPNs), which is the start of really understanding
1303:
These are interesting figures, but they need to be viewed with caution, for a number of reasons, including the following. For a start, the records are of the assumed location, and not necessarily the nationality, of the contributor. So, eg, if a contributor is a foreign student in the USA, the UK or
600:
Nevertheless, wealth – or perhaps dialect – may be playing a stronger role in eswiki than it does in enwiki. The 12 largest countries by native
Spanish-speaking population are, in order, Mexico, Colombia, Spain, Argentina, the United States, Venezuela, Peru, Chile, Ecuador, Cuba, Guatemala, and the
596:
Table 2 shows analogous rankings for the
Spanish language Knowledge. While Spain and Argentina combine for slightly over half of the reported very active editors, the very active editors are distributed more evenly over all the reported countries. Only one country without Spanish as its predominant
564:
The countries with the most very active editors in enwiki are the US (43%) and the UK (17%) , or almost 60% of the total reported editors between them. The two large rich countries predominate. Two rich but less populous countries, Canada and
Australia, are also well-represented with almost 12% of
1034:
will likely be of greater interest. For example, let's say that there was a new program introduced intended to increase the number of editors from country Y. The full effects of the program might not be seen after 9 months, but after 2 or 3 years hopefully any effects could be seen in the data.
1033:
Time is the main variable of interest that was left out of the above examinations. Right now we could see how edit contributions from different countries change over the nine months from
January through September 2019. As time goes by, more months of data will be released, and the effect of time
198:
are excluded, e.g. China, Kazakhstan, Russia, Saudi Arabia and
Venezuela. Exact data on the number of editors in each category (editors from country x who edited Knowledge version y) are not given. Rather these numbers are only given in “buckets” of ten: 1–10, 11–20, 21–30, 31–40, etc. Technical
172:
Let's say you are interested in how many active editors from France are editing the
English-language Knowledge; or conversely, you'd like to know how many editors from the UK are editing the French-language Knowledge. All the necessary information needed to calculate these numbers is recorded, at
210:
But enough for the preliminaries! What questions can the dataset answer that I’ve been dying to know the answer to? The following analysis is only the briefest overview of data from one month, September, quickly done. It’s not in any sense academic research, but hopefully will allow people to
884:
Table 3 shows how very active editors from the US and the UK edit the non-English Wikipedias. Altogether very active editors from the US edit in 44 different Knowledge versions. Those from the UK edit in 29 versions. Among those versions with 11–20 very active editors from the US are an
1162:
583:
but the first language of only a small fraction. The Philippines, with nearly 2% of the reported very active editors, may be affected by similar factors as India. The percentages of reported active editors (5–99 edits) appear to be similar to the percentages for very active editors.
143:
113:
597:
language, the United States, has a fairly large proportion of the very active editors. The same three factors that seem to explain the rankings for enwiki editors, mother tongue, population, and wealth, may very well explain the rankings for eswiki as well.
70:
234:
Table 1 shows the 11 countries with the most active editors and the 11 with the most very active editors to enwiki (14 countries total), plus two other large English-speaking countries, Ireland and South Africa. Numbers marked * are not in the largest 11.
177:
database you could never find those numbers. The WMF did not wish to disclose this data out of concerns that the numbers were precise enough that governments or others could back out material that might lead to the identification of individual editors.
153:
133:
218:
What countries contribute most to the English-language Knowledge (enwiki)? Are they the richer, or the more populous English-speaking countries? Or perhaps those countries where English is widely spoken as a second
568:
The much smaller but still relatively rich New Zealand and Ireland, with about 1% of the total reported very active editors each, trail among those countries where English is the predominant first language.
579:
India, which has the 5th largest group of very active editors (4%) and third largest group of active editors (9%), has a very large population, for whom English is an important
576:. The four countries with the largest native English-speaking populations are also the largest four contributors to enwiki – in the same order: USA, UK, Canada, and Australia.
76:
222:
Do these relations differ across different Knowledge language versions? Answering the above questions for the Spanish-language Knowledge (eswiki) allows a simple comparison.
1198:
Well I am a Knowledge editor from Sri Lanka and of course English is not a popular language though it is one of the official languages of the country. It is regarded as a
1344:
1318:
1177:
1296:
1226:
1111:
1106:
879:
1141:
1121:
123:
1131:
1091:
885:
interesting mix of the Chinese, Spanish, Farsi (Persian), Japanese, and Russian Wikipedias. The similar data from UK editors only includes the French Knowledge.
1116:
1096:
1054:
1045:
1101:
195:
1277:
1084:
1211:
1028:
1146:
1078:
55:
44:
1202:
in the country. Surprised to Portugese speaking Brazil in the top 15 list even ahead of South Africa for gaining popularity in English Knowledge.
1126:
1136:
1409:
1182:
225:
And finally, how do contributions across countries to different language versions compare. Edits from the US and UK are examined here.
1189:
21:
1385:
1166:
93:
1380:
1375:
1370:
1365:
1066:
49:
35:
17:
211:
understand what type of data the dataset contains and what type of questions it can be used to address.
573:
173:
least temporarily, by the Wikimedia Foundation, but unless you worked for the WMF and had access to the
591:
229:
200:
183:
1222:
580:
1391:
1337:
1270:
30:
How many people edit in your favorite language? Where are they from?: Only now can we say!
8:
1314:
174:
1292:
1173:
1218:
1207:
588:
directly explained by the three factors of mother tongue, population, and wealth.
1332:
1284:
1265:
163:
1325:
1310:
1304:
Australia, all of which have big foreign student populations, the records will
1403:
1199:
204:
1288:
1203:
194:
Because of privacy concerns exact numbers are not given. Data from
181:
This month a new dataset was made public by the Wikimedia Foundation
71:
How many people edit in your favorite language? Where are they from?
572:
The proportion of native English speakers by country is shown at
1217:
Fascinating. I'm surprised Nigeria did not make the list. -
1241:language . country quant. lower limit upper limit
1187:If your comment has not appeared here, you can try
880:
US and UK editors editing on non-English Wikipedias
1401:
214:My main questions – of personal interest – are:
565:the total very active editors between them.
1235:This is all there is from the September file:
161:
1029:So what else can you do with this dataset?
1190:
14:
1402:
574:English language#Pluricentric English
54:
29:
1410:Knowledge Signpost archives 2019-11
27:
1065:
56:
34:
28:
1421:
1251:enwiki Nigeria 5 to 99 251 260
1248:enwiki Nigeria 100 or more 11 20
1172:These comments are automatically
148:
138:
128:
118:
108:
98:
88:
1360:: doing it for free since 2005.
1038:of the effects of the program.
1264:yowiki is prob. yorba (sp?) .
1260:yowiki Nigeria 5 to 99 1 10
1257:jawiki Nigeria 5 to 99 1 10
1254:hawiki Nigeria 5 to 99 1 10
1245:arwiki Nigeria 5 to 99 1 10
1183:add the page to your watchlist
13:
1:
1297:15:13, 30 November 2019 (UTC)
1278:14:12, 30 November 2019 (UTC)
1227:08:04, 30 November 2019 (UTC)
1212:03:36, 30 November 2019 (UTC)
1345:17:19, 2 December 2019 (UTC)
1319:09:46, 2 December 2019 (UTC)
1158:
18:Knowledge:Knowledge Signpost
7:
10:
1426:
203:. The data are available
199:information is available
189:Active Editors by country
1180:. To follow comments,
1070:
39:
1069:
581:medium of instruction
187:, or more informally
38:
1176:from this article's
840:Dominican Republic
1167:Discuss this story
1112:Arbitration report
1107:On the bright side
1071:
1058:"Special report" →
175:Geoeditors Monthly
45:← Back to Contents
40:
1191:purging the cache
1142:From the archives
1122:Technology report
1026:
1025:
877:
876:
592:Who edits eswiki?
562:
561:
230:Who edits enwiki?
184:Geoeditors/Public
50:View Latest Issue
1417:
1394:
1340:
1329:
1273:
1194:
1192:
1186:
1165:
1089:
1081:
1079:29 November 2019
1074:
1057:
1050:"Special report"
1049:
888:
887:
858:Total (in table)
604:
603:
543:Total (in table)
238:
237:
166:
152:
151:
142:
141:
132:
131:
122:
121:
112:
111:
102:
101:
92:
91:
62:
60:
58:
57:29 November 2019
1425:
1424:
1420:
1419:
1418:
1416:
1415:
1414:
1400:
1399:
1398:
1397:
1396:
1395:
1390:
1388:
1383:
1378:
1373:
1368:
1361:
1353:
1352:
1343:
1338:
1323:
1285:Yoruba language
1276:
1271:
1196:
1188:
1181:
1170:
1169:
1163:+ Add a comment
1161:
1157:
1156:
1155:
1132:Recent research
1092:From the editor
1082:
1077:
1075:
1072:
1061:
1060:
1055:
1052:
1047:
1041:
1040:
1031:
1019:United Kingdom
1008:United Kingdom
997:United Kingdom
891:Version edited
882:
631:total reported
619:total reported
594:
287:United Kingdom
265:total reported
253:total reported
232:
168:
167:
160:
159:
158:
149:
139:
129:
119:
109:
99:
89:
83:
80:
69:
65:
63:
53:
52:
47:
41:
31:
26:
25:
24:
12:
11:
5:
1423:
1413:
1412:
1389:
1384:
1379:
1374:
1369:
1364:
1363:
1362:
1355:
1354:
1351:
1350:
1349:
1348:
1347:
1335:
1300:
1299:
1268:
1262:
1261:
1258:
1255:
1252:
1249:
1246:
1239:
1238:
1237:
1236:
1230:
1229:
1171:
1168:
1160:
1159:
1154:
1152:Special report
1149:
1144:
1139:
1134:
1129:
1124:
1119:
1117:Traffic report
1114:
1109:
1104:
1099:
1097:News and notes
1094:
1088:
1076:
1064:
1063:
1062:
1053:
1044:
1043:
1042:
1030:
1027:
1024:
1023:
1020:
1017:
1013:
1012:
1009:
1006:
1002:
1001:
998:
995:
991:
990:
987:
986:United States
984:
980:
979:
976:
975:United States
973:
969:
968:
965:
964:United States
962:
958:
957:
954:
953:United States
951:
947:
946:
943:
942:United States
940:
936:
935:
932:
931:United States
929:
925:
924:
921:
920:United States
918:
914:
913:
910:
909:United States
907:
903:
902:
901:(lower bound)
900:
898:
895:
892:
881:
878:
875:
874:
869:
867:
862:
860:
854:
853:
850:
847:
844:
841:
837:
836:
833:
830:
827:
824:
820:
819:
816:
813:
810:
807:
803:
802:
799:
796:
793:
790:
786:
785:
782:
779:
776:
773:
772:United States
769:
768:
765:
762:
759:
756:
752:
751:
748:
745:
742:
739:
735:
734:
731:
728:
725:
722:
718:
717:
714:
711:
708:
705:
701:
700:
697:
694:
691:
688:
684:
683:
680:
677:
674:
671:
667:
666:
663:
660:
657:
654:
650:
649:
646:
643:
640:
637:
633:
632:
630:
627:
626:(lower bound)
625:
623:
620:
618:
615:
614:(lower bound)
613:
611:
608:
593:
590:
560:
559:
554:
552:
547:
545:
539:
538:
535:
532:
529:
526:
522:
521:
518:
515:
512:
509:
505:
504:
501:
498:
495:
492:
488:
487:
484:
481:
478:
475:
471:
470:
467:
464:
461:
458:
454:
453:
450:
447:
444:
441:
437:
436:
433:
430:
427:
424:
420:
419:
416:
413:
410:
407:
403:
402:
399:
396:
393:
390:
386:
385:
382:
379:
376:
373:
369:
368:
365:
362:
359:
356:
352:
351:
348:
345:
342:
339:
335:
334:
331:
328:
325:
322:
318:
317:
314:
311:
308:
305:
301:
300:
297:
294:
291:
288:
284:
283:
280:
277:
274:
271:
270:United States
267:
266:
264:
261:
260:(lower bound)
259:
257:
254:
252:
249:
248:(lower bound)
247:
245:
242:
231:
228:
227:
226:
223:
220:
170:
169:
157:
156:
146:
136:
126:
116:
106:
96:
85:
84:
81:
75:
74:
73:
72:
68:Special report
67:
66:
64:
61:
48:
43:
42:
33:
32:
15:
9:
6:
4:
3:
2:
1422:
1411:
1408:
1407:
1405:
1393:
1387:
1382:
1377:
1372:
1367:
1359:
1346:
1341:
1334:
1327:
1322:
1321:
1320:
1316:
1312:
1307:
1302:
1301:
1298:
1294:
1290:
1286:
1282:
1281:
1280:
1279:
1274:
1267:
1259:
1256:
1253:
1250:
1247:
1244:
1243:
1242:
1234:
1233:
1232:
1231:
1228:
1224:
1220:
1216:
1215:
1214:
1213:
1209:
1205:
1201:
1200:link language
1193:
1184:
1179:
1175:
1164:
1153:
1150:
1148:
1145:
1143:
1140:
1138:
1135:
1133:
1130:
1128:
1125:
1123:
1120:
1118:
1115:
1113:
1110:
1108:
1105:
1103:
1100:
1098:
1095:
1093:
1090:
1086:
1080:
1073:In this issue
1068:
1059:
1051:
1039:
1035:
1021:
1018:
1015:
1014:
1010:
1007:
1004:
1003:
999:
996:
993:
992:
988:
985:
982:
981:
977:
974:
971:
970:
966:
963:
960:
959:
955:
952:
949:
948:
944:
941:
938:
937:
933:
930:
927:
926:
922:
919:
916:
915:
911:
908:
905:
904:
897:Editors with
896:
893:
890:
889:
886:
873:
870:
868:
866:
863:
861:
859:
856:
855:
851:
848:
845:
842:
839:
838:
834:
831:
828:
825:
822:
821:
817:
814:
811:
808:
805:
804:
800:
797:
794:
791:
788:
787:
783:
780:
777:
774:
771:
770:
766:
763:
760:
757:
754:
753:
749:
746:
743:
740:
737:
736:
732:
729:
726:
723:
720:
719:
715:
712:
709:
706:
703:
702:
698:
695:
692:
689:
686:
685:
681:
678:
675:
672:
669:
668:
664:
661:
658:
655:
652:
651:
647:
644:
641:
638:
635:
634:
628:
622:Editors with
621:
616:
610:Editors with
609:
607:Editors from
606:
605:
602:
598:
589:
585:
582:
577:
575:
570:
566:
558:
555:
553:
551:
548:
546:
544:
541:
540:
536:
533:
530:
527:
525:South Africa
524:
523:
519:
516:
513:
510:
507:
506:
502:
499:
496:
493:
490:
489:
485:
482:
479:
476:
473:
472:
468:
465:
462:
459:
456:
455:
451:
448:
445:
442:
439:
438:
434:
431:
428:
425:
422:
421:
417:
414:
411:
408:
405:
404:
400:
397:
394:
391:
388:
387:
383:
380:
377:
374:
371:
370:
366:
363:
360:
357:
354:
353:
349:
346:
343:
340:
337:
336:
332:
329:
326:
323:
320:
319:
315:
312:
309:
306:
303:
302:
298:
295:
292:
289:
286:
285:
281:
278:
275:
272:
269:
268:
262:
256:Editors with
255:
250:
244:Editors with
243:
241:Editors from
240:
239:
236:
224:
221:
217:
216:
215:
212:
208:
206:
202:
197:
192:
190:
186:
185:
179:
176:
165:
155:
147:
145:
137:
135:
127:
125:
117:
115:
107:
105:
97:
95:
87:
86:
78:
59:
51:
46:
37:
23:
19:
1357:
1305:
1263:
1240:
1197:
1151:
1102:In the media
1085:all comments
1036:
1032:
883:
871:
864:
857:
599:
595:
586:
578:
571:
567:
563:
556:
549:
542:
423:New Zealand
389:Netherlands
372:Philippines
233:
213:
209:
196:30 countries
193:
188:
182:
180:
171:
94:PDF download
1392:Suggestions
1219:Indy beetle
1174:transcluded
972:simplewiki
144:X (Twitter)
1333:Smallbones
1266:Smallbones
1016:27 others
983:37 others
899:100+ edits
755:Nicaragua
653:Argentina
624:5–99 edits
612:100+ edits
321:Australia
258:5–99 edits
246:100+ edits
164:Smallbones
82:Share this
77:Contribute
22:2019-11-29
1386:Subscribe
1339:smalltalk
1326:Bahnfrend
1311:Bahnfrend
1272:smalltalk
1178:talk page
704:Colombia
219:language?
1404:Category
1381:Newsroom
1376:Archives
1358:Signpost
1147:In focus
1048:Previous
823:Bolivia
806:unknown
789:Uruguay
738:Ecuador
474:Ireland
355:Germany
134:Facebook
124:LinkedIn
114:Mastodon
20: |
1289:MPS1992
1127:Gallery
1005:frwiki
994:enwiki
961:ruwiki
950:jawiki
939:fawiki
928:eswiki
917:zhwiki
906:enwiki
670:Mexico
508:Brazil
457:France
440:Sweden
304:Canada
279:25,401
1204:Abishe
687:Chile
682:13.4%
679:1,471
676:12.1%
665:12.0%
662:1,421
659:17.2%
648:35.4%
645:3,881
642:35.9%
636:Spain
491:Spain
406:Italy
381:1,021
364:1,281
347:5,241
338:India
330:2,491
313:3,321
299:12.1%
296:7,491
293:16.7%
282:41.0%
276:42.9%
273:1,881
154:Reddit
104:E-mail
1371:About
1137:Essay
912:1881
894:From
872:91.8%
865:96.0%
852:0.9%
846:0.2%
835:0.9%
829:0.2%
818:0.1%
812:1.9%
801:1.3%
795:1.9%
784:2.3%
778:1.9%
767:0.7%
761:1.9%
750:2.1%
744:1.9%
733:5.8%
727:5.3%
721:Peru
716:7.8%
710:7.0%
699:7.6%
693:8.7%
629:% of
617:% of
557:83.6%
550:88.8%
537:0.5%
534:291*
531:0.5%
520:1.2%
514:0.7%
503:1.3%
497:0.9%
486:1.1%
483:661*
480:0.9%
469:1.3%
463:0.9%
452:0.7%
449:431*
446:1.2%
435:0.7%
432:441*
429:1.2%
418:1.3%
412:1.2%
401:1.0%
398:621*
395:1.4%
384:1.6%
378:1.8%
367:2.1%
361:2.8%
350:8.5%
344:4.4%
333:4.0%
327:5.3%
316:5.4%
310:6.2%
263:% of
251:% of
16:<
1366:Home
1356:The
1315:talk
1293:talk
1283:See
1223:talk
1208:talk
1056:Next
1000:731
849:101
832:101
815:11*
798:211
781:251
764:81*
747:231
730:631
713:851
696:831
656:101
639:211
528:21*
517:721
511:31*
500:681
494:41*
477:41*
466:791
460:41*
415:831
358:121
341:191
324:231
307:271
290:731
205:here
201:here
1306:not
1022:27
1011:11
989:37
978:11
967:11
956:11
945:11
934:11
923:51
843:1*
826:1*
809:11
792:11
775:11
758:11
741:11
724:31
707:41
690:51
673:71
443:51
426:51
409:51
392:61
375:81
162:By
79:—
1406::
1317:)
1295:)
1287:.
1225:)
1210:)
1046:←
207:.
1342:)
1336:(
1328::
1324:@
1313:(
1291:(
1275:)
1269:(
1221:(
1206:(
1195:.
1185:.
1087:)
1083:(
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.