413:
562:
218:
443:
614:
Good evening, I wanted to ask about a problem I'm having. In this account (MarianoMora23) I can move articles to the mainspace with no problem after barely making 10 edits. However in this SAME account (MarianoMora23) but in the SPANISH wikipedia I have more than 20 edits and still can't move from my
883:
shortcut at some point and it hasn't been reverted yet. It doesn't really make sense to faithfully reproduce simple mistakes made by others when they are irrelevant and only distract imo. Your approach does affect the hitrate tho. Are there others who I should contact? I assume the 16789 typos above
768:
to extract the 3000+ article names and the alleged typos, and have begun an AWB run to detect those words in those articles. So far I've saved 23 edits and have skipped 25 other articles - not a bad hit rate, by my standards, so I'll press on with this over the next few days. "Gettig" is a surname;
1039:
I make the lists with Java and then I use
Javascript to actually make the edits. When I improved the url regex in Javascript I forgot to add it to the Java code as well. I had a bunch of ideas to improve my workflow so I am cooking up a fresh batch for you. Might take a while, even on a modern pc.
254:
page and would like you to review the authenticity of the template that reads "This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed."
723:
We could use a custom AWB module in C# or perhaps just use some custom
Selenium-based tool (which would be pretty damn similar, not radically different). Or perhaps a JWB-like interface on wiki. Haven't really decided how to approach that
716:
I take a list of the most frequently used words, create typos with a
Levenshtein distance of 1, and check which occur in the dump. Then I do a bunch of filtering and I check which exist in the live version of
910:
803:. When my Raspberry Pi is done I will have another ~60.000. The typos already have very similar regex ran on them as you saw in typo.js so much of the WONTFIX stuff has been filtered out already.
569:
990:. This makes it easier for me, as the fixes for the same target word turn up together, and perhaps for you, since you can compare my contribution list with the list I'm working from.
861:
AWB has two checkboxes at the top left of the "Find & Replace" configuration, which aim to cover the "certain situations". I run with those turned off, though, so that I
800:
796:
832:
792:
275:
260:
632:
616:
909:. Fortunately they say the same thing! I do fix typos in quotations if I think they are "insignificant" or are likely to have been copying errors. See
695:
Interesting. I'm finding typos by running regular expressions on a database dump; how are you creating your work list? What's your false positive rate?
638:
986:
I've restarted the list after telling AWB not to sort the pages alphabetically, so I'm now processing them in the same order as they were listed in
698:
I confess I'm so used to working with AWB and my 4000+ regular expressions that I'm unlikely to switch to a radically different method. --
38:
451:
382:
73:
209:
1158:
884:
will keep you busy for a while but you know where to find me when you want more. Perhaps I should stick the lists in a subpage of
405:
1065:\b((?:https?://|www\.)(?:\S+(?::\S*)?@)?(?:(?:{1,3}\.){3}{1,3}|(?:(?:-*)*+)(?:\.(?:-*)*+)*(?:\.(?:{2,})))(?::\d{2,5})?(?:\S*)?\b)
658:
502:
477:
428:
368:
603:
279:
264:
1170:
1144:
1107:
1049:
1034:
958:
933:
897:
874:
856:
812:
782:
752:
707:
465:
79:
550:
217:
205:
201:
197:
193:
189:
185:
181:
177:
173:
169:
165:
161:
157:
153:
149:
145:
141:
137:
133:
1112:
Are the URL regexes running with "ignore case" turned on? If not, the first URL regex fails to match the whole URL in the
764:
Languages? Assembler, BCPL, C, C++ - all unused for a decade, I'm afraid. But I've used regular expressions on a copy of
498:
401:
271:
256:
129:
125:
121:
117:
113:
109:
105:
101:
97:
669:
You like typofixing? I got tens of thousands of typos and I can't fix em all alone. Perhaps we can combine our forces?
376:
727:
I never really bothered to create stats of the amount of skips vs the amount of fixes but that is a good idea to have.
356:
1127:
prefix because it is being used as an infobox parameter. To exclude those, you'll either have to look backwards for
302:
297:
624:
581:
352:
306:
245:
865:
fix errors in quotations, references, foreign-language text and so on - with appropriate care and checking. --
741:
I have at least 60.000 potential typos left to fix so it is probably worth it to create a decent tool for that.
24:
943:
524:
289:
68:
1157:. I have added range_map to the list of disallowed parameters. I am currently trying to figure out whether
642:
311:
464:. There aren't many redirects, and they aren't used much, so I looked through them all manually. I fixed
348:
59:
682:
1120:
1013:
92:
538:
323:
1084:
and I haven't really decided how to improve on that. Not all of them have file extensions. Perhaps
921:
917:
946:. If you want some, please delete them from the list so that its clear that they've been handled.
938:
Thank you, redirect target improved. I combined typolist, typolist2 and typolist3 above (but not
840:
735:
620:
494:
397:
880:
1095:
911:
User:John of
Reading/Typo fixing with AutoWikiBrowser#Editing quotes, book titles and such like
577:
334:
924:
project? That's another attempt at co-ordinated checking using data-crunching techniques. --
987:
939:
788:
765:
670:
250:
Hi, I'm not an experienced editor here, though I did contribute significantly lately to the
1166:
1136:
1103:
1045:
1026:
954:
925:
902:
893:
866:
852:
808:
774:
748:
699:
678:
650:
595:
542:
469:
420:
360:
319:
20:
843:) to not fix typos in certain situations. Do you know how we can get closer to that goal?
8:
49:
649:
edits to be autoconfirmed - as opposed to only 10 at the
English-language Knowledge. --
609:
556:
520:
510:
485:
435:
388:
315:
64:
589:
573:
293:
45:
949:
I added Moss and the (code behind the) AWB checkboxes to my todolist, thanks again!
770:
458:
450:- pretty speedy now I have the uncompressed dump on an SSD drive. At the bottom of
1162:
1099:
1041:
981:
950:
889:
848:
804:
759:
744:
690:
674:
342:
232:
885:
385:. Please could you repeat that exercise (feel free to overwrite the original).
228:
844:
532:
516:
1113:
1003:
285:
251:
839:) as a list generator source. And AWB would contain code (very similar to
1085:
664:
412:
359:, and many new sources have been added. I'm going to remove the tag. --
233:
338:
561:
906:
230:
1089:
1008:
In some cases the typo is embedded within a file name - example
330:
Thank you for the birthday wishes - that's a few weeks ago now.
1016:. I exclude those by peeking ahead for a known image suffix -
234:
920:
you may attract more helpers. Oh, and are you aware of the
720:
Which programming languages, if any, are you familiar with?
1058:
for URLs but a lot of them escaped the wrath of the regex.
1018:(?!*\.(?i:(?:gif|jpe?g|ogg|ogv|pdf|png|svg|tiff?|webm))\b)
998:
In many cases the typo is embedded within a URL - example
641:, it says you have to be autoconfirmed to move a page; at
993:
Two of your "don't fix" tests aren't working correctly:
905:
is marked as an essay; the authoritative guide is at
734:
of regex to avoid typos that shouldn't be fixed, see
572:
were found precious. That's what you are, always. --
454:
there's a short list of articles using redirects to
15:
537:You can read about Knowledge's deletion policy at
637:Each version of Knowledge sets its own rules. At
1161:can help identify typos better than a coinflip.
1020:- this regular expression isn't perfect, I know.
27:, where you can send him messages and comments.
639:es:Ayuda:Cómo cambiar el nombre de una página
1056:((http|https)://)(www.)?{2,256}\.{2,26}\b(*)
560:
1116:example because parts of it are uppercase.
452:User:Pigsonthewing/Direct calls to Infobox
383:User:Pigsonthewing/Direct calls to Infobox
615:sandbox to the mainspace. Any idea why?
1069:instead unless you have a better idea.
1061:I am considering using something like:
847:lists some developers in the infobox.
888:? I'll dive in the AWB code, thanks.
466:Federal College of Agriculture, Akure
942:, which you imported into AWB) into
831:In an ideal world, AWB would accept
515:Hello, what is the deletion policy?
381:A decade(!) ago, you kindly created
355:. Since then, yes, the article has
13:
769:"protectin" is a kind of protein;
594:How the time flies! Thank you. --
14:
1188:
1131:or similar, or look forwards for
837:christmas|chirstmas|My Christmas
441:
411:
216:
39:Click here to start a new topic.
1086:Commons Special:MediaStatistics
773:is a stage name; and so on. --
377:Direct uses of Template:Infobox
1171:07:47, 10 September 2024 (UTC)
1145:07:01, 10 September 2024 (UTC)
1108:03:41, 10 September 2024 (UTC)
1050:03:33, 10 September 2024 (UTC)
1:
1035:07:26, 9 September 2024 (UTC)
959:04:30, 9 September 2024 (UTC)
944:User:Polygnotus/Data/Typolist
934:20:14, 8 September 2024 (UTC)
898:19:40, 8 September 2024 (UTC)
875:18:50, 8 September 2024 (UTC)
857:18:44, 8 September 2024 (UTC)
813:18:15, 8 September 2024 (UTC)
783:18:08, 8 September 2024 (UTC)
753:17:14, 8 September 2024 (UTC)
708:16:47, 8 September 2024 (UTC)
683:16:21, 8 September 2024 (UTC)
36:Put new text under old text.
643:es:Knowledge:Autoconfirmados
246:Removing Template Assistance
7:
659:06:45, 27 August 2024 (UTC)
645:, it says you have to make
625:03:46, 27 August 2024 (UTC)
44:New to Knowledge? Welcome!
10:
1193:
1121:Lesser blue-eared starling
1014:Lesser blue-eared starling
916:If you post your links at
604:07:25, 3 August 2024 (UTC)
582:09:37, 2 August 2024 (UTC)
551:07:25, 3 August 2024 (UTC)
539:Knowledge:Deletion policy
525:19:48, 24 July 2024 (UTC)
482:Very helpful. Thank you.
74:Be welcoming to newcomers
1151:Pattern.CASE_INSENSITIVE
1140:
1030:
929:
922:Knowledge:Typo Team/moss
918:Knowledge talk:Typo Team
870:
778:
703:
654:
599:
546:
503:16:50, 7 June 2024 (UTC)
478:17:17, 6 June 2024 (UTC)
473:
429:16:43, 6 June 2024 (UTC)
424:
406:16:32, 6 June 2024 (UTC)
369:10:48, 26 May 2024 (UTC)
364:
280:05:04, 24 May 2024 (UTC)
265:05:03, 24 May 2024 (UTC)
736:User:Polygnotus/typo.js
333:Let's see. The tag was
565:
270:Also, happy birthday!
69:avoid personal attacks
1098:is steadily growing.
988:User:Polygnotus/typos
940:User:Polygnotus/typos
766:User:Polygnotus/typos
671:User:Polygnotus/typos
564:
357:changed substantially
210:Auto-archiving period
1155:Pattern.UNICODE_CASE
1119:The filename in the
1075:File:(.*?)(\\.|\\|)"
903:Knowledge:Quotations
879:I boldy created the
351:) when the article
1054:Originally I used
566:
80:dispute resolution
41:
1081:Category:(.*?)\\.
1072:For files I used:
241:
240:
60:Assume good faith
37:
1184:
1156:
1152:
1134:
1130:
1126:
1066:
1057:
1019:
1011:
1001:
985:
835:in this format (
791:and then we got
771:Supremme de Luxe
763:
694:
636:
593:
536:
501:
492:
488:
463:
457:
449:
445:
444:
439:
415:
404:
395:
391:
353:looked like this
327:
309:
235:
221:
220:
211:
16:
1192:
1191:
1187:
1186:
1185:
1183:
1182:
1181:
1154:
1150:
1137:John of Reading
1135:or similar. --
1132:
1128:
1124:
1064:
1055:
1027:John of Reading
1017:
1009:
999:
979:
926:John of Reading
867:John of Reading
775:John of Reading
757:
700:John of Reading
688:
667:
651:John of Reading
630:
612:
610:Knowledge edits
596:John of Reading
587:
568:Ten years ago,
559:
557:Always precious
543:John of Reading
530:
513:
511:Deletion policy
490:
484:
483:
470:John of Reading
461:
455:
442:
440:
433:
421:John of Reading
393:
387:
386:
379:
361:John of Reading
300:
284:
248:
237:
236:
231:
208:
86:
85:
55:
21:John of Reading
12:
11:
5:
1190:
1180:
1179:
1178:
1177:
1176:
1175:
1174:
1173:
1117:
1093:
1082:
1079:
1078:Image:(.*?)\\.
1076:
1073:
1070:
1067:
1062:
1059:
1052:
1022:
1021:
1006:
995:
994:
991:
976:
975:
974:
973:
972:
971:
970:
969:
968:
967:
966:
965:
964:
963:
962:
961:
947:
914:
822:
821:
820:
819:
818:
817:
816:
815:
742:
739:
728:
725:
721:
718:
711:
710:
696:
666:
663:
662:
661:
611:
608:
607:
606:
558:
555:
554:
553:
512:
509:
508:
507:
506:
505:
431:
378:
375:
374:
373:
372:
371:
331:
328:
247:
244:
239:
238:
229:
227:
226:
223:
222:
88:
87:
84:
83:
76:
71:
62:
56:
54:
53:
42:
33:
32:
29:
28:
9:
6:
4:
3:
2:
1189:
1172:
1168:
1164:
1160:
1148:
1147:
1146:
1142:
1138:
1122:
1118:
1115:
1111:
1110:
1109:
1105:
1101:
1097:
1094:
1091:
1087:
1083:
1080:
1077:
1074:
1071:
1068:
1063:
1060:
1053:
1051:
1047:
1043:
1038:
1037:
1036:
1032:
1028:
1024:
1023:
1015:
1007:
1005:
997:
996:
992:
989:
983:
978:
977:
960:
956:
952:
948:
945:
941:
937:
936:
935:
931:
927:
923:
919:
915:
912:
908:
904:
901:
900:
899:
895:
891:
887:
882:
878:
877:
876:
872:
868:
864:
860:
859:
858:
854:
850:
846:
842:
838:
834:
830:
829:
828:
827:
826:
825:
824:
823:
814:
810:
806:
802:
798:
794:
790:
787:Yeah that is
786:
785:
784:
780:
776:
772:
767:
761:
756:
755:
754:
750:
746:
743:
740:
737:
733:
729:
726:
722:
719:
715:
714:
713:
712:
709:
705:
701:
697:
692:
687:
686:
685:
684:
680:
676:
672:
660:
656:
652:
648:
644:
640:
634:
633:MarianoMora23
629:
628:
627:
626:
622:
618:
617:MarianoMora23
605:
601:
597:
591:
586:
585:
584:
583:
579:
575:
571:
563:
552:
548:
544:
540:
534:
529:
528:
527:
526:
522:
518:
504:
500:
496:
491:Pigsonthewing
487:
481:
480:
479:
475:
471:
467:
460:
453:
448:
437:
436:Pigsonthewing
432:
430:
426:
422:
418:
414:
410:
409:
408:
407:
403:
399:
394:Pigsonthewing
390:
384:
370:
366:
362:
358:
354:
350:
347:
344:
340:
336:
335:added in 2018
332:
329:
325:
321:
317:
313:
308:
304:
299:
295:
291:
287:
283:
282:
281:
277:
273:
272:144.86.34.230
269:
268:
267:
266:
262:
258:
257:144.86.34.230
253:
243:
225:
224:
219:
215:
207:
203:
199:
195:
191:
187:
183:
179:
175:
171:
167:
163:
159:
155:
151:
147:
143:
139:
135:
131:
127:
123:
119:
115:
111:
107:
103:
99:
96:
94:
90:
89:
81:
77:
75:
72:
70:
66:
63:
61:
58:
57:
51:
47:
46:Learn to edit
43:
40:
35:
34:
31:
30:
26:
22:
18:
17:
1114:Merle Miller
1092:can be used?
1004:Merle Miller
881:WP:QUOTETYPO
862:
836:
731:
668:
646:
613:
590:Gerda Arendt
574:Gerda Arendt
567:
514:
499:Andy's edits
495:Talk to Andy
486:Andy Mabbett
446:
416:
402:Andy's edits
398:Talk to Andy
389:Andy Mabbett
380:
345:
286:Zahran tribe
252:Zahran tribe
249:
242:
213:
91:
1129:range_map =
1096:My todolist
1010:distribuion
1163:Polygnotus
1100:Polygnotus
1042:Polygnotus
982:Polygnotus
951:Polygnotus
890:Polygnotus
849:Polygnotus
805:Polygnotus
797:9300 there
789:3489 typos
760:Polygnotus
745:Polygnotus
717:Knowledge.
691:Polygnotus
675:Polygnotus
1090:local one
907:MOS:QUOTE
801:1200 here
793:2800 here
82:if needed
65:Be polite
25:talk page
1088:and the
730:I use a
665:Hi John!
533:Gdfctjmm
517:Gdfctjmm
417:Doing...
349:contribs
93:Archives
50:get help
19:This is
1123:has no
1012:within
1002:within
1000:mmiller
886:WP:TYPO
841:typo.js
459:Infobox
303:protect
298:history
214:21 days
1159:Ollama
1149:I use
845:WP:AWB
307:delete
1125:File:
833:lists
541:. --
468:. --
339:Bradv
324:views
316:watch
312:links
78:Seek
1167:talk
1153:and
1141:talk
1133:.png
1104:talk
1046:talk
1031:talk
955:talk
930:talk
894:talk
871:talk
853:talk
809:talk
799:and
795:and
779:talk
749:talk
724:yet.
704:talk
679:talk
655:talk
621:talk
600:talk
578:talk
547:talk
521:talk
474:talk
447:Done
425:talk
365:talk
343:talk
320:logs
294:talk
290:edit
276:talk
261:talk
67:and
1025:--
732:lot
570:you
493:);
419:--
396:);
337:by
23:'s
1169:)
1143:)
1106:)
1048:)
1033:)
957:)
932:)
896:)
873:)
863:do
855:)
811:)
781:)
751:)
706:)
681:)
673:.
657:)
647:50
623:)
602:)
580:)
549:)
523:)
497:;
476:)
462:}}
456:{{
427:)
400:;
367:)
322:|
318:|
314:|
310:|
305:|
301:|
296:|
292:|
278:)
263:)
212::
206:28
204:,
202:27
200:,
198:26
196:,
194:25
192:,
190:24
188:,
186:23
184:,
182:22
180:,
178:21
176:,
174:20
172:,
170:19
168:,
166:18
164:,
162:17
160:,
158:16
156:,
154:15
152:,
150:14
148:,
146:13
144:,
142:12
140:,
138:11
136:,
134:10
132:,
128:,
124:,
120:,
116:,
112:,
108:,
104:,
100:,
48:;
1165:(
1139:(
1102:(
1044:(
1029:(
984::
980:@
953:(
928:(
913:.
892:(
869:(
851:(
807:(
777:(
762::
758:@
747:(
738:.
702:(
693::
689:@
677:(
653:(
635::
631:@
619:(
598:(
592::
588:@
576:(
545:(
535::
531:@
519:(
489:(
472:(
438::
434:@
423:(
392:(
363:(
346:·
341:(
326:)
288:(
274:(
259:(
130:9
126:8
122:7
118:6
114:5
110:4
106:3
102:2
98:1
95::
52:.
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.