Knowledge

User talk:John of Reading

Source 📝

413: 562: 218: 443: 614:
Good evening, I wanted to ask about a problem I'm having. In this account (MarianoMora23) I can move articles to the mainspace with no problem after barely making 10 edits. However in this SAME account (MarianoMora23) but in the SPANISH wikipedia I have more than 20 edits and still can't move from my
883:
shortcut at some point and it hasn't been reverted yet. It doesn't really make sense to faithfully reproduce simple mistakes made by others when they are irrelevant and only distract imo. Your approach does affect the hitrate tho. Are there others who I should contact? I assume the 16789 typos above
768:
to extract the 3000+ article names and the alleged typos, and have begun an AWB run to detect those words in those articles. So far I've saved 23 edits and have skipped 25 other articles - not a bad hit rate, by my standards, so I'll press on with this over the next few days. "Gettig" is a surname;
1039:
I make the lists with Java and then I use Javascript to actually make the edits. When I improved the url regex in Javascript I forgot to add it to the Java code as well. I had a bunch of ideas to improve my workflow so I am cooking up a fresh batch for you. Might take a while, even on a modern pc.
254:
page and would like you to review the authenticity of the template that reads "This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed."
723:
We could use a custom AWB module in C# or perhaps just use some custom Selenium-based tool (which would be pretty damn similar, not radically different). Or perhaps a JWB-like interface on wiki. Haven't really decided how to approach that
716:
I take a list of the most frequently used words, create typos with a Levenshtein distance of 1, and check which occur in the dump. Then I do a bunch of filtering and I check which exist in the live version of
910: 803:. When my Raspberry Pi is done I will have another ~60.000. The typos already have very similar regex ran on them as you saw in typo.js so much of the WONTFIX stuff has been filtered out already. 569: 990:. This makes it easier for me, as the fixes for the same target word turn up together, and perhaps for you, since you can compare my contribution list with the list I'm working from. 861:
AWB has two checkboxes at the top left of the "Find & Replace" configuration, which aim to cover the "certain situations". I run with those turned off, though, so that I
800: 796: 832: 792: 275: 260: 632: 616: 909:. Fortunately they say the same thing! I do fix typos in quotations if I think they are "insignificant" or are likely to have been copying errors. See 695:
Interesting. I'm finding typos by running regular expressions on a database dump; how are you creating your work list? What's your false positive rate?
638: 986:
I've restarted the list after telling AWB not to sort the pages alphabetically, so I'm now processing them in the same order as they were listed in
698:
I confess I'm so used to working with AWB and my 4000+ regular expressions that I'm unlikely to switch to a radically different method. --
38: 451: 382: 73: 209: 1158: 884:
will keep you busy for a while but you know where to find me when you want more. Perhaps I should stick the lists in a subpage of
405: 1065:\b((?:https?://|www\.)(?:\S+(?::\S*)?@)?(?:(?:{1,3}\.){3}{1,3}|(?:(?:-*)*+)(?:\.(?:-*)*+)*(?:\.(?:{2,})))(?::\d{2,5})?(?:\S*)?\b) 658: 502: 477: 428: 368: 603: 279: 264: 1170: 1144: 1107: 1049: 1034: 958: 933: 897: 874: 856: 812: 782: 752: 707: 465: 79: 550: 217: 205: 201: 197: 193: 189: 185: 181: 177: 173: 169: 165: 161: 157: 153: 149: 145: 141: 137: 133: 1112:
Are the URL regexes running with "ignore case" turned on? If not, the first URL regex fails to match the whole URL in the
764:
Languages? Assembler, BCPL, C, C++ - all unused for a decade, I'm afraid. But I've used regular expressions on a copy of
498: 401: 271: 256: 129: 125: 121: 117: 113: 109: 105: 101: 97: 669:
You like typofixing? I got tens of thousands of typos and I can't fix em all alone. Perhaps we can combine our forces?
376: 727:
I never really bothered to create stats of the amount of skips vs the amount of fixes but that is a good idea to have.
356: 1127:
prefix because it is being used as an infobox parameter. To exclude those, you'll either have to look backwards for
302: 297: 624: 581: 352: 306: 245: 865:
fix errors in quotations, references, foreign-language text and so on - with appropriate care and checking. --
741:
I have at least 60.000 potential typos left to fix so it is probably worth it to create a decent tool for that.
24: 943: 524: 289: 68: 1157:. I have added range_map to the list of disallowed parameters. I am currently trying to figure out whether 642: 311: 464:. There aren't many redirects, and they aren't used much, so I looked through them all manually. I fixed 348: 59: 682: 1120: 1013: 92: 538: 323: 1084:
and I haven't really decided how to improve on that. Not all of them have file extensions. Perhaps
921: 917: 946:. If you want some, please delete them from the list so that its clear that they've been handled. 938:
Thank you, redirect target improved. I combined typolist, typolist2 and typolist3 above (but not
840: 735: 620: 494: 397: 880: 1095: 911:
User:John of Reading/Typo fixing with AutoWikiBrowser#Editing quotes, book titles and such like
577: 334: 924:
project? That's another attempt at co-ordinated checking using data-crunching techniques. --
987: 939: 788: 765: 670: 250:
Hi, I'm not an experienced editor here, though I did contribute significantly lately to the
1166: 1136: 1103: 1045: 1026: 954: 925: 902: 893: 866: 852: 808: 774: 748: 699: 678: 650: 595: 542: 469: 420: 360: 319: 20: 843:) to not fix typos in certain situations. Do you know how we can get closer to that goal? 8: 49: 649:
edits to be autoconfirmed - as opposed to only 10 at the English-language Knowledge. --
609: 556: 520: 510: 485: 435: 388: 315: 64: 589: 573: 293: 45: 949:
I added Moss and the (code behind the) AWB checkboxes to my todolist, thanks again!
770: 458: 450:- pretty speedy now I have the uncompressed dump on an SSD drive. At the bottom of 1162: 1099: 1041: 981: 950: 889: 848: 804: 759: 744: 690: 674: 342: 232: 885: 385:. Please could you repeat that exercise (feel free to overwrite the original). 228: 844: 532: 516: 1113: 1003: 285: 251: 839:) as a list generator source. And AWB would contain code (very similar to 1085: 664: 412: 359:, and many new sources have been added. I'm going to remove the tag. -- 233: 338: 561: 906: 230: 1089: 1008:
In some cases the typo is embedded within a file name - example
330:
Thank you for the birthday wishes - that's a few weeks ago now.
1016:. I exclude those by peeking ahead for a known image suffix - 234: 920:
you may attract more helpers. Oh, and are you aware of the
720:
Which programming languages, if any, are you familiar with?
1058:
for URLs but a lot of them escaped the wrath of the regex.
1018:(?!*\.(?i:(?:gif|jpe?g|ogg|ogv|pdf|png|svg|tiff?|webm))\b) 998:
In many cases the typo is embedded within a URL - example
641:, it says you have to be autoconfirmed to move a page; at 993:
Two of your "don't fix" tests aren't working correctly:
905:
is marked as an essay; the authoritative guide is at
734:
of regex to avoid typos that shouldn't be fixed, see
572:
were found precious. That's what you are, always. --
454:
there's a short list of articles using redirects to
15: 537:You can read about Knowledge's deletion policy at 637:Each version of Knowledge sets its own rules. At 1161:can help identify typos better than a coinflip. 1020:- this regular expression isn't perfect, I know. 27:, where you can send him messages and comments. 639:es:Ayuda:Cómo cambiar el nombre de una página 1056:((http|https)://)(www.)?{2,256}\.{2,26}\b(*) 560: 1116:example because parts of it are uppercase. 452:User:Pigsonthewing/Direct calls to Infobox 383:User:Pigsonthewing/Direct calls to Infobox 615:sandbox to the mainspace. Any idea why? 1069:instead unless you have a better idea. 1061:I am considering using something like: 847:lists some developers in the infobox. 888:? I'll dive in the AWB code, thanks. 466:Federal College of Agriculture, Akure 942:, which you imported into AWB) into 831:In an ideal world, AWB would accept 515:Hello, what is the deletion policy? 381:A decade(!) ago, you kindly created 355:. Since then, yes, the article has 13: 769:"protectin" is a kind of protein; 594:How the time flies! Thank you. -- 14: 1188: 1131:or similar, or look forwards for 837:christmas|chirstmas|My Christmas 441: 411: 216: 39:Click here to start a new topic. 1086:Commons Special:MediaStatistics 773:is a stage name; and so on. -- 377:Direct uses of Template:Infobox 1171:07:47, 10 September 2024 (UTC) 1145:07:01, 10 September 2024 (UTC) 1108:03:41, 10 September 2024 (UTC) 1050:03:33, 10 September 2024 (UTC) 1: 1035:07:26, 9 September 2024 (UTC) 959:04:30, 9 September 2024 (UTC) 944:User:Polygnotus/Data/Typolist 934:20:14, 8 September 2024 (UTC) 898:19:40, 8 September 2024 (UTC) 875:18:50, 8 September 2024 (UTC) 857:18:44, 8 September 2024 (UTC) 813:18:15, 8 September 2024 (UTC) 783:18:08, 8 September 2024 (UTC) 753:17:14, 8 September 2024 (UTC) 708:16:47, 8 September 2024 (UTC) 683:16:21, 8 September 2024 (UTC) 36:Put new text under old text. 643:es:Knowledge:Autoconfirmados 246:Removing Template Assistance 7: 659:06:45, 27 August 2024 (UTC) 645:, it says you have to make 625:03:46, 27 August 2024 (UTC) 44:New to Knowledge? Welcome! 10: 1193: 1121:Lesser blue-eared starling 1014:Lesser blue-eared starling 916:If you post your links at 604:07:25, 3 August 2024 (UTC) 582:09:37, 2 August 2024 (UTC) 551:07:25, 3 August 2024 (UTC) 539:Knowledge:Deletion policy 525:19:48, 24 July 2024 (UTC) 482:Very helpful. Thank you. 74:Be welcoming to newcomers 1151:Pattern.CASE_INSENSITIVE 1140: 1030: 929: 922:Knowledge:Typo Team/moss 918:Knowledge talk:Typo Team 870: 778: 703: 654: 599: 546: 503:16:50, 7 June 2024 (UTC) 478:17:17, 6 June 2024 (UTC) 473: 429:16:43, 6 June 2024 (UTC) 424: 406:16:32, 6 June 2024 (UTC) 369:10:48, 26 May 2024 (UTC) 364: 280:05:04, 24 May 2024 (UTC) 265:05:03, 24 May 2024 (UTC) 736:User:Polygnotus/typo.js 333:Let's see. The tag was 565: 270:Also, happy birthday! 69:avoid personal attacks 1098:is steadily growing. 988:User:Polygnotus/typos 940:User:Polygnotus/typos 766:User:Polygnotus/typos 671:User:Polygnotus/typos 564: 357:changed substantially 210:Auto-archiving period 1155:Pattern.UNICODE_CASE 1119:The filename in the 1075:File:(.*?)(\\.|\\|)" 903:Knowledge:Quotations 879:I boldy created the 351:) when the article 1054:Originally I used 566: 80:dispute resolution 41: 1081:Category:(.*?)\\. 1072:For files I used: 241: 240: 60:Assume good faith 37: 1184: 1156: 1152: 1134: 1130: 1126: 1066: 1057: 1019: 1011: 1001: 985: 835:in this format ( 791:and then we got 771:Supremme de Luxe 763: 694: 636: 593: 536: 501: 492: 488: 463: 457: 449: 445: 444: 439: 415: 404: 395: 391: 353:looked like this 327: 309: 235: 221: 220: 211: 16: 1192: 1191: 1187: 1186: 1185: 1183: 1182: 1181: 1154: 1150: 1137:John of Reading 1135:or similar. -- 1132: 1128: 1124: 1064: 1055: 1027:John of Reading 1017: 1009: 999: 979: 926:John of Reading 867:John of Reading 775:John of Reading 757: 700:John of Reading 688: 667: 651:John of Reading 630: 612: 610:Knowledge edits 596:John of Reading 587: 568:Ten years ago, 559: 557:Always precious 543:John of Reading 530: 513: 511:Deletion policy 490: 484: 483: 470:John of Reading 461: 455: 442: 440: 433: 421:John of Reading 393: 387: 386: 379: 361:John of Reading 300: 284: 248: 237: 236: 231: 208: 86: 85: 55: 21:John of Reading 12: 11: 5: 1190: 1180: 1179: 1178: 1177: 1176: 1175: 1174: 1173: 1117: 1093: 1082: 1079: 1078:Image:(.*?)\\. 1076: 1073: 1070: 1067: 1062: 1059: 1052: 1022: 1021: 1006: 995: 994: 991: 976: 975: 974: 973: 972: 971: 970: 969: 968: 967: 966: 965: 964: 963: 962: 961: 947: 914: 822: 821: 820: 819: 818: 817: 816: 815: 742: 739: 728: 725: 721: 718: 711: 710: 696: 666: 663: 662: 661: 611: 608: 607: 606: 558: 555: 554: 553: 512: 509: 508: 507: 506: 505: 431: 378: 375: 374: 373: 372: 371: 331: 328: 247: 244: 239: 238: 229: 227: 226: 223: 222: 88: 87: 84: 83: 76: 71: 62: 56: 54: 53: 42: 33: 32: 29: 28: 9: 6: 4: 3: 2: 1189: 1172: 1168: 1164: 1160: 1148: 1147: 1146: 1142: 1138: 1122: 1118: 1115: 1111: 1110: 1109: 1105: 1101: 1097: 1094: 1091: 1087: 1083: 1080: 1077: 1074: 1071: 1068: 1063: 1060: 1053: 1051: 1047: 1043: 1038: 1037: 1036: 1032: 1028: 1024: 1023: 1015: 1007: 1005: 997: 996: 992: 989: 983: 978: 977: 960: 956: 952: 948: 945: 941: 937: 936: 935: 931: 927: 923: 919: 915: 912: 908: 904: 901: 900: 899: 895: 891: 887: 882: 878: 877: 876: 872: 868: 864: 860: 859: 858: 854: 850: 846: 842: 838: 834: 830: 829: 828: 827: 826: 825: 824: 823: 814: 810: 806: 802: 798: 794: 790: 787:Yeah that is 786: 785: 784: 780: 776: 772: 767: 761: 756: 755: 754: 750: 746: 743: 740: 737: 733: 729: 726: 722: 719: 715: 714: 713: 712: 709: 705: 701: 697: 692: 687: 686: 685: 684: 680: 676: 672: 660: 656: 652: 648: 644: 640: 634: 633:MarianoMora23 629: 628: 627: 626: 622: 618: 617:MarianoMora23 605: 601: 597: 591: 586: 585: 584: 583: 579: 575: 571: 563: 552: 548: 544: 540: 534: 529: 528: 527: 526: 522: 518: 504: 500: 496: 491:Pigsonthewing 487: 481: 480: 479: 475: 471: 467: 460: 453: 448: 437: 436:Pigsonthewing 432: 430: 426: 422: 418: 414: 410: 409: 408: 407: 403: 399: 394:Pigsonthewing 390: 384: 370: 366: 362: 358: 354: 350: 347: 344: 340: 336: 335:added in 2018 332: 329: 325: 321: 317: 313: 308: 304: 299: 295: 291: 287: 283: 282: 281: 277: 273: 272:144.86.34.230 269: 268: 267: 266: 262: 258: 257:144.86.34.230 253: 243: 225: 224: 219: 215: 207: 203: 199: 195: 191: 187: 183: 179: 175: 171: 167: 163: 159: 155: 151: 147: 143: 139: 135: 131: 127: 123: 119: 115: 111: 107: 103: 99: 96: 94: 90: 89: 81: 77: 75: 72: 70: 66: 63: 61: 58: 57: 51: 47: 46:Learn to edit 43: 40: 35: 34: 31: 30: 26: 22: 18: 17: 1114:Merle Miller 1092:can be used? 1004:Merle Miller 881:WP:QUOTETYPO 862: 836: 731: 668: 646: 613: 590:Gerda Arendt 574:Gerda Arendt 567: 514: 499:Andy's edits 495:Talk to Andy 486:Andy Mabbett 446: 416: 402:Andy's edits 398:Talk to Andy 389:Andy Mabbett 380: 345: 286:Zahran tribe 252:Zahran tribe 249: 242: 213: 91: 1129:range_map = 1096:My todolist 1010:distribuion 1163:Polygnotus 1100:Polygnotus 1042:Polygnotus 982:Polygnotus 951:Polygnotus 890:Polygnotus 849:Polygnotus 805:Polygnotus 797:9300 there 789:3489 typos 760:Polygnotus 745:Polygnotus 717:Knowledge. 691:Polygnotus 675:Polygnotus 1090:local one 907:MOS:QUOTE 801:1200 here 793:2800 here 82:if needed 65:Be polite 25:talk page 1088:and the 730:I use a 665:Hi John! 533:Gdfctjmm 517:Gdfctjmm 417:Doing... 349:contribs 93:Archives 50:get help 19:This is 1123:has no 1012:within 1002:within 1000:mmiller 886:WP:TYPO 841:typo.js 459:Infobox 303:protect 298:history 214:21 days 1159:Ollama 1149:I use 845:WP:AWB 307:delete 1125:File: 833:lists 541:. -- 468:. -- 339:Bradv 324:views 316:watch 312:links 78:Seek 1167:talk 1153:and 1141:talk 1133:.png 1104:talk 1046:talk 1031:talk 955:talk 930:talk 894:talk 871:talk 853:talk 809:talk 799:and 795:and 779:talk 749:talk 724:yet. 704:talk 679:talk 655:talk 621:talk 600:talk 578:talk 547:talk 521:talk 474:talk 447:Done 425:talk 365:talk 343:talk 320:logs 294:talk 290:edit 276:talk 261:talk 67:and 1025:-- 732:lot 570:you 493:); 419:-- 396:); 337:by 23:'s 1169:) 1143:) 1106:) 1048:) 1033:) 957:) 932:) 896:) 873:) 863:do 855:) 811:) 781:) 751:) 706:) 681:) 673:. 657:) 647:50 623:) 602:) 580:) 549:) 523:) 497:; 476:) 462:}} 456:{{ 427:) 400:; 367:) 322:| 318:| 314:| 310:| 305:| 301:| 296:| 292:| 278:) 263:) 212:: 206:28 204:, 202:27 200:, 198:26 196:, 194:25 192:, 190:24 188:, 186:23 184:, 182:22 180:, 178:21 176:, 174:20 172:, 170:19 168:, 166:18 164:, 162:17 160:, 158:16 156:, 154:15 152:, 150:14 148:, 146:13 144:, 142:12 140:, 138:11 136:, 134:10 132:, 128:, 124:, 120:, 116:, 112:, 108:, 104:, 100:, 48:; 1165:( 1139:( 1102:( 1044:( 1029:( 984:: 980:@ 953:( 928:( 913:. 892:( 869:( 851:( 807:( 777:( 762:: 758:@ 747:( 738:. 702:( 693:: 689:@ 677:( 653:( 635:: 631:@ 619:( 598:( 592:: 588:@ 576:( 545:( 535:: 531:@ 519:( 489:( 472:( 438:: 434:@ 423:( 392:( 363:( 346:· 341:( 326:) 288:( 274:( 259:( 130:9 126:8 122:7 118:6 114:5 110:4 106:3 102:2 98:1 95:: 52:.

Index

John of Reading
talk page
Click here to start a new topic.
Learn to edit
get help
Assume good faith
Be polite
avoid personal attacks
Be welcoming to newcomers
dispute resolution
Archives
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.