User talk:Tokenzero/Archive 1
From Wikipedia, the free encyclopedia
| This is an archive of past discussions with User:Tokenzero. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
| Archive 1 |
New lists
To be done in the same manner
- User:Headbomb/Academic Knowledge and Research Publishing
- User:Headbomb/Academic and Scientific Publishing
- User:Headbomb/American Research Publications
- User:Headbomb/Asian and American Research Publishing Group
- User:Headbomb/British Open Research Publications
- User:Headbomb/Eurasian Research Publishing
- User:Headbomb/European Union Research Publishing
- User:Headbomb/North American Research Publishing
- User:Headbomb/Research and Knowledge Publication
- User:Headbomb/Science and Technology Publishing
- User:Headbomb/World Current Research Publishing
Headbomb {t · c · p · b} 21:24, 12 October 2018 (UTC)
- The bot seems to have chocked / needs a kick in the bucket of bolts to get restarted. Headbomb {t · c · p · b} 11:39, 17 October 2018 (UTC)
- @Headbomb: Actually I paused the bot yesterday when I realized the number of redirects created, to discuss it today. The number at this moment is about 20k pages created (of which 20% are talk pages). With the further lists here, this is more than the total number of redirects created otherwise in all of (English) Wikipedia within a month (~37k), and it is getting close to 1% of all existing redirects (stats). By itself it's not a technical problem and not a big maintenance problem either (though the ISO-4 category and the list of redirects in WP:JOURNALS are useless for humans now). But I'd like to at least understand if all of these are really necessary.
- I mean what are the goals? Is it to make them reachable when typing the title? If so, I would vote to skip the and/ampersand variants and dotless variants, as Search will find them easily. Is it to have some other bot work based on redirect data? Then maybe that data could be provided directly. To be clear, I'm not objecting, just asking, maybe what I need to find peace :) is just some documentation of how WP:CRAPWATCH works (like does it use redirects?) and how it is supposed to be used (like editors sifting through the list, or getting notified when adding/editing/viewing a dubious citation?), because that's a bit unclear to me now. Tokenzero (talk) 17:18, 17 October 2018 (UTC)
- The objectives are twofold, the first is if someone searches for/links to Open Journal of Crap & Foo, there is a reasonable target at the end of it. That's the 'encyclopedic' benefit of it. The second is that it hugely facilitates what WP:JCW does. It looks for things that links to the same place, variants of those links, and typos of those links. Having say Open Journal of Crap / Open J. Crap / Open J Crap existing means they won't show up as potential typos of say Open Journal of the CRA / Open J. CRA / Open J CRA. And they'll also show up in things like (search for 'Journals cited by Wikipedia).Headbomb {t · c · p · b} 19:47, 17 October 2018 (UTC)
- Talk page tagging though, is pretty pointless. As far as I'm concerned, all abbreviations/ISO 4/NLM/MathSciNet/Bluebook redirects could be de-tagged without any loss.Headbomb {t · c · p · b} 20:03, 17 October 2018 (UTC)
- OK, now that I read the updated Questionable1 list, I see people use those dotless and ampersand variants much more that I thought, for the 'encyclopedic' side. On the other hand, from the 5000 titles and variants in the search you gave, only 2 are actually cited, apparently. I don't see any variants ever used as links, and as for Search as I said it handles variants well anyway. For JCW the bot could in principle just read, say, subpages of User:JL-Bot/Questionable.cfg that I could fill with lists of titles and abbrevs, one for each publisher. I don't see the value of WhatLinksHere, since the same is much better presented by the JCW lists. The red-blue distinction in links in JCW could be easily simulated, and I don't see links outside of WP:JCW. But I guess there is some value to making individual redirects, because e.g. when editors handle some abbreviation collision between a crap and non-crap journal as usual, their decision is automatically used by JL-Bot too.
- So I'm continuing the bot run. But if you plan to increase the numbers more than 10× then I would seriously consider doing subpages of JL-Bot instead, or creating only redirects for variants that ever appeared in JCW. (As for talk pages, I only create them for the main, unabbreviated variant now, since at least the beginning of this task; that's why they make 20% and not 50%). Tokenzero (talk) 10:36, 18 October 2018 (UTC)
- For that link, 3 were cited (as created by TokenzeroBot). Part of the point for the bot run is that we don't know what variants exist (and those are still all likely search terms). For instance, there could be a lot of Open journal of crap, or Open J Crap., or even a Open Journal of Crap: Official Journal of the Crap Society and those aren't picked up because we need a new dump before they shows up.
- Talk page tagging though, is pretty pointless. As far as I'm concerned, all abbreviations/ISO 4/NLM/MathSciNet/Bluebook redirects could be de-tagged without any loss.Headbomb {t · c · p · b} 20:03, 17 October 2018 (UTC)
- The objectives are twofold, the first is if someone searches for/links to Open Journal of Crap & Foo, there is a reasonable target at the end of it. That's the 'encyclopedic' benefit of it. The second is that it hugely facilitates what WP:JCW does. It looks for things that links to the same place, variants of those links, and typos of those links. Having say Open Journal of Crap / Open J. Crap / Open J Crap existing means they won't show up as potential typos of say Open Journal of the CRA / Open J. CRA / Open J CRA. And they'll also show up in things like (search for 'Journals cited by Wikipedia).Headbomb {t · c · p · b} 19:47, 17 October 2018 (UTC)
- Could we set these up as subpages of /JL-Bot? Theoretically yes, but that would require new code, and would be less efficient at reducing false positives for other similarly named journals. And we'd lack the encyclopedic benefits for the reader too. Headbomb {t · c · p · b} 13:31, 18 October 2018 (UTC)
You could omit all redirect talk page tagging if you want. I think it was User:Randykitty who requested that, likely out of misunderstanding of how WP:AALERTS work. I can't think of any benefit to that tagging, and multiple drawbacks. Headbomb {t · c · p · b} 13:34, 18 October 2018 (UTC)