Commons:Bots/Work requests
|   | 
| This is a page for requesting work to be done by a bot. This is an appropriate place to simply put ideas for bots. However be aware of various tools available to all users which can be used to accomplish the work without the need for a bot: 
 | 
|  | SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 7 days. | 
|  | 
| Legend | 
|---|
| 
 | 
| 
 | 
| 
 | 
| 
 | 
| 
 | 
| Manual settings | 
| When exceptions occur, please check the setting first. | 
Remove extraneous "I, " in author param of PD-self
[edit]@Pelikana noticed that there are a lot of erroneous uses of {{PD-self}} which insert "I, " before the author. Could someone please replace {{PD-self|author=I,  with {{PD-self|author= in the following pages? —CalendulaAsteraceae (talk • contribs) 06:52, 24 July 2024 (UTC)
- I can do that -- DaxServer (talk) 09:46, 24 July 2024 (UTC)
- @DaxServer: I think we've resolved the issues with this request. Are you still up for doing the replacement? —CalendulaAsteraceae (talk • contribs) 20:06, 15 October 2024 (UTC)
- @CalendulaAsteraceae Yes, let me know the final replacements to be done -- DaxServer (talk) 19:27, 10 November 2024 (UTC)
- @DaxServer, please replace {{PD-self|author=I,with{{PD-self|author=in these pages. —CalendulaAsteraceae (talk • contribs) 15:34, 11 November 2024 (UTC)
 
- @DaxServer, please replace 
 
- @CalendulaAsteraceae Yes, let me know the final replacements to be done -- DaxServer (talk) 19:27, 10 November 2024 (UTC)
 
- @DaxServer: I think we've resolved the issues with this request. Are you still up for doing the replacement? —CalendulaAsteraceae (talk • contribs) 20:06, 15 October 2024 (UTC)
- @CalendulaAsteraceae and @Pelikana: "I, " is there to make the assertion first person.   — 🇺🇦Jeff G. ツ please ping or talk to me🇺🇦 10:21, 24 July 2024 (UTC)
- Hi @Jeff Yes, it obviously used to be there to make the assertion first person. But I think at some point the textlines were changed and now IMHO it is a displaced element, plus very odd that it is the only not translated text element in the template, at least in these use cases. Do you mean to say the results are completely correct this way and need no change? Both lines seem grammatically faulty to me  "... door de auteur, I, JohnDoe" (".. by the author, I, JohnDoe") and  "I, JohnDoe allows ...". Last one should read (in Dutch) "Ik, JohnDoe sta ...." It should not read "I, JohnDoe staat ... " because this line starts in first person and ends in 3rd person. In later days (past 2007-2008) the "I, " "I, "  is not in the templates anymore it seems. Peli (talk) 10:52, 24 July 2024 (UTC)
- Indeed. The template uses {{int:Wm-license-pd-author-with-author-text}}, which produces the text "This work has been released into the public domain by its author, $1. This applies worldwide." The appropriate way to make this first person would to edit the page on TranslateWiki (well, the English one needs to be changed in MW code, but for other languages this is where you'd edit it), not to manually put "I, " in the author parameter. —CalendulaAsteraceae (talk • contribs) 20:50, 24 July 2024 (UTC)
 
- Indeed. The template uses 
 
- Hi @Jeff Yes, it obviously used to be there to make the assertion first person. But I think at some point the textlines were changed and now IMHO it is a displaced element, plus very odd that it is the only not translated text element in the template, at least in these use cases. Do you mean to say the results are completely correct this way and need no change? Both lines seem grammatically faulty to me  "... door de auteur, I, JohnDoe" (".. by the author, I, JohnDoe") and  "I, JohnDoe allows ...". Last one should read (in Dutch) "Ik, JohnDoe sta ...." It should not read "I, JohnDoe staat ... " because this line starts in first person and ends in 3rd person. In later days (past 2007-2008) the "I, " "I, "  is not in the templates anymore it seems. Peli (talk) 10:52, 24 July 2024 (UTC)
- I think it is a good idea to add "I, " as a suffix if the uploader is also the work's creator. Please don't replace that. For example, it may not be clear to many or people only or first check the author field where this is useful metadata, especially if the author name is different from the username in which case they would also need to check the license template. Prototyperspective (talk) 12:04, 25 July 2024 (UTC)
- This is a good thing to handle in {{PD-self}} (which is a template only intended to be used by the uploader). Adding it manually means it's a huge pain to update if the wording of the template changes, and also doesn't work with internationalization. Right now
- {{int:Wm-license-pd-author-with-author-text|I, Calendula}} 
- produces
- This work has been released into the public domain by its author, I, Calendula. This applies worldwide. 
- in English, which is ungrammatical and frankly silly. If I switch my display language to Spanish, it instead produces
- Este trabajo ha sido liberado al dominio público por su autor, I, Calendula. Esto aplica para todo el mundo. 
- which is even worse. If you want to change the wording of {{PD-self}}, probably the way to go is switching in the template from int:Wm-license-pd-author-with-author-text to something like int:Wm-license-pd-author-self-text that incorporates the author's name. —CalendulaAsteraceae (talk • contribs) 19:28, 25 July 2024 (UTC)
- I couldn't find an existing piece of text, so I submitted a feature request at phabricator:T371057. I think that further discussion of updates to the text of {{PD-self}} should go to the template talk page, and also that this bot request should go ahead because manually adding "I, " before the author's name is a terrible way to make the template first-person. —CalendulaAsteraceae (talk • contribs) 20:28, 25 July 2024 (UTC)
- You're absolutely right. Sorry, I misunderstood. It's not really clear in your initial post that this would be added to the template instead. Prototyperspective (talk) 21:02, 25 July 2024 (UTC)
- I agree that this is just about cleaning up a tiny bit of lost and redundant text on a limited number of pages and would be glad if @-- DaxServer would get the green light to fix this series of typo's, on these old pages by deleting "I, ". ThanksPeli (talk) 21:44, 4 August 2024 (UTC)
 
 
 
- Bot request filed - Commons:Bots/Requests/DaxBot (7) -- DaxServer (talk) 20:56, 11 November 2024 (UTC)
- Thank you! —CalendulaAsteraceae (talk • contribs) 07:03, 16 November 2024 (UTC)
 
Add OCR output to jpg
[edit]From the discussion at VP/T, I found a solution to a problem identified earlier: frequently we have images of streets and other with some text in it. Sometimes this is of interest, but it's not necessarily included in filename or description.
https://ocr.wmcloud.org/ would allow to extract such text and make it editable on Commons.
Ideally a bot would go through new uploads (and also some maintenance category for older files) and run https://ocr.wmcloud.org/ on it. The output (if any) could be added to the file description page, either with a template or as structured data.
Sample file:
Input:
Output:
- "PER PONTEM AD FORTUNAM GOURNAY-SUR-MARNE RUE DES LAURIERS"
Enhancing999 (talk) 15:16, 2 August 2024 (UTC)
- Also please see the discussion at VP/T linked above. Just briefly adding support to this wish and two notes:
 it would likely be a problem to scan all files on WMC and/or all new uploads, instead one could let the bot run only categories where this may be useful. Secondly, rather than writing a new bot it would probably be better to add this functionality to some bot that already writes e.g. structured data to lots of files (however SD can't be searched on WMC can it?) like SchlurcherBot. Prototyperspective (talk) 10:50, 4 August 2024 (UTC)- I think there are already some bots who scan all uploads .. it could obviously be added to those. If SD is used, we should make sure it's searchable. Enhancing999 (talk) 13:47, 4 August 2024 (UTC)
- I think the place to put OCR results would be a new field for the file summary box that is collapsed by default. This way all users can easily find and see this info and it can be searched. The mentioned VP/T thread is now archived to here. I think adding a way to categorize based on OCR results would be quite useful. However, not extremely useful so I don't know if it's necessarily worth the effort to develop a categorization-based-on-OCR tool or extension for the ocr-tool. This is why after briefly asking about it here I only listed the task I meant to use this for at the new page Commons:Categorization requests. Prototyperspective (talk) 11:45, 21 September 2024 (UTC)
- I don't think "collapsed by default" allows users to " easily [..] see this info". 
 ∞∞ Enhancing999 (talk) 08:36, 16 November 2024 (UTC)- Disagree, it's just a button to click; also the search would index the content in just the normal way (optimally with some ocr:"text" search operator). For example, what about image that contain very much text? It makes the information-box very large and pushes the other content down usually without being useful. Prototyperspective (talk) 10:53, 16 November 2024 (UTC)
- I'm not saying you should place it on top. It's always confusing when text one is searching for isn't visible in Wikipedia articles, because someone somewhere hid it. Some Wikipedia's have explicit rules against such hiding, but they aren't always applied. 
 ∞∞ Enhancing999 (talk) 10:55, 16 November 2024 (UTC)- Well one can simply click the well-visible Expand button. If these are put into a template that is consistently at the bottom / somewhere beneath the licensing template then that may be as good, it would be worth discussing this but none of OCR is currently added anyway. Prototyperspective (talk) 11:12, 16 November 2024 (UTC)
- How do you know where the "well visible expand button" is and where it's placed? If you just search for the text on the open Wikipedia article, you will never find it. 
 ∞∞ Enhancing999 (talk) 11:20, 16 November 2024 (UTC)- It's in the quite small/short {{Information}} template. A Wikipedia article is long; also highlighting the element underneath text searched for is hidden would be very useful. Prototyperspective (talk) 11:37, 16 November 2024 (UTC)
 
 
- How do you know where the "well visible expand button" is and where it's placed? If you just search for the text on the open Wikipedia article, you will never find it. 
 
- Well one can simply click the well-visible Expand button. If these are put into a template that is consistently at the bottom / somewhere beneath the licensing template then that may be as good, it would be worth discussing this but none of OCR is currently added anyway. Prototyperspective (talk) 11:12, 16 November 2024 (UTC)
 
- I'm not saying you should place it on top. It's always confusing when text one is searching for isn't visible in Wikipedia articles, because someone somewhere hid it. Some Wikipedia's have explicit rules against such hiding, but they aren't always applied. 
 
- Disagree, it's just a button to click; also the search would index the content in just the normal way (optimally with some ocr:"text" search operator). For example, what about image that contain very much text? It makes the information-box very large and pushes the other content down usually without being useful. Prototyperspective (talk) 10:53, 16 November 2024 (UTC)
 
- I don't think "collapsed by default" allows users to " easily [..] see this info". 
 
Move "Historical images of" to "History of"
[edit]Per note at Category:Historical images by country (as conclusion from Commons:Categories for discussion/2019/09/Category:Historical images), the content of the categories at Special:PrefixIndex/Category:Historical images of should be moved to "History of". This seems to involve more than 10'000 categories, see PetScan:29034509. I think the resulting redirect could afterwards be tagged for speedy deletion. Enhancing999 (talk) 18:59, 2 August 2024 (UTC)
- i dont think it's a good idea to handle this problem without human supervision.
- i would rather do these instead:
- prohibit new categories with the word from being created.
- let users slowly move the files to the appropriate categories (by time).
 
- RZuo (talk) 20:42, 2 August 2024 (UTC)
- "history of ..." is not any better. everything is history. RZuo (talk) 20:43, 2 August 2024 (UTC)
- Right, any cutoff for "history" will change every second/minute/hour/week/month/year/century/millennium. See also Commons:Categories for discussion/2024/08/Category:History by country.   — 🇺🇦Jeff G. ツ please ping or talk to me🇺🇦 10:20, 4 August 2024 (UTC)
- there's specific interest related to "history" of something.
- for example, historians of asian history should go under "history of asia".
- but to dump files into "history of xx" is no more better than dumping them in "xx" or "historical images of xx". all files of xx can perfectly fit into all those three variations.
- most of these "historical images of xx" basically contain all photographs before the advent of digital photography, especially black and white photographs.
- so i'd rather users move these cats to or create for example "xx in the 19th/20th century". RZuo (talk) 12:49, 4 August 2024 (UTC)
- i have an idea of a bot moving files according to the time/date, but i need probably 1 or 2 years to code something like that up. RZuo (talk) 12:53, 4 August 2024 (UTC)
- I don't think this is the place to re-discuss the CfD. If you think the closure is problematic, ask an admin to re-open it. Enhancing999 (talk) 12:57, 4 August 2024 (UTC)
 
 
- i have an idea of a bot moving files according to the time/date, but i need probably 1 or 2 years to code something like that up. RZuo (talk) 12:53, 4 August 2024 (UTC)
 
 
- Right, any cutoff for "history" will change every second/minute/hour/week/month/year/century/millennium. See also Commons:Categories for discussion/2024/08/Category:History by country.   — 🇺🇦Jeff G. ツ please ping or talk to me🇺🇦 10:20, 4 August 2024 (UTC)
 
- "history of ..." is not any better. everything is history. RZuo (talk) 20:43, 2 August 2024 (UTC)
- There is just no way this can be done manually. If there are cases you think would be problematic, please state them here. Enhancing999 (talk) 20:56, 2 August 2024 (UTC)
 Support Per Enhancing999. There's currently 33732 categories for "historical images", which is way to many for anyone to deal with manually. This also isn't the place to relitigate the CfD. Nor do I think doing so would go anywhere anyways since it was open for 4 years and has been closed since last year. So there has been plenty of time for people to raise concerns about it. Most of these categories only contain a couple of images to begin with and they aren't "historical" either. The idea that we should let users slowly move the files to the appropriate categories when it's only a couple of images per category to begin with is totally ridiculous and would just waste everyone's time. There's no reason people can't better categorize the images once they are moved to "history of xx" categories. That's where most of the images were in the first place. Regardless, this should totally be done by a bot instead of forcing users to waste time doing it manually. --Adamant1 (talk) 07:27, 10 August 2024 (UTC) Support Per Enhancing999. There's currently 33732 categories for "historical images", which is way to many for anyone to deal with manually. This also isn't the place to relitigate the CfD. Nor do I think doing so would go anywhere anyways since it was open for 4 years and has been closed since last year. So there has been plenty of time for people to raise concerns about it. Most of these categories only contain a couple of images to begin with and they aren't "historical" either. The idea that we should let users slowly move the files to the appropriate categories when it's only a couple of images per category to begin with is totally ridiculous and would just waste everyone's time. There's no reason people can't better categorize the images once they are moved to "history of xx" categories. That's where most of the images were in the first place. Regardless, this should totally be done by a bot instead of forcing users to waste time doing it manually. --Adamant1 (talk) 07:27, 10 August 2024 (UTC)
- Instead of tagging the redirects for speedy deletion, a bot may rename the categories without leaving a redirect if the corresponding category 'history of...' does not yet exist. Wikiwerner (talk) 12:12, 17 November 2024 (UTC)
- If a cat "historical images of xx" has < 5 (or a similarly small number) files, all files should be moved to "xx". it's not necessary to make a separate subcat for just a handful of files. RoyZuo (talk) 14:14, 17 November 2024 (UTC)
- e.g. Category:Historical images of Sababurg which has 6 files, while Category:Sababurg has only ~50 files. RoyZuo (talk) 14:40, 17 November 2024 (UTC)
- But how do decide by bot when a category is desired to keep? Wikiwerner (talk) 17:09, 18 November 2024 (UTC)
 
 
- e.g. Category:Historical images of Sababurg which has 6 files, while Category:Sababurg has only ~50 files. RoyZuo (talk) 14:40, 17 November 2024 (UTC)
- This request requires probably hundreds of thousands of edits. Is "History of ..." a better categorization? The category:History is subject of a CfD too: see Commons:Categories for discussion/2024/06/Category:History. We better wait for a verdict there. Otherwise perhaps another hundreds of thousands of edits are necessary after the latter verdict. Wikiwerner (talk) 19:59, 21 November 2024 (UTC)
There are about 339,000 files in the category Media missing infobox template. Even using add_information.php (or the gadget), the task is too huge to be done manually. I assume that would be a nice job for a bot. A simple search/replace wouldn't be sufficient, since some file pges contain {{Filedesc}} and {{License-header}} which should be preserved. Additionally, some files have information on sources, e.g. 1884 South Penn RR.jpg. Those should be used for the source parameter of the information template. Fl.schmitt (talk) 19:22, 10 August 2024 (UTC)
- Maybe a list could be generate from the category about the most used files and these be done manually? Also, please keep in mind COM:GOF. Enhancing999 (talk) 07:09, 12 August 2024 (UTC)
- Good idea - restricting on the most used files is reasonable. Additionally, i thought about grouping by uploader / author which would facilitate automatic editing. Fl.schmitt (talk) 07:25, 12 August 2024 (UTC)
- I tried Special:Search/switzerland incategory:"Media_missing_infobox_template" and then used Petscan:29082230 to find the uploaders.
- This found images like File:Runs_Kapelle.jpeg by "Ikiwaner" who uploaded plenty of own pictures which is clearly indicated, but even add-information can't complete it.
- One would think that we'd have more pictures of these places almost 20 years later, but sometimes we don't. Enhancing999 (talk) 08:30, 12 August 2024 (UTC)
- Looks very interesting! The problem with add-information.php is that it has to transform arbitrary input, which is IMO almost impossible. With pre-structured data (known author/uploader, known structure of file description), maybe the task can be automated to a certain extend. Limiting the input by location is a good idea! Fl.schmitt (talk) 08:55, 12 August 2024 (UTC)
- add-information.php seems relatively good based on the input, but a review seems necessary.
- Even when filtering by uploader can give large range of complicated cases (especially old imports from other wikis). Adding a search for "own photograph" (or similar) can simplify things. Enhancing999 (talk) 12:02, 12 August 2024 (UTC)
 
 
- Looks very interesting! The problem with add-information.php is that it has to transform arbitrary input, which is IMO almost impossible. With pre-structured data (known author/uploader, known structure of file description), maybe the task can be automated to a certain extend. Limiting the input by location is a good idea! Fl.schmitt (talk) 08:55, 12 August 2024 (UTC)
 
 
- Good idea - restricting on the most used files is reasonable. Additionally, i thought about grouping by uploader / author which would facilitate automatic editing. Fl.schmitt (talk) 07:25, 12 August 2024 (UTC)
Maybe we could list groups of a similar cases somewhere, so someone else can determine if they want to assess them further (or they are all actually similar). Samples:
- Mapmakers
- Tschubby maps (1100), see Category:Media missing infobox template (maps t1)
- AHoerstemeier map (275)
- Vardion map (288)
 
- Photographers/image sources
- Famfamfam flag icons (250)
- Peter Berger (50)
- Picswiss (294)
- NASA (4794)
- Lienhard Schulz (270)
- Dake Switzerland (82)
- Carlo Ponti (31)
- Giorgio Sommer (75)
- Arnaud Gaillard (105)
- Qwertzy2 (81)
- CdaMVvWgS (65)
- Marcel.C (7)
- Flyout (46)
- Francis Frith (28)
- Matthäus Merian (160)
- Marc Mongenet (68)
- Markus Bernet (40)
- Julo (524): possibly several
- Flickr (4483)
- Crops from Mathematikerkongress, Zürich 1932 (all done)
 
- Copyright status
- PD-Old (22630): Category:PD Old
- PD Official documents (115)
 
- Added complexity
- original upload log-header (10450): Template:original upload log (initial upload at Wikipedia)
- transferred from (458): Template:Transferred from (not included in previous)
- Files_moved_to_Commons_from_Wikipedia: Category:Files_moved_to_Commons_from_Wikipedia (possibly included in previous, likely not)
- derived from (91): Template:derived from
- extracted from (1641): Template:Extracted from
- Bilderwerkstatt (387) :Template:Bilderwerkstatt
- images with annotations (1765): Category:Images with annotations (doesn't work well with add_information)
- Files in need of review (sources) (4705): Category:Files in need of review (sources) (possibly this doesn't go beyond the file not having the information template)
 
- Personal templates, see Category:Media missing infobox template (personal templates)
- RHaworth personal template (25): user:RHaworth/mylic
- IUCN category/Pengo (362): Template:Pengo IUCN
- Fb78 (122): User:Fb78/Licence
- Twice25 (270): user:Twice25/Crediti
- CNG (343): Template:CNG
 
- Image types
- coat of arms (16323), some in Category:Media missing infobox template (coats of arms)
- insignia (11356)
- flag (13852)
- currency (423), see Category:Media missing infobox template (currency)
- logo (7826)
- Wikipedia brand (957): Category:Trademarks and logos of Wikimedia (also included in previous)
- kit body (769)
- ChemDraw (490)
 
Enhancing999 (talk) 15:01, 12 August 2024 (UTC) updated
- We could do subcategories of Media missing infobox template for maps, logos, coats of arms, insignia, currency, flags and personal templates. There is already one for artwork.
- Interesting to compare the early digital photos with others we have: sometimes it still looks the same, others lack any comparable one, sometimes it's clearly aged, sometimes it gives a historic comparison, sometimes in a larger set we lack clearly better ones.
- BTW, image notes seem to be handled badly by add-information (they get mixed into the description). Headers handling could be improved too. I don't think I ever had one that didn't need editing (that seems to be the idea anyways).. besides, I try to complete them. Enhancing999 (talk) 13:38, 13 August 2024 (UTC)
- @Enhancing999: Great work, this is very helpful. I've started with the maps provided by Tschubby, because it seems that most of the file description shares the same structure. Please check Revision #909185535 of Karte Gemeinde Troinex.png for a regex-based replacement by pywikibot. IMO, this looks ok. Fl.schmitt (talk) 16:49, 13 August 2024 (UTC)
- The problem with File:Karte Gemeinde Troinex.png is that it wasn't uploaded by Tschubby, so {{Own}} isn't applicable.
- Supposedly that file and File:Carte Commune Troinex.png are based on a file that was initially uploaded at de:File:Karte Gemeinde Troinex.png, see https://de.wikipedia.org/w/index.php?title=Spezial:Logbuch&logid=283755 . Normally the file description page would include copy of the upload log from dewiki, but it doesn't. File:Glacier.zermatt.arp.750pix.jpg had some details I added after "own".
- BTW, Tschubby is still very active, so he might have a view how he prefers them to be done or do them directly himself. Enhancing999 (talk) 17:03, 13 August 2024 (UTC)
- If it's the same file, initial upload was: [1]. Enhancing999 (talk) 17:09, 13 August 2024 (UTC)
- hmm - ok - yes, seems I was too optimistic... it's clear that getting this done by a bot will never reach the quality of manually checking / editing all the parameters. So we will have to decide which grade of completeness is achievable / required. Searching for other / derived / source versions can only be done manually, I think. So if this is a requirement, there's no way to get this task done by a bot, not even a small part of this task.
- What's possible IMO is to group the files by the structure of their description, maybe additionally by uploader and year/month of upload, and do a regex-based replacement. This may lead to incomplete Information/Map/Artwork templates, e.g. if there's no information regarding the source.
- Regarding the parameters:
- Setting the source parameter may be possible (1) if the source is stated in the description or (2) if uploader is identical with author. In other cases, the source can't be set automatically.
- Setting the exact upload date will be very difficult if we use pywikibot's replace script. If using the upload's year and month is sufficient, one could group the files accordingly, based on a PetScan search. This depends on the required/acceptable grade of precision.
 
- Fl.schmitt (talk) 18:13, 13 August 2024 (UTC)
 
 
- If it's the same file, initial upload was: [1]. Enhancing999 (talk) 17:09, 13 August 2024 (UTC)
 
- Still trying to get the {{Upload date}} template working... Fl.schmitt (talk) 16:51, 13 August 2024 (UTC)
- I try to avoid upload date. Weirdly, add-information tends to get even the exif date wrong. For Tschubby's municipality maps, it may be sufficient to add the year they are meant to be current (borders don't change that frequently). Enhancing999 (talk) 15:00, 15 August 2024 (UTC)
 
 
- @Enhancing999: Great work, this is very helpful. I've started with the maps provided by Tschubby, because it seems that most of the file description shares the same structure. Please check Revision #909185535 of Karte Gemeinde Troinex.png for a regex-based replacement by pywikibot. IMO, this looks ok. Fl.schmitt (talk) 16:49, 13 August 2024 (UTC)
- when you can identify some common pattern in some file sets, Commons:AWB or jwb might be a good tool. RZuo (talk) 22:04, 14 August 2024 (UTC)
- What would be cool for add-information is if one could use it with some defaults (description language, author, date, {{Taken on}}-location, source, other fields, license, etc) for a given subset.
- Also, a few bugs might be worth fixing (licence header formatting, keeping image annotations together, placement of coordinates template, exif dates) if others plan to use it (I'm mostly done with the subset I'm looking into). Enhancing999 (talk) 15:05, 15 August 2024 (UTC)
- Enhancing999, Thank you for tackling this long neglected problem. I like your divide-and-concur approach, and I agree with RZuo that Commons:AWB might be a good tool to use. That is what I used when some years ago I was adding infoboxes. Another possible approach might be to start adding com:SDC data like author, description and date with QuickStatements tool. If you do that than you can just add {{Information}} template with no parameters and it will display SDC data. See File:Indoor_Climbing_Kid.jpg for example. If you have any questions about this approach I can explain with more details. --Jarekt (talk) 04:21, 16 August 2024 (UTC)
- Good idea indeed. This could simplify adding only one aspect at a time (not everything can be determined with the same ease). Once sufficient data for {{Information}} is available, the template could be added.We just need to be careful that basic information available as statements is also otherwise visible.
- BTW, one would think that it's an old issue, but sometimes even recent uploads don't have a template (or someone deleted it).
- If it's thought helpful for others, I can create subcategories for some or most of the above groups (obviously they should be deleted easily once empty or if a better one can be found).
- If it's easy to add by bot, a subcategory for frequently used files could be helpful. (it's doable with PetScan for a relatively small set, but not for all 337000 files in the category). In the subset I checked few had more than 30 main namespace uses (sample, now with template). Enhancing999 (talk) 11:21, 16 August 2024 (UTC)
- Flickr might a good start to add {{Information}} through statements only. We currently have ca. 4500 files mentioning Flickr. Some 2100 have both creator and source. An issue with some of these seems to be that they are blank. I brought this up at Schlurcherbot. Wouldn't the various Flickr templates also include source and creator? Enhancing999 (talk) 15:01, 16 August 2024 (UTC)
- @Enhancing999 thank you for creating the hiddencat - this makes it easier to get a clearly defined set of files as input for bulk modifications! I'm currently working on a bot that should be able to work through the grouped files, preferably writing SDC data wherever possible. But there are some points where I'm not sure about:
- Date: We may simply take the year (as you've proposed earlier), but I found it would be quite easy for a bot (from a technical point of view) to use the oldest upload date. Is there a way to use inception (P571), qualifying the date as {{Upload date}} in SDC?
- Source: We can use either original creation by uploader (Q66458942) (if uploader and creator seem to be identical) or own work by the original uploader (Q87402110) (in other cases). I wonder if there's a way to additionally point to the source wikipedia (e.g. german wikipedia)?
 
- Fl.schmitt (talk) 16:11, 22 August 2024 (UTC)
- Commons_talk:Structured_data might find you help on the question specifically for structured data.
- If {{Information}} has no date, the line doesn't even appear as missing. Sample: Special:Diff/914493166.
- I noticed some uploaders use {{Own}} and link directly their username at Wikipedia. Not sure how bots handle this.
- Reimports from Wikipedia are tricky in general. See also: Commons:Village_pump#c-Jarekt-20240817151300-Asclepias-20240817140600 Enhancing999 (talk) 19:29, 22 August 2024 (UTC)
- @Enhancing999 - it took some time, but my bot solution is almost ready for action. Since handling weak-structured data is tricky, the bot first prepares (and actually prepared) just a "simulation" result, without any "live" modifications of Commons pages. This "simulation" result shows the proposed modifications for a certain set of file pages lacking {{Information}}. The bot tries to add as much information as possible by SDC (esp. Date and Author) and doesn't repeat those values in the generated {{Information}} template, since the template uses those SDC values by default. So, the template may look "incomplete" (for reference, see e.g. File:Karte Bodensee Birnau.png where I added as much as SDC as possible manually, leaving the respecting files in the Information template empty). The simulation result is available on gitlab in two formats: plain txt and SQL (sqlite). Before filing a bot request, I would be glad about any critical feedback regarding the proposed modifications. Fl.schmitt (talk) 18:31, 4 September 2024 (UTC)
 
 
 
- @Enhancing999 thank you for creating the hiddencat - this makes it easier to get a clearly defined set of files as input for bulk modifications! I'm currently working on a bot that should be able to work through the grouped files, preferably writing SDC data wherever possible. But there are some points where I'm not sure about:
 
- Flickr might a good start to add {{Information}} through statements only. We currently have ca. 4500 files mentioning Flickr. Some 2100 have both creator and source. An issue with some of these seems to be that they are blank. I brought this up at Schlurcherbot. Wouldn't the various Flickr templates also include source and creator? Enhancing999 (talk) 15:01, 16 August 2024 (UTC)
 
- Good idea indeed. This could simplify adding only one aspect at a time (not everything can be determined with the same ease). 
 
- Enhancing999, Thank you for tackling this long neglected problem. I like your divide-and-concur approach, and I agree with RZuo that Commons:AWB might be a good tool to use. That is what I used when some years ago I was adding infoboxes. Another possible approach might be to start adding com:SDC data like author, description and date with QuickStatements tool. If you do that than you can just add {{Information}} template with no parameters and it will display SDC data. See File:Indoor_Climbing_Kid.jpg for example. If you have any questions about this approach I can explain with more details. --Jarekt (talk) 04:21, 16 August 2024 (UTC)
 
.svg/15px-Pictogram_voting_comment_(orange).svg.png) Comment Have anyone looked at who the uploader was? Sometimes you can be lucky and find users who uploaded hundreds (or thousands) of files using the same way of adding information. So if user Foo uploaded 1000 photos with "<description>Taken by me.<license>" then it is possible to add an information template and put the <description> in the description field and add "Taken by me" in source field (or add {{Own}} instead) and add User Foo as author. If Foo is still active and did not add a good source/author the files could be added in a category called "Files uploaded by Foo" and then Foo could be asked to check the files and confirm to be the photographer. --MGA73 (talk) 18:49, 1 November 2024 (UTC) Comment Have anyone looked at who the uploader was? Sometimes you can be lucky and find users who uploaded hundreds (or thousands) of files using the same way of adding information. So if user Foo uploaded 1000 photos with "<description>Taken by me.<license>" then it is possible to add an information template and put the <description> in the description field and add "Taken by me" in source field (or add {{Own}} instead) and add User Foo as author. If Foo is still active and did not add a good source/author the files could be added in a category called "Files uploaded by Foo" and then Foo could be asked to check the files and confirm to be the photographer. --MGA73 (talk) 18:49, 1 November 2024 (UTC)- Give it a try. Please make sure to skip people who frequently imported from other wikis. 
 ∞∞ Enhancing999 (talk) 19:59, 1 November 2024 (UTC)
 
- Give it a try. Please make sure to skip people who frequently imported from other wikis. 
- If imported with FileImporter it can still work. I fixed thousands of files this way on various wikis before they were imported to Commons but also some after they were imported. --MGA73 (talk) 17:43, 2 November 2024 (UTC)
 
- Special:Search/"featured picture" incategory:"Media missing infobox template" (118) might be worth doing in priority. 
 ∞∞ Enhancing999 (talk) 08:57, 16 November 2024 (UTC)
Generate a daily database report equivalent of Special:UncategorizedCategories
[edit]| initial request and related discussion | 
|---|
| Generate a daily database report equivalent of Special:UncategorizedCategories For each page, output: 
 Ideally formatted in a template. Enhancing999 (talk) 14:27, 24 August 2024 (UTC) 
 | 
- Updated request (the reports were created a while ago and manually updated)
The following reports should be updated by bot:
- Commons:Report_Special:UncategorizedCategories (based on Quarry:query/86077, takes >10 minutes to run)
- Commons:Report_UncategorizedCategories_with_infobox (Quarry:query/85877, takes ∼1 minute to run)
Notes:
- When updating, after running the query, the resulting categories need to be null-edited and then the queries run again. Otherwise we get false positives due to template based categorizations (notably {{Wikidata Infobox}}).
- The count by user is added when it's formatted.
- The lines should be in a template for easier formatting.
- If it's easier to update, I could merge the two reports.
- Ideally, the reports are updated 6AM and 6PM UTC, so Europeans and Americans don't get too many entries that have already been dealt with.
The reports may appear short now, but not too long ago they were at 4000 categories total. I think this was partially due to Special:UncategorizedCategories having ran only once a month.
The reports would be similar to w:Wikipedia:Database_reports/Uncategorized_categories. 
 ∞∞ Enhancing999 (talk) 12:08, 29 September 2024 (UTC)
- You can choose to download the results as a wikitable. Does that resemble the desired output? Wikiwerner (talk) 17:46, 20 November 2024 (UTC)
- A bit (compare with the pages). If you can automated that part, it would be a good start. 
 ∞∞ Enhancing999 (talk) 19:36, 24 November 2024 (UTC)- I have given it a try. I let a script request the wikitable download URL and perform two regex replacements. (And now I see that you piped the Wikidata search link, unlike my script. That's fixed easily next time.) Wikiwerner (talk) 20:29, 27 November 2024 (UTC)
- Looks good. Thanks! 
 ∞∞ Enhancing999 (talk) 22:41, 27 November 2024 (UTC)- The next step is running the query again. How do you do that? Wikiwerner (talk) 14:21, 1 December 2024 (UTC)
- If it's your own query, you have to login into quarry and click "submit query". There is a feature that makes forking other people's queries easy.
- As for pybot, I asked at mw:Topic:Yd8qqsrjykawj9v9. I couldn't get far with the "superset" solution mentioned there.
- In the meantime, I found that loading the most recent run is possible per m:Research:Quarry#Downloading_a_resultset.
- If you ask for access to toolserver, you could use m:Research:Quarry#Querying_ToolsDB_public_databases. 
 ∞∞ Enhancing999 (talk) 14:35, 1 December 2024 (UTC)- Thank you very much. Now I can run the same script, with the same HTTP request, after each query run. The only thing we need is a way to trigger a new query run... Wikiwerner (talk) 17:02, 1 December 2024 (UTC)
- I have discovered how to do that. I have run the query and updated the report. Wikiwerner (talk) 11:21, 8 December 2024 (UTC)
 
 
- Thank you very much. Now I can run the same script, with the same HTTP request, after each query run. The only thing we need is a way to trigger a new query run... Wikiwerner (talk) 17:02, 1 December 2024 (UTC)
 
 
- The next step is running the query again. How do you do that? Wikiwerner (talk) 14:21, 1 December 2024 (UTC)
 
- Looks good. Thanks! 
 
- I have given it a try. I let a script request the wikitable download URL and perform two regex replacements. (And now I see that you piped the Wikidata search link, unlike my script. That's fixed easily next time.) Wikiwerner (talk) 20:29, 27 November 2024 (UTC)
 
- A bit (compare with the pages). If you can automated that part, it would be a good start. 
Report update request (#2)
[edit]- Please also update these new reports with a bot:
- I suggest that these are updated twice a month at first. Frequency could be increased as needed.
- Here's how I update the reports manually (info how this is done for the two reports above doesn't seem to be included): I go to the query page click Download data and select csv. Then I open the csv in VSCodium (Visual Studio Code) and use this to add [[:Category:to the start and]],to the end of every line as well as replacing all linebreaks. There also is a page 2 with only the first 500 items. I requested the queries here so thanks to Matěj Suchánek. Changing the output to be ordered alphabetically would improve it. "redcats" refers to nonexisting categories – further explanations are at the top of these reports.
- By the way, I think the resulting categories need to be null-edited is too unclear. Prototyperspective (talk) 16:45, 7 October 2024 (UTC)- @Wikiwerner:  could you also look into these reports? I started this thread as a subthread of the thread right above where you participated. Prototyperspective (talk) 17:29, 31 January 2025 (UTC)
- I am running the query now, which I can post-process to update the report. I have not yet a tool to update the word count. Does that matter? Wikiwerner (talk) 17:16, 2 February 2025 (UTC)
- Thank you. No, the paragraph about top word counts is not important. Enhancing999 added it afterwards. If you update the report, please simply remove that part. Prototyperspective (talk) 17:27, 2 February 2025 (UTC)
- Well, I have saved the output and removed the word count. The second report you mentioned, already appears to be updated every two weeks. Wikiwerner (talk) 19:03, 2 February 2025 (UTC)
 
 
- Thank you. No, the paragraph about top word counts is not important. Enhancing999 added it afterwards. If you update the report, please simply remove that part. Prototyperspective (talk) 17:27, 2 February 2025 (UTC)
 
- I am running the query now, which I can post-process to update the report. I have not yet a tool to update the word count. Does that matter? Wikiwerner (talk) 17:16, 2 February 2025 (UTC)
 
- @Wikiwerner:  could you also look into these reports? I started this thread as a subthread of the thread right above where you participated. Prototyperspective (talk) 17:29, 31 January 2025 (UTC)
Monuments database in Russia
[edit]Per discussion at Commons:Village pump#Monuments database in Russia.
There are >25K sub-categories of Category:Galleries of cultural heritage monuments in Russia (and about 275 in its subcategory, Category:Galleries of cultural heritage monuments in Crimea) named in the format (for example) Category:WLM/1010021052. That example duplicates Category:Threshing barn from Berezovaya Selga. The corresponding Wikidata item, Threshing barn from Berezovaya Selga (Q106488771), has a Wiki Loves Monuments ID (P2186) value of RU-1010021052
 (note the "RU-" prefix). That Wikidata item is linked to the alphanumerically named, not numbered, category.
For each of those 25K categories, we need a bot to do the following:
- Find the Wikidata item with the Wiki Loves Monuments ID (P2186) value (e.g. RU-1010021052)- If no Wikidata item is found, write a log entry and skip to the next category
 
- Find the Commons category that the Wikidata item is linked to
- If no Commons category is found; or if the linked category is of the numeric type, write a log entry and skip to the next category
 
- Redirect the numeric category (e.g. Category:WLM/1010021052) to the latter category (e.g. Category:Threshing barn from Berezovaya Selga)
- Ensure that the latter category transcludes {{Wikidata infobox}}
An alternative at 1.1 would be to create a Wikidata item; populating with data from e.g. https://ru-monuments.toolforge.org/wikivoyage.php?id=1010021052 - but this could be done later. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 11:14, 24 September 2024 (UTC)
- @Pigsonthewing How does this diff look on the WLM cat with redirect? -- DaxServer (talk) 11:23, 19 January 2025 (UTC)
- @DaxServer: Thank you. Looks good to me. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 11:44, 19 January 2025 (UTC)
 
geocode Sanborn maps
[edit]We have thousands of fairly local maps in Category:Sanborn maps: Special:Search/File: "Sanborn Fire Insurance Maps" (also Special:Search/File: "Sanborn Insurance Maps" or Special:Search/File: "Sanborn Maps")-
It would be helpful if we had some geocoding for these.. Not entirely sure where to get coordinates from, but it should be doable. 
 ∞∞ Enhancing999 (talk) 14:31, 15 October 2024 (UTC)
Sanborn maps add link to LOC description
[edit]Many Sanborn maps from LOC only have a link to the actual source of the map, which doesn't provide much context (Sample).
It would be helpful, if each file also had a link to the LOC description page for the series of map sheets.
Samples:
- this map has it both in file name and on the file description page.
- This table has a table of such links (first column: remove the "manifest.json"-part, individual files are in the last column).
Notes:
- This should concern the following files: Special:Search/File: Sanborn insource:tile.loc.gov/image-service -insource:"loc.gov/item/sanborn"
- Some of these files did include the LOC details in the filename, but were renamed.
∞∞ Enhancing999 (talk) 09:54, 19 October 2024 (UTC), updated 10:22, 26 October 2024 (UTC)
add Sanborn sheet numbers to files from LOC (possibly others)
[edit]Maps in some categories, e.g. Category:Sanborn Fire Insurance Map from Brooklyn, Kings County, New York, include the LOC detail, but no sheet numbers.
Sample:
- File:Sanborn Fire Insurance Map from Brooklyn, Kings County, New York. LOC sanborn05791 001-10.jpg shows part of sheet #4 (note the large "4" in the top right corner), but this isn't anywhere on the file description page or file name.
Notes:
- Some sheets are uploaded as single files, others are split in left and right side (as the sample) where only the right side show the large sheet number.
- Renames solve this for some files, but omitted to keep the LOC details somewhere in the file description
 ∞∞ Enhancing999 (talk) 10:22, 26 October 2024 (UTC)
move Sanborn tiffs to Category:LC TIF images with categorized JPGs
[edit]Files like this with a categorized jpg version like that can be moved to Category:LC TIF images with categorized JPGs. 
 ∞∞ Enhancing999 (talk) 20:35, 23 October 2024 (UTC)
Sanborn maps are suitable to be rectified with Wikimaps Warper. To do so, the files need to use {{Map}} instead of {{Information}}. This can be done simply by replacing {{Information with {{Map at the beginning of file description pages. 
Sample:
Notes:
- A few Sanborn pages are not maps and should remain with the information template: e.g. Sanborn maps title pages.
- The change could be included in other updates (see other requests).
- A good series of files to start with could be Sanborn index maps.
 ∞∞ Enhancing999 (talk) 08:16, 30 October 2024 (UTC)
Sending automated messages to users who upload suspicious SVG files claiming to be their authors
[edit]I've noticed there are a *lot* of users who mistakenly think that making a vector version of a file grants them authorship status. I was thinking as a means to reduce how often this happens, it would be helpful to have a bot that would promptly flag likely cases and inform the uploader about relevant Commons policies.
I'd like there to be a bot that monitors new uploads to Commons:
- For each new SVG file that is uploaded:
- Check the file description to see if ANY of the following are true: Author field is equal to [[User:UPLOADERNAME|UPLOADERNAME]], license tag contains self (e.g. {{PD-self}}, {{self|cc-by-sa-4.0}})
- If any of those cases are True:
- Check to see if there is a file already on Commons with the same name but different file extension (e.g. File:MyCityFlag.svg was just uploaded, and File:MyCityFlag.jpg already exists)
- If there is a filename match, add an appropriate tag on the image's page and leave automated message on user's talkpage about it
 
 
- Check to see if there is a file already on Commons with the same name but different file extension (e.g. File:MyCityFlag.svg was just uploaded, and File:MyCityFlag.jpg already exists)
 
There are a couple of possible tags that could be added to the file's page. I'm thinking {{Disputed}} or {{Wrong license}} but there may be something more suitable.
As for the automated message to leave on users' talk pages, I'm thinking something along the lines of:
Dear USERNAME, thank you for your contribution to Commons! I am an automated bot and am responding to your upload of FILENAME. It appears you have listed yourself as the author, or used a license tag that implies you are the copyright holder of this file.
Making a vector version of an existing work of art is, in copyright terms, seen as a derivative work. While creating a vector version requires skill and effort, it is still legally considered derivative.
It appears your upload is a vector version of RASTER-FILENAME. The description of UPLOAD should be updated to ensure the original designer of the work is credited as author, and that the license tag reflects the copyright status held by the original designer.
You can credit yourself as the vectorizer using the Igen template. For example, add: |other fields={{Igen|Inkscape|+|u=[[User:USERNAME|USERNAME]]}} (replace Inkscape with relevant software as needed, information at Template:Image generation).
Please update the description of FILENAME promptly. Your contribution to Commons is appreciated!
I don't have any experience with running bots on wikis so I'm afraid I don't know how technically difficult this will be. The automated message should probably be refined - it's just a first draft. But I'm hoping this sort of thing will help new users figure this out much sooner and reduce how many files with inappropriate authorship/licensing need to be fixed. Intervex (talk) 21:37, 26 November 2024 (UTC)
- I think better yet since *other* users might not know the name of the authors:
Dear USERNAME, thank you for your contribution to Commons! I am an automated bot and am responding to your upload of FILENAME.
It appears you have listed yourself as the author, or used a license tag that implies you are the copyright holder of this file.
Making a vector version of an existing work of art is, in copyright terms, seen as a derivative work.
While creating a vector version requires skill and effort, it is still legally considered derivative. It appears your upload is a vector version of RASTER-FILENAME.
The description of UPLOAD should be updated to ensure the original designer of the work is credited as author, and that the license tag reflects the copyright status held by the original designer.
You can credit yourself as the vectorizer using the Igen template. For example, add: |other fields={{Igen|Inkscape|+|u=[[User:USERNAME|USERNAME]]}} (replace Inkscape with relevant software as needed, information at Template:Image generation).
If you are unable to find the name of the author, feel free add: Template:Unknown author.
Please update the description of FILENAME promptly. Your contribution to Commons is appreciated!
SpinnerLaserzthe2nd (talk) 02:30, 28 November 2024 (UTC)
Adding cat "Animated GIF files" to all instances of such
[edit]Category:Animated GIF files is a fairly flat category containing most or probably more than half and most in-use animated GIF files. However, I noticed it's quite unreliable and does not contain a large fraction of animated GIF files. Could a bot please add this inferrable category to all files with the GIF filetype that are animated?
This can be useful to later have a filter for animated GIF files, to complete Animations of xyz categories, and for deepcategory searches and Petscans for animated GIF files in specific, and for allowing searching of all animated GIF files (e.g. via the category search box at the top of that page).
I don't know how one could check whether a GIF file is animated or not but there probably is a way for that (maybe using machine vision via using some machine vision package but not unlikely also possible in a much easier way). If somebody know how that could be done please add info about that here.
See this search query.
 Prototyperspective (talk) 12:10, 2 December 2024 (UTC)
- According to Google it's fairly straightforward in most programming languages to check if a gif file is animated. Is there a way or wiki API call that can tell if a Gif is animated without downloading it first? --Schlurcher (talk) 17:21, 2 December 2024 (UTC)
- I'll probably implement this during the Christmas break. --Schlurcher (talk) 14:36, 12 December 2024 (UTC)
- Sounds great! I don't know if there is an API to check whether or not it's animated but that service may need to download the full thing as well (don't know if it can just selectively download some of its metadata). Prototyperspective (talk) 15:55, 12 December 2024 (UTC)
 
 
- I'll probably implement this during the Christmas break. --Schlurcher (talk) 14:36, 12 December 2024 (UTC)
- Short status update. I've now updated my bot to add instance of (P31) → animated GIF (Q11201061) to all animated Gif files it touches. However, adding the category Category:Animated GIF files sounds straigt forward, but it is not. For one, it is not a flat category, as it has subcategories that should be excluded. I also think that structured data is the correct way to proceed here. Any thoughts? --Schlurcher (talk) 12:46, 15 December 2024 (UTC)
- Thanks for that. If there is a way to autoadd categories, it does need a way to exclude subcategories (so to not add a category above a category that's already set). However, that may be the only thing that's needed and such a way would be very useful. Maybe just having some structured data set would be be best for metadata like this that is about the kind of file. However, currently I think it's not because
- that SD is not yet added automatically and otherwise people need a way to quickly conveniently add categories using HotCat or CataLot to specify this (maybe this could be changed with the bot)
- one can't search / filter via the SD as far as I know which is I think the current main use of this cat – one can do things like deepcategory:"Animated_GIF_files" time-lapse -deepcategory:"Time-lapse animations"or use the search box at the top of the Animated_GIF_files cat.
 
- Prototyperspective (talk) 13:43, 15 December 2024 (UTC)
 
- Thanks for that. If there is a way to autoadd categories, it does need a way to exclude subcategories (so to not add a category above a category that's already set). However, that may be the only thing that's needed and such a way would be very useful. Maybe just having some structured data set would be be best for metadata like this that is about the kind of file. However, currently I think it's not because
Mass changing WD statements about files
[edit]I have my own bot, but I need to manage WD statements linked with files, they aren't stored in wikitext, so my bot can't change them. I need, for several categories, to do a job: remove one WD property and add another, with a different value for different categories. Could someone here do that? MBH 08:23, 12 December 2024 (UTC)
- @Schlurcher or Mike Peel: Could you help out with this?   — 🇺🇦Jeff G. ツ please ping or talk to me🇺🇦 09:35, 12 December 2024 (UTC)
- Some more detail would be needed. Also sounds like a job that could be done with AC/DC gadget. --Schlurcher (talk) 14:32, 12 December 2024 (UTC)
- @Schlurcher my case is described on phab:T381945. On categories like Category:Views_from_The_First_Tower_observation_deck it's needed to set P1071 for all files and, if exist, remove P180, because earlier I was uploading such batches setting a name of summit to P180 property instead of P1071. MBH 02:50, 13 December 2024 (UTC)
- @PMG looks like you're doing this automatically, how you did it? MBH 02:54, 13 December 2024 (UTC)
- These edits were done with Help:Gadget-ACDC which was also my first suggestion. --Schlurcher (talk) 06:50, 13 December 2024 (UTC)
- @MBH - I am using AC/DC. There is also option to remove properties so you can both remove and add something. PMG (talk) 19:12, 15 December 2024 (UTC)
 
 
- @PMG looks like you're doing this automatically, how you did it? MBH 02:54, 13 December 2024 (UTC)
 
- @Schlurcher my case is described on phab:T381945. On categories like Category:Views_from_The_First_Tower_observation_deck it's needed to set P1071 for all files and, if exist, remove P180, because earlier I was uploading such batches setting a name of summit to P180 property instead of P1071. MBH 02:50, 13 December 2024 (UTC)
- I've not figured out bot editing of SDC yet, I suggest asking @Multichill: . Thanks. Mike Peel (talk) 17:41, 12 December 2024 (UTC)
 
- Some more detail would be needed. Also sounds like a job that could be done with AC/DC gadget. --Schlurcher (talk) 14:32, 12 December 2024 (UTC)
US GOV accounts on Flickr
[edit]I am requesting a upload of all USGOV Flickr accounts.
Unfortunately, many of them are locked behind the copyright tag. The C copyright tag is (unfortunately) the default tag on Flickr and most likely were never changed to the proper tag of being public domain via USGOV work. A change to USGOV means it has to be manually changed, which someone never did.
I say this because there was a recent change in administration that seemed to aim to gut the govt, including shuttering US A.I.D.. I’m just concerned the Flickr images will get deleted. Thank y ou. SeichanGant (talk) 17:57, 17 February 2025 (UTC)
- I would suggest asking for a bot to review the account and if there is a copyright tag, make a list somewhere on site and let a human examine it. Otherwise download the details. Leave the bot operator to handle the bot tasks and let us humans do what we can. Ricky81682 (talk) 20:33, 17 February 2025 (UTC)
Requesting edit or deletion on all 1,500+ map files in a category
[edit]Hello! I created NPMaps.com in my spare time and have digitized hundreds of maps that have since been scraped and added to Wikimedia Commons (see: Category:Files_from_the_National Park Service uploaded by RKBot). While I’m happy to support the discovery of these maps, there are a few issues with the automatically generated file summaries for almost all files. Let’s use the Arches map as an example: NPS_arches-map.jpg.
- Source Field: Currently, it links directly to the file hosted on my server. I’d like this changed to the page where the map is featured—in this case, https://npmaps.com/arches/. I do not want my file uploads linked directly.
- Source Field: The images were actually sourced from my site, not the National Park Service. Therefore, the source should reflect the name of my website—“National Park Maps”—or the specific source page’s title. So a way forward could be: “Arches National Park Maps (https://npmaps.com/arches)”.
- Author Field: Although the author information is correct, I do not want my personal name appearing. I'd rather it refer to the site, e.g.: “U.S. National Park Service, restoration/cleanup by National Park Maps (https://npmaps.com)”.
I’m not entirely sure what the policy is regarding including URLs in the Source and Author fields, but I’ve spent hundreds of hours digitizing and cleaning up these maps, so including the source credits as requested seems appropriate. If this isn’t possible, I’d be fine with having the files marked for deletion and removed—instead, a bot could scrape nps.gov to find the actual original source where they are truly public domain maps, and not ones I've spent many hours providing.
I’ve experimented with the batch edit tools but sadly haven’t been able to figure it out myself, so I'd appreciate it if someone can make these changes if possible. Thank you for your help! Npmaps (talk) 23:52, 18 February 2025 (UTC)
