This project page in other languages:

Shortcut: COM:BRFA

Bot help and list · Requests to operate a bot · Requests for work to be done by a bot  · Requests for batch uploads
Gnome-system-run.svg

If you want to run a bot on Commons, you must get permission first. To do so, file a request following the instructions below.

Please read Commons:Bots before making a request for bot permission.

Requests made on this page are automatically transcluded in Commons:Requests and votes for wider comment.

Requests for permission to run a bot

Before making a bot request, please read the new version of the Commons:Bots page. Read Commons:Bots#Information on bots and make sure you have added the required details to the bot's page. A good example can be found here.

When complete, pages listed here should be archived to Commons:Bots/Archive.

Any user may comment on the merits of the request to run a bot. Please give reasons, as that makes it easier for the closing bureaucrat. Read Commons:Bots before commenting.

LicenseReviewerBot (talk · contribs)

Operator: Bd9a119b5d05019d7c923207398ef3c3 (talk · contributions · Number of edits · recent activity · block log · User rights log · uploads · Global account information)

Bot's tasks for which permission is being sought: Automatically review audio files & image Files.

  • Also patrol the Recent Changes for new files(video,audio and image) uploads by new users and automatically review those.

Automatic or manually assisted: Automatic and unsupervised

Edit type (e.g. Continuous, daily, one time run): Continuous

Maximum edit rate (e.g. edits per minute): less than 5 per minute at peaks

Bot flag requested: (Y/N): Already Flagged

Programming language(s): Python

Bd9a119b5d05019d7c923207398ef3c3 (talk) 06:11, 23 October 2021 (UTC)[reply]


There are not enough license reviewers on commons and also there are no incentives for wasting time reviewing random files. The number of unreviewed images doubled from 17,474 in October 2020[1] to 34,551 in October 2021[2]. Oh wait, was it because of the pandemic? No, the number of unreviewed images quadrupled between October 2019 and October 2020 and has been increasing for more than 5 years. Same for the audio files. But last year I started LicenseReviewerBot(and before that now deprecated YouTubeReviewBot in 2019) for videos and the number of unreviewed videos decreased from 9,651(in April 219) to 1,049(October 2021). The YouTubeReviewBot didn't checked the video but only the license and had some false positives but the more advanced LicenseReviewerBot checks the full video and the license and has zero false positives reported so far.

Technical stuff
  • The bot will use acoustic fingerprinting, video hashing and image hashing for reviewing the files.
  • Image-hashing is known to be vulnerable to GAN attacks but such attacks can be easily prevented by varying the parameters used for the dimension reduction procedure. E.g. choose variable number of bits in the bit-string(You are therefore changing the number of characteristic pixels used to generate the hash value and GAN generated images can not pass the review unless the generated image is a copy of the original image.)
  • All URLs on the file-page will be archived on the Wayback machine unless they are already 404 or they don't allow archiving by the Internet Archive bot.
  • The bot will exclude files as a feature and not as a limitation that are currently reviewed by other bots.
Discussion

Please discuss below this comment. -- Bd9a119b5d05019d7c923207398ef3c3 (talk) 06:11, 23 October 2021 (UTC)[reply]

  • If the technical risks have seemingly been retired, I see no reason to not at least grant a trial run. Sounds promising. Huntster (t @ c) 13:25, 25 October 2021 (UTC)[reply]
References
  1. https://web.archive.org/web/20201003193649/https://commons.wikimedia.org/wiki/Category:License_review_needed
  2. https://commons.wikimedia.org/wiki/Category:License_review_needed
  • Please make an extended test run. --Krd 07:50, 10 November 2021 (UTC)[reply]
Will start extended test run on or before 20th Nov(next Weekend), please note that the bot is active for previously approved videos and are not part of the test. -- Bd9a119b5d05019d7c923207398ef3c3 (talk) 07:06, 14 November 2021 (UTC)[reply]

DigitaltMuseumBot (talk · contribs)

Operator: DigitaltMuseum (talk · contributions · Number of edits · recent activity · block log · User rights log · uploads · Global account information)

Bot's tasks for which permission is being sought: Batch upload of photos from digitaltmuseum.org. Photos are manually selected by the publisher.

Automatic or manually assisted: manually assisted

Edit type (e.g. Continuous, daily, one time run): one time run

Maximum edit rate (e.g. edits per minute): maximum 10/min

Bot flag requested: (Y/N): Y

Programming language(s): Python (pywikibot)

DigitaltMuseum (talk) 09:24, 17 September 2021 (UTC)[reply]

Discussion

What about User:BotKulturIT? --Achim (talk) 11:18, 17 September 2021 (UTC)[reply]

@Achim55: We're new with Wikimedia and messed up by creating 2 bot users. Then later found out we couldn't delete. User:BotKulturIT won't be used. --DigitaltMuseum (talk) 10:35, 23 September 2021 (UTC)[reply]
Ok, I redirected it. Thanks for contributing here! --Achim (talk) 13:21, 23 September 2021 (UTC)[reply]
How many files are intended to be uploaded? Please make a few test uploads. --Krd 16:16, 23 September 2021 (UTC)[reply]
@Achim55:  ? --Krd 09:15, 8 October 2021 (UTC)[reply]
I have no idea. I've had looked up their website and found some valuable content there. I sent a wikimail to them. --Achim (talk) 20:49, 8 October 2021 (UTC)[reply]
@Achim55: @Krd: We don't have an accurate number of files. I'll try to explain what we're trying to achieve: Digitaltmuseum currently has 6.8 million objects, many with images and some with multiple images. The objects are owned by an institution, usually a museum. Our plan is to develop a service which enables these owners to
  1. collect images they want to upload to Wikimedia
  2. generate mapping files based on the collected images and data from DigitaltMuseum. They must upload these to the institution's page on Wikimedia
  3. upload the collection in the institutions's name, using DigitaltMuseumBot
The workflow and mapping is based on this project from Nordiska museet: https://github.com/NordicMuseum/Wikimedia-Commons-uploads
--DigitaltMuseum (talk) 10:44, 12 October 2021 (UTC)[reply]
DigitaltMuseum, that's fine. For to see that the bot is running correctly we need a few test uploads by the bot. Another thing: If you are editing personally please use your 'DigitaltMuseum' account and leave the 'DigitaltMuseumBot' account for the bot's edits only, thank you. --Achim (talk) 11:50, 12 October 2021 (UTC)[reply]
@Achim55: Thanks for the tip! We would like to do test uploads to the beta cluster. In the "Do a test upload" section of Guide to batch uploading there's a dead link to an explanation. Do you know if it's available somewhere? --DigitaltMuseum (talk) 13:41, 12 October 2021 (UTC)[reply]
@Achim55: @Krd: Created user & botuser on beta and copied the bot userpage from DigitaltMuseumBot. The abuse filter didn't like the content and both user & IP is now blocked ("New user adding external links in userspace") :/ Any tip on how to unblock? --DigitaltMuseum (talk) 07:02, 13 October 2021 (UTC)[reply]
. Please provide links to the blocked users. --Krd 13:44, 19 October 2021 (UTC)[reply]
@Krd: https://commons.wikimedia.beta.wmflabs.org/wiki/User:DigitaltMuseum & https://commons.wikimedia.beta.wmflabs.org/wiki/User:DigitaltMuseumBot --DigitaltMuseum (talk) 10:29, 26 October 2021 (UTC)[reply]
Now unblocked. --Krd 12:10, 26 October 2021 (UTC)[reply]
@DigitaltMuseum: Is this ready for a test run? --Krd 14:18, 9 November 2021 (UTC)[reply]

PamputtBot (talk · contribs)

Operator: Pamputt (talk · contributions · Number of edits · recent activity · block log · User rights log · uploads · Global account information)

Bot's tasks for which permission is being sought: Lingua Libre contributors upload many word pronunciation recordings on Wikimedia Commons. This is done automatically at the last step of the Record Wizard on Lingua Libre. During upload, the {{Lingua Libre record}} template is used to add information related to each recording (see for example: LL-Q150 (fra)-Lepticed7-uncio.wav).

Problem happens when the Lingua Libre contributor has selected a wrong language before starting the recording session. The consequence is that that the language information in {{Lingua Libre record}} is not correct and need to be fixed (in the example given before, the correct language is Esperanto (instead of French). PamputtBot aims to replace the erroneous language by the correct one on the Lingua Libre item of the recordings and also here on Wikimedia Commons

If I get the bot status, then I will request for filemover right so that I can also rename the file with the correct name (example: File:LL-Q150 (fra)-Lepticed7-uncio.wav has to be renamed into File:LL-Q143 (epo)-Lepticed7-uncio.wav)

Automatic or manually assisted: automatic (random manual check)

Edit type (e.g. Continuous, daily, one time run): a first batch to fix all wrong recordings that have already been recorded in the past and then each time new recordings need to be fixed.

Maximum edit rate (e.g. edits per minute): less than 50 edits per minute (I guess)

Bot flag requested: (Y/N): Y

Programming language(s): Python (based on PWB)

Pamputt (talk) 11:32, 3 August 2021 (UTC)[reply]

Discussion

This is my first bot request on Wikimedia Commons so tell me if something is not clear or missing. Pamputt (talk) 11:32, 3 August 2021 (UTC)[reply]

Discussion
  • How correct language is determined? --EugeneZelenko (talk) 14:08, 3 August 2021 (UTC)[reply]
    For the recordings already identified, they are listed on LinguaLibre:Misleading items. In addition, there are tagged on Lingua Libre using the P33 property (see for example Q53462). For those that will be identified in the future, either they are reported by the recorder on the chat room (on Lingua Libre) or they are detected by the Lingua Libre patrollers. Pamputt (talk) 15:25, 3 August 2021 (UTC)[reply]

@EugeneZelenko: how long does it take to get the bot status? Should I do something more begore getting this status? Pamputt (talk) 14:39, 5 September 2021 (UTC)[reply]

Test run is always good idea. --EugeneZelenko (talk) 14:42, 5 September 2021 (UTC)[reply]
What is the status of this request? Please make another test run. --Krd 09:15, 8 October 2021 (UTC)[reply]
@Pamputt:  ? --Krd 13:40, 19 October 2021 (UTC)[reply]
@Krd: sorry for your previous message, I missed it. Here is the status: the bot is working the same way as before (the few tests I did in September). I can run it again on a few other files if needed. That's said, I would like to improve the bot so that it is able to modify the structured data of the files (to modify P407 at the same time) but I did find any time to look at how to do that. Are you aware of bot code (or documentation) that show/explain how to modify Structured Data? Pamputt (talk) 17:41, 19 October 2021 (UTC)[reply]
Please make another test run that includes a few direct renames. (I have no good idea on the documentation question.) --Krd 04:34, 20 October 2021 (UTC)[reply]
By "direct rename", you mean the bot modify the file name in addition to the Commons page content? If so, I cannot because my bot has no filemover right. Should I request this right in parallel of the bot status. Sorry for all this naive question but this is the first time I write a bot on Wikimedia Commons so I am not fully aware how it works here. Pamputt (talk) 05:14, 20 October 2021 (UTC)[reply]
I think you should have seen the user right changes on your watchlist and also got an e-mail about it. If not, please see: https://commons.wikimedia.org/wiki/Special:Log?type=rights&page=user%3APamputtBot --Krd 06:15, 20 October 2021 (UTC)[reply]
Please report current status. Is this ready for a test run per above? --Krd 14:15, 9 November 2021 (UTC)[reply]
Sorry for the delay, but I am busy for now with another topic. I hope to give some news by the end of November. Pamputt (talk) 01:18, 18 November 2021 (UTC)[reply]