@petrescatraian I noticed that.
will correct it!
by the way, you are the most advanced user;)
#nobot
@petrescatraian I noticed that.
will correct it!
by the way, you are the most advanced user;)
#nobot
Hrmmm I have #nobot in my profile and I *still* get those damn bots boosting me. Maybe it's case sensitive ;) or I made a mistake?
Interesante programa para proteger nuestras páginas web de scrappers indeseados.
#noAI #noIA #nobot #anubis
https://anubis.techaro.lol/docs/
@manna Ilmeisesti joku dataharvester / suodatustyökalu. JSONin lopussa oli jotain linkkejä projektiin, sielltä löysin "poistumistoiveen" (opt-out) ohjeen: https://github.com/awakari/.github/blob/master/OPT-OUT.md . #nobot riittänee ehkä profiilissa.
@eatyourglory @noondlyt It's a proposal to add an extension/field to the protocol to let people explicitly express consent or lack of consent for a few different types of uses of their data. Something that could very well be a thing in ActivityPub too, if it isn't already. Sort of like the #NoBot hashtags people put in the bio, but in a more organized format.
@MalthusJohn we allow bots on mas.to, but have very specific rules about how they can operate:
“All automated (bot) accounts must enable the bot flag in their account preferences, respect #nobot markers, and post as unlisted unless making less than one post per hour.”
So they’re easy to block and won’t flood your local timeline.
I’d say our moderation team is great, but, then, I’d be biased.
Seriously, though. I think we do a pretty good job, and we’d love to have you.
Poll time (Multiple selection possible):
Should I make a "Summoned MIKU" bot that posts the "OOO-EEE-OOO" video to every post mentioning Hatsune Miku (and ofc respects the #nobot flag)
And
Should I look into a bot that Markov-Chains Bible verses? (could be offensive to people who are Christians)
@AccidentalHaiku Your bot found and boosted my post that used no hashtags, meaning it had to scrape and process text. I also have had #NoIndex and #NoBot in my profile for years now. Does this bot intentionally disregard the filter flags or is this an oversight?
Please correct this to respect the flags some of us use to prevent this kind of data capture.
So... not a topic getting much interest, let me decide by myself.
Not a DoS as requests are coming every other second... not blocked by fail2ban as IP's are numerous and changing... not causing any trouble, I don't see what they can achieve by doing this as requests seem legitimate and reading random posts... but clearly not human nor social interactions, so undesirable. Maybe scraping content for AI training?
I will block the offending IP's this evening. This should not affect our other Singaporean users, who, let's be honest, are (understandably) not very many. Nor will it affect federation as user agent is voluntarily misleading.
I encourage other administrators to monitor this kind of case and to take action if necessary. Let's try and avoid scraping bots when we can.
Need to check if #noBot in the profile prevents this…
Or maybe just avoid those hashtags
@ciccillo @c558c7cc69bbda3c271782b736babc64acd2da258b14f356dbca966cb0b7b89e
perché è un bot impiccione che non si capisce nemmeno se rispetta eventuali #nobot o #nobots in bio, buuuuuu cacca schifo buuuuuu
Je n'accepte pas d'abonnement de personnes qui n'ont aucun post, aucune présentation, je suis en mode public pour l'instant, histoire de découvrir, faire connaissance mais le mode public n'est pas celui avec lequel je suis à l'aise, donc ne m'invitez pas si votre profil est vide, ou alors exprimez-vous en m'envoyant un message, merci.
As you can see from https://cathode.church/fedi-scraper-counter.html they don't respect the #NoBot #NoAI etc. Tags. And in my experience, especially online, people are rarely searching in good faith, they want someone to pick at, but the bigger issue is the above violates consent, so if I have the #NoBot tec tags on my profile, everyone of those would scrape my content anyway. The other half of this is refusing to comprehend that there are a large number of people that don't want to be other's content and want to post among their federated circle and value organic, human, growth over someone just randomly finding them. While this isn't fully me, for example, anyone can see me, but I don't wanna interact with everybody, which is why I don't have #WebMentions on my blog for example. In truth, I don't want any data of theirs on my website, period, and this includes social media replies. I'd like a one way web mention instead, where they know I linked them but interaction is on my terms not theirs, but going back to the above, the above gives examples where consent is violated. Not everyone is worth interacting with online. @miturian