Mathers
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
3dcadmin@lemmy.relayeasy.com to Selfhosted@lemmy.worldEnglish · 1 year ago

Cloudflare blocking AI crawlers

lemmy.relayeasy.com

message-square
33
fedilink
352

Cloudflare blocking AI crawlers

lemmy.relayeasy.com

3dcadmin@lemmy.relayeasy.com to Selfhosted@lemmy.worldEnglish · 1 year ago
message-square
33
fedilink

Cloudflare trying to stop AI crawling somehow!

https://arstechnica.com/tech-policy/2025/07/pay-up-or-stop-scraping-cloudflare-program-charges-bots-for-each-crawl/

  • irmadlad@lemmy.world
    link
    fedilink
    English
    arrow-up
    13
    ·
    1 year ago

    I’m pretty sure some auto drive company is getting the advantage

    I’d recon that a lot of that is spliced from pictures captured from Google Map vehicles.

    • w3dd1e@lemmy.zip
      link
      fedilink
      English
      arrow-up
      13
      ·
      edit-2
      1 year ago

      Both you and @DoucheBagMcSwag@lemmy.dbzer0.com are correct. Google bought reCAPTCHA in 2012.

      Here’s an article about it from 2018.

      (╯°□°)╯︵ ┻━┻

      Captcha if you can: how you’ve been training AI for years without realising it

      And another from 2019! Captchas got harder for us because the AI had learned from our training.

      Why CAPTCHAs have gotten so difficult

      • irmadlad@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        1 year ago

        A few years ago I picked up an online gig with a company that trained AI. You’d log in to your dashboard and be presented with questions you had to answer in the best way, such as ‘Is the earth round?’. Well, it’s round in nature but is not perfectly round. So you’d have to pick the best solution from the answer list. It was interesting, but tedious. It put taters on the table, so I got that going for me…which is nice.

      • DoucheBagMcSwag@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        Fucking hell

Selfhosted@lemmy.world

selfhosted@lemmy.world

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !selfhosted@lemmy.world

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don’t control.

Rules:

Detailed Rules Post

  1. Be civil.

  2. No spam.

  3. Posts are to be related to self-hosting.

  4. Don’t duplicate the full text of your blog or readme if you’re providing a link.

  5. Submission headline should match the article title.

  6. No trolling.

  7. Promotion posts require active participation, with an account that is at least 30 days old. F/LOSS without a paywall has exceptions, with requirements. See the rules link for details.

Resources:

  • selfh.st Newsletter and index of selfhosted software and apps
  • awesome-selfhosted software
  • awesome-sysadmin resources
  • Self-Hosted Podcast from Jupiter Broadcasting

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 241 users / day
  • 2.49K users / week
  • 6.38K users / month
  • 16.5K users / 6 months
  • 1 local subscriber
  • 60.4K subscribers
  • 4.37K Posts
  • 104K Comments
  • Modlog
  • mods:
  • Ruud@lemmy.world
  • Loki@lemmy.world
  • CannaVet@lemmy.world
  • devve@lemmy.world
  • ayyy@sh.itjust.works
  • curbstickle@anarchist.nexus
  • curbstickle_lw@lemmy.world
  • BE: 0.19.4
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org