• leopoldEnglish
    272 months ago
    link
    fedilink

    this is just going to cause indexers to ignore robots.txt

    • gedaliyahOPEnglish
      212 months ago
      link
      fedilink

      “We always obey the robots.txt”

      • A bunch of corporations that have no accountability and plenty of incentive to just ignore it and have all been caught training AI on off-limits data.
    • capitalEnglish
      62 months ago
      link
      fedilink

      Rate limiting could “fix” that unfortunately.

    • KairosEnglish
      52 months ago
      link
      fedilink

      They’re likely blocking user agents too, which I think also doesn’t have legal enforcement (as in DuckDuckGo can just use “Google” unless they said otherwise.