• melroy
    arrow-up
    5
    arrow-down
    0
    ·
    23 days ago
    link
    fedilink

    Also you don’t want to block legit search engines that are not scraping your data for AI.

    • gravitas_deficiencyEnglish
      arrow-up
      7
      arrow-down
      0
      ·
      23 days ago
      link
      fedilink

      Again: hard to differentiate all those different bots, because you have to trust that they are what they say they are, and they often are not

        • vinnymacEnglish
          arrow-up
          4
          arrow-down
          0
          ·
          23 days ago
          edit-2
          23 days ago
          link
          fedilink

          It certainly can be a cat and mouse game, but scraping at scale tends to be ahead of the curve of the security teams. Some examples:

          https://brightdata.com/

          https://oxylabs.io/

          Preventing access by requiring an account, with strict access rules can curb the vast majority of scraping, then your only bad actors are the rich venture capitalists.