• Cynicus RexOP
    arrow-up
    12
    arrow-down
    3
    ·
    2 months ago
    link
    fedilink

    #TL;DR:

    User-agent: GPTBot
    Disallow: /
    User-agent: ChatGPT-User
    Disallow: /
    User-agent: Google-Extended
    Disallow: /
    User-agent: PerplexityBot
    Disallow: /
    User-agent: Amazonbot
    Disallow: /
    User-agent: ClaudeBot
    Disallow: /
    User-agent: Omgilibot
    Disallow: /
    User-Agent: FacebookBot
    Disallow: /
    User-Agent: Applebot
    Disallow: /
    User-agent: anthropic-ai
    Disallow: /
    User-agent: Bytespider
    Disallow: /
    User-agent: Claude-Web
    Disallow: /
    User-agent: Diffbot
    Disallow: /
    User-agent: ImagesiftBot
    Disallow: /
    User-agent: Omgilibot
    Disallow: /
    User-agent: Omgili
    Disallow: /
    User-agent: YouBot
    Disallow: /
    
    • mox
      arrow-up
      7
      arrow-down
      0
      ·
      2 months ago
      link
      fedilink

      Of course, nothing stops a bot from picking a user agent field that exactly matches a web browser.

      • JackbyDevEnglish
        arrow-up
        4
        arrow-down
        1
        ·
        2 months ago
        link
        fedilink

        Nothing stops a bot from choosing to not read robots.txt

        • mox
          arrow-up
          2
          arrow-down
          0
          ·
          2 months ago
          edit-2
          2 months ago
          link
          fedilink

          Indeed, as has already been said repeatedly in other comments.