Reddit has a new AI training deal to sell user content::Reddit has reportedly made a deal with an unnamed AI company to allow access to its platform’s content for the purposes of AI model training.

  • LvxferreEnglish
    arrow-up
    9
    arrow-down
    0
    ·
    8 months ago
    edit-2
    8 months ago
    link
    fedilink

    For anyone looking for a gibberish generator to replace their Reddit content with, here’s one. This shit is like poison for those large models.

    For automatic edition I’m not sure on what people can use nowadays; back then just before the APIcalypse I’ve used power delete suite, I’m not sure if it still works and I’m not creating a Reddit account just to test it out.

    • greaprrEnglish
      arrow-up
      1
      arrow-down
      0
      ·
      8 months ago
      link
      fedilink

      Not that I’m against telling Reddit to fuck off in no uncertain terms, but won’t providing this kind of poisoning to AI training just make it more resilient to exactly this kind of thing?

      • LvxferreEnglish
        arrow-up
        1
        arrow-down
        0
        ·
        8 months ago
        edit-2
        8 months ago
        link
        fedilink

        I don’t think so. It’s really hard to sort the poison out of the data, unless you actually have enough reading comprehension to know that it’s gibberish - humans do, bots don’t. And even if they discard 80% of the poison, the 20% there are already screwing with the model.

        They could prevent you from editing your posts/comments, but that would cause an uproar.