A new report from plagiarism detector Copyleaks found that 60% of OpenAI’s GPT-3.5 outputs contained some form of plagiarism.

Why it matters: Content creators from authors and songwriters to The New York Times are arguing in court that generative AI trained on copyrighted material ends up spitting out exact copies.

  • GhostalmediaEnglish
    arrow-up
    12
    arrow-down
    1
    ·
    8 months ago
    link
    fedilink

    Eh, kinda. It’s not like a science paper is just going to be an equation and nothing else. An author’s synthesis of the results is always going to have unique language. And that is even more true for a social science paper.

    • JojoEnglish
      arrow-up
      5
      arrow-down
      1
      ·
      8 months ago
      link
      fedilink

      Are those “best matches” paper-sized, or snippet-sized?

      • FaceDeer
        arrow-up
        6
        arrow-down
        1
        ·
        8 months ago
        link
        fedilink

        Article mentioned 400-word chunks, so much less than paper-sized.