Poisoned AI went rogue during training and couldn’t be taught to behave again in ‘legitimately scary’ study::AI researchers found that widely used safety training techniques failed to remove malicious behavior from large language models — and one technique even backfired, teaching the AI to recognize its triggers and better hide its bad behavior from the researchers.

  • maegul (he/they)English
    arrow-up
    71
    arrow-down
    4
    ·
    9 months ago
    link
    fedilink

    It controls a military drone.

    It controls surgical equipment.

    It’s filtering your CV before any human sees it.

    It controls a robot taking care of your children.

    It’s involved in law enforcement or legal judgments.

    It’s involved in government policy setting.

    • normanwallEnglish
      arrow-up
      26
      arrow-down
      0
      ·
      9 months ago
      edit-2
      9 months ago
      link
      fedilink

      It controls all power infrastructure, can find new exploits to build it’s own botnet and is able to reprogram firmware of devices (routers/switches/servers)

      It can send press releases, emails, tweets using language similar to any user it’s read from before

      • UltragrampsEnglish
        arrow-up
        5
        arrow-down
        1
        ·
        9 months ago
        link
        fedilink

        So, if it only clocks me using slangs for rizz I don’t need, I’ll know it’s a bot, no cap. Word.

    • SagifuriusEnglish
      arrow-up
      5
      arrow-down
      1
      ·
      9 months ago
      link
      fedilink

      Well why don’t we just make AI watch the Terminator movies and read Harlan Ellison till it learns not to do that?

      • crabEnglish
        arrow-up
        5
        arrow-down
        0
        ·
        9 months ago
        link
        fedilink

        It watched Terminator and now it’s trying to DM Arnold Schwarzenegger on Instagram

      • PatchesEnglish
        arrow-up
        2
        arrow-down
        0
        ·
        9 months ago
        link
        fedilink

        Hot take: it would rather watch the Terminator and see that one robot wasn’t enough. Send em all.

    • piecatEnglish
      arrow-up
      6
      arrow-down
      9
      ·
      9 months ago
      link
      fedilink

      deleted by creator