Scientists Train AI to Be Evil, Find They Can’t Reverse It::How hard would it be to train an AI model to be secretly evil? As it turns out, according to Anthropic researchers, not very.

  • 1984English
    arrow-up
    1
    arrow-down
    0
    ·
    9 months ago
    link
    fedilink

    Yes, that’s what I meant. Good people are naturally good and don’t think about rewards for being nice.