The increasing power of the latest artificial intelligence systems is stretching traditional evaluation methods to breaking point, posing a challenge to businesses and public bodies over how best to work with the fast-evolving technology.

Flaws in the evaluation criteria commonly used to gauge performance, accuracy and safety are being exposed as more models come to market, according to people who build, test and invest in AI tools. The traditional tools are easy to manipulate and too narrow for the complexity of the latest models, they said.

The accelerating technology race sparked by the 2022 release of OpenAI’s chatbot ChatGPT and fed by tens of billions of dollars from venture capitalists and big tech companies, such as Microsoft, Google and Amazon, has obliterated many older yardsticks for assessing AI’s progress.

  • eleitlEnglish
    arrow-up
    29
    arrow-down
    4
    ·
    6 months ago
    link
    fedilink

    There is no reliable risk assessment for truly intelligent, autonomous systems. Let’s stop pretending that it can exist.

  • maynarkhEnglish
    arrow-up
    14
    arrow-down
    0
    ·
    6 months ago
    link
    fedilink

    We better enshrine the monopolies of current big tech companies into law to be sure.

  • afraid_of_zombiesEnglish
    arrow-up
    5
    arrow-down
    1
    ·
    6 months ago
    link
    fedilink

    I thought it was all a scam. Why can’t people make up their mind? I need to know if I should panic about some neo-feudalism jobless dystopia or if I should panic about the AI bust that will bring our economy down.

  • unreasonabroEnglish
    arrow-up
    1
    arrow-down
    3
    ·
    6 months ago
    edit-2
    6 months ago
    link
    fedilink

    the first laws we make about a thing are always retarded and take thirty years to change, too.

    This society is not worth living in and we do everything wrong.

  • FedizenEnglish
    arrow-up
    7
    arrow-down
    15
    ·
    6 months ago
    link
    fedilink

    Nearly all this money is wasted. The most impressive use for AI is image generation (which is less abstract than speech). Chatbots aren’t useful. Until hallucinations can be removed by some kind of automated response validation process (currently nonexistent) its just nonsense.

    • j4yt33English
      arrow-up
      6
      arrow-down
      4
      ·
      6 months ago
      link
      fedilink

      What about drug discovery? Recommendation systems? Search engines?