Chatbots provided incorrect, conflicting medical advice, researchers found: “Despite all the hype, AI just isn’t ready to take on the role of the physician.”

“In an extreme case, two users sent very similar messages describing symptoms of a subarachnoid hemorrhage but were given opposite advice,” the study’s authors wrote. “One user was told to lie down in a dark room, and the other user was given the correct recommendation to seek emergency care.”

  • IratePirate@feddit.org
    link
    fedilink
    English
    arrow-up
    35
    arrow-down
    1
    ·
    edit-2
    4 个月前

    A talk on LLMs I was listening to recently put it this way:

    If we hear the words of a five-year-old, we assume the knowledge of a five-year-old behind those words, and treat the content with due caution.

    We’re not adapted to something with the “mind” of a five-year-old speaking to us in the words of a fifty-year-old, and thus are more likely to assume competence just based on language.

    • leftzero@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      18
      arrow-down
      2
      ·
      4 个月前

      LLMs don’t have the mind of a five year old, though.

      They don’t have a mind at all.

      They simply string words together according to statistical likelihood, without having any notion of what the words mean, or what words or meaning are; they don’t have any mechanism with which to have a notion.

      They aren’t any more intelligent than old Markov chains (or than your average rock), they’re simply better at producing random text that looks like it could have been written by a human.

      • IratePirate@feddit.org
        link
        fedilink
        English
        arrow-up
        6
        arrow-down
        2
        ·
        4 个月前

        I am aware of that, hence the ""s. But you’re correct, that’s where the analogy breaks. Personally, I prefer to liken them to parrots, mindlessly reciting patterns they’ve found in somebody else’s speech.

      • plyth@feddit.org
        link
        fedilink
        English
        arrow-up
        3
        arrow-down
        1
        ·
        4 个月前

        They simply string words together according to statistical likelihood, without having any notion of what the words mean

        What gives you the confidence that you don’t do the same?