• chaogomu@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        ·
        1 month ago

        Someone posted yesterday with a question asked to AI.

        What weighs more, 20 pounds of bricks or 20 feathers?

        The useless chat bot will always answer with “they both weigh 20 pounds” because that’s what the training data always says when asked about bricks and feathers.

              • chaogomu@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                ·
                1 month ago

                I’ll have to find the post, but you did it in two steps, changed the units of mass, and object.

                The post, which is extremely hard to find with the latest slop release from Nvidia, asked the chatbot to consider the exact wording, without babying it into the correct answer. All because the close variations of the phase “X pounds of bricks and X pounds of feathers weigh the exact same” have been used in various textbooks and such for at least the last hundred years or so.

                That means that the chatbot has seen that exact combo of words, in roughly that order, quite a bit more than your use of “100 kilograms of rice”. At least in English.

                You can baby it through when the training data is sparse, but not when there are hundreds of uses of the same phrase over and over again in the training.

                  • chaogomu@lemmy.world
                    link
                    fedilink
                    English
                    arrow-up
                    1
                    ·
                    1 month ago

                    Yup, it’s actually an interesting demonstration of the power of training data on a chatbot, after all, they’re just feeding it back to you.