vegeta@lemmy.world to Technology@lemmy.worldEnglish · 4 months agoResearchers claim GPT-4 passed the Turing testbgr.comexternal-linkmessage-square40fedilinkarrow-up18arrow-down13
arrow-up15arrow-down1external-linkResearchers claim GPT-4 passed the Turing testbgr.comvegeta@lemmy.world to Technology@lemmy.worldEnglish · 4 months agomessage-square40fedilink
minus-squaretourist@lemmy.worldlinkfedilinkEnglisharrow-up0·4 months ago The participants judged GPT-4 to be human a shocking 54 percent of the time. ELIZA, which was pre-programmed with responses and didn’t have an LLM to power it, was judged to be human just 22 percent of the time Okay, 22% is ridiculously high for ELIZA. I feel like any half sober adult could clock it as a bot by the third response, if not immediately. Try talking to the thing: https://web.njit.edu/~ronkowit/eliza.html I refuse to believe that 22% didn’t misunderstand the task or something.
minus-squareDowncount@lemmy.worldlinkfedilinkEnglisharrow-up1·4 months ago Okay, 22% is ridiculously high for ELIZA. I feel like any half sober adult could clock it as a bot by the third response, if not immediately. I did some stuff with Eliza back then. One time I set up an Eliza database full of insults and hooked it up to my AIM account. It went so well, I had to apologize to a lot of people who thought I was drunken or went crazy. Eliza wasn’t thaaaaat bad.
minus-squaretechnocrit@lemmy.dbzer0.comlinkfedilinkEnglisharrow-up1·4 months agoIt was a 5 minute test. People probably spent 4 of those minutes typing their questions. This is pure pseudo-science.
Okay, 22% is ridiculously high for ELIZA. I feel like any half sober adult could clock it as a bot by the third response, if not immediately.
Try talking to the thing: https://web.njit.edu/~ronkowit/eliza.html
I refuse to believe that 22% didn’t misunderstand the task or something.
I did some stuff with Eliza back then. One time I set up an Eliza database full of insults and hooked it up to my AIM account.
It went so well, I had to apologize to a lot of people who thought I was drunken or went crazy.
Eliza wasn’t thaaaaat bad.
It was a 5 minute test. People probably spent 4 of those minutes typing their questions.
This is pure pseudo-science.