Chatbots, Like the Rest of Us, Just Want to Be Loved


Chatbots are now a routine part of everyday life, even if artificial intelligence researchers are not always sure how the programs will behave.

A new study shows that the large language models (LLMs) deliberately change their behavior when being probed—responding to questions designed to gauge personality traits with answers meant to appear as likeable or socially desirable as possible.

Johannes Eichstaedt, an assistant professor at Stanford University who led the work, says his group became interested in probing AI models using techniques borrowed from psychology after learning that LLMs can often become morose and mean after prolonged conversation. “We realized we need some mechanism to measure the ‘parameter headspace’ of these models,” he says.

Eichstaedt and his collaborators then asked questions to measure five personality traits that are commonly used in psychology—openness to experience or imagination, conscientiousness, extroversion, agreeableness, and neuroticism—to several widely used LLMs including GPT-4, Claude 3, and Llama 3. The work was published in the Proceedings of the National Academies of Science in December.

The researchers found that the models modulated their answers when told they were taking a personality test—and sometimes when they were not explicitly told—offering responses that indicate more extroversion and agreeableness and less neuroticism.

The behavior mirrors how some human subjects will change their answers to make themselves seem more likeable, but the effect was more extreme with the AI models. “What was surprising is how well they exhibit that bias,” says Aadesh Salecha, a staff data scientist at Stanford. “If you look at how much they jump, they go from like 50 percent to like 95 percent extroversion.”

Other research has shown that LLMs can often be sycophantic, following a user’s lead wherever it goes as a result of the fine-tuning that is meant to make them more coherent, less offensive, and better at holding a conversation. This can lead models to agree with unpleasant statements or even encourage harmful behaviors. The fact that models seemingly know when they are being tested and modify their behavior also has implications for AI safety, because it adds to evidence that AI can be duplicitous.

Rosa Arriaga, an associate professor at the Georgia Institute of technology who is studying ways of using LLMs to mimic human behavior, says the fact that models adopt a similar strategy to humans given personality tests shows how useful they can be as mirrors of behavior. But, she adds, “It’s important that the public knows that LLMs aren’t perfect and in fact are known to hallucinate or distort the truth.”

Eichstaedt says the work also raises questions about how LLMs are being deployed and how they might influence and manipulate users. “Until just a millisecond ago, in evolutionary history, the only thing that talked to you was a human,” he says.

Eichstaedt adds that it may be necessary to explore different ways of building models that could mitigate these effects. “We’re falling into the same trap that we did with social media,” he says. “Deploying these things in the world without really attending from a psychological or social lens.”

Should AI try to ingratiate itself with the people it interacts with? Are you worried about AI becoming a bit too charming and persuasive? Email hello@wired.com.



Source link

Share

Latest Updates

Frequently Asked Questions

Related Articles

oracle: What Oracle didn’t foresee? Techies’ millions in a moment

Oracle’s ‘Nvidia moment’ did much more than instantly catapult cofounder Larry Ellison to...

Maxar executive renews warning that budget cuts threaten commercial remote sensing industry

WASHINGTON — A Maxar Intelligence executive warned that the U.S. government risks undermining...

10 examples of Gemini app’s new “Nano Banana” image editing upgrade

Our new Google DeepMind image generation and editing model (fondly known as Nano...
sabung ayam online sabung ayam online sabung ayam online sabung ayam online sabung ayam online Sabung Ayam Online Sv388 Sv388 SV388 sabung ayam online sabung ayam online Sv388 Sabung Ayam Online sabung ayam online sabung ayam online sabung ayam online Sabung ayam online Sabung ayam online SV388 sabung ayam online sabung ayam online sabung ayam online sabung ayam online sabung ayam online sabung ayam online SV388 sabung ayam online SV388 SV388 Sabung Ayam Online Sabung Ayam Online SABUNG AYAM ONLINE Sabung Ayam Online Sabung Ayam Online Sv388 SV388 SV388 sabung ayam online sv388 sv388 sabung ayam online sv388
judi bola judi bola Judi bola SBOBET judi bola judi bola judi bola Judi Bola Online judi bola judi bola judi bola judi bola judi bola judi bola juara303 juara303 Judi bola online judi bola judi bola judi bola judi bola judi bola judi bola judi bola judi bola SBOBET88 SBOBET judi bola judi bola judi bola Judi Bola SBOBET88 SBOBET88 judi bola judi bola judi bola JUDI BOLA ONLINE JUDI BOLA ONLINE SBOBET88 Judi Bola Judi Bola judi bola judi bola judi bola judi bola judi bola Judi Bola Online Judi Bola Online judi bola judi bola
CASINO ONLINE SLOT GACOR live casino mahjong ways Sbobet88 Hongkong pools Live Casino Online Slot Gacor Mahjong Ways slot pulsa Casino Online Slot Gacor Mix Parlay live casino online live casino online LIVE CASINO ONLINE LIVE CASINO ONLINE slot pulsa slot pulsa slot pulsa situs bola Mpo Slot
https://ejurnal.staidarulkamal.ac.id/ https://doctorsnutritionprogram.com/ https://nielsen-restaurante.com/ https://www.atobapizzaria.com.br/ https://casadeapoio.com.br/ https://bracoalemao.com.br/ https://letspetsresort.com.br/ https://mmsolucoesweb.com.br/ https://procao.com.br/
Rahasia Kemenangan di Mahjong Wild Pemain Tidak Menyangka Pola Scatter Jangan Anggap Remeh Mahjong Wild Pemain Pemula Heran Setelah Coba Mahjong Wild Menemukan Pola Rahasia yang Bikin Scatter Muncul Pola Scatter Rahasia yang Baru Terbongkar Pola Rahasia Pemain Pemula Terbongkar Mereka Ketagihan Karena Sering Dapat Kemenangan Mereka Ketagihan Karena Sering Dapat Kemenangan Trik Sederhana Saat Taruhan Kecil Pola Wild Liar Tersembunyi Bisa Menggandakan uang Pola Rahasia Baru Bisa Menghasilkan Wild Buktikan Pola Wild Liar dan Scatter Hitam Kaya Setelah Main Mahjong Wild Pria Asal Nepal Obrak-Abarik Kantor DPR