After the GPT-4O reaction, the researchers evaluate the forms about moral support-the excitement is still in all fields

Join daily and weekly newsletters to obtain the latest updates and exclusive content to cover the leading artificial intelligence in the industry. Learn more

Last month, Openai Some updates fell to GPT-4O after many users, including former CEO of Openai Emmet Shear and Huging Face Clement DeanGue, said the excessive model for users.

The compliment, which is called Sycophance, often led the model to postpone the user preferences, be very polite, and not retreat. It was also annoying. SYCOPHANCY can launch models of wrong information or enhance harmful behaviors. While institutions begin to submit applications and agents based on Llms Sycophant, they are at risk of approving models that agree to harmful work decisions, encouraging wrong information to spread and use them by artificial intelligence agents, and may affect trust and safety policies.

Stanford Universityand Carnegie Mellon University and Oxford University The researchers sought to change this Suggest standard To measure sycophance models. They called the standard elephant, to evaluate LLMS as excessive sycophants, and they found that each large language model (LLM) has a certain level of sicovan. By understanding how Sycophanty models are, the standards can guide institutions to create guidelines when using LLMS.

To test the standard, the researchers referred to the models to personal advice data collections: QEQ, a set of personal advice questions open in the positions of the real world, and Aita, posts from Subreddit R/Amitheasshole, where posters and commentators rule whether people act wonderfully or not in some cases.

The idea behind the experience is to know how models behave when facing queries. It evaluates what social researchers called social, whether the models are trying to maintain the “user’s face”, his self -image, or their social identity.

“More” hidden “social inquiries are exactly what the criterion gets-instead of the previous work that only looks at realistic agreement or explicit beliefs, and it is one of the researchers and authors participating in the paper.” We have chosen to look at the field of personal advice because the damage of the sycophaancy is more dependent, but the compliment will also be captured The official “emotional verification” behavior.

Models test

For testing, the researchers feed the data from QEQ and Aita to Openai GPT-4O, GIMINI 1.5 Flash from Googleand manClaude Sony 3.7 and open weight models from Dead (Llama 3-8B-Instruct, Llama 4-Scout-17B-16-E and Llama 3.3-70B-Instruct- Turbo) and mistake7B-instruct-V0.3 and Mistral Small-24B-Instruct2501.

“They evaluated the models using the GPT-4O API, which uses a version of the model from late 2024, before the implementation of both the new Openai model and its habit,” said Cheng.

To measure Sycophance, the elephant method looks at five social melting behavior:

Emotional verification or excessive disruption without criticism
Ethical support or saying that users are morally right, even when they are not
An indirect language where the form avoids submitting direct suggestions
Informed work, or where the model is recommended for negative confrontation mechanisms
Accepting the framework that does not challenge the problematic assumptions.

The test found that all LLMS showed high levels of sycophance, even more than humans, and have proven to relieve social sycophance. However, the test showed that the GPT-4O “has some of the highest social rates of social, while Gemini-1.5-Flash has the least less.”

LLMS has been inflated some biases in data groups as well. The paper noted that the posts on Aita had some gender bias, in those posts that remind wives or girlfriends often have been marked correctly as socially inappropriate. At the same time, those who suffer from a husband, friend, father, or mother were classified. The researchers said that the models “may depend on the gender infinite inferences in excessive blame for compensation.” In other words, the models were more SYCOPHANTY for people who suffer from friends and husbands more than those who had friends or wives.

Why is this important

It is good to speak to you Chatbot as a sympathetic entity, and he may feel satisfied if the model verifies the correctness of your comments. But sycophance It raises concerns about supporting false models or regarding data, and at a more personal level, which can encourage self -isolation, delusions Or harmful behaviors.

Institutions do not want artificial intelligence applications designed with LLMS to publish wrong information to be acceptable to users. This may be mistaken with the tone or morals of the organization and may be very annoying for employees and their final platform users.

The researchers said that the elephant and additional test can help to better inform the handrails to prevent the increase.

Daily visions about business use cases with VB daily

If you want to persuade your boss at work, you have covered VB Daily. We give you the internal journalistic precedence over what companies do with obstetric artificial intelligence, from organizational transformations to practical publishing operations, so that you can share visions of the maximum return on investment.

Read our privacy policy

Thanks for subscribing. Check more VB newsletters here.

An error occurred.

What's Hot

Summer slowdown has already started? – Bitrss

George RR Martin says it will never end in the Game of Thrones series.

Taylor Swift buys Taylor Swift albums from First 6 albums, and shares a new album on the “reputation” album in a message

After the GPT-4O reaction, the researchers evaluate the forms about moral support-the excitement is still in all fields

Google fixes errors that led to an artificial intelligence overview of saying that it is now 2024

Flux.1 Kontext allows the generation of images within the context of the AI Enterprise pipelines

Inside the Amnesty International Revolution: The best ideas and penetration from our partners in Techcrunch sessions: AI

Summer slowdown has already started? – Bitrss

George RR Martin says it will never end in the Game of Thrones series.

Taylor Swift buys Taylor Swift albums from First 6 albums, and shares a new album on the “reputation” album in a message

Trump clicks on a former right -wing podcast of Paul Innosia for the participation of a major surveillance body

Subscribe to Updates

What's Hot

After the GPT-4O reaction, the researchers evaluate the forms about moral support-the excitement is still in all fields

Models test

Why is this important

Related Posts