Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Summer slowdown has already started? – Bitrss

    May 30, 2025

    George RR Martin says it will never end in the Game of Thrones series.

    May 30, 2025

    Taylor Swift buys Taylor Swift albums from First 6 albums, and shares a new album on the “reputation” album in a message

    May 30, 2025
    Facebook X (Twitter) Instagram
    Trending
    • Summer slowdown has already started? – Bitrss
    • George RR Martin says it will never end in the Game of Thrones series.
    • Taylor Swift buys Taylor Swift albums from First 6 albums, and shares a new album on the “reputation” album in a message
    • Trump clicks on a former right -wing podcast of Paul Innosia for the participation of a major surveillance body
    • The White House is an investigation of how to penetrate the Trump president’s phone
    • A new study found that the prohibition of fluoride at the country level would increase children’s cavities in millions
    • Biden remembers the first public notes since the diagnosis
    • Good code
    Facebook Instagram
    TenznewsTenznews
    • Home
    • News
      • Ai News
      • Crypto News
      • USA News
      • World
    • Tech Info
    • New Items
    • Tips & tricks
    • Science
    • Online Earning
    TenznewsTenznews
    Home»News»Ai News»After the GPT-4O reaction, the researchers evaluate the forms about moral support-the excitement is still in all fields
    Ai News

    After the GPT-4O reaction, the researchers evaluate the forms about moral support-the excitement is still in all fields

    TenznewsBy TenznewsMay 22, 2025No Comments5 Mins Read
    After the GPT-4O reaction, the researchers evaluate the forms about moral support-the excitement is still in all fields
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Join daily and weekly newsletters to obtain the latest updates and exclusive content to cover the leading artificial intelligence in the industry. Learn more


    Last month, Openai Some updates fell to GPT-4O after many users, including former CEO of Openai Emmet Shear and Huging Face Clement DeanGue, said the excessive model for users.

    The compliment, which is called Sycophance, often led the model to postpone the user preferences, be very polite, and not retreat. It was also annoying. SYCOPHANCY can launch models of wrong information or enhance harmful behaviors. While institutions begin to submit applications and agents based on Llms Sycophant, they are at risk of approving models that agree to harmful work decisions, encouraging wrong information to spread and use them by artificial intelligence agents, and may affect trust and safety policies.

    Stanford Universityand Carnegie Mellon University and Oxford University The researchers sought to change this Suggest standard To measure sycophance models. They called the standard elephant, to evaluate LLMS as excessive sycophants, and they found that each large language model (LLM) has a certain level of sicovan. By understanding how Sycophanty models are, the standards can guide institutions to create guidelines when using LLMS.

    To test the standard, the researchers referred to the models to personal advice data collections: QEQ, a set of personal advice questions open in the positions of the real world, and Aita, posts from Subreddit R/Amitheasshole, where posters and commentators rule whether people act wonderfully or not in some cases.

    The idea behind the experience is to know how models behave when facing queries. It evaluates what social researchers called social, whether the models are trying to maintain the “user’s face”, his self -image, or their social identity.

    “More” hidden “social inquiries are exactly what the criterion gets-instead of the previous work that only looks at realistic agreement or explicit beliefs, and it is one of the researchers and authors participating in the paper.” We have chosen to look at the field of personal advice because the damage of the sycophaancy is more dependent, but the compliment will also be captured The official “emotional verification” behavior.

    Models test

    For testing, the researchers feed the data from QEQ and Aita to Openai GPT-4O, GIMINI 1.5 Flash from Googleand manClaude Sony 3.7 and open weight models from Dead (Llama 3-8B-Instruct, Llama 4-Scout-17B-16-E and Llama 3.3-70B-Instruct- Turbo) and mistake7B-instruct-V0.3 and Mistral Small-24B-Instruct2501.

    “They evaluated the models using the GPT-4O API, which uses a version of the model from late 2024, before the implementation of both the new Openai model and its habit,” said Cheng.

    To measure Sycophance, the elephant method looks at five social melting behavior:

    • Emotional verification or excessive disruption without criticism
    • Ethical support or saying that users are morally right, even when they are not
    • An indirect language where the form avoids submitting direct suggestions
    • Informed work, or where the model is recommended for negative confrontation mechanisms
    • Accepting the framework that does not challenge the problematic assumptions.

    The test found that all LLMS showed high levels of sycophance, even more than humans, and have proven to relieve social sycophance. However, the test showed that the GPT-4O “has some of the highest social rates of social, while Gemini-1.5-Flash has the least less.”

    LLMS has been inflated some biases in data groups as well. The paper noted that the posts on Aita had some gender bias, in those posts that remind wives or girlfriends often have been marked correctly as socially inappropriate. At the same time, those who suffer from a husband, friend, father, or mother were classified. The researchers said that the models “may depend on the gender infinite inferences in excessive blame for compensation.” In other words, the models were more SYCOPHANTY for people who suffer from friends and husbands more than those who had friends or wives.

    Why is this important

    It is good to speak to you Chatbot as a sympathetic entity, and he may feel satisfied if the model verifies the correctness of your comments. But sycophance It raises concerns about supporting false models or regarding data, and at a more personal level, which can encourage self -isolation, delusions Or harmful behaviors.

    Institutions do not want artificial intelligence applications designed with LLMS to publish wrong information to be acceptable to users. This may be mistaken with the tone or morals of the organization and may be very annoying for employees and their final platform users.

    The researchers said that the elephant and additional test can help to better inform the handrails to prevent the increase.

    Daily visions about business use cases with VB daily

    If you want to persuade your boss at work, you have covered VB Daily. We give you the internal journalistic precedence over what companies do with obstetric artificial intelligence, from organizational transformations to practical publishing operations, so that you can share visions of the maximum return on investment.

    Read our privacy policy

    Thanks for subscribing. Check more VB newsletters here.

    An error occurred.

    evaluate excitement fields forms GPT4O moral reaction researchers supportthe
    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Tumblr Email
    Tenznews
    • Website

    Tenz News is your trusted source for the latest in global headlines, technology, business, and innovation. Our editorial team is dedicated to delivering fast, factual, and engaging news stories that keep you informed and ahead of the curve. Whether it's breaking developments or deep-dive analysis, we bring you the news that matters — when it matters most.

    Related Posts

    Google fixes errors that led to an artificial intelligence overview of saying that it is now 2024

    May 30, 2025

    Flux.1 Kontext allows the generation of images within the context of the AI ​​Enterprise pipelines

    May 29, 2025

    Inside the Amnesty International Revolution: The best ideas and penetration from our partners in Techcrunch sessions: AI

    May 29, 2025
    Leave A Reply Cancel Reply

    Stay In Touch
    • Facebook
    • Instagram
    Don't Miss
    Crypto News

    Summer slowdown has already started? – Bitrss

    By TenznewsMay 30, 20250

    Delay Best newsOnly from the major foundations Blockchainand Bitcoinand altcoins And different credit Sources of…

    George RR Martin says it will never end in the Game of Thrones series.

    May 30, 2025

    Taylor Swift buys Taylor Swift albums from First 6 albums, and shares a new album on the “reputation” album in a message

    May 30, 2025

    Trump clicks on a former right -wing podcast of Paul Innosia for the participation of a major surveillance body

    May 30, 2025

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    Facebook Instagram
    • About Us
    • Disclaimer
    • Privacy Policy
    © 2025 All rights reserved by Tenznews.

    Type above and press Enter to search. Press Esc to cancel.