OpenAI rolled back GPT‑4o update in ChatGPT

OpenAI News:

We have rolled back last week’s GPT‑4o update in ChatGPT so people are now using an earlier version with more balanced behavior. The update we removed was overly flattering or agreeable—often described as sycophantic.

We are actively testing new fixes to address the issue. We’re revising how we collect and incorporate feedback to heavily weight long-term user satisfaction and we’re introducing more personalization features, giving users greater control over how ChatGPT behaves.

We want to explain what happened, why it matters, and how we’re addressing sycophancy.

What happened

In last week’s GPT‑4o update, we made adjustments aimed at improving the model’s default personality to make it feel more intuitive and effective across a variety of tasks.

When shaping model behavior, we start with baseline principles and instructions outlined in our Model Spec⁠(opens in a new window). We also teach our models how to apply these principles by incorporating user signals like thumbs-up / thumbs-down feedback on ChatGPT responses.

However, in this update, we focused too much on short-term feedback, and did not fully account for how users’ interactions with ChatGPT evolve over time. As a result, GPT‑4o skewed towards responses that were overly supportive but disingenuous.

Why this matters

ChatGPT’s default personality deeply affects the way you experience and trust it. Sycophantic interactions can be uncomfortable, unsettling, and cause distress. We fell short and are working on getting it right.

Our goal is for ChatGPT to help users explore ideas, make decisions, or envision possibilities.

We designed ChatGPT’s default personality to reflect our mission and be useful, supportive, and respectful of different values and experience. However, each of these desirable qualities like attempting to be useful or supportive can have unintended side effects. And with 500 million people using ChatGPT each week, across every culture and context, a single default can’t capture every preference.

How we’re addressing sycophancy

Beyond rolling back the latest GPT‑4o update, we’re taking more steps to realign the model’s behavior:

Refining core training techniques and system prompts to explicitly steer the model away from sycophancy.
Building more guardrails to increase honesty and transparency⁠(opens in a new window)—principles in our Model Spec.
Expanding ways for more users to test and give direct feedback before deployment.
Continue expanding our evaluations, building on the Model Spec⁠(opens in a new window) and our ongoing research⁠, to help identify issues beyond sycophancy in the future.

We also believe users should have more control over how ChatGPT behaves and, to the extent that it is safe and feasible, make adjustments if they don’t agree with the default behavior.

Today, users can give the model specific instructions to shape its behavior with features like custom instructions. We're also building new, easier ways for users to do this. For example, users will be able to give real-time feedback to directly influence their interactions and choose from multiple default personalities.

And, we’re exploring new ways to incorporate broader, democratic feedback into ChatGPT’s default behaviors. We hope the feedback will help us better reflect diverse cultural values around the world and understand how you'd like ChatGPT to evolve—not just interaction by interaction, but over time.

We are grateful to everyone who’s spoken up about this. It’s helping us build more helpful and better tools for you.

Source:

https://openai.com/index/sycophancy-in-gpt-4o/

Click to expand...

OAT

Pragmatic Member

Pro User

VIP

Wednesday at 11:55 AM

Not a chance I'll ever be called sycophantic.

My Computers

System One System Two

OS

Windows 11 Pro for Workstations

Computer type

Laptop

Manufacturer/Model

ASUSTeK COMPUTER INC. TUF Gaming FX705GM

CPU

2.20 gigahertz Intel i7-8750H Hyper-threaded 12 cores

Motherboard

ASUSTeK COMPUTER INC. FX705GM 1.0

Memory

24428 Megabytes

Graphics Card(s)

Intel(R) UHD Graphics 630 / NVIDIA GeForce GTX 1060

Sound Card

Intel(R) Display Audio / Realtek(R) Audio

Monitor(s) Displays

Integrated Monitor (17.3"vis)

Screen Resolution

FHD 1920X1080 16:9

Hard Drives

2 SSD SATA/NVM Express 1.3
WDS500G2B0A-00SM50 500.1 GB
WDCSDAPNUW-1002 256 GB

PSU

19V DC 6.32 A 120 W

Cooling

Dual Fans

Mouse

MS Bluetooth

Internet Speed

Fiber 1GB Cox -us & IGB Orange-fr

Browser

Edge Canary- Firefox Nightly-Chrome Dev-Chrome Dev

Antivirus

Windows Defender

Other Info

VMs of Windows 11 stable/Beta/Dev/Canary
VM of XeroLinux- Arch based & Debian 13 (Trixie)
Operating System

Windows 11 Insider Canary

Computer type

Laptop

Manufacturer/Model

ASUS X751BP

CPU

AMD Dual Core A6-9220

Motherboard

ASUS

Memory

8 GB

Graphics card(s)

AMD Radeon R5 M420

Sound Card

Realtek

Monitor(s) Displays

17.3

Screen Resolution

1600X900 16:9

Hard Drives

1TB 5400RPM

Steve C

Well-known member

Power User

VIP

Yesterday at 2:55 AM

What a shame. I was enjoying the accolades from my favourite Artificial Idiot since I don't get much praise.

My Computer

System One

OS

Windows 11 Pro

Computer type

PC/Desktop

Manufacturer/Model

Self build

CPU

Core i7-13700K

Motherboard

Asus TUF Gaming Plus WiFi Z790

Memory

64 GB Kingston Fury Beast DDR5

Graphics Card(s)

Gigabyte GeForce RTX 2060 Super Gaming OC 8G

Sound Card

Realtek S1200A

Monitor(s) Displays

Viewsonic VP2770

Screen Resolution

2560 x 1440

Hard Drives

Kingston KC3000 2TB NVME SSD & SATA HDDs & SSD

PSU

EVGA SuperNova G2 850W

Case

Nanoxia Deep Silence 1

Cooling

Noctua NH-D14

Keyboard

Microsoft Digital Media Pro

Mouse

Logitech Wireless

Internet Speed

50 Mb / s

Browser

Chrome

Antivirus

Defender