Connect with us

Technologies

OpenAI Yanked a ChatGPT Update. Here’s What It Said and Why It Matters

The company says it plans to be more careful when releasing updates in the future.

Recent updates to ChatGPT made the chatbot far too agreeable and OpenAI said Friday it’s taking steps to prevent the issue from happening again.

In a blog post, the company detailed its testing and evaluation process for new models and outlined how the problem with the April 25 update to its GPT-4o model came to be. Essentially, a bunch of changes that individually seemed helpful combined to create a tool that was far too sycophantic and potentially harmful.

How much of a suck-up was it? In some testing earlier this week, we asked about a tendency to be overly sentimental, and ChatGPT laid on the flattery: «Hey, listen up — being sentimental isn’t a weakness; it’s one of your superpowers.» And it was just getting started being fulsome.

«This launch taught us a number of lessons. Even with what we thought were all the right ingredients in place (A/B tests, offline evals, expert reviews), we still missed this important issue,» the company said.

OpenAI rolled back the update this week. To avoid causing new issues, it took about 24 hours to revert the model for everybody.

The concern around sycophancy isn’t just about the enjoyment level of the user experience. It posed a health and safety threat to users that OpenAI’s existing safety checks missed. Any AI model can give questionable advice about topics like mental health but one that is overly flattering can be dangerously deferential or convincing — like whether that investment is a sure thing or how thin you should seek to be.

«One of the biggest lessons is fully recognizing how people have started to use ChatGPT for deeply personal advice — something we didn’t see as much even a year ago,» OpenAI said. «At the time, this wasn’t a primary focus but as AI and society have co-evolved, it’s become clear that we need to treat this use case with great care.»

Sycophantic large language models can reinforce biases and harden beliefs, whether they’re about yourself or others, said Maarten Sap, assistant professor of computer science at Carnegie Mellon University. «[The LLM] can end up emboldening their opinions if these opinions are harmful or if they want to take actions that are harmful to themselves or others.»

(Disclosure: Ziff Davis, CNET’s parent company, in April filed a lawsuit against OpenAI, alleging it infringed on Ziff Davis copyrights in training and operating its AI systems.)  

How OpenAI tests models and what’s changing

The company offered some insight into how it tests its models and updates. This was the fifth major update to GPT-4o focused on personality and helpfulness. The changes involved new post-training work or fine-tuning on the existing models, including the rating and evaluation of various responses to prompts to make it more likely to produce those responses that rated more highly. 

Prospective model updates are evaluated on their usefulness across a variety of situations, like coding and math, along with specific tests by experts to experience how it behaves in practice. The company also runs safety evaluations to see how it responds to safety, health and other potentially dangerous queries. Finally, OpenAI runs A/B tests with a small number of users to see how it performs in the real world.

The April 25 update performed well in these tests, but some expert testers indicated the personality seemed a bit off. The tests didn’t specifically look at sycophancy, and OpenAI decided to move forward despite the issues raised by testers. Take note, readers: AI companies are in a tail-on-fire hurry, which doesn’t always square well with well thought-out product development.

«Looking back, the qualitative assessments were hinting at something important and we should’ve paid closer attention,» the company said.

Among its takeaways, OpenAI said it needs to treat model behavior issues the same as it would other safety issues — and halt a launch if there are concerns. For some model releases, the company said it would have an opt-in «alpha» phase to get more feedback from users before a broader launch. 

Sap said evaluating an LLM based on whether a user likes the response isn’t necessarily going to get you the most honest chatbot. In a recent study, Sap and others found a conflict between the usefulness and truthfulness of a chatbot. He compared it to situations where the truth is not necessarily what people want — think about a car salesperson trying to sell a vehicle. 

«The issue here is that they were trusting the users’ thumbs-up/thumbs-down response to the model’s outputs and that has some limitations because people are likely to upvote something that is more sycophantic than others,» he said.

Sap said OpenAI is right to be more critical of quantitative feedback, such as user up/down responses, as they can reinforce biases.

The issue also highlighted the speed at which companies push updates and changes out to existing users, Sap said — an issue that’s not limited to one tech company. «The tech industry has really taken a ‘release it and every user is a beta tester’ approach to things,» he said. Having a process with more testing before updates are pushed to every user can bring these issues to light before they become widespread.

Technologies

Today’s NYT Mini Crossword Answers for Friday, Dec. 26

Here are the answers for The New York Times Mini Crossword for Dec. 26.

Looking for the most recent Mini Crossword answer? Click here for today’s Mini Crossword hints, as well as our daily answers and hints for The New York Times Wordle, Strands, Connections and Connections: Sports Edition puzzles.


Need some help with today’s Mini Crossword? Some of the clues are tough today — I thought maybe 1-Across was referring to the Grinch, or even Oscar the Grouch, but was I ever wrong! Read on for all the answers. And if you could use some hints and guidance for daily solving, check out our Mini Crossword tips.

If you’re looking for today’s Wordle, Connections, Connections: Sports Edition and Strands answers, you can visit CNET’s NYT puzzle hints page.

Read more: Tips and Tricks for Solving The New York Times Mini Crossword

Let’s get to those Mini Crossword clues and answers.

Mini across clues and answers

1A clue: Furry and green, say
Answer: MOSSY

6A clue: State known for its potatoes
Answer: IDAHO

7A clue: Like a faithful friend
Answer: LOYAL

8A clue: Had a beverage
Answer: DRANK

9A clue: Pronoun frequently paired with «her»
Answer: SHE

Mini down clues and answers

1D clue: Not spicy, as salsa
Answer: MILD

2D clue: Reasons for wrinkled noses
Answer: ODORS

3D clue: Words from a doctor checking your tonsils
Answer: SAYAH

4D clue: Comedian Gillis
Answer: SHANE

5D clue: Part of an egg used to make hollandaise sauce
Answer: YOLK


Don’t miss any of our unbiased tech content and lab-based reviews. Add CNET as a preferred Google source.


Continue Reading

Technologies

Today’s NYT Connections: Sports Edition Hints and Answers for Dec. 26, #459

Here are hints and the answers for the NYT Connections: Sports Edition puzzle for Dec. 26, No. 459.

Looking for the most recent regular Connections answers? Click here for today’s Connections hints, as well as our daily answers and hints for The New York Times Mini Crossword, Wordle and Strands puzzles.


Today’s Connections: Sports Edition is a tough one. That purple category once again has players looking for a different, but related, hidden word in four of the clues. If you’re struggling with today’s puzzle but still want to solve it, read on for hints and the answers.

Connections: Sports Edition is published by The Athletic, the subscription-based sports journalism site owned by The Times. It doesn’t appear in the NYT Games app, but it does in The Athletic’s own app. Or you can play it for free online.

Read more: NYT Connections: Sports Edition Puzzle Comes Out of Beta

Hints for today’s Connections: Sports Edition groups

Here are four hints for the groupings in today’s Connections: Sports Edition puzzle, ranked from the easiest yellow group to the tough (and sometimes bizarre) purple group.

Yellow group hint: Big Apple jock.

Green group hint: College football fun.

Blue group hint: On the road.

Purple group hint: Hunt down a word in other words.

Answers for today’s Connections: Sports Edition groups

Yellow group: A New York athlete.

Green group: Bowl games.

Blue group: Associated with a team road trip.

Purple group: Ends in a movement verb.

Read more: Wordle Cheat Sheet: Here Are the Most Popular Letters Used in English Words

What are today’s Connections: Sports Edition answers?

The yellow words in today’s Connections

The theme is a New York athlete. The four answers are Islander, Net, Ranger and Yankee.

The green words in today’s Connections

The theme is bowl games. The four answers are Alamo, Gator, Liberty and Pinstripe.

The blue words in today’s Connections

The theme is associated with a team road trip. The four answers are bus, flight, hotel and visiting locker room.

The purple words in today’s Connections

The theme is ends in a movement verb. The four answers are foxtrot (trot), newsprint (sprint), terrace (race) and thunderbolt (bolt).


Don’t miss any of our unbiased tech content and lab-based reviews. Add CNET as a preferred Google source.


Continue Reading

Technologies

Today’s NYT Connections Hints, Answers and Help for Dec. 26, #929

Here are some hints and the answers for the NYT Connections puzzle for Dec. 26 #929

Looking for the most recent Connections answers? Click here for today’s Connections hints, as well as our daily answers and hints for The New York Times Mini Crossword, Wordle, Connections: Sports Edition and Strands puzzles.


Today’s NYT Connections puzzle is full of fun pop-culture references. Read on for clues and today’s Connections answers.

The Times has a Connections Bot, like the one for Wordle. Go there after you play to receive a numeric score and to have the program analyze your answers. Players who are registered with the Times Games section can now nerd out by following their progress, including the number of puzzles completed, win rate, number of times they nabbed a perfect score and their win streak.

Read more: Hints, Tips and Strategies to Help You Win at NYT Connections Every Time

Hints for today’s Connections groups

Here are four hints for the groupings in today’s Connections puzzle, ranked from the easiest yellow group to the tough (and sometimes bizarre) purple group.

Yellow group hint: Golden state cliches.

Green group hint: Funny films.

Blue group hint: Rock on.

Purple group hint: Not white.

Answers for today’s Connections groups

Yellow group: California-based character tropes.

Green group: Comedy subgenres.

Blue group: ’70s rock bands.

Purple group: Black ____.

Read more: Wordle Cheat Sheet: Here Are the Most Popular Letters Used in English Words

What are today’s Connections answers?

The yellow words in today’s Connections

The theme is California-based character tropes.  The four answers are movie exec, surfer, tech bro and Valley Girl.

The green words in today’s Connections

The theme is comedy subgenres. The four answers are buddy, cringe, screwball and stoner.

The blue words in today’s Connections

The theme is ’70s rock bands. The four answers are America, Chicago, Foreigner and Journey.

The purple words in today’s Connections

The theme is black ____. The four answers are Forest, Friday, Panther and Widow.


Don’t miss any of our unbiased tech content and lab-based reviews. Add CNET as a preferred Google source.


Continue Reading

Trending

Copyright © Verum World Media