Connect with us

Technologies

AI Is Bad at Sudoku. It’s Even Worse at Showing Its Work

Researchers did more than ask chatbots to play games. They tested whether AI models could describe their thinking. The results were troubling.

Chatbots are genuinely impressive when you watch them do things they’re good at, like writing a basic email or creating weird, futuristic-looking images. But ask generative AI to solve one of those puzzles in the back of a newspaper, and things can quickly go off the rails.

That’s what researchers at the University of Colorado at Boulder found when they challenged large language models to solve sudoku. And not even the standard 9×9 puzzles. An easier 6×6 puzzle was often beyond the capabilities of an LLM without outside help (in this case, specific puzzle-solving tools).

A more important finding came when the models were asked to show their work. For the most part, they couldn’t. Sometimes they lied. Sometimes they explained things in ways that made no sense. Sometimes they hallucinated and started talking about the weather.

If gen AI tools can’t explain their decisions accurately or transparently, that should cause us to be cautious as we give these things more control over our lives and decisions, said Ashutosh Trivedi, a computer science professor at the University of Colorado at Boulder and one of the authors of the paper published in July in the Findings of the Association for Computational Linguistics.

«We would really like those explanations to be transparent and be reflective of why AI made that decision, and not AI trying to manipulate the human by providing an explanation that a human might like,» Trivedi said.


Don’t miss any of our unbiased tech content and lab-based reviews. Add CNET as a preferred Google source.


The paper is part of a growing body of research into the behavior of large language models. Other recent studies have found, for example, that models hallucinate in part because their training procedures incentivize them to produce results a user will like, rather than what is accurate, or that people who use LLMs to help them write essays are less likely to remember what they wrote. As gen AI becomes more and more a part of our daily lives, the implications of how this technology works and how we behave when using it become hugely important.

When you make a decision, you can try to justify it, or at least explain how you arrived at it. An AI model may not be able to accurately or transparently do the same. Would you trust it?

Why LLMs struggle with sudoku

We’ve seen AI models fail at basic games and puzzles before. OpenAI’s ChatGPT (among others) has been totally crushed at chess by the computer opponent in a 1979 Atari game. A recent research paper from Apple found that models can struggle with other puzzles, like the Tower of Hanoi.

It has to do with the way LLMs work and fill in gaps in information. These models try to complete those gaps based on what happens in similar cases in their training data or other things they’ve seen in the past. With a sudoku, the question is one of logic. The AI might try to fill each gap in order, based on what seems like a reasonable answer, but to solve it properly, it instead has to look at the entire picture and find a logical order that changes from puzzle to puzzle. 

Read more: 29 Ways You Can Make Gen AI Work for You, According to Our Experts

Chatbots are bad at chess for a similar reason. They find logical next moves but don’t necessarily think three, four or five moves ahead — the fundamental skill needed to play chess well. Chatbots also sometimes tend to move chess pieces in ways that don’t really follow the rules or put pieces in meaningless jeopardy. 

You might expect LLMs to be able to solve sudoku because they’re computers and the puzzle consists of numbers, but the puzzles themselves are not really mathematical; they’re symbolic. «Sudoku is famous for being a puzzle with numbers that could be done with anything that is not numbers,» said Fabio Somenzi, a professor at CU and one of the research paper’s authors.

I used a sample prompt from the researchers’ paper and gave it to ChatGPT. The tool showed its work, and repeatedly told me it had the answer before showing a puzzle that didn’t work, then going back and correcting it. It was like the bot was turning in a presentation that kept getting last-second edits: This is the final answer. No, actually, never mind, this is the final answer. It got the answer eventually, through trial and error. But trial and error isn’t a practical way for a person to solve a sudoku in the newspaper. That’s way too much erasing and ruins the fun.

AI struggles to show its work

The Colorado researchers didn’t just want to see if the bots could solve puzzles. They asked for explanations of how the bots worked through them. Things did not go well.

Testing OpenAI’s o1-preview reasoning model, the researchers saw that the explanations — even for correctly solved puzzles — didn’t accurately explain or justify their moves and got basic terms wrong. 

«One thing they’re good at is providing explanations that seem reasonable,» said Maria Pacheco, an assistant professor of computer science at CU. «They align to humans, so they learn to speak like we like it, but whether they’re faithful to what the actual steps need to be to solve the thing is where we’re struggling a little bit.»

Sometimes, the explanations were completely irrelevant. Since the paper’s work was finished, the researchers have continued to test new models released. Somenzi said that when he and Trivedi were running OpenAI’s o4 reasoning model through the same tests, at one point, it seemed to give up entirely. 

«The next question that we asked, the answer was the weather forecast for Denver,» he said.

(Disclosure: Ziff Davis, CNET’s parent company, in April filed a lawsuit against OpenAI, alleging it infringed Ziff Davis copyrights in training and operating its AI systems.)

Explaining yourself is an important skill

When you solve a puzzle, you’re almost certainly able to walk someone else through your thinking. The fact that these LLMs failed so spectacularly at that basic job isn’t a trivial problem. With AI companies constantly talking about «AI agents» that can take actions on your behalf, being able to explain yourself is essential.

Consider the types of jobs being given to AI now, or planned for in the near future: driving, doing taxes, deciding business strategies and translating important documents. Imagine what would happen if you, a person, did one of those things and something went wrong.

«When humans have to put their face in front of their decisions, they better be able to explain what led to that decision,» Somenzi said.

It isn’t just a matter of getting a reasonable-sounding answer. It needs to be accurate. One day, an AI’s explanation of itself might have to hold up in court, but how can its testimony be taken seriously if it’s known to lie? You wouldn’t trust a person who failed to explain themselves, and you also wouldn’t trust someone you found was saying what you wanted to hear instead of the truth. 

«Having an explanation is very close to manipulation if it is done for the wrong reason,» Trivedi said. «We have to be very careful with respect to the transparency of these explanations.»

Technologies

Today’s NYT Mini Crossword Answers for Friday, Sept. 19

Here are the answers for The New York Times Mini Crossword for Sept. 19.

Looking for the most recent Mini Crossword answer? Click here for today’s Mini Crossword hints, as well as our daily answers and hints for The New York Times Wordle, Strands, Connections and Connections: Sports Edition puzzles.


I didn’t get off to a great start with today’s Mini Crossword, completely blanking on 1-Across. Thankfully, the other clues were easy, and that answer filled itself in. Need some help? Read on. And if you could use some hints and guidance for daily solving, check out our Mini Crossword tips.

If you’re looking for today’s Wordle, Connections, Connections: Sports Edition and Strands answers, you can visit CNET’s NYT puzzle hints page.

Read more: Tips and Tricks for Solving The New York Times Mini Crossword

Let’s get to those Mini Crossword clues and answers.

Mini across clues and answers

1A clue: Cancel on plans at the last moment
Answer: FLAKE

6A clue: Shade of light purple
Answer: LILAC

7A clue: ___ acid (protein builder)
Answer: AMINO

8A clue: Sarcastic «Yeah, sure»
Answer: IBET

9A clue: Sardonic boss on «Parks and Recreation»
Answer: RON

Mini down clues and answers

1D clue: Stylish panache
Answer: FLAIR

2D clue: Party game that tests how low you can go
Answer: LIMBO

3D clue: Visitor from outer space
Answer: ALIEN

4D clue: Philosopher who posed the question «What can I know?»
Answer: KANT

5D clue: Environmentally friendly prefix
Answer: ECO

Continue Reading

Technologies

Your Old Android Isn’t Dead. These Tweaks Can Bring It Back to Life

Clear space, optimize your battery and update the basics. These quick changes can make an old Android phone feel snappier.

You don’t need the latest Android flagship to get good performance. 

Thanks to longer software support from brands like Google and Samsung, older models can still run smoothly, as long as you give them a little attention. Clearing out unused apps, updating your software and tweaking a few settings can breathe new life into a device that feels sluggish. These quick fixes can help your phone last longer and save you from spending on an early upgrade.

Before you start shopping for a replacement, try a few simple adjustments. You might be surprised by how much faster your phone feels once you free up space, optimize battery use and turn off background drains.

Whether you use a Samsung Galaxy, Motorola or OnePlus phone, chances are you can still improve battery life and overall speed without buying something new. Just remember that Android settings vary slightly from brand to brand, so the menus may look a little different depending on your phone.

Don’t miss any of CNET’s unbiased tech content and lab-based reviews. Add us as a preferred Google source on Chrome.

Settings to improve your battery life

Living with a phone that has poor battery life can be infuriating, but there are some steps you can take to maximize each charge right from the very beginning:

1. Turn off auto screen brightness or adaptive brightness and set the brightness level slider to under 50%

The brighter your screen, the more battery power it uses. 

To get to the setting, pull down the shortcut menu from the top of the screen and adjust the slider, if it’s there. Some phones may have a toggle for auto brightness in the shortcut panel; otherwise, you need to open the settings app and search for «brightness» to find the setting and turn it off.

2. Use Adaptive Battery and Battery Optimization

These features focus on learning how you use your phone, including which apps you use and when, and then optimizing the apps and the amount of battery they use. 

Some Android phones have a dedicated Battery section in the Settings app, while other phones (looking at you, Samsung) bury these settings. It’s a little different for each phone. I recommend opening your settings and searching for «battery» to find the right screen. Your phone may also have an adaptive charging setting that can monitor how quickly your phone battery charges overnight to preserve its health.

Why you should use dark mode more often

Another way to improve battery life while also helping save your eyes is to use Android’s dedicated dark mode. Any Android phone running Android 10 or newer will have a dedicated dark mode option. 

According to Google, dark mode not only reduces the strain that smartphone displays cause on our eyes but also improves battery life because it takes less power to display dark backgrounds on OLED displays (used in most flagship phones) than a white background. 

Depending on which version of Android your phone is running, and what company made your phone, you may have to dig around the settings app to find a dark mode. If your phone runs Android 10 or newer, you’ll be able to turn on system-wide dark mode. If it runs Android 9, don’t despair. Plenty of apps have their own dark mode option in the settings that you can use, whether or not you have Android 10. 

To turn it on dark mode, open the Settings app and search for Dark Mode, Dark Theme or even Night Mode (as Samsung likes to call it). I suggest using dark mode all the time, but if you’re not sure, you can always set dark mode to automatically turn on based on a schedule, say from 7 p.m. to 7 a.m. every day, or allow it to automatically switch based on your location at sunset and sunrise. 

Keep your home screen free of clutter

Planning to hit up the Google Play Store for a bunch of new Android apps? Be prepared for a lot of icon clutter on your home screen, which is where shortcuts land every time you install something.

If you don’t want that, there’s a simple way out of this: Long-press on an empty area of your home screen and tap Settings. Find the option labeled something along the lines of Add icon to Home Screen or Add new apps to Home Screen and turn it off. 

Presto! No more icons on the home screen when you install new apps. You can still add shortcuts by dragging an app’s icon out of the app drawer, but they won’t appear on your home screen unless you want them to.

Read more: Best Android Phones You Can Buy in 2024

Set up Do Not Disturb so that you can better focus

If your phone routinely spends the night on your nightstand, you probably don’t want it beeping or buzzing every time there’s a call, message or Facebook alert — especially when you’re trying to sleep. Android offers a Do Not Disturb mode that will keep the phone more or less silent during designated hours. On some phones, this is referred to as the Downtime setting or even Quiet Time.

Head to Settings > Sounds (or Notifications), then look for Do Not Disturb or a similar name. If you can’t find it, search for it using the built-in search feature in your settings.

Using the feature, you can set up a range of hours when you want to turn off the digital noise. Don’t worry, any notifications you get while Do Not Disturb is turned on will still be waiting for you when you wake up. Also, you can typically make an exception that allows repeat callers and favorite contacts’ calls to go through. Turn that on. If someone is calling you in an emergency, odds are they are going to keep trying.

Always be prepared in case you lose your phone or it’s stolen

Is there anything worse than a lost or stolen phone? Only the knowledge that you could have tracked it down if you had turned on Google’s Find My Device feature.

To prepare for a successful recovery, here’s what you need to do: Open the Settings app and then search for Find My Device. It’s usually in the Security section of the Settings app.

If you have a Samsung device, you can use Samsung’s Find My Mobile service, which is found in Settings > Biometrics and security > Find My Mobile

Once that’s enabled, you can head to android.com/find from any PC or mobile device and sign in to your account. Samsung users can visit findmymobile.samsung.com to find a lost phone. 

If you have trouble setting any of this up, be sure to read our complete guide to finding a lost Android phone.

Assuming your phone is on and online, you should be able to see its location on a map. From there, you can make it ring, lock it, set a lock screen note to tell whoever has it how to get it back to you, or, worst-case scenario, remotely wipe the whole thing.

And always keep your phone up to date

As obvious as it may seem, a simple software update could fix bugs and other issues slowing down your Android device. 

Before you download and install the latest software update, make sure your device is connected to Wi-Fi, or else this won’t work.

Now, open the Settings application and type in Update. You’ll then either see Software update or System update — choose either one. Then just download the software, wait for a few minutes and install it when it’s ready. Your Android device will reboot and install the latest software update available.

There’s a lot more to learn about a new phone. Here are the best ways to boost your cell signal, and here’s a flagship phone head-to-head comparison. Plus, check out CNET’s list of the best cases for your Samsung phone. More of an Apple fan? We have tips for boosting your iPhone’s performance, too.

Continue Reading

Technologies

Your Pixel 10 Might Have Issues With Older Wireless Chargers

You might want to try taking the case off your phone in order to successfully charge it.

When Google introduced the Pixel 10 lineup in August, it became one of the first major Android phones to receive the Qi 2 wireless charging standard, which Google calls Pixelsnap. However, users noticed issues with wireless charging on the Pixel 10  almost immediately after its release. 

Some people are having trouble charging their phone with the new Pixelsnap charger, and others are having issues with older wireless chargers, including Google’s own Pixel Stands. The bulk of the problems happen when a case is on the phone — whether it has the magnet array or not.

I own both the first and second generation Pixel Stands and both will charge my Pixel 10 Pro XL without an issue if there’s no case on it. However, when I add a case to my phone, the problems begin. 

I have three cases for my phone, the Mous Super Thin Clear Case, the Magnetic Slim Case Fit by Grecazo, and a no-name soft TPU case. If my phone has any of those cases on and I attempt to charge it while it’s vertical, it starts to charge and then stops after a second or two, and keeps doing that. 

I can fix this for the first-generation Pixel Stand by turning the phone horizontal, but it will still charge very slowly. I can’t seem to fix it at all for the Pixel Stand 2 — vertical, horizontal, it doesn’t charge. 

Not everyone has this issue

The problem doesn’t seem to be universal. CNET editor Patrick Holland said he had no issues charging the Pixel 10 Pro during his time with it. 

A Google spokesperson told me the Pixel 10 lineup is not optimized for older Qi wireless charging standards, but that doesn’t necessarily mean the phones won’t work with older wireless chargers. 

Qi 2 is backwards-compatible with older standards, but the phone’s height and charging coil placement on both the phone and the charger are still factors. If you’re having problems, you might see if removing the case helps.

The prospect of potentially needing to replace your older wireless chargers with newer ones isn’t ideal, especially if you shelled out $80 for one or both of Google’s own Pixel Stands. Still, if you want the best wireless charging speed for your brand new Pixel 10 phone, it won’t be with wireless chargers that only support older Qi standards.

Continue Reading

Trending

Copyright © Verum World Media