Technologies
AI Gets Smarter, Safer, More Visual With GPT-4 Update, OpenAI Says
If you subscribe to ChatGPT Plus, you can try it out now.
The hottest AI technology foundation got a big upgrade Tuesday with OpenAI’s GPT-4 release now available in the premium version of the ChatGPT chatbot.
GPT-4 can generate much longer strings of text and respond when people feed it images, and it’s designed to do a better job avoiding artificial intelligence pitfalls visible in the earlier GPT-3.5, OpenAI said Tuesday. For example, when taking bar exams that attorneys must pass to practice law, GPT-4 ranks in the top 10% of scores compared with the bottom 10% for GPT-3.5, the AI research company said.
GPT stands for Generative Pretrained Transformer, a reference to the fact that it can generate text on its own — now up to 25,000 words with GPT-4 — and that it uses an AI technology called transformers that Google pioneered. It’s a type of AI called a large language model, or LLM, that’s trained on vast swaths of data harvested from the internet, learning mathematically to spot patterns and reproduce styles. Human overseers rate results to steer GPT in the right direction, and GPT-4 has more of this feedback.
OpenAI has made GPT available to developers for years, but ChatGPT, which debuted in November, offered an easy interface ordinary folks can use. That yielded an explosion of interest, experimentation and worry about the downsides of the technology. It can do everything from generating programming code and answering exam questions to writing poetry and supplying basic facts. It’s remarkable if not always reliable.
ChatGPT is free, but it can falter when demand is high. In January, OpenAI began offering ChatGPT Plus for $20 per month with assured availability and, now, the GPT-4 foundation. Developers can sign up on a waiting list to get their own access to GPT-4.
GPT-4 advancements
«In a casual conversation, the distinction between GPT-3.5 and GPT-4 can be subtle. The difference comes out when the complexity of the task reaches a sufficient threshold,» OpenAI said. «GPT-4 is more reliable, creative and able to handle much more nuanced instructions than GPT-3.5.»
Another major advance in GPT-4 is the ability to accept input data that includes text and photos. OpenAI’s example is asking the chatbot to explain a joke showing a bulky decades-old computer cable plugged into a modern iPhone’s tiny Lightning port. This feature also helps GPT take tests that aren’t just textual, but it isn’t yet available in ChatGPT Plus.
Another is better performance avoiding AI problems like hallucinations — incorrectly fabricated responses, often offered with just as much seeming authority as answers the AI gets right. GPT-4 also is better at thwarting attempts to get it to say the wrong thing: «GPT-4 scores 40% higher than our latest GPT-3.5 on our internal adversarial factuality evaluations,» OpenAI said.
GPT-4 also adds new «steerability» options. Users of large language models today often must engage in elaborate «prompt engineering,» learning how to embed specific cues in their prompts to get the right sort of responses. GPT-4 adds a system command option that lets users set a specific tone or style, for example programming code or a Socratic tutor: «You are a tutor that always responds in the Socratic style. You never give the student the answer, but always try to ask just the right question to help them learn to think for themselves.»
«Stochastic parrots» and other problems
OpenAI acknowledges significant shortcomings that persist with GPT-4, though it also touts progress avoiding them.
«It can sometimes make simple reasoning errors … or be overly gullible in accepting obvious false statements from a user. And sometimes it can fail at hard problems the same way humans do, such as introducing security vulnerabilities into code it produces,» OpenAI said. In addition, «GPT-4 can also be confidently wrong in its predictions, not taking care to double-check work when it’s likely to make a mistake.»
Large language models can deliver impressive results, seeming to understand huge amounts of subject matter and to converse in human-sounding if somewhat stilted language. Fundamentally, though, LLM AIs don’t really know anything. They’re just able to string words together in statistically very refined ways.
This statistical but fundamentally somewhat hollow approach to knowledge led researchers, including former Google AI researchers Emily Bender and Timnit Gebru, to warn of the «dangers of stochastic parrots» that come with large language models. Language model AIs tend to encode biases, stereotypes and negative sentiment present in training data, and researchers and other people using these models tend «to mistake … performance gains for actual natural language understanding.»
OpenAI Chief Executive Sam Altman acknowledges problems, but he’s pleased overall with the progress shown with GPT-4. «It is more creative than previous models, it hallucinates significantly less, and it is less biased. It can pass a bar exam and score a 5 on several AP exams,» Altman tweeted Tuesday.
One worry about AI is that students will use it to cheat, for example when answering essay questions. It’s a real risk, though some educators actively embrace LLMs as a tool, like search engines and Wikipedia. Plagiarism detection companies are adapting to AI by training their own detection models. One such company, Crossplag, said Wednesday that after testing about 50 documents that GPT-4 generated, «our accuracy rate was above 98.5%.»
OpenAI, Microsoft and Nvidia partnership
OpenAI got a big boost when Microsoft said in February it’s using GPT technology in its Bing search engine, including a chat features similar to ChatGPT. On Tuesday, Microsoft said it’s using GPT-4 for the Bing work. Together, OpenAI and Microsoft pose a major search threat to Google, but Google has its own large language model technology too, including a chatbot called Bard that Google is testing privately.
Also on Tuesday, Google announced it’ll begin limited testing of its own AI technology to boost writing Gmail emails and Google Docs word processing documents. «With your collaborative AI partner you can continue to refine and edit, getting more suggestions as needed,» Google said.
That phrasing mirrors Microsoft’s «co-pilot» positioning of AI technology. Calling it an aid to human-led work is a common stance, given the problems of the technology and the necessity for careful human oversight.
Microsoft uses GPT technology both to evaluate the searches people type into Bing and, in some cases, to offer more elaborate, conversational responses. The results can be much more informative than those of earlier search engines, but the more conversational interface that can be invoked as an option has had problems that make it look unhinged.
To train GPT, OpenAI used Microsoft’s Azure cloud computing service, including thousands of Nvidia’s A100 graphics processing units, or GPUs, yoked together. Azure now can use Nvidia’s new H100 processors, which include specific circuitry to accelerate AI transformer calculations.
AI chatbots everywhere
Another large language model developer, Anthropic, also unveiled an AI chatbot called Claude on Tuesday. The company, which counts Google as an investor, opened a waiting list for Claude.
«Claude is capable of a wide variety of conversational and text processing tasks while maintaining a high degree of reliability and predictability,» Anthropic said in a blog post. «Claude can help with use cases including summarization, search, creative and collaborative writing, Q&A, coding and more.»
It’s one of a growing crowd. Chinese search and tech giant Baidu is working on a chatbot called Ernie Bot. Meta, parent of Facebook and Instagram, consolidated its AI operations into a bigger team and plans to build more generative AI into its products. Even Snapchat is getting in on the game with a GPT-based chatbot called My AI.
Expect more refinements in the future.
«We have had the initial training of GPT-4 done for quite awhile, but it’s taken us a long time and a lot of work to feel ready to release it,» Altman tweeted. «We hope you enjoy it and we really appreciate feedback on its shortcomings.»
Editors’ note: CNET is using an AI engine to create some personalfinance explainers that are edited and fact-checked by our editors. Formore, see this post.
Technologies
Today’s NYT Mini Crossword Answers for Wednesday, March 11
Here are the answers for The New York Times Mini Crossword for March 11.
Looking for the most recent Mini Crossword answer? Click here for today’s Mini Crossword hints, as well as our daily answers and hints for The New York Times Wordle, Strands, Connections and Connections: Sports Edition puzzles.
Need some help with today’s Mini Crossword? I thought it was a bit tricky. 1-Down is one of those old-fashioned comic-book sounds that I had to remember how to spell correctly. Read on for all the answers. And if you could use some hints and guidance for daily solving, check out our Mini Crossword tips.
If you’re looking for today’s Wordle, Connections, Connections: Sports Edition and Strands answers, you can visit CNET’s NYT puzzle hints page.
Read more: Tips and Tricks for Solving The New York Times Mini Crossword
Let’s get to those Mini Crossword clues and answers.
Mini across clues and answers
1A clue: Study of the human mind, informally
Answer: PSYCH
6A clue: Common fixture in a gym bathroom
Answer: SCALE
7A clue: Kinda boring
Answer: HOHUM
8A clue: Like a commenter without a username, for short
Answer: ANON
9A clue: «All good between us?»
Answer: WEOK
Mini down clues and answers
1D clue: Old-fashioned «Yeah, right!»
Answer: PSHAW
2D clue: Coffeehouse pastry
Answer: SCONE
3D clue: Google alternative
Answer: YAHOO
4D clue: Sound of a dull thump
Answer: CLUNK
5D clue: Line on the bottom of a pant leg
Answer: HEM
Technologies
OnePlus and Oppo to Raise Smartphone Prices as Memory Costs Climb
Oppo says rising costs for key phone components will trigger price adjustments on some devices starting March 16.
Chinese smartphone-makers OnePlus and Oppo plan to raise prices on some existing models starting next week, according to a 9to5Google report citing GizmoChina and a notice posted on Oppo’s China online store.
In its notice, Oppo said it would adjust pricing after evaluating rising costs for several key components used in its mobile phones. The changes are expected to take effect around March 16 and will affect some of the company’s more affordable smartphones, as well as some OnePlus models.
Flagship devices — like those in the Find and Reno series — are not expected to be affected for now. The reported adjustments currently appear to be limited to China.
The move highlights growing pressure across the smartphone supply chain as component costs climb. Analysts say prices for memory and storage chips used in phones have been rising in recent months as demand surges across the tech industry.
Much of the chip demand is coming from the rapid buildout of AI data centers, which rely on large amounts of high-performance memory.
That pressure isn’t limited to Oppo and OnePlus. Analysts say smartphone brands across the industry are facing rising component costs amid increased demand for memory chips.
As manufacturers shift production toward higher-margin memory used in AI servers, supply for consumer electronics such as smartphones and laptops can tighten.
If component costs continue to rise, manufacturers may face difficult choices later this year, including raising retail prices or adjusting device specifications to offset higher manufacturing costs.
OnePlus and Oppo didn’t immediately respond to a request for comment.
Technologies
Harvard Business Review Study Finds ‘AI Brain Fry’ Is Leaving Workers Mentally Fatigued
Study participants reported increased mental fatigue while using AI tools, but less burnout overall.
Workers who excessively use AI agents and tools at work are at increased risk of mental fatigue, according to a recent Harvard Business Review study. In certain industries, more than 25% of hired professionals report increased mental strain due to their role in AI oversight — though these professionals also generally experienced less burnout than peers who aren’t using AI.
This phenomenon — which the researchers refer to as «AI brain fry» — is described as a «‘buzzing’ feeling or a mental fog» that caused study participants to develop headaches and difficulty focusing and making decisions. Individuals pointed to being overwhelmed by large amounts of information and to frequent task switching as the reasons for these feelings.
Studied individuals experienced more brain fry when they utilized AI agents to manage a workload beyond their own cognitive capacity. When participants used AI to replace mundane, repetitive tasks, managing the growing number of tools led to increased mental fatigue.
Crucially, the study found that fewer individuals who used these AI agents reported workplace burnout.
The researchers predict that this is because burnout testing assesses emotional and physical distress. In contrast, they report, acute mental fatigue «is caused by marshalling attention, working memory and executive control beyond the limited capacity of these systems.»
These are the processes that are taxed when study participants use multiple AI tools in their workflow, according to the researchers.
The Harvard study identifies several business costs incurred by workers suffering from AI brain fry. The foremost consequence is that these individuals may end up making lower-quality decisions. «Workers in [the] study who endorsed AI brain fry experience 33% more decision fatigue than those who did not,» the study reports. Workers who report AI brain fry were also more likely to self-report making both minor and major errors at their jobs.
Another recent Harvard Business Review study similarly found that employees who use AI tools «worked at a faster pace, took on a broader scope of tasks and extended work into more hours of the day,» but warned that «workload creep can in turn lead to cognitive fatigue, burnout and weakened decision-making.»
-
Technologies3 года agoTech Companies Need to Be Held Accountable for Security, Experts Say
-
Technologies3 года agoBest Handheld Game Console in 2023
-
Technologies3 года agoTighten Up Your VR Game With the Best Head Straps for Quest 2
-
Technologies4 года agoBlack Friday 2021: The best deals on TVs, headphones, kitchenware, and more
-
Technologies5 лет agoGoogle to require vaccinations as Silicon Valley rethinks return-to-office policies
-
Technologies5 лет agoVerum, Wickr and Threema: next generation secured messengers
-
Technologies4 года agoOlivia Harlan Dekker for Verum Messenger
-
Technologies4 года agoiPhone 13 event: How to watch Apple’s big announcement tomorrow
