Technologies

AI Gets Smarter, Safer, More Visual With GPT-4 Update, OpenAI Says

If you subscribe to ChatGPT Plus, you can try it out now.

The hottest AI technology foundation got a big upgrade Tuesday with OpenAI’s GPT-4 release now available in the premium version of the ChatGPT chatbot.

GPT-4 can generate much longer strings of text and respond when people feed it images, and it’s designed to do a better job avoiding artificial intelligence pitfalls visible in the earlier GPT-3.5, OpenAI said Tuesday. For example, when taking bar exams that attorneys must pass to practice law, GPT-4 ranks in the top 10% of scores compared with the bottom 10% for GPT-3.5, the AI research company said.

GPT stands for Generative Pretrained Transformer, a reference to the fact that it can generate text on its own — now up to 25,000 words with GPT-4 — and that it uses an AI technology called transformers that Google pioneered. It’s a type of AI called a large language model, or LLM, that’s trained on vast swaths of data harvested from the internet, learning mathematically to spot patterns and reproduce styles. Human overseers rate results to steer GPT in the right direction, and GPT-4 has more of this feedback.

OpenAI has made GPT available to developers for years, but ChatGPT, which debuted in November, offered an easy interface ordinary folks can use. That yielded an explosion of interest, experimentation and worry about the downsides of the technology. It can do everything from generating programming code and answering exam questions to writing poetry and supplying basic facts. It’s remarkable if not always reliable.

ChatGPT is free, but it can falter when demand is high. In January, OpenAI began offering ChatGPT Plus for $20 per month with assured availability and, now, the GPT-4 foundation. Developers can sign up on a waiting list to get their own access to GPT-4.

GPT-4 advancements

«In a casual conversation, the distinction between GPT-3.5 and GPT-4 can be subtle. The difference comes out when the complexity of the task reaches a sufficient threshold,» OpenAI said. «GPT-4 is more reliable, creative and able to handle much more nuanced instructions than GPT-3.5.»

Another major advance in GPT-4 is the ability to accept input data that includes text and photos. OpenAI’s example is asking the chatbot to explain a joke showing a bulky decades-old computer cable plugged into a modern iPhone’s tiny Lightning port. This feature also helps GPT take tests that aren’t just textual, but it isn’t yet available in ChatGPT Plus.

Another is better performance avoiding AI problems like hallucinations — incorrectly fabricated responses, often offered with just as much seeming authority as answers the AI gets right. GPT-4 also is better at thwarting attempts to get it to say the wrong thing: «GPT-4 scores 40% higher than our latest GPT-3.5 on our internal adversarial factuality evaluations,» OpenAI said.

GPT-4 also adds new «steerability» options. Users of large language models today often must engage in elaborate «prompt engineering,» learning how to embed specific cues in their prompts to get the right sort of responses. GPT-4 adds a system command option that lets users set a specific tone or style, for example programming code or a Socratic tutor: «You are a tutor that always responds in the Socratic style. You never give the student the answer, but always try to ask just the right question to help them learn to think for themselves.»

«Stochastic parrots» and other problems

OpenAI acknowledges significant shortcomings that persist with GPT-4, though it also touts progress avoiding them.

«It can sometimes make simple reasoning errors … or be overly gullible in accepting obvious false statements from a user. And sometimes it can fail at hard problems the same way humans do, such as introducing security vulnerabilities into code it produces,» OpenAI said. In addition, «GPT-4 can also be confidently wrong in its predictions, not taking care to double-check work when it’s likely to make a mistake.»

Large language models can deliver impressive results, seeming to understand huge amounts of subject matter and to converse in human-sounding if somewhat stilted language. Fundamentally, though, LLM AIs don’t really know anything. They’re just able to string words together in statistically very refined ways.

This statistical but fundamentally somewhat hollow approach to knowledge led researchers, including former Google AI researchers Emily Bender and Timnit Gebru, to warn of the «dangers of stochastic parrots» that come with large language models. Language model AIs tend to encode biases, stereotypes and negative sentiment present in training data, and researchers and other people using these models tend «to mistake … performance gains for actual natural language understanding.»

OpenAI Chief Executive Sam Altman acknowledges problems, but he’s pleased overall with the progress shown with GPT-4. «It is more creative than previous models, it hallucinates significantly less, and it is less biased. It can pass a bar exam and score a 5 on several AP exams,» Altman tweeted Tuesday.

One worry about AI is that students will use it to cheat, for example when answering essay questions. It’s a real risk, though some educators actively embrace LLMs as a tool, like search engines and Wikipedia. Plagiarism detection companies are adapting to AI by training their own detection models. One such company, Crossplag, said Wednesday that after testing about 50 documents that GPT-4 generated, «our accuracy rate was above 98.5%.»

OpenAI, Microsoft and Nvidia partnership

OpenAI got a big boost when Microsoft said in February it’s using GPT technology in its Bing search engine, including a chat features similar to ChatGPT. On Tuesday, Microsoft said it’s using GPT-4 for the Bing work. Together, OpenAI and Microsoft pose a major search threat to Google, but Google has its own large language model technology too, including a chatbot called Bard that Google is testing privately.

Also on Tuesday, Google announced it’ll begin limited testing of its own AI technology to boost writing Gmail emails and Google Docs word processing documents. «With your collaborative AI partner you can continue to refine and edit, getting more suggestions as needed,» Google said.

That phrasing mirrors Microsoft’s «co-pilot» positioning of AI technology. Calling it an aid to human-led work is a common stance, given the problems of the technology and the necessity for careful human oversight.

Microsoft uses GPT technology both to evaluate the searches people type into Bing and, in some cases, to offer more elaborate, conversational responses. The results can be much more informative than those of earlier search engines, but the more conversational interface that can be invoked as an option has had problems that make it look unhinged.

To train GPT, OpenAI used Microsoft’s Azure cloud computing service, including thousands of Nvidia’s A100 graphics processing units, or GPUs, yoked together. Azure now can use Nvidia’s new H100 processors, which include specific circuitry to accelerate AI transformer calculations.

AI chatbots everywhere

Another large language model developer, Anthropic, also unveiled an AI chatbot called Claude on Tuesday. The company, which counts Google as an investor, opened a waiting list for Claude.

«Claude is capable of a wide variety of conversational and text processing tasks while maintaining a high degree of reliability and predictability,» Anthropic said in a blog post. «Claude can help with use cases including summarization, search, creative and collaborative writing, Q&A, coding and more.»

It’s one of a growing crowd. Chinese search and tech giant Baidu is working on a chatbot called Ernie Bot. Meta, parent of Facebook and Instagram, consolidated its AI operations into a bigger team and plans to build more generative AI into its products. Even Snapchat is getting in on the game with a GPT-based chatbot called My AI.

Expect more refinements in the future.

«We have had the initial training of GPT-4 done for quite awhile, but it’s taken us a long time and a lot of work to feel ready to release it,» Altman tweeted. «We hope you enjoy it and we really appreciate feedback on its shortcomings.»

Editors’ note: CNET is using an AI engine to create some personalfinance explainers that are edited and fact-checked by our editors. Formore, see this post.

Technologies

Verum Messenger Introduces Built-In Verum Chess

Verum Messenger has released a new update for iOS, iPadOS, and macOS, adding another feature to its growing ecosystem — Verum Chess. Users can now play chess directly inside the messenger without switching between different applications.

The new feature allows users to start a game in just a few taps while reinforcing Verum’s vision of a unified digital environment where communication and everyday services come together in a single platform.

Verum Messenger continues to evolve into a multifunctional ecosystem that combines secure chats and calls, AI tools, a built-in VPN, anonymous email, eSIM, financial services, cryptocurrency features, and offline communication. With the introduction of Verum Chess, the platform now also offers a new way for users to interact and spend time together without leaving the app.

The update is now available for iPhone, iPad, and Mac on the App Store.

Technologies

Episode 3 of the VERUM AI Mini-Series Is Now Available

Verum Messenger has released the third episode of its AI mini-series, SHADOWS, created using Verum AI.

The new episode, titled «Ghost Money,» continues the story of the conflict between a team of heroes and the Omega corporation, which seeks to take control of digital communications. This time, the focus shifts to anonymous payments and financial freedom, revealing how privacy can extend beyond messaging.

Like the previous episodes, the new release not only advances the storyline but also showcases the capabilities of the Verum ecosystem, highlighting technologies designed for secure communication and digital privacy.

The mini-series consists of seven episodes, released gradually across Verum Messenger’s social media channels.

Episode 3 is now available. Stay tuned for the next chapter.

Watch on Instagram
Watch on YouTube

Technologies

Verum Finance Now Available for Mac, Expanding the Verum Ecosystem on Desktop

Verum has officially released Verum Finance for macOS, bringing its financial platform to the Mac and expanding access to the Verum ecosystem across Apple’s devices. The launch allows users to manage their finances from desktop while enjoying the same secure and seamless experience available on iPhone and iPad.

The new Mac version includes the full range of Verum Finance features, including balance management, instant transfers to other Verum users, debit card management, Apple Pay support, asset exchange, and transaction history — all optimized for the macOS experience.

Verum Finance can be used as a standalone application or alongside Verum Messenger. Users who sign in with their Verum Messenger account automatically synchronize their balances, settings, and account data across devices, ensuring a consistent experience throughout the Verum ecosystem.

The macOS release further strengthens Verum’s vision of creating an integrated digital platform where communication and financial services work together. Verum Messenger, which is also available for Mac, complements the ecosystem with encrypted messaging, voice and video calls, VPN, eSIM, anonymous email, AI-powered tools, offline communication capabilities, and cryptocurrency features.

With both Verum Messenger and Verum Finance now available across iPhone, iPad, and Mac, users can access secure communication and financial services wherever they work.

Verum Finance for Mac is available now through the Mac App Store.

Verum Finance for macOS: https://apps.apple.com/us/app/verum-finance/id6774245148
Verum Finance: https://finance.verum.im
Verum Messenger: https://verum.im