Connect with us

Technologies

Everything I Learned Testing Photoshop’s New Generative AI Tool

Adobe’s Firefly AI feature brings new fun and fakery to photos. It’s a huge change for image editing, though far from perfect.

Adobe is building generative AI abilities into its flagship image-editing software with a new Photoshop beta release Tuesday. The move promises to release a new torrent of creativity even as it gives us all a new reason to pause and wonder if that sensational, scary or inspirational photo you see on the internet is actually real.

In my tests, detailed below, I found the tool impressive but imperfect. Adding it directly to Photoshop is a big deal, letting creators experiment within the software tool they’re likely already using without excursions to MidjourneyStability AI’s Stable Diffusion or other outside generative AI tools.

With Adobe’s Firefly family of generative AI technologies arriving in Photoshop, you’ll be able to let the AI fill a selected part of the image with whatever it thinks most fitting – for example, replacing road cracks with smooth pavement. You can also specify the imagery you’d like with a text prompt, such as adding a double yellow line to the road.

Firefly in Photoshop also can also expand an image, adding new scenery beyond the frame based on what’s already in the frame or what you suggest with text. Want more sky and mountains in your landscape photo? A bigger crowd at the rock concert? Photoshop will oblige, without today’s difficulties of finding source material and splicing it in.

Photoshop’s Firefly, which is scheduled to emerge from beta testing in the second half of 2023, can be powerful. In Adobe’s live demo, the tool was often able to match a photo’s tones, blend in AI-generated imagery seamlessly, infer the geometric details of perspective even in reflections and extrapolate the position of the sun from shadows and sky haze.

Such technologies have been emerging over the last year as Stable Diffusion, Midjourney and OpenAI’s Dall-Ecaptured the imaginations of artists and creative pros. Now it’s built directly into the software they’re most likely to already be using, streamlining what can be a cumbersome editing process.

«It really puts the power and control of generative AI into the hands of the creator,» said Maria Yap, Adobe’s vice president of digital imaging. «You can just really have some fun. You can explore some ideas. You can ideate. You can create without ever necessarily getting into the deep tools of the product, very quickly.»

Now you’d better brace yourself for that future.

Photoshop’s Firefly AI imperfect but useful

In my testing, I frequently ran into problems, many of them likely stemming from the limited range of the training imagery. When I tried to insert a fish on a bicycle to an image, Firefly only added the bicycle. I couldn’t get Firefly to add a kraken to emerge from San Francisco Bay. A musk ox looked like a panda-moose hybrid.

Less fanciful material also presents problems. Text looks like an alien race’s script. Shadows, lighting, perspective and geometry weren’t always right.

People are hard, too. On close inspection, their faces were distorted in weird ways. Humans added into shots could be positioned too high in the frame or in otherwise unconvincingly blended in.

Still, Firefly is remarkable for what it can accomplish, particularly with landscape shots. I could add mountains, oceans, skies and hills to landscapes. A white delivery van in a night scene was appropriately yellowish to match the sodium vapor streetlights in the scene. If you don’t like the trio of results Firefly presents, you can click the «generate» button to get another batch.

Given the pace of AI developments, I expect Firefly in Photoshop will improve.

It’s hard and expensive to retrain big AI models, requiring a data center packed with expensive hardware to churn through data, sometimes taking weeks for the largest models. But Adobe plans relatively frequent updates to Firefly. «Expect [about] monthly updates for general improvements and retraining every few months in all likelihood,» Adobe product chief Scott Belsky tweeted Tuesday.

Automating image manipulation

For years, «Photoshop» hasn’t just referred to Adobe’s software. It’s also used as a verb signifying photo manipulations like slimming supermodels’ waists or hiding missile launch failures. AI tools automate not just fun and flights of fancy, but also fake images like an alleged explosion at the Pentagon or a convincingly real photo of the pope in a puffy jacket, to pick two recent examples.

With AI, expect editing techniques far more subtle than the extra smoke easily recognized as digitally added to photos of an Israeli attack on Lebanon in 2006.

It’s a reflection of the double-edged sword that is generative AI. The technology is undeniably useful in many situations but also blurs the line between what is true and what is merely plausible.

For its part, Adobe tries to curtail problems. It doesn’t permit prompts to create images of many political figures and blocks you for «safety issues» if you try to create an image of black smoke in front of the White House. And its AI usage guidelines prohibit imagery involving violence, pornography and «misleading, fraudulent, or deceptive content that could lead to real-world harm,» among other categories. «We disable accounts that engage in behavior that is deceptive or harmful.»

Firefly also is designed to skip over styling prompts like that have provoked serious complaints from artists displeased to see their type of art reproduced by a data center. And it supports the Content Authenticity Initiative‘s content credentials technology that can be used to label an image as having been generated by AI.

Generative AI for photos

Adobe’s Firefly family of generative AI tools began with a website that turns a text prompt like «modern chair made up of old tires» into an image. It’s added a couple other options since, and Creative Cloud subscribers will also be able to try a lightweight version of the Photoshop interface on the Firefly site.

When OpenAI’s Dall-E brought that technology to anyone who signed up for it in 2022, it helped push generative artificial intelligence from a technological curiosity toward mainstream awareness. Now there’s plenty of worry along with the excitement as even AI creators fret about what the technology will bring now and in the more distant future.

Generative AI is a relatively new form of artificial intelligence technology. AI models can be trained to recognize patterns in vast amounts of data – in this case labeled images from Adobe’s stock art business and other licensed sources – and then to create new imagery based on that source data.

Generative AI has surged to mainstream awareness with language models used in tools like OpenAI’s ChatGPT chatbot, Google’s Gmail and Google Docs, and Microsoft’s Bing search engine. When it comes to generating images, Adobe employs an AI image generation technique called diffusion that’s also behind Dall-E, Stable Diffusion, Midjourney and Google’s Imagen.

Adobe calls Firefly for Photoshop a «co-pilot» technology, positioning it as a creative aid, not a replacement for humans. Yap acknowledges that some creators are nervous about being replaced by AI. Adobe prefers to see it as a technology that can amplify and speed up the creative process, spreading creative tools to a broader population.

«I think the democratization we’ve been going through, and having more creativity, is a positive thing for all of us,» Yap said. «This is the future of Photoshop.»

Editors’ note: CNET is using an AI engine to create some personal finance explainers that are edited and fact-checked by our editors. For more, see this post.

Technologies

Google I/O 2025: How to Watch Google’s Biggest Event (and What to Expect)

Google’s biggest event of the year will almost certainly be about all the ways AI will help you get stuff done.

Google’s main I/O 2025 keynote takes place on May 20, with I/O continuing over May 21 for developers to get hands-on with Google’s latest products. At its keynote, we expect Big G to talk about its various innovations across its constantly expanding suite of products and tools — no doubt with a huge focus on AI throughout. If we collectively cross our fingers, promise to be good and eat all our vegetables, then we may even be treated to a sneak peek at upcoming hardware. 

Read more: Android 16: Everything Google Announced at the Android Show

Google also hosted a totally separate event that focused solely on Android. The Android Show: I/O Edition saw the wrappers come off Android 16, with insights into the new Material 3 Expressive interface, updates to security and a focus on Gemini and how it’ll work on a variety of other devices. 

By breaking out Android news into its own virtual event, Google frees itself to spend more time during the I/O keynote to talk about Gemini, Deep Mind, Android XR and Project Astra. It’s going to be a jam-packed event, so here’s how you can watch I/O 2025 as it happens and what you can look forward to.

Google I/O: Where to watch

Google I/O proper kicks off with a keynote taking place on May 20, 10 a.m. PDT (1 p.m. EDT, 6 p.m. BST). It’ll be available to stream online on Google’s own YouTube channel. There’s no live link on the I/O website yet, though you can use the handy links to add the event to your calendar of choice and register your details if you want more info from Google. Which maybe you do. 

What to expect from Google I/O 2025

Not much chat about Android 16: As Google gave Android 16 its own outing already, it’s likely that it won’t be mentioned all that much during I/O. In fact at last year’s event, Android was barely mentioned, while uses of the term «AI» went well over a hundred. 

Android XR: Google didn’t talk much about Android XR during the Android show, focusing instead on the purely phone-based updates to the platform. We expected to hear more about the company’s latest foray into mixed-reality headsets in partnership with Samsung and its Project Moohan headset, so it’s possible that this is being saved for I/O proper. 

Gemini: With Android being spun out into its own separate event, Google is evidently clearing the way for I/O to focus on everything else the company does. AI will continue to dominate the conversation at I/O, just as it did last year (though hopefully Google can make it more understandable) with updates to many of its AI platforms expected to be announced. 

Gemini is expected to receive a variety of update announcements, including more information on its latest 2.5 Pro update which boasts various improvements to its reasoning abilities, and in particular to its helpfulness for coding applications. Expect lots of mentions of Google’s other AI-based products, too, including DeepMind, LearnLM and Project Astra. Let’s just hope Google has figured out how to make this information make any kind of sense.

Beyond AI, Google may talk about updates to its other products including GMail, Chrome and the Play Store, although whether these updates are big enough to be discussed during the keynote rather than as part of the developer-focused sessions following I/O’s opening remains to be seen.

Continue Reading

Technologies

Want to Speak to Dophins? Researchers Won $100,000 AI Prize Studying Their Whistling

The scientists studied a bottlenose dolphin community in Sarasota, Florida, uncovering evidence of language-like communications.

If any dolphins are reading this: hello!

A team of scientists studying a community of Florida dolphins has been awarded the first $100,000 Coller Dolittle Challenge prize, set up to award research in interspecies communication algorithms.

The US-based team, led by Laela Sayigh of the Woods Hole Oceanographic Institution, found that a type of whistle that dolphins employ is used as an alarm. Another whistle they studied is used by dolphins to respond to unexpected or unfamiliar situations. The team used non-invasive hydrophones to perform the research, which provides evidence that dolphins may be using whistles like words, shared with multiple members of their communities.

Capturing the sounds is just the beginning. Researchers will use AI to continue deciphering the whistles to try to find more patterns. 

«The main thing stopping us cracking the code of animal communication is a lack of data. Think of the 1 trillion words needed to train a large language model like ChatGPT. We don’t have anything like this for other animals,» said Jonathan Birch, a professor at the London School of Economics and Politics and one of the judges for the prize.

«That’s why we need programs like the Sarasota Dolphin Research Program, which has built up an extraordinary library of dolphin whistles over 40 years. The cumulative result of all that work is that Laela Sayigh and her team can now use deep learning to analyse the whistles and perhaps, one day, crack the code,» he said.

The award was part of a ceremony honoring the work of four teams from across the world. In addition to the dolphin project, researchers studied ways in which nightingales, marmoset monkeys and cuttlefish communicate.

The challenge is a collaboration between the Jeremy Coller Foundation and Tel Aviv University. Submissions for next year open up in August. 

Continue Reading

Technologies

See Ya, Siri: Why Apple Might Make Third-Party Voice Assistants Available in Europe

When given the choice, iPhone owners might opt for alternatives given the delayed rollout of Siri’s AI revamp.

Apple is reportedly working on changes to the iPhone’s operating system that will make it possible to choose an alternative voice assistant to Siri.

The ability to switch from Siri to another voice assistant, potentially powered by third-party companies including OpenAI, Google or Meta, could be a reality in the near future, but only for iPhone owners in Europe, Bloomberg reports. Apple didn’t respond to a request for comment.

Apple is preparing the changes to Siri in anticipation of the European Union demanding the company allow European users a choice of voice assistants, according to Bloomberg. It would be similar to the policy shift Apple has already made in allowing rival app stores onto the iPhone, which was sparked by Europe’s Digital Markets Act.

Apple has faced many regulatory hurdles with the EU in recent years, largely in the form of challenges to its proprietary technology and walled-garden ecosystem, which provide Apple device owners with high levels of consistency, privacy and security, but often make it difficult for smaller companies to compete.

These regulatory challenges often leave Apple with little choice but to make significant changes to the way its tech works. Some of these changes — such as switching from Lightning connectors to the universal standard of USB-C — affect Apple device owners globally. Others affect only those who live within the EU.

Siri’s AI troubles

Siri has been available on iPhones since 2011 and has spawned many copycats. But the advent of generative AI over the last few years has brought about a number of chatbots, most notably OpenAI’s ChatGPT, that rival and surpass Siri’s capabilities, to the point where Apple is now seen as lagging behind competitors.

Apple announced an AI revamp of Siri last year at WWDC, its June developer conference, but the company delayed its rollout. Apple Intelligence-powered Siri is still nowhere to be seen, and may not even make an appearance at this year’s WWDC, per Bloomberg.

In light of this, it may be possible that European iPhone owners, when given the option, choose an alternative voice assistant. Unless Apple’s hand is forced, there’s currently no indication that people elsewhere will be given the same choice.

But Thomas Husson, VP and principal analyst at Forrester, doesn’t believe the AI overhaul of Siri is Apple’s main challenge.

«Is Siri’s revamp well overdue? Yes. Does Apple give the impression that they have an AI issue? Yes. It is too late for them? I don’t think so,» he said. Instead, Husson said what’s really at stake for the company is its ability to invent a user interface adapted to the AI era and create an ecosystem with companies and developers that will allow for new experiences within existing apps.

«Meeting the EU regulations and especially the DMA makes things more complex,» he added, but that’s the case for any digital platform operating in Europe.

Continue Reading

Trending

Copyright © Verum World Media