Connect with us

Technologies

Gemini Live Gives You AI With Eyes, and It’s Awesome

When it works, Gemini Live’s new camera mode feels like the future in all the right ways. I put it to the test.

Google’s been rolling out the new Gemini Live camera mode to all Android phones using the Gemini app for free after a two-week exclusive for Pixel 9 (including the new Pixel 9A) and Galaxy S5 smartphones. In simpler terms, Google successfully gave Gemini the ability to see, as it can recognize objects that you put in front of your camera. 

It’s not just a party trick, either. Not only can it identify objects, but you can also ask questions about them — and it works pretty well for the most part. In addition, you can share your screen with Gemini so it can identify things you surface on your phone’s display. When you start a live session with Gemini, you now have the option to enable a live camera view, where you can talk to the chatbot and ask it about anything the camera sees. I was most impressed when I asked Gemini where I misplaced my scissors during one of my initial tests.

«I just spotted your scissors on the table, right next to the green package of pistachios. Do you see them?»

Gemini Live’s chatty new camera feature was right. My scissors were exactly where it said they were, and all I did was pass my camera in front of them at some point during a 15-minute live session of me giving the AI chatbot a tour of my apartment.

When the new camera feature popped up on my phone, I didn’t hesitate to try it out. In one of my longer tests, I turned it on and started walking through my apartment, asking Gemini what it saw. It identified some fruit, ChapStick and a few other everyday items with no problem. I was wowed when it found my scissors. 

That’s because I hadn’t mentioned the scissors at all. Gemini had silently identified them somewhere along the way and then  recalled the location with precision. It felt so much like the future, I had to do further testing. 

My experiment with Gemini Live’s camera feature was following the lead of the demo that Google did last summer when it first showed off these live video AI capabilities. Gemini reminded the person giving the demo where they’d left their glasses, and it seemed too good to be true. But as I discovered, it was very true indeed.

Gemini Live will recognize a whole lot more than household odds and ends. Google says it’ll help you navigate a crowded train station or figure out the filling of a pastry. It can give you deeper information about artwork, like where an object originated and whether it was a limited edition piece.

It’s more than just a souped-up Google Lens. You talk with it, and it talks to you. I didn’t need to speak to Gemini in any particular way — it was as casual as any conversation. Way better than talking with the old Google Assistant that the company is quickly phasing out.

Google also released a new YouTube video for the April 2025 Pixel Drop showcasing the feature, and there’s now a dedicated page on the Google Store for it.

To get started, you can go live with Gemini, enable the camera and start talking. That’s it.

Gemini Live follows on from Google’s Project Astra, first revealed last year as possibly the company’s biggest «we’re in the future» feature, an experimental next step for generative AI capabilities, beyond your simply typing or even speaking prompts into a chatbot like ChatGPT, Claude or Gemini. It comes as AI companies continue to dramatically increase the skills of AI tools, from video generation to raw processing power. Similar to Gemini Live, there’s Apple’s Visual Intelligence, which the iPhone maker released in a beta form late last year. 

My big takeaway is that a feature like Gemini Live has the potential to change how we interact with the world around us, melding our digital and physical worlds together just by holding your camera in front of almost anything.

I put Gemini Live to a real test

The first time I tried it, Gemini was shockingly accurate when I placed a very specific gaming collectible of a stuffed rabbit in my camera’s view. The second time, I showed it to a friend in an art gallery. It identified the tortoise on a cross (don’t ask me) and immediately identified and translated the kanji right next to the tortoise, giving both of us chills and leaving us more than a little creeped out. In a good way, I think.

I got to thinking about how I could stress-test the feature. I tried to screen-record it in action, but it consistently fell apart at that task. And what if I went off the beaten path with it? I’m a huge fan of the horror genre — movies, TV shows, video games — and have countless collectibles, trinkets and what have you. How well would it do with more obscure stuff — like my horror-themed collectibles?

First, let me say that Gemini can be both absolutely incredible and ridiculously frustrating in the same round of questions. I had roughly 11 objects that I was asking Gemini to identify, and it would sometimes get worse the longer the live session ran, so I had to limit sessions to only one or two objects. My guess is that Gemini attempted to use contextual information from previously identified objects to guess new objects put in front of it, which sort of makes sense, but ultimately, neither I nor it benefited from this.

Sometimes, Gemini was just on point, easily landing the correct answers with no fuss or confusion, but this tended to happen with more recent or popular objects. For example, I was surprised when it immediately guessed one of my test objects was not only from Destiny 2, but was a limited edition from a seasonal event from last year. 

At other times, Gemini would be way off the mark, and I would need to give it more hints to get into the ballpark of the right answer. And sometimes, it seemed as though Gemini was taking context from my previous live sessions to come up with answers, identifying multiple objects as coming from Silent Hill when they were not. I have a display case dedicated to the game series, so I could see why it would want to dip into that territory quickly.

Gemini can get full-on bugged out at times. On more than one occasion, Gemini misidentified one of the items as a made-up character from the unreleased Silent Hill: f game, clearly merging pieces of different titles into something that never was. The other consistent bug I experienced was when Gemini would produce an incorrect answer, and I would correct it and hint closer at the answer — or straight up give it the answer, only to have it repeat the incorrect answer as if it was a new guess. When that happened, I would close the session and start a new one, which wasn’t always helpful.

One trick I found was that some conversations did better than others. If I scrolled through my Gemini conversation list, tapped an old chat that had gotten a specific item correct, and then went live again from that chat, it would be able to identify the items without issue. While that’s not necessarily surprising, it was interesting to see that some conversations worked better than others, even if you used the same language. 

Google didn’t respond to my requests for more information on how Gemini Live works.

I wanted Gemini to successfully answer my sometimes highly specific questions, so I provided plenty of hints to get there. The nudges were often helpful, but not always. Below are a series of objects I tried to get Gemini to identify and provide information about. 

Technologies

Apple Launches Creator Studio Package as $13 a Month Subscription

Mac users can still buy the apps individually, but subscribers get access to Final Cut Pro and other Studio tools.

Apple is bundling its pro filmmaking and audio tools including Final Cut Pro with its productivity apps Keynote, Pages and Numbers into a subscription software suite called Apple Creator Studio.

The package, which includes apps for Mac, iPad and iPhone, includes Logic Pro, Pixelmator Pro, Motion, Compressor, MainStage and the whiteboard app Freeform. Creator Studio will be available starting Jan. 28 at a cost of $13 per month or $129 per year, or $3 per month or $30 per year for students and educators. Mac users will still have the option to purchase software like Final Cut Pro for a one-time free. The current price for Final Cut Pro in the Mac App Store is $300.

While apps such as Keynote and Pages are already free on Apple platforms, it appears that new versions of those apps will receive access to beta features that will roll out first to Creator Studio subscribers. The announcement by Apple alludes to «new AI features and premium content» in some of the apps it otherwise makes available to use for free.

What the Creator Studio bundle comes with

The star of the show in Creator Studio is Final Cut Pro, the video editing software that will now include Transcript Search on both Mac and iPad. There is also a new Beat Detection feature Apple says uses an AI model to analyze a music track and display a beat grid, making it easier to cut video to music rhythms. The software also will include a new Montage Maker on iPad for quick social video creation.

Motion, the 2D and 3D graphics tool, and Compressor also integrate with Final Cut Pro. Apple touted Motion’s Magnetic Mask feature for isolating objects or people without the need for a green screen.

Logic Pro has new features for musicians, including a Synth Player addition to AI Session Players. Chord ID, a new AI feature, can create chord progressions from audio or MIDI recordings. A new Sound Library will have hundreds of royalty-free clips, samples and loops.

A revamped MainStage app gives subscribers access to instrument, voice-professing and guitar rig tools. Pixelmator Pro arrives with new tools and filters, and there will be an iPad version in addition to the Mac tool.

Freeform in the Creator Studio package will add premium content, including curated photos, graphics and illustrations. It will also get new AI features that include image creation.

Continue Reading

Technologies

Reddit Outage Resolved: Here’s What Happened

Did you have trouble reading your favorite subreddits today? You weren’t alone.

If you had trouble accessing the news and discussion forum Reddit on Tuesday, you weren’t the only one. However, as of 10:15 a.m. PT, the site appears to be back up and running normally. Reddit’s status monitoring page, RedditStatus.com, notes that «all systems (are) operational» after the brief outage.

But earlier, at 9:30 a.m. PT, RedditStatus.com said the company was «investigating elevated errors across reddit.com and native apps.» RedditStatus.com reported degraded site performance for both desktop web use and native mobile apps.

Earlier on Tuesday, the site-monitoring service DownDetector also reported issues at Reddit, providing additional details. At one point on Tuesday, DownDetector received over 100,000 reports that the site was having problems. At 10:25 am PT, the report numbers fell to under 600. (Disclosure: Downdetector is owned by the same parent company as CNET, Ziff Davis.)


Don’t miss any of our unbiased tech content and lab-based reviews. Add CNET as a preferred Google source.


«Reddit is currently experiencing a significant internal outage causing widespread service disruptions,» the site said earlier Tuesday. «The impact is categorized as Very High, primarily affecting mobile app access (55%) and website connectivity (39%). While reports are heavily concentrated in major hubs like New York City and Chicago, the lack of ISP correlation suggests a broad, nationwide issue stemming from Reddit’s internal servers rather than external network providers.»

A representative for Reddit did not immediately respond to a request for comment. Another social media site, X, formerly Twitter, also showed problems on Tuesday, according to DownDetector. Those problems seemed to spike around 6:30 a.m. PT and improve after.

Continue Reading

Technologies

This 3-in-1 Charger Is a Must-Have for Travelers, and It Just Hit a Record-Low of $95

Snag it for $45 off and charge your iPhone, AirPods and Apple Watch at the same time.

If you’re a frequent traveler, then you know that outlets are a precious commodity in places like airports and coffee shops. So why waste one on a single device when you can charge up to three at once? Right now, you can grab this seriously sleek Ugreen Magflow three-in-one foldable charger for just $95 at Amazon. That’s a $45 discount and the all-time lowest price we’ve seen. Just don’t wait too long, as this deal could expire at any time.

At just 7.4 ounces, this compact charging station is designed to be taken on the go. But despite its size, it still supports 25-watt MagSafe charging for iPhones, as well as 5-watt wireless charging for AirPods and Apple Watches. The charging stand also tilts up to double as a stand, and it’s equipped with 16 magnets to keep your phone aligned and securely in place. Plus, it’s got built-in protections against overheating, overcharging, short-circuiting and more to prevent damage to your devices.

Why this deal matters

This folding Ugreen charger is great for juicing up your devices on the go, and it’s never been more affordable. Plus, Ugreen makes some of the best MagSafe chargers on the market right now, so don’t miss your chance to grab one at a record-low price.

Continue Reading

Trending

Copyright © Verum World Media