Breaking
December 14, 2024

ChatGPT Can Finally See Jake Peterson | usagoldmines.com

After months of testing, OpenAI rolled out “advanced voice” mode for ChatGPT back in September. The feature lets you have real-time conversations with ChatGPT: You can interrupt the bot and its “speaking” to ask another question, and it understands your tone of voice, which it uses to both inform its responses, as well as the inflection it uses. (It’s very creepy when it laughs.)

One feature of advanced voice mode has been missing since launch, however. When OpenAI first announced the perk back in May, it showed off how ChatGPT would be able to access your camera and “see” the world around you. While chatting with the bot, you could point your camera at something, ask a question, and ChatGPT would answer as best it could. Seven months later, this capability is here, and it’s frighteningly impressive.

In order to access it, you’ll need to have a paid subscription to ChatGPT—either Plus ($20 per month), or Pro ($200 per month). ChatGPT Team subscribers are also eligible. The feature may not be available on your end right away, even if you pay, since OpenAI is rolling it out over time.

Testing out ChatGPT advanced voice mode’s vision feature

Accessing the camera is pretty straightforward once it rolls out to your account. You launch advanced voice mode the same way you always do, using the waveform icon in the bottom-right of the chat. From here, you’ll see a new camera icon, which, of course, launches the live camera feed. This doesn’t interrupt the chat: You can be in the middle of a conversation with ChatGPT, open the camera, and continue gabbing away, only now with the camera feed as part of the conversation.

The first time I used this, I pointed the camera at a Nintendo Switch box I had nearby, with an iPhone cable and my Magic Trackpad resting on top of it, and asked, “What is this?” ChatGPT said: “It looks like a Nintendo Switch OLED box with some cables and a laptop on top. Are you planning on setting it up?” Two of out three correct, as it mistook my trackpad for a laptop, but hey, close enough. Next up, I pointed it at my water bottle, and asked it to identify what I was highlighting: “That looks like a black Hydro Flask bottle. It’s great for keeping drinks cold or hot! Do you take it with you often?”

I asked a follow-up: “Do you know what model of Hydro Flask this is?” ChatGPT: “I can’t be certain of the exact model, but it looks like one of their wide-mouth bottles, probably around 32 ounces. It’s definitely designed to keep your drinks at the right temperature for hours.” That…is basically right. I’m not all that comfortable with ChatGPT guessing the size correctly, either.

I moved on to my keyboard, which ChatGPT accurately stated was an Apple Magic Keyboard. I asked which keys it could see, and named a handful, but not all, of the keys I had in frame. So, I asked how many keys it could see, and it said “about 30,” when there were 26. So, again, close.

It was able to identify the MagSafe port on my MacBook, as well as the two USB ports and the headphone jack to its right. It recognized the air vent in my ceiling, and the specific type of boots I had by my front door. All in all, it basically recognized everything I tested it on—minus the trackpad.

Advanced voice mode’s sight is fast

But beyond recognition, I think what startled me the most was the speed of these responses. You ask ChatGPT to identify something, and it does, sometimes quicker than if you asked a real person to do it. Sometimes, the bot will hold onto a word for a moment (e.g. “I thiiiiiiiiink that’s a…”) which is probably a trick to let ChatGPT process the rest of what it wants to say. I’ve also caught it less sure of itself with its first response: I pointed it at my Magic Mouse, and its first guess what a computer mouse. But when I asked what brand it was, it didn’t only specify Apple, but said it was an Apple Magic Mouse, known for its “sleek design” and “touch-sensitive surface.”

All things considered, though, these responses are often near-instantaneous, which speaks to how powerful OpenAI’s models are these days. I’m still largely an AI skeptic, but this was the first development in a while that impressed me—and I’m torn about how I feel about that.

On the one hand, I could see this tech being used for good. Imagine how helpful something like this could be for users who are blind or have impaired vision, especially in a convenient device like smart glasses. Someone could ask their AI assistant what direction they’re facing, to read the menu at a restaurant, or whether it’s safe to cross the street. Tech like this could change search for the better, and make it easy to learn new things about the world by pointing our smartphone camera at a subject.

On the flip side, my mind turns to the negative, especially since AI is still prone to hallucination. As more and more people use this technology, they will inevitably experience the mistakes AI can make, and if they’re relying on the bot to help them with tasks—especially something that involves their safety—hallucinations can be dangerous. I didn’t experience any large errors; just the trackpad mixup. Anderson Cooper found that the bot made a mistake on a geometry problem (again, not a huge issue). But it’s a good reminder that as this tech improves rapidly, its inherent flaws raise the stakes for failure.

Perhaps that’s why every live camera session warns you not to use the feature for anything involving safety.

 

This articles is written by : Nermeen Nabil Khear Abdelmalak

All rights reserved to : USAGOLDMIES . www.usagoldmines.com

You can Enjoy surfing our website categories and read more content in many fields you may like .

Why USAGoldMines ?

USAGoldMines is a comprehensive website offering the latest in financial, crypto, and technical news. With specialized sections for each category, it provides readers with up-to-date market insights, investment trends, and technological advancements, making it a valuable resource for investors and enthusiasts in the fast-paced financial world.

Recent:

The Bose New Soundlink Flex Is Down to Its Lowest Price (and It Arrives Before Christmas) Daniel Oro...
11 of the Best Music Documentaries Ever Made Jason Keil | usagoldmines.com
Four Situations When Supplemental Health Insurance Makes Sense (When You’re Not Retired) Jeff Somers...
All The Biggest Reveals From the 2024 Game Awards Michelle Ehrhardt | usagoldmines.com
iOS 18.2: What You Can Do With Visual Intelligence Juli Clover | usagoldmines.com
Character.AI won't let its chatbots get romantic with teenagers anymore erichs211@gmail.com (Eric Ha...
NYT Strands today — my hints, answers and spangram for Saturday, December 14 (game #286) | usagoldm...
NYT Connections today — my hints and answers for Saturday, December 14 (game #552) | usagoldmines.c...
Quordle today – my hints and answers for Saturday, December 14 (game #1055) | usagoldmines.com
The Best Ways to Find a Running Track Near You (and a Beginner’s Workout to Try) Beth Skwarecki | us...
What You Can and Can't Make With iOS 18.2's Genmoji Feature Juli Clover | usagoldmines.com
Malcolm, Malcolm, Malcolm! Yes, 'Malcolm in the Middle' is being revived for Disney Plus jacob.krol@...
Amazon teams up with Samsung rival to design and build bespoke next generation tech that will help A...
The Intel Arc B580 GPU could rejuvenate the budget PC market - here's why allisa.james@futurenet.com...
Bird flu jumps from birds to human in Louisiana; patient hospitalized Beth Mole | usagoldmines.com
My Best Advice for Shipping Holiday Cookies Without Ruining Them Allie Chanthorn Reinmann | usagoldm...
Yearlong supply-chain attack targeting security pros steals 390K credentials Dan Goodin | usagoldmin...
Windows PCs are full of ads. These 9 settings turn off the worst ones | usagoldmines.com
5 useful PC upgrades to plug into your unused PCIe slots | usagoldmines.com
Best Windows backup software 2024: Protect your data! | usagoldmines.com
Best gaming laptops under $1,000: Expert picks that won’t break the bank | usagoldmines.com
Classic Outlook gets an official ‘death date’ as users are urged to switch | usagoldmines.com
Best VPNs for Android 2024: Our picks for phones and tablets | usagoldmines.com
Mint Mobile Cuts $400 Off Pixel 9 Pro, Gives You Unlimited for 12 Months at 50% Off Kellen | usagold...
You Can Max Out Your IRA Contributions for Longer Than You Might Think Emily Long | usagoldmines.com
iOS 18.2: Here's How Mail Categories Work Juli Clover | usagoldmines.com
ChatGPT's new Projects feature can organize your AI clutter erichs211@gmail.com (Eric Hal Schwartz) ...
December Pixel Update Expands to Pixel Fold, Pixel 7 on T-Mobile Kellen | usagoldmines.com
WhatsApp Now Lets You Call Select Members of a Group Chat Jake Peterson | usagoldmines.com
Toxic Christmas Tree Water and Other Holiday Pet Dangers You Never Knew About Lindsey Ellefson | usa...
Best laptops for college students 2024: Top picks and expert advice | usagoldmines.com
It’s Time to Learn What ‘Core Sleep’ Actually Is Beth Skwarecki | usagoldmines.com
Report: Apple to Stop Selling iPhone 14 and iPhone SE in EU This Month Joe Rossignol | usagoldmines....
Best Apple Deals of the Week: Steep Discounts Hit Apple Watches and Bands, Plus Sales on AirTag, Ank...
“6G can efficiently enable intelligent computing everywhere”: Qualcomm offers an exclusive sneak pee...
Hackers are abusing Microsoft tools more than ever before | usagoldmines.com
Microsoft announced Phi-4, a new AI that’s better at math and language processing | usagoldmines.co...
Google Home Devices Get First Taste of Gemini in Place of Assistant Kellen | usagoldmines.com
This iPhone 15 Pro Max Is Less Than $900 Pradershika Sharma | usagoldmines.com
Google Maps’ Best Feature Is About to Get a Lot Less Useful David Nield | usagoldmines.com
Apple Begins Selling New Vision Pro Carry Sling and Exclusive Charging Accessories Juli Clover | usa...
I didn’t expect Fallout to win Best Adaptation at The Game Awards 2024 when Netflix’s Arcane was suc...
This Yoto Mini 'fire hazard' children's speaker has been recalled again due to its overheating batte...
Werner Herzog muses on mysteries of the brain in Theater of Thought Jennifer Ouellette | usagoldmine...
Nvidia stokes RTX 50-series hype with Witcher 4 and a global LAN party | usagoldmines.com
Google’s Pixel Camera Update Returns Quick Access Controls, and Folks are Happy Kellen | usagoldmine...
This Free App Archives and Deletes Your Tweets Justin Pot | usagoldmines.com
Now Is the Best Time to Get a Deal on a Used Car Emily Long | usagoldmines.com
The MacRumors Show: Apple's 2024 – Year in Review Hartley Charlton | usagoldmines.com
M2 iPad Air Holiday Deals Include $100 Off and All-Time Low Prices at Best Buy Mitchel Broussard | u...
Civil societies warn against EU plans to make digital devices monitorable at all times chiara.castro...
Prime Video is testing a great new feature that'll use AI to better recommend movies and shows | us...
eM Client boosts email offerings with Postbox acquisition udinmwenefosa@gmail.com (Efosa Udinmwen) |...
Project Moohan shows Samsung doesn’t understand what makes the Meta Quest 3 special – and I don't t...
Where is Apple CarPlay 2? A 2024 launch is looking unlikely, but not impossible alexblake.techradar@...
Apple forced to patch iOS and macOS security flaw that could have leaked your private info | usagol...
Astro Bot takes home four major awards at The Game Awards 2024, including Game of the Year | usagol...
Americans spend more years being unhealthy than people in any other country Beth Mole | usagoldmines...
F1 Arcade trip report: Great sims make for a compelling experience Jonathan M. Gitlin | usagoldmines...
Don’t use crypto to cheat on taxes: Bitcoin bro gets 2 years Ashley Belanger | usagoldmines.com
Elon Musk slams SEC as agency threatens charges in Twitter stock probe Jon Brodkin | usagoldmines.co...
Best PCIe 4.0 SSDs 2024: Top picks from experts | usagoldmines.com
Get festive with these magical Christmas tree lights for 35% off | usagoldmines.com
Does it really matter what thermal paste you use in your gaming PC? | usagoldmines.com
Seven Ways to Make Hosting Little Kids for the Holidays Less Stressful for Everyone Jason Keil | usa...
The 28 Best Holiday and Christmas Movies on Netflix Right Now Ross Johnson | usagoldmines.com
Best Buy Takes Up to $200 Off M4 iPad Pro, Available From $849 Mitchel Broussard | usagoldmines.com
New Galaxy S25 leak suggests there'll be no Slim model after all, but I'm not convinced jamie.richar...
Image Playground made me a wizard but I’m still waiting for that Siri magic lance.ulanoff@futurenet....
Amazon pauses $1bn Microsoft 365 rollout following Russian security concerns | usagoldmines.com
Split Fiction is a new co-op game from the studio behind the award-winning It Takes Two | usagoldmi...
The US military is now talking openly about going on the attack in space Stephen Clark | usagoldmine...
Today’s best laptop deals: Save big on work, school, home use, and gaming | usagoldmines.com
Installing Windows 11 on old PCs is incredibly risky. Here’s why | usagoldmines.com
Ho ho ho! ChatGPT rolls out a Santa voice for Christmas season | usagoldmines.com
Watch out! Your latest data breach notification might not be real | usagoldmines.com
This tiny (but mighty!) speaker is 40% off right now | usagoldmines.com
Firefox’s Do Not Track feature is going away because websites ignore it | usagoldmines.com
This $650 RTX-loaded MSI gaming laptop is a total steal | usagoldmines.com
How to Make Your Christmas Tree Last as Long as Possible Beth Skwarecki | usagoldmines.com
Seven Home Maintenance Tasks Everyone Forgets to Do Jeff Somers | usagoldmines.com
The latest Google Pixel Drop changes how your phone charges – here's what's new | usagoldmines.com
PS2 and Wii classic Okami is getting a sequel headed up by original creator Hideki Kamiya | usagold...
7 new movies and TV shows to watch on Netflix, Prime Video, Max, and more this weekend (December 13)...
Hybrid work is winning out, despite employers trying to force a return to office | usagoldmines.com
2025's first streaming price hike will come courtesy of YouTube TV and I'm concerned which service i...
Ensuring data security with continuity, compliance, and disaster recovery | usagoldmines.com
Critical infrastructure being hit by dangerous new malware - routers, firewalls and fuel systems all...
Can digital resilience be achieved beyond the IT perimeter? | usagoldmines.com
Lenovo Legion R27fc-30 review: A 240Hz display for under $200! | usagoldmines.com
Intel Arc B580 review: The first worthy budget GPU of the decade | usagoldmines.com
Apple CEO Tim Cook Hosts King Charles III at UK Headquarters Hartley Charlton | usagoldmines.com
Chinese police found using spyware to monitor Android devices | usagoldmines.com
Photoshop gets the next big thing in AI photography – a tool that makes unwanted reflections vanish ...
Thousands of servers potentially at risk from Prometheus security flaw | usagoldmines.com
Apple HomePod mini 2 said to be arriving in 2025, along with one maxi upgrade | usagoldmines.com
Netflix releases trailer for Bank of Dave 2 and I can't wait to see him take on a new rival lucy.bug...
Twirling body horror in gymnastics video exposes AI’s flaws Benj Edwards | usagoldmines.com
Best online backup 2024: iDrive, Backblaze, Livedrive, and more | usagoldmines.com
The future of VPNs: Decentralized and post-quantum security | usagoldmines.com

Leave a Reply