ChatGPT Can Finally See Jake Peterson | usagoldmines.com

After months of testing, OpenAI rolled out “advanced voice” mode for ChatGPT back in September. The feature lets you have real-time conversations with ChatGPT: You can interrupt the bot and its “speaking” to ask another question, and it understands your tone of voice, which it uses to both inform its responses, as well as the inflection it uses. (It’s very creepy when it laughs.)

One feature of advanced voice mode has been missing since launch, however. When OpenAI first announced the perk back in May, it showed off how ChatGPT would be able to access your camera and “see” the world around you. While chatting with the bot, you could point your camera at something, ask a question, and ChatGPT would answer as best it could. Seven months later, this capability is here, and it’s frighteningly impressive.

In order to access it, you’ll need to have a paid subscription to ChatGPT—either Plus ($20 per month), or Pro ($200 per month). ChatGPT Team subscribers are also eligible. The feature may not be available on your end right away, even if you pay, since OpenAI is rolling it out over time.

Testing out ChatGPT advanced voice mode’s vision feature

Accessing the camera is pretty straightforward once it rolls out to your account. You launch advanced voice mode the same way you always do, using the waveform icon in the bottom-right of the chat. From here, you’ll see a new camera icon, which, of course, launches the live camera feed. This doesn’t interrupt the chat: You can be in the middle of a conversation with ChatGPT, open the camera, and continue gabbing away, only now with the camera feed as part of the conversation.

The first time I used this, I pointed the camera at a Nintendo Switch box I had nearby, with an iPhone cable and my Magic Trackpad resting on top of it, and asked, “What is this?” ChatGPT said: “It looks like a Nintendo Switch OLED box with some cables and a laptop on top. Are you planning on setting it up?” Two of out three correct, as it mistook my trackpad for a laptop, but hey, close enough. Next up, I pointed it at my water bottle, and asked it to identify what I was highlighting: “That looks like a black Hydro Flask bottle. It’s great for keeping drinks cold or hot! Do you take it with you often?”

I asked a follow-up: “Do you know what model of Hydro Flask this is?” ChatGPT: “I can’t be certain of the exact model, but it looks like one of their wide-mouth bottles, probably around 32 ounces. It’s definitely designed to keep your drinks at the right temperature for hours.” That…is basically right. I’m not all that comfortable with ChatGPT guessing the size correctly, either.

I moved on to my keyboard, which ChatGPT accurately stated was an Apple Magic Keyboard. I asked which keys it could see, and named a handful, but not all, of the keys I had in frame. So, I asked how many keys it could see, and it said “about 30,” when there were 26. So, again, close.

It was able to identify the MagSafe port on my MacBook, as well as the two USB ports and the headphone jack to its right. It recognized the air vent in my ceiling, and the specific type of boots I had by my front door. All in all, it basically recognized everything I tested it on—minus the trackpad.

Advanced voice mode’s sight is fast

But beyond recognition, I think what startled me the most was the speed of these responses. You ask ChatGPT to identify something, and it does, sometimes quicker than if you asked a real person to do it. Sometimes, the bot will hold onto a word for a moment (e.g. “I thiiiiiiiiink that’s a…”) which is probably a trick to let ChatGPT process the rest of what it wants to say. I’ve also caught it less sure of itself with its first response: I pointed it at my Magic Mouse, and its first guess what a computer mouse. But when I asked what brand it was, it didn’t only specify Apple, but said it was an Apple Magic Mouse, known for its “sleek design” and “touch-sensitive surface.”

All things considered, though, these responses are often near-instantaneous, which speaks to how powerful OpenAI’s models are these days. I’m still largely an AI skeptic, but this was the first development in a while that impressed me—and I’m torn about how I feel about that.

On the one hand, I could see this tech being used for good. Imagine how helpful something like this could be for users who are blind or have impaired vision, especially in a convenient device like smart glasses. Someone could ask their AI assistant what direction they’re facing, to read the menu at a restaurant, or whether it’s safe to cross the street. Tech like this could change search for the better, and make it easy to learn new things about the world by pointing our smartphone camera at a subject.

On the flip side, my mind turns to the negative, especially since AI is still prone to hallucination. As more and more people use this technology, they will inevitably experience the mistakes AI can make, and if they’re relying on the bot to help them with tasks—especially something that involves their safety—hallucinations can be dangerous. I didn’t experience any large errors; just the trackpad mixup. Anderson Cooper found that the bot made a mistake on a geometry problem (again, not a huge issue). But it’s a good reminder that as this tech improves rapidly, its inherent flaws raise the stakes for failure.

Perhaps that’s why every live camera session warns you not to use the feature for anything involving safety.

This articles is written by : Nermeen Nabil Khear Abdelmalak

You can Enjoy surfing our website categories and read more content in many fields you may like .

Why USAGoldMines ?

USAGoldMines is a comprehensive website offering the latest in financial, crypto, and technical news. With specialized sections for each category, it provides readers with up-to-date market insights, investment trends, and technological advancements, making it a valuable resource for investors and enthusiasts in the fast-paced financial world.

Recent:

The Bose New Soundlink Flex Is Down to Its Lowest Price (and It Arrives Before Christmas) Daniel Oro...

11 of the Best Music Documentaries Ever Made Jason Keil | usagoldmines.com

Four Situations When Supplemental Health Insurance Makes Sense (When You’re Not Retired) Jeff Somers...

All The Biggest Reveals From the 2024 Game Awards Michelle Ehrhardt | usagoldmines.com

iOS 18.2: What You Can Do With Visual Intelligence Juli Clover | usagoldmines.com

Character.AI won't let its chatbots get romantic with teenagers anymore erichs211@gmail.com (Eric Ha...

NYT Strands today — my hints, answers and spangram for Saturday, December 14 (game #286) | usagoldm...

NYT Connections today — my hints and answers for Saturday, December 14 (game #552) | usagoldmines.c...

Quordle today – my hints and answers for Saturday, December 14 (game #1055) | usagoldmines.com

The Best Ways to Find a Running Track Near You (and a Beginner’s Workout to Try) Beth Skwarecki | us...

What You Can and Can't Make With iOS 18.2's Genmoji Feature Juli Clover | usagoldmines.com

Malcolm, Malcolm, Malcolm! Yes, 'Malcolm in the Middle' is being revived for Disney Plus jacob.krol@...

Amazon teams up with Samsung rival to design and build bespoke next generation tech that will help A...

The Intel Arc B580 GPU could rejuvenate the budget PC market - here's why allisa.james@futurenet.com...

Bird flu jumps from birds to human in Louisiana; patient hospitalized Beth Mole | usagoldmines.com

My Best Advice for Shipping Holiday Cookies Without Ruining Them Allie Chanthorn Reinmann | usagoldm...

Yearlong supply-chain attack targeting security pros steals 390K credentials Dan Goodin | usagoldmin...

Windows PCs are full of ads. These 9 settings turn off the worst ones | usagoldmines.com

5 useful PC upgrades to plug into your unused PCIe slots | usagoldmines.com

Best Windows backup software 2024: Protect your data! | usagoldmines.com

Best gaming laptops under $1,000: Expert picks that won’t break the bank | usagoldmines.com

Classic Outlook gets an official ‘death date’ as users are urged to switch | usagoldmines.com

Best VPNs for Android 2024: Our picks for phones and tablets | usagoldmines.com

Mint Mobile Cuts $400 Off Pixel 9 Pro, Gives You Unlimited for 12 Months at 50% Off Kellen | usagold...

You Can Max Out Your IRA Contributions for Longer Than You Might Think Emily Long | usagoldmines.com

iOS 18.2: Here's How Mail Categories Work Juli Clover | usagoldmines.com

ChatGPT's new Projects feature can organize your AI clutter erichs211@gmail.com (Eric Hal Schwartz) ...

December Pixel Update Expands to Pixel Fold, Pixel 7 on T-Mobile Kellen | usagoldmines.com

WhatsApp Now Lets You Call Select Members of a Group Chat Jake Peterson | usagoldmines.com

Toxic Christmas Tree Water and Other Holiday Pet Dangers You Never Knew About Lindsey Ellefson | usa...

Best laptops for college students 2024: Top picks and expert advice | usagoldmines.com

It’s Time to Learn What ‘Core Sleep’ Actually Is Beth Skwarecki | usagoldmines.com

Report: Apple to Stop Selling iPhone 14 and iPhone SE in EU This Month Joe Rossignol | usagoldmines....

Best Apple Deals of the Week: Steep Discounts Hit Apple Watches and Bands, Plus Sales on AirTag, Ank...

“6G can efficiently enable intelligent computing everywhere”: Qualcomm offers an exclusive sneak pee...

Hackers are abusing Microsoft tools more than ever before | usagoldmines.com

Microsoft announced Phi-4, a new AI that’s better at math and language processing | usagoldmines.co...

Google Home Devices Get First Taste of Gemini in Place of Assistant Kellen | usagoldmines.com

This iPhone 15 Pro Max Is Less Than $900 Pradershika Sharma | usagoldmines.com

Google Maps’ Best Feature Is About to Get a Lot Less Useful David Nield | usagoldmines.com

Apple Begins Selling New Vision Pro Carry Sling and Exclusive Charging Accessories Juli Clover | usa...

I didn’t expect Fallout to win Best Adaptation at The Game Awards 2024 when Netflix’s Arcane was suc...

This Yoto Mini 'fire hazard' children's speaker has been recalled again due to its overheating batte...

Werner Herzog muses on mysteries of the brain in Theater of Thought Jennifer Ouellette | usagoldmine...

Nvidia stokes RTX 50-series hype with Witcher 4 and a global LAN party | usagoldmines.com

Google’s Pixel Camera Update Returns Quick Access Controls, and Folks are Happy Kellen | usagoldmine...

This Free App Archives and Deletes Your Tweets Justin Pot | usagoldmines.com

Now Is the Best Time to Get a Deal on a Used Car Emily Long | usagoldmines.com

The MacRumors Show: Apple's 2024 – Year in Review Hartley Charlton | usagoldmines.com

M2 iPad Air Holiday Deals Include $100 Off and All-Time Low Prices at Best Buy Mitchel Broussard | u...

Civil societies warn against EU plans to make digital devices monitorable at all times chiara.castro...

Prime Video is testing a great new feature that'll use AI to better recommend movies and shows | us...

eM Client boosts email offerings with Postbox acquisition udinmwenefosa@gmail.com (Efosa Udinmwen) |...

Project Moohan shows Samsung doesn’t understand what makes the Meta Quest 3 special – and I don't t...

Where is Apple CarPlay 2? A 2024 launch is looking unlikely, but not impossible alexblake.techradar@...

Apple forced to patch iOS and macOS security flaw that could have leaked your private info | usagol...

Astro Bot takes home four major awards at The Game Awards 2024, including Game of the Year | usagol...

Americans spend more years being unhealthy than people in any other country Beth Mole | usagoldmines...

F1 Arcade trip report: Great sims make for a compelling experience Jonathan M. Gitlin | usagoldmines...

Don’t use crypto to cheat on taxes: Bitcoin bro gets 2 years Ashley Belanger | usagoldmines.com

Elon Musk slams SEC as agency threatens charges in Twitter stock probe Jon Brodkin | usagoldmines.co...

Best PCIe 4.0 SSDs 2024: Top picks from experts | usagoldmines.com

Get festive with these magical Christmas tree lights for 35% off | usagoldmines.com

Does it really matter what thermal paste you use in your gaming PC? | usagoldmines.com

Seven Ways to Make Hosting Little Kids for the Holidays Less Stressful for Everyone Jason Keil | usa...

The 28 Best Holiday and Christmas Movies on Netflix Right Now Ross Johnson | usagoldmines.com

Best Buy Takes Up to $200 Off M4 iPad Pro, Available From $849 Mitchel Broussard | usagoldmines.com

New Galaxy S25 leak suggests there'll be no Slim model after all, but I'm not convinced jamie.richar...

Image Playground made me a wizard but I’m still waiting for that Siri magic lance.ulanoff@futurenet....

Amazon pauses $1bn Microsoft 365 rollout following Russian security concerns | usagoldmines.com

Split Fiction is a new co-op game from the studio behind the award-winning It Takes Two | usagoldmi...

The US military is now talking openly about going on the attack in space Stephen Clark | usagoldmine...

Today’s best laptop deals: Save big on work, school, home use, and gaming | usagoldmines.com

Installing Windows 11 on old PCs is incredibly risky. Here’s why | usagoldmines.com

Ho ho ho! ChatGPT rolls out a Santa voice for Christmas season | usagoldmines.com

Watch out! Your latest data breach notification might not be real | usagoldmines.com

This tiny (but mighty!) speaker is 40% off right now | usagoldmines.com

Firefox’s Do Not Track feature is going away because websites ignore it | usagoldmines.com

This $650 RTX-loaded MSI gaming laptop is a total steal | usagoldmines.com

How to Make Your Christmas Tree Last as Long as Possible Beth Skwarecki | usagoldmines.com

Seven Home Maintenance Tasks Everyone Forgets to Do Jeff Somers | usagoldmines.com

The latest Google Pixel Drop changes how your phone charges – here's what's new | usagoldmines.com

PS2 and Wii classic Okami is getting a sequel headed up by original creator Hideki Kamiya | usagold...

7 new movies and TV shows to watch on Netflix, Prime Video, Max, and more this weekend (December 13)...

Hybrid work is winning out, despite employers trying to force a return to office | usagoldmines.com

2025's first streaming price hike will come courtesy of YouTube TV and I'm concerned which service i...

Ensuring data security with continuity, compliance, and disaster recovery | usagoldmines.com

Critical infrastructure being hit by dangerous new malware - routers, firewalls and fuel systems all...

Can digital resilience be achieved beyond the IT perimeter? | usagoldmines.com

Lenovo Legion R27fc-30 review: A 240Hz display for under $200! | usagoldmines.com

Intel Arc B580 review: The first worthy budget GPU of the decade | usagoldmines.com

Apple CEO Tim Cook Hosts King Charles III at UK Headquarters Hartley Charlton | usagoldmines.com

Chinese police found using spyware to monitor Android devices | usagoldmines.com

Photoshop gets the next big thing in AI photography – a tool that makes unwanted reflections vanish ...

Thousands of servers potentially at risk from Prometheus security flaw | usagoldmines.com

Apple HomePod mini 2 said to be arriving in 2025, along with one maxi upgrade | usagoldmines.com

Netflix releases trailer for Bank of Dave 2 and I can't wait to see him take on a new rival lucy.bug...

Twirling body horror in gymnastics video exposes AI’s flaws Benj Edwards | usagoldmines.com

Best online backup 2024: iDrive, Backblaze, Livedrive, and more | usagoldmines.com

The future of VPNs: Decentralized and post-quantum security | usagoldmines.com

Breaking

ChatGPT Can Finally See Jake Peterson | usagoldmines.com

Testing out ChatGPT advanced voice mode’s vision feature

Advanced voice mode’s sight is fast

Recent:

By Nermeen Nabil Khear

Leave a Reply Cancel reply

You Missed

Bitcoin Daily Bollinger Bands Expand, Why BTC Price Could Rise To $120,000 As Early As Next Week Scott Matherson | usagoldmines.com

Walkie Talkies, Jedi Style: Building a Communicator Heidi Ulrich | usagoldmines.com

Travala (AVA) Rally: Binance Early Bet And CZ’s Nod Drive 300% Growth Christian Encila | usagoldmines.com

OpenAI counters Musk’s lawsuit by highlighting his past profit advocacy Assad Jafri | usagoldmines.com

ChatGPT Can Finally See Jake Peterson | usagoldmines.com

Testing out ChatGPT advanced voice mode’s vision feature

Advanced voice mode’s sight is fast

Recent:

By Nermeen Nabil Khear

Related Posts

The Bose New Soundlink Flex Is Down to Its Lowest Price (and It Arrives Before Christmas) Daniel Oropeza | usagoldmines.com

11 of the Best Music Documentaries Ever Made Jason Keil | usagoldmines.com

Four Situations When Supplemental Health Insurance Makes Sense (When You’re Not Retired) Jeff Somers | usagoldmines.com

Leave a Reply Cancel reply

You Missed

Bitcoin Daily Bollinger Bands Expand, Why BTC Price Could Rise To $120,000 As Early As Next Week Scott Matherson | usagoldmines.com

Walkie Talkies, Jedi Style: Building a Communicator Heidi Ulrich | usagoldmines.com

Travala (AVA) Rally: Binance Early Bet And CZ’s Nod Drive 300% Growth Christian Encila | usagoldmines.com

OpenAI counters Musk’s lawsuit by highlighting his past profit advocacy Assad Jafri | usagoldmines.com