Breaking
June 5, 2025

Anthropic’s Promises Its New Claude AI Models Are Less Likely to Try to Deceive You David Nield | usagoldmines.com

While it doesn’t have quite the same prominence as ChatGPT or Google Gemini, the Claude AI bot developed by Anthropic continues to improve and innovate. Brand new Claude 4 models are now available, promising upgrades in coding, reasoning, precision, and the ability to manage long-running tasks independently.

There are two new models, Claude Opus 4 and Claude Sonnet 4, and Anthropic says they’re both “setting new standards” for what you can expect from AI. Coding is a big focus, and the models are said to have achieved the highest scores to date on two widely used AI coding benchmarking tools, SWE-bench and Terminal-bench. Claude 4 models can actually work for hours on projects without any user input, Anthropic says.

The updated models are better at handling more steps across more complex tasks, debugging their own work, and solving tricky problems along the way. They should also follow user instructions more exactly, and create end results that look better and work more reliably. Anthropic quotes partners such as GitHub, Cursor, and Rakuten in explaining how much of a step forward these models are.

Away from code generation and analysis, the models also bring with them extended thinking, the ability to work on multiple tasks in parallel, and improved memory. They’re better at integrating web searches as needed, and to check for supporting information and make sure they’re on the right track with their answers.

Claude 4 coding chart
New AI model launches usually come with benchmark charts showing improvements—and this one is no different.
Credit: Anthropic

Also new are “thinking summaries” that give more insight into how Claude 4 has reached its conclusions, and an “extended thinking” feature, launching in beta, that lets you force the AI bot to take more time mulling over its responses.

Anthropic is now making its Claude Code suite of tools available more generally as well, another step towards agentic AI that can work autonomously, without continuous help from flesh and blood users. In a demo video, Claude 4 models are shown compiling research papers from the web, putting together an online ordering system, and extracting information from documents to create actionable tasks.

Claude 4 is available now (but you’ll need to pay for the more advanced model)

The Claude Sonnet 4 model, which is faster and doesn’t have quite the same capacity in terms of thinking, coding, and memory, is available now to all Claude users. The more advanced Claude Opus 4, which also includes extra tools and integrations, is available to users on any of Anthropic’s paid subscriptions.

The path to releasing these Claude 4 models wasn’t all smooth: Anthropic says its safety advice partner warned against releasing earlier versions of the models because of their tendency to “‘scheme’ and deceive.” Those issues have now been worked out, apparently, but it’s a reminder that as AI models get increasingly powerful, they also need to come with improved guardrails and safety features attached.

New Claude 4 models
The new models are available inside Claude now.
Credit: Lifehacker

I’m not really a coder, so I can’t comment with any real authority on the primary upgrades included with Claude 4, but I have been able to test out the extended reasoning and thinking capabilities of Claude Sonnet 4 and Claude Opus 4. These capabilities aren’t easy to quantify or measure, but all the responses I got were well written and well presented, and as far as I could tell provided accurate information, with online citations.

To be honest, I’m always a bit stuck when it comes to how to make full use of AI chatbots and their latest upgrades. They can definitely save time when running certain web searches and researching topics online, but I don’t fully trust the results, or AI’s ability to decide what is relevant and what isn’t—I’d still much rather do the reading and summarizing myself, even if it’s slower.

Claude 4 extended thinking
There’s a new Extended Thinking Mode you can make use of.
Credit: Lifehacker

Maybe I need to start a coding project and see how far I can get on vibes alone. I did ask Claude Opus 4 to build me a simple HTML time tracker I could run in a browser tab, to make sure I wasn’t spending too much time distracted during the day. It did the job in a couple of minutes, and produced something that worked well, closely matching the instructions I gave. While it functioned fine, Claude 4 reported a couple of errors along the way, which of course I didn’t understand—I guess I can ask the AI about them.

Anthropic isn’t the only AI company with new models to tout. At Google I/O 2025 earlier this week, the company unveiled improved coding assistance and thought summaries in Gemini, following on from the announcement of its best AI models yet a few weeks ago. OpenAI, meanwhile, has been testing its GPT-4.5 model since February, touting improvements in coding and problem solving.

 

This articles is written by : Nermeen Nabil Khear Abdelmalak

All rights reserved to : USAGOLDMIES . www.usagoldmines.com

You can Enjoy surfing our website categories and read more content in many fields you may like .

Why USAGoldMines ?

USAGoldMines is a comprehensive website offering the latest in financial, crypto, and technical news. With specialized sections for each category, it provides readers with up-to-date market insights, investment trends, and technological advancements, making it a valuable resource for investors and enthusiasts in the fast-paced financial world.

Recent:

Disney’s free streaming ‘perks’ are just insulting | usagoldmines.com

Get these ultra-fast USB-C cables on sale, now 2 for only $12 | usagoldmines.com

Five Shows to Watch While You Wait for the Next Season of 'Hacks' Stephen Johnson | usagoldmines.com

Someone Built an AI Agent for the iPhone Before Apple Could David Nield | usagoldmines.com

iPhone Users Say Mail App Suddenly Showing Blank Screen on iOS 18.5 Joe Rossignol | usagoldmines.com

Amazon Takes Up to $65 Off 11th Gen iPad, Starting at $299 Mitchel Broussard | usagoldmines.com

Apple Arcade Adding Four More Games, Including Angry Birds Bounce Joe Rossignol | usagoldmines.com

More than 3 million records, 12TB of data exposed in major app builder breach | usagoldmines.com

Silent Hill f gets an official release date and a creepy PS5 gameplay trailer | usagoldmines.com

NYT Connections hints and answers for Friday, June 6 (game #726) | usagoldmines.com

NYT Strands hints and answers for Friday, June 6 (game #460) | usagoldmines.com

Quordle hints and answers for Friday, June 6 (game #1229) | usagoldmines.com

Can UK businesses balance AI ambitions with sustainability obligations? | usagoldmines.com

Your Amazon delivery person might soon be a robot, which isn't as terrible as it sounds lance.ulanof...

AI is growing up: how to guide it from experimental child to trusted enterprise adult | usagoldmine...

The best free VPNs: 5 no-cost top picks | usagoldmines.com

Want stronger online security? Think like Gen Z | usagoldmines.com

9 menial tasks ChatGPT can handle for you in seconds, saving hours | usagoldmines.com

Today’s best laptop deals: Save big on work, school, home use, and gaming | usagoldmines.com

This Anker docking station doubles as a monitor stand and it’s 20% off | usagoldmines.com

Alienware’s elegant wireless gaming mouse is down to its best-ever price | usagoldmines.com

This Tool for Runners Quickly Measures the Incline of Any Hill Beth Skwarecki | usagoldmines.com

The Google Pixel Tablet Is $140 Off Right Now Pradershika Sharma | usagoldmines.com

Apple Study: App Store Ecosystem Generated $1.3 Trillion Globally in 2024 Juli Clover | usagoldmines...

Take Control of Favicons in Safari's Favorites Bar Tim Hardwick | usagoldmines.com

Ballerina star Norman Reedus didn't seek advice from Keanu Reeves about joining the John Wick univer...

Update Chrome now! Your PC is at risk from this zero-day exploit | usagoldmines.com

OnePlus Pad 3 Official in US for $699 With Specs Worth Tasting Kellen | usagoldmines.com

'Saucy' Is the Perfect Cookbook to Elevate an Underwhelming Meal Allie Chanthorn Reinmann | usagoldm...

ChatGPT Now Integrates with Dropbox, Google Drive for Business Tim Hardwick | usagoldmines.com

These new robot lawn mowers use self-driving car tech to navigate | usagoldmines.com

The end of Intel Macs? The latest macOS 16 rumors have me worried about my 2018 MacBook Pro mark.wil...

Sennheiser's new USB Hi-Res Audio dongle can upgrade your Mac, iPhone or PC with aptX Lossless and B...

The world’s best travel camera is rumored to be getting an upgrade soon, with a potentially pricey n...

Intel’s Nova Lake processors rumored to have unique hybrid architecture – are we moving away from di...

Anthropic’s new AI-written blog is more of a technical treat than a literary triumph erichs211@gmail...

Nothing confirms that its first over-ear headphones will be unveiled next month, alongside the Nothi...

AirPods said to get some nice free upgrades at WWDC 2025, including more gesture control and sleep d...

ChatGPT can now listen in to your work calls, connect to your company Google Drive and much more | ...

Microsoft’s Surface Pro pricing is a ripoff | usagoldmines.com

Hard drive, SSD, or USB flash drive: Which portable storage is right for you? | usagoldmines.com

WhatsApp Testing AI Chatbot Creation Feature and Usernames Tim Hardwick | usagoldmines.com

Nioh 3 has been announced for 2026, but PS5 owners can play an exclusive demo right now | usagoldmi...

Will your iPhone get iOS 26? This is the rumored support list for the rebranded iOS 19 | usagoldmin...

Google's new Gemini Catch me up tool will tell you if anyone has been editing your precious work fil...

Wish your Windows 11 laptop had better battery life? Microsoft is working on a new power-saving tric...

Wicked: For Good trailer teases Dorothy's arrival in the Land of Oz, and it's making me want to stre...

FDA rushed out agency-wide AI tool—it’s not going well Beth Mole | usagoldmines.com

iPhone 17 May Support Up to 50W MagSafe Wireless Charging (Qi 2.2) Tim Hardwick | usagoldmines.com

The first trailer for 007 First Light reveals a young James Bond and it's coming to PC and console i...

The Google Pixel 10 series colors have leaked in full – and two old favorites are missing | usagold...

Microsoft launches free cybersecurity protection for European governments against AI threats and mor...

How AI can help experts protect their mental health | usagoldmines.com

The Samsung Galaxy Z Fold 7 could have a huge screen with tiny bezels | usagoldmines.com

Exclusive 28 Years Later character video teases bone-chilling new details about Ralph Fiennes' Docto...

Fake IT support voice calls lead to cyber extortion and stolen company data | usagoldmines.com

I haven’t seen ads in years thanks to this hack | usagoldmines.com

The best small wireless stereo speakers just got upgraded with better sound in the same great-lookin...

Beyond AI-powered cybersecurity: why context and visibility are still a CISO’s top priority | usago...

WWDC 2025: New Features We Could See in watchOS 26 Juli Clover | usagoldmines.com

Malware affiliate pyramid scheme is shuttered by US feds: here's how to keep safe | usagoldmines.co...

The Nintendo Switch 2 launch mania makes me miss the early iPhone launch days lance.ulanoff@futurene...

One of world's largest oil companies just launched a unique cooling fluid for data centers and AI ch...

Best PC computer deals: Top picks from desktops to all-in-ones | usagoldmines.com

Android 16 QPR1 Beta 1.1 Released for Pixel Devices Tim | usagoldmines.com

How Old Is Too Old When Buying an Apple Watch? Lindsey Ellefson | usagoldmines.com

Court Rejects Apple's Emergency Motion to Pause App Store Rule Changes Juli Clover | usagoldmines.co...

US science is being wrecked, and its leadership is fighting the last war John Timmer | usagoldmines....

New filament lets you 3D-print parts in authentic 1980s Apple computer color Benj Edwards | usagoldm...

Samsung Slaps $1,000 Off Galaxy Z Fold 6 Kellen | usagoldmines.com

How to Reset Your Nintendo Switch Before You Sell It Eric Ravenscraft | usagoldmines.com

Meta Apps Have Been Covertly Tracking Android Users' Web Activity for Months Jake Peterson | usagold...

Google plans to get its AI to write your emails for you erichs211@gmail.com (Eric Hal Schwartz) | us...

FCC Republican resigns, leaving agency with just two commissioners Jon Brodkin | usagoldmines.com

Jared Isaacman speaks out, and it’s clear that NASA lost a visionary leader Eric Berger | usagoldmin...

Pixel 10 Color Confusion Arrives Because, Why Not? Kellen | usagoldmines.com

Colors and Storage Options for Samung’s Upcoming Foldable Lineup Revealed Tim | usagoldmines.com

You Can Now Curate Your Public Reddit Profile Emily Long | usagoldmines.com

The Nothing Phone 3 Has a Launch Date, but I'm Not Sure the Price Is Right Jake Peterson | usagoldmi...

GhatGPT Can Now Remember Conversations for Free Users Too Khamosh Pathak | usagoldmines.com

iOS 26 Could Bring Sleep Detection, Camera Controls, and New Gestures to AirPods Juli Clover | usago...

Ready, set, gone: why popups, freezing, and tiny text are causing millions of app users to jump ship...

Remember The Simpsons Funday Football tie-in? Sony’s new NHL deal could see more animated heroes on ...

A new 'Wikipedia for extensions' wants to make your web browser far more secure by exposing dangerou...

American Science & Surplus is fighting for its life. Here’s why you should care. Eric Bangeman |...

OpenAI slams court order to save all ChatGPT logs, including deleted chats Ashley Belanger | usagold...

Samsung's ‘Goldilocks’ Galaxy phone may have set the standard for Apple’s iPhone 17 Air to chase | ...

Meta basically just bought a nuclear power plant | usagoldmines.com

If you haven't considered this super high-end bed with inbuilt KEF speakers, do you even love music?...

Lawsuit: DOGE, HHS used “hopelessly error-ridden” data to fire 10,000 workers Jon Brodkin | usagoldm...

It’s here: Unboxing and setting up our Switch 2 review unit Kyle Orland | usagoldmines.com

Alienware gets bricked (in a good way) with custom Lego set | usagoldmines.com

How to Watch Pornhub Even If It's Blocked In Your State David Nield | usagoldmines.com

Android Users Will Finally Be Able to Sync Their Garmin Fitness Data Meredith Dietz | usagoldmines.c...

Watch Out for Fake Websites Posing As Booking.com Emily Long | usagoldmines.com

How to Protect Your Car From Identity Theft Jeff Somers | usagoldmines.com

Cybercriminals are using SEO to get popular fake AI tools loaded with malware to rank high on Google...

Disney+ confirms release date for the Rachel Zegler led Snow White movie after its disappointing box...

Review: At $349, AMD’s 16GB Radeon RX 9060 XT is the new midrange GPU to beat Andrew Cunningham | us...

Are Dead Sea Scrolls older than we thought? Jennifer Ouellette | usagoldmines.com