Nvidia rival claims DeepSeek world record as it delivers industry-first performance with 95% fewer chips waynewilliams@onmail.com (Wayne Williams)

SambaNova runs DeepSeek-R1 at 198 tokens/sec using 16 custom chips
The SN40L RDU chip is reportedly 3X faster, 5X more efficient than GPUs
5X speed boost is promised soon, with 100X capacity by year-end on cloud

Chinese AI upstart DeepSeek has very quickly made a name for itself in 2025, with its R1 large-scale open source language model, built for advanced reasoning tasks, showing performance on par with the industry’s top models, while being more cost-efficient.

SambaNova Systems, an AI startup founded in 2017 by experts from Sun/Oracle and Stanford University, has now announced what it claims is the world’s fastest deployment of the DeepSeek-R1 671B LLM to date.

The company says it has achieved 198 tokens per second, per user, using just 16 custom-built chips, replacing the 40 racks of 320 Nvidia GPUs that would typically be required.

Independently verified

“Powered by the SN40L RDU chip, SambaNova is the fastest platform running DeepSeek,” said Rodrigo Liang, CEO and co-founder of SambaNova. “This will increase to 5X faster than the latest GPU speed on a single rack – and by year-end, we will offer 100X capacity for DeepSeek-R1.”

While Nvidia’s GPUs have traditionally powered large AI workloads, SambaNova argues that its reconfigurable dataflow architecture offers a more efficient solution. The company claims its hardware delivers three times the speed and five times the efficiency of leading GPUs while maintaining the full reasoning power of DeepSeek-R1.

“DeepSeek-R1 is one of the most advanced frontier AI models available, but its full potential has been limited by the inefficiency of GPUs,” said Liang. “That changes today. We’re bringing the next major breakthrough – collapsing inference costs and reducing hardware requirements from 40 racks to just one – to offer DeepSeek-R1 at the fastest speeds, efficiently.”

George Cameron, co-founder of AI evaluating firm Artificial Analysis, said his company had “independently benchmarked SambaNova’s cloud deployment of the full 671 billion parameter DeepSeek-R1 Mixture of Experts model at over 195 output tokens/s, the fastest output speed we have ever measured for DeepSeek-R1. High output speeds are particularly important for reasoning models, as these models use reasoning output tokens to improve the quality of their responses. SambaNova’s high output speeds will support the use of reasoning models in latency-sensitive use cases.”

DeepSeek-R1 671B is now available on SambaNova Cloud, with API access offered to select users. The company is scaling capacity rapidly, and says it hopes to reach 20,000 tokens per second of total rack throughput “in the near future”.

DeepSeek R1 on SambaNova — (Image credit: Artificial Analysis)

This articles is written by : Nermeen Nabil Khear Abdelmalak

You can Enjoy surfing our website categories and read more content in many fields you may like .

Why USAGoldMines ?

USAGoldMines is a comprehensive website offering the latest in financial, crypto, and technical news. With specialized sections for each category, it provides readers with up-to-date market insights, investment trends, and technological advancements, making it a valuable resource for investors and enthusiasts in the fast-paced financial world.

Recent:

Samsung’s tiny 128GB flash drive is a steal at this deal price: $14 | usagoldmines.com

Best wireless keyboards 2025: Top Bluetooth and USB models | usagoldmines.com

This 34-inch Gigabyte ultrawide OLED gaming monitor is 39% off | usagoldmines.com

Here’s the Nothing Phone 3a and 3a Pro Tim | usagoldmines.com

This Blink Video Doorbell Is at Its Lowest Price Ever Pradershika Sharma | usagoldmines.com

My Favorite Amazon Deal of the Day: The Samsung Galaxy Watch Ultra Daniel Oropeza | usagoldmines.com

The MacRumors Show: iPhone 16e Announced! Hartley Charlton | usagoldmines.com

An Apple Store is on the Move in the UK Joe Rossignol | usagoldmines.com

iPhone 16e Continues Apple's Transition to Manufacturing in India Hartley Charlton | usagoldmines.co...

Apple pulls end-to-end encryption in UK, spurning backdoors for gov’t spying Ashley Belanger | usago...

DeepSeek goes beyond “open weights” AI with plans for source code release Kyle Orland | usagoldmines...

LG UltraGear 27GX790A-B review: A monitor for competitive gamers | usagoldmines.com

A cheaper YouTube Premium plan is coming ‘soon’ for users in the US | usagoldmines.com

Lenovo laptops get an F rating for repairability | usagoldmines.com

GTA V for PC will get ray tracing and more with big visual update in March | usagoldmines.com

Make sure you update your AM5 motherboard for the Ryzen 9 9950X3D | usagoldmines.com

Turn Off Uber's Preferred Currency Feature to Avoid a Fee Emily Long | usagoldmines.com

Google's 'Career Dreamer' Claims It Can Help You Find a Job to Match Your Skills David Nield | usago...

Apple Denies Speculation Surrounding iPhone 16e's Lack of MagSafe Joe Rossignol | usagoldmines.com

Is the Apple Watch SE next for the chop? The surprise iPhone 16e reveal could hint at more changes t...

Salt Typhoon hackers used this clever technique to attack US networks | usagoldmines.com

An episode of The Simpsons? Fake speakers found in Chinese Volvos. Jonathan M. Gitlin | usagoldmines...

HP realizes that mandatory 15-minute support call wait times isn’t good support Scharon Harding | us...

SEC’s “scorched-earth” lawsuit against Coinbase to be dropped, company says Ashley Belanger | usagol...

Nissan’s latest desperate gamble—see if Tesla will buy the company Jonathan M. Gitlin | usagoldmines...

The truth about PC gaming on SSDs vs. HDDs, tested with real data | usagoldmines.com

Your gaming monitor specs could be deceiving you | usagoldmines.com

This Ryzen 7 mini PC with 32GB RAM is only $299 right now | usagoldmines.com

ExpressVPN: The first 5 settings you need to change | usagoldmines.com

Today’s best laptop deals: Save big on work, school, home use, and gaming | usagoldmines.com

Grab this fast-charging 25K power bank for 25% off while you can | usagoldmines.com

Shopping for Google’s cheapest TV streamers? Good luck with that | usagoldmines.com

Windows 11 Remote Desktop issues? You aren’t alone. Here’s what you can do | usagoldmines.com

The First Seven Things to Cut From Your Budget When You Lose Your Job Meredith Dietz | usagoldmines....

13 Body Horror Movies With Substance Ross Johnson | usagoldmines.com

Here Are The Best Carrier Deals You Can Get When Pre-Ordering iPhone 16e Today Mitchel Broussard | u...

Report: Apple's C1 Is Just the Beginning of Modem Changes Hartley Charlton | usagoldmines.com

All Four iPhone 17 Models Said to Feature Apple-Designed Wi-Fi 7 Chip Joe Rossignol | usagoldmines.c...

Apple Pulls Encrypted iCloud Security Feature in UK Amid Government Backdoor Demands Tim Hardwick | ...

US government reveals new cybercrime unit targeting AI fraud, crypto and other scams | usagoldmines...

Leaked Nothing Phone 3a and 3a Pro renders tease a mid-range phone that should have Samsung worried ...

A cheaper YouTube Premium Lite tier could roll out soon – and as a Spotify fan I'm ready to sign up ...

Microsoft fixes Power Pages security flaw, tells users to be on their guard | usagoldmines.com

Building a resilient workforce security strategy | usagoldmines.com

Fitbit Sleep Lab leaked – new feature could give you personalized bedtime recommendations stephen.wa...

NYT Connections hints and answers for Saturday, February 22 (game #622) | usagoldmines.com

NYT Strands hints and answers for Saturday, February 22 (game #356) | usagoldmines.com

Quordle hints and answers for Saturday, February 22 (game #1125) | usagoldmines.com

7 new movies and TV shows to stream on Netflix, Prime Video, Max, and more this weekend (February 21...

AMD RX 9070 GPU spec and benchmark rumors cast fresh doubt on power usage and performance – but I wo...

Everything new on Max in March 2024 rowan.davies@futurenet.com (Rowan Davies) | usagoldmines.com

Multimodal and Agentic AI: The next evolution in customer experience | usagoldmines.com

More than 376,000 Tesla Model Y, Model 3s have faulty steering Jonathan M. Gitlin | usagoldmines.com

Look Up to See Two Bright 'Celestial Triangles' This Month Emily Long | usagoldmines.com

First iPhone 16e Benchmark Reveals Impact of Reduced GPU Core Count Tim Hardwick | usagoldmines.com

Apple Store Down Ahead of iPhone 16e Pre-Orders Tim Hardwick | usagoldmines.com

iPhone 16e Supports USB-C Fast Charging up to 29W, Based on Chinese Regulatory Filing Tim Hardwick |...

Cheaper 'YouTube Premium Lite' Ad-Free Tier Reportedly Coming Soon Tim Hardwick | usagoldmines.com

iPhone 16e Now Available for Pre-Order Ahead of February 28 Launch Tim Hardwick | usagoldmines.com

Apple Vision Pro Immersive Video 'Arctic Surfing' Available Now Tim Hardwick | usagoldmines.com

Windows 11’s screenshot tool is getting a nifty new time-saving ability | usagoldmines.com

IBM return-to-office scheme is reportedly targeting older workers | usagoldmines.com

'It lasts less than a day' – Fitbit users furious over update that crushes battery life stephen.warw...

Spotify audiobooks just added more AI-narration support, and now your next listen might not be so hu...

The M4 MacBook Air’s performance scores just leaked – and they suggest it’ll be a solid upgrade alex...

Garmin's Fenix 8 just got a ton of free upgrades, here's what you need to know stephen.warwick@futur...

My favorite affordable audiophile brand just launch new flagship wired headphones – though these one...

Call of Duty: Black Ops 6 easter egg seems to tease Tony Hawk's Pro Skater revival dash.wood@futuren...

8 new big theories I have after Severance season 2 episode 6, aka 'Attila': can Burt be trusted, wha...

The iPhone 17 could launch with a display upgrade that makes the iPhone 17 Pro seem questionable jam...

I'm so excited to watch A24's Bring Her Back when it eventually streams on Max – I just hope it's as...

HP is apparently forcing customer support callers to wait 15 minutes before talking to anyone | usa...

Fatal Fury: City of the Wolves could be 2025's best fighting game, and you can try it right now | u...

Nvidia's RTX 5000 GPUs continue to face severe supply issues, with RTX 5070 Ti reportedly being even...

Netflix reportedly cancels Yellowstone-style series Territory after one season and it's disappointin...

North Korean hackers are posing as software development recruiters to target freelancers benedict.co...

The most powerful Rolls-Royce in history is electric and it’s here to boost performance –and the mar...

Major website hijacking scam sees over 35,000 sites attacked, redirected to gambling sites, so be on...

Doctors find worms squirming through teen’s neck: A cautionary tale Beth Mole | usagoldmines.com

Rocket Report: SpaceX lands in the Bahamas; ULA tests modified booster Stephen Clark | usagoldmines....

EV battery manufacturing capacity will rise when 10 plants come online this year Dan Gearino, Inside...

New app controls weather, clears skin, and fixes relationships? | usagoldmines.com

Google’s powerful ‘Deep Research’ Gemini AI arrives in Workspace | usagoldmines.com

Alleged Display Sizes Leaked for Apple's Book-Style Foldable iPhone Tim Hardwick | usagoldmines.com

The Huawei Band 10 is here, and it's packing a secret mood-tracking weapon | usagoldmines.com

Best external drives 2025: Backup, storage, and portability | usagoldmines.com

I tried this new online AI agent, and I can’t believe how good Convergence AI's Proxy 1.0 is at comp...

Over a million clinical records exposed in data breach | usagoldmines.com

Rabbit AI's new tool can control your Android phones, but I’m not sure how I feel about letting it c...

Everything missing from the iPhone 16e, including MagSafe and Photographic Styles jacob.krol@futuren...

Apple Already Testing a C2 Modem for iPhones, According to Leaker Joe Rossignol | usagoldmines.com

What's New on Max in March in 2025 Emily Long | usagoldmines.com

Apple Says iPhone 16e's New C1 Modem is Just the 'Start' in Interview Joe Rossignol | usagoldmines.c...

Someone wants to sell you a digital version of the antiquated typewriter but without a glued-on keyb...

Microsoft’s new AI agent can control software and robots Benj Edwards | usagoldmines.com

Best PC computer deals: Top picks from desktops to all-in-ones | usagoldmines.com

T-Mobile’s Free MLB TV Offer Returns March 25 Tim | usagoldmines.com

Why the iPhone 16e Uses a 'Binned' A18 Chip (and What That Means) Jake Peterson | usagoldmines.com

What's New on Paramount+ With Showtime in March 2025 Emily Long | usagoldmines.com

This is probably the best looking docking station I've ever seen in my entire life - and I can't wai...