Clicky

  • Login
  • Register
  • Submit Your Content
  • Contact Us
Tuesday, September 30, 2025
World Tribune
No Result
View All Result
  • Home
  • News
  • Business
  • Technology
  • Sports
  • Health
  • Food
Submit
  • Home
  • News
  • Business
  • Technology
  • Sports
  • Health
  • Food
No Result
View All Result
World Tribune
No Result
View All Result

What’s new in DeepSeek’s latest model: DeepSeek-V3.2-Exp

September 30, 2025
in News
Reading Time: 4 mins read
A A
What’s new in DeepSeek’s latest model: DeepSeek-V3.2-Exp
0
SHARES
ShareShareShareShareShare

READ ALSO

YouTube to pay Trump $24.5 Million to settle lawsuit over suspension

Wildfire Burns Over 1.9 Million Acres in Etosha National Park

Anna Barclay | Getty Images News | Getty Images

Chinese startup DeepSeek’s latest experimental model promises to increase efficiency and improve AI’s ability to handle a lot of information at a fraction of the cost, but questions remain over how effective and safe the architecture is.  

DeepSeek sent Silicon Valley into a frenzy when it launched its first model R1 out of nowhere last year, showing that it’s possible to train large language models (LLMs) quickly, on less powerful chips, using fewer resources.

The company released DeepSeek-V3.2-Exp on Monday, an experimental version of its current model DeepSeek-V3.1-Terminus, which builds further on its mission to increase efficiency in AI systems, according to a post on the AI forum Hugging Face.

“DeepSeek V3.2 continues the focus on efficiency, cost reduction, and open-source sharing,” Adina Yakefu, Chinese community lead at Hugging Face, told CNBC. “The big improvement is a new feature called DSA (DeepSeek Sparse Attention), which makes the AI better at handling long documents and conversations. It also cuts the cost of running the AI in half compared to the previous version.”

“It’s significant because it should make the model faster and more cost-effective to use without a noticeable drop in performance,” said Nick Patience, vice president and practice lead for AI at The Futurum Group. “This makes powerful AI more accessible to developers, researchers, and smaller companies, potentially leading to a wave of new and innovative applications.”

The pros and cons of sparse attention 

An AI model makes decisions based on its training data and new information, such as a prompt. Say an airline wants to find the best route from A to B, while there are many options, not all are feasible. By filtering out the less viable routes, you dramatically reduce the amount of time, fuel and, ultimately, money, needed to make the journey. That is exactly sparse attention does, it only factors in data that it thinks is important given the task at hand, as opposed to other models thus far which have crunched all data in the model.

“So basically, you cut out things that you think are not important,” said Ekaterina Almasque, the cofounder and managing partner of new venture capital fund BlankPage Capital.

Sparse attention is a boon for efficiency and the ability to scale AI given fewer resources are needed, but one concern is that it could lead to a drop in how reliable models are due to the lack of oversight in how and why it discounts information.

“The reality is, they [sparse attention models] have lost a lot of nuances,” said Almasque, who was an early supporter of Dataiku and Darktrace, and an investor in Graphcore. “And then the real question is, did they have the right mechanism to exclude not important data, or is there a mechanism excluding really important data, and then the outcome will be much less relevant?”

This could be particularly problematic for AI safety and inclusivity, the investor noted, adding that it may not be “the optimal one or the safest” AI model to use compared with competitors or traditional architectures. 

DeepSeek, however, says the experimental model works on par with its V3.1-Terminus. Despite speculation of a bubble forming, AI remains at the centre of geopolitical competition with the U.S. and China vying for the winning spot. Yakefu noted that DeepSeek’s models work “right out of the box” with Chinese-made AI chips, such as Ascend and Cambricon, meaning they can run locally on domestic hardware without any extra setup.

What’s new in DeepSeek’s latest model: DeepSeek-V3.2-Exp

DeepSeek also shared the actual programming code and tools needed to use the experimental model, she said. “This means other people can learn from it and build their own improvements.”

But for Almasque, the very nature of this means the tech may not be defensible. “The approach is not super new,” she said, noting the industry has been “talking about sparse models since 2015” and that DeepSeek is not able to patent its technology due to being open source. DeepSeek’s competitive edge, therefore, must lie in how it decides what information to include, she added.

The company itself acknowledges V3.2-Exp is an “intermediate step toward our next-generation architecture,” per the Hugging Face post.

As Patience pointed out, “this is DeepSeek’s value prop all over: efficiency is becoming as important as raw power.”

“DeepSeek is playing the long game to keep the community invested in their progress,” Yakefu added. “People will always go for what is cheap, reliable, and effective.”

Credit: Source link

ShareTweetSendSharePin
Previous Post

The Logitech MX Master 4 is here with haptic feedback, less rubber and the same shape

Related Posts

YouTube to pay Trump .5 Million to settle lawsuit over suspension
News

YouTube to pay Trump $24.5 Million to settle lawsuit over suspension

September 30, 2025
Wildfire Burns Over 1.9 Million Acres in Etosha National Park
News

Wildfire Burns Over 1.9 Million Acres in Etosha National Park

September 30, 2025
Etsy pops 13% as OpenAI announces ChatGPT Instant Checkout for the shopping site
News

Etsy pops 13% as OpenAI announces ChatGPT Instant Checkout for the shopping site

September 29, 2025
Anthropic launches Claude Sonnet 4.5, its latest AI model
News

Anthropic launches Claude Sonnet 4.5, its latest AI model

September 29, 2025
EA going private in deal that will pay shareholders 0 a share
News

EA going private in deal that will pay shareholders $210 a share

September 29, 2025
From Elon Musk to Microsoft’s Satya Nadella, these tech leaders were once H-1B visa holders 
News

From Elon Musk to Microsoft’s Satya Nadella, these tech leaders were once H-1B visa holders 

September 29, 2025

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

What's New Here!

The Fed ‘desperately’ wants to avoid a recession because it doesn’t want to get blamed: Zandi

The Fed ‘desperately’ wants to avoid a recession because it doesn’t want to get blamed: Zandi

September 13, 2025
Texas A&M fires English professor over children’s literature course that critics called ‘DEI and LGBTQ indoctrination’

Texas A&M fires English professor over children’s literature course that critics called ‘DEI and LGBTQ indoctrination’

September 11, 2025
Aaron Judge’s Yankees outfield situation still has more questions than answers

Aaron Judge’s Yankees outfield situation still has more questions than answers

September 3, 2025
China to stop claiming special WTO benefits that rankled U.S.

China to stop claiming special WTO benefits that rankled U.S.

September 24, 2025
CoreWeave stock jumps on disclosure of .3 billion order from Nvidia

CoreWeave stock jumps on disclosure of $6.3 billion order from Nvidia

September 15, 2025
Professors think students are prepared for the workforce— nearly half of students disagree and feel unready even for entry-level roles

Professors think students are prepared for the workforce— nearly half of students disagree and feel unready even for entry-level roles

September 15, 2025
Meet all 33 Silicon Valley power players at Trump’s high-profile tech dinner — and Elon Musk’s explanation for why he wasn’t there

Meet all 33 Silicon Valley power players at Trump’s high-profile tech dinner — and Elon Musk’s explanation for why he wasn’t there

September 5, 2025

About

World Tribune is an online news portal that shares the latest news on world, business, health, tech, sports, and related topics.

Follow us

Recent Posts

  • What’s new in DeepSeek’s latest model: DeepSeek-V3.2-Exp
  • The Logitech MX Master 4 is here with haptic feedback, less rubber and the same shape
  • New Jets day still looks far away in missed Aaron Glenn chance
  • Dolphins’ Darren Waller double dips in NFL return against Jets

Newslatter

Loading
  • Submit Your Content
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

© 2024 World Tribune - All Rights Reserved!

No Result
View All Result
  • Home
  • News
  • Business
  • Technology
  • Sports
  • Health
  • Food

© 2024 World Tribune - All Rights Reserved!

Welcome Back!

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Fill the forms below to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In