DeepSeek v3.1 is not having a moment

https://news.ycombinator.com/rss Hits: 2

Summary

What if DeepSeek released a model claiming 66 on SWE and almost no one tried using it? Would it be any good? Would you be able to tell? Or would we get the shortest post of the year? Why We Haven’t Seen v4 or r2 Why are we settling for v3.1 and have yet to see DeepSeek release v4 or r2 yet? Eleanor Olcott and Zijing Wu: Chinese artificial intelligence company DeepSeek delayed the release of its new model after failing to train it using Huawei’s chips, highlighting the limits of Beijing’s push to replace US technology. DeepSeek was encouraged by authorities to adopt Huawei’s Ascend processor rather than use Nvidia’s systems after releasing its R1 model in January, according to three people familiar with the matter. But the Chinese start-up encountered persistent technical issues during its R2 training process using Ascend chips, prompting it to use Nvidia chips for training and Huawei’s for inference, said the people. The issues were the main reason the model’s launch was delayed from May, said a person with knowledge of the situation, causing it to lose ground to rivals. The real world so often involves people acting so much stupider than you could write into fiction. America tried to sell China H20s and China decided they didn’t want them and now Nvidia is halting related orders with suppliers. DeepSeek says that the main restriction on their development is lack of compute, and the PRC responds not by helping them get better chips but by advising them to not use the chips that they have, greatly slowing things down at least for a while. Introducing DeepSeek v3.1 In any case, DeepSeek v3.1 exists now, and remarkably few people care? DeepSeek: Introducing DeepSeek-V3.1: our first step toward the agent era! 🚀 🧠 Hybrid inference: Think & Non-Think — one model, two modes ⚡️ Faster thinking: DeepSeek-V3.1-Think reaches answers in less time vs. DeepSeek-R1-0528 🛠️ Stronger agent skills: Post-training boosts tool use and multi-step agent tasks Try it now — toggle Think/Non...

First seen: 2025-08-22 21:29

Last seen: 2025-08-22 22:30

Read Full Article More from this Source

DeepSeek v3.1 is not having a moment

Summary

Related News

FFmpeg 8.0

Glyn: Type-safe PubSub and Registry for Gleam actors with distributed clustering

Launch HN: BlankBio (YC S25) - Making RNA Programmable

70% of Japan smartphone games bypass in-app payment to avoid US tech giants

Leaving Gmail for Mailbox.org