A new super-powered, open-source AI model called DeepSeek R1 is rattling the industry this week, after it was unexpectedly dropped into the laps of artificial intelligence experts — and the world — with seemingly valid challenges to OpenAI’s expensive AI model.
Released by Chinese AI startup DeepSeek, the DeepSeek R1 advanced reasoning model purports to outperform the most popular large language models (LLMs), including OpenAI’s o1. According to the company, DeepSeek R1 bested these black box offerings in several important benchmarks, and has a particular talent at mathematical, coding, and reasoning tasks, Mashable’s Stan Schroeder reports. Schroeder’s own tests have shown that it holds its own against rival ChatGPT in complex coding tasks.
Why DeepSeek is hitting tech stocks hard, including Nvidia’s
“In the background of all this are training costs which are orders of magnitude lower than for some competing models, as well as chips which aren’t as powerful as the chips that are on disposal for U.S. AI companies,” wrote Schroeder. “DeepSeek thus shows that extremely clever AI with reasoning ability doesn’t have to be extremely expensive to train — or to use.”
Support authors and subscribe to content
This is premium stuff. Subscribe to read the entire article.