Two years ago, when big-name Chinese technology companies like Baidu and Alibaba were chasing Silicon Valley's advances in artificial intelligence with splashy announcements and new chatbots, DeepSeek took a different approach. It zeroed in on research.
The strategy paid off.
The Chinese startup has jolted the tech world with its claim that it created a powerful AI model that was significantly cheaper to build than the offerings of its better-funded American rivals.
In the rivalry between China and the United States over domination of artificial intelligence, DeepSeek seemed to come out of nowhere. In fact, it has skyrocketed through China's tech world in recent years with a path that was anything but conventional.
Its mission to pursue research mirrors that of companies like OpenAI, the Silicon Valley firm that marked an American signature over AI in fall 2022. But the similarities mostly end there.
DeepSeek's origins are in finance, not technology for technology's sake. Its parent company, a Chinese hedge fund called High-Flyer, began not as a laboratory devoted to safeguarding humanity from AI like Open AI, but as a business using AI to make bets in the Chinese stock market.
High-Flyer had thrived by capitalizing on a market dominated by China's retail investors, who are known for jumping in and out of stocks impulsively. In 2021, High-Flyer found itself pressured by regulatory crackdowns in China on speculative trading, which authorities in Beijing felt was at odds with their attempts to keep markets calm.
So High-Flyer pursued a new opportunity that it said aligned better with Chinese government priorities: advanced AI.
"We want to do things with greater value and things that go beyond the investment industry, but it has been misinterpreted as AI stock speculation," High-Flyer CEO Lu Zhengzhe told Chinese state media in 2023. "We have set up a new team independent of investment, which is equivalent to a second startup."
DeepSeek was born. As with many other Chinese startups, DeepSeek came at an established market with a different business approach.
DeepSeek's latest model for artificial intelligence is believed to be nearly as powerful as American rivals but far more efficient. Its success suggests that Silicon Valley's AI lead has shrunk. DeepSeek's breakthrough, despite efforts by Washington to limit Chinese access to the advanced chips needed for AI, raises questions about how effective those controls can be over the long term -- although DeepSeek's founder has acknowledged that the chip restrictions are a limitation.
DeepSeek did not rely on making consumer-facing AI products for revenue, and only this month released its first chatbot, which allows anyone to generate text and photos with simple commands. Instead, the company used the money that High-Flyer made from stock trading to bankroll ambitious research. The approach set it apart from U.S. rivals, all of which are ultimately consumer technology companies.
This unconventional approach also allowed DeepSeek to sidestep stringent regulations the Chinese government has placed on AI use by the public. Because its focus was research and selling to businesses that use its model -- and, until the release of its chatbot this month, not consumer applications -- its early work did not face the same government restrictions.
DeepSeek is run by its CEO, Liang Wenfeng, a thin, bespectacled engineer who studied at Zhejiang University in the eastern city of Hangzhou. He has said repeatedly in the few interviews he has given to Chinese media that to catch up with American innovation, Chinese companies must put research before profits. DeepSeek and High-Flyer did not respond to requests for comment.
What Chinese technology companies "lack in innovation is certainly not capital, but a lack of confidence and knowledge about how to organize a high density of talent to achieve effective innovation," he said in a widely circulated interview with Chinese tech outlet 36Kr.
Those who have worked with Liang describe him as a capable manager with a deep technical background, according to interviews and public accounts.
"He's definitely an INTP," said Zihan Wang, a computer engineer who worked on an earlier DeepSeek model, referring to an introspective personality type from the Myers-Briggs test, a popular personality test among young people in China. "INTPs are really good researchers and they have a willingness to explore," Wang said. "He is not one of those people who wants to control everything."
Liang was not too bothered with details like project timelines, and occasionally sent thought-provoking research questions to the entire team of researchers, Wang said. But mostly, Liang seemed driven to advance the technology and was not focused on profits.
Unlike many Chinese companies, which tend to focus on hiring programmers, Liang has gained a reputation for employing people from outside computing. Poets and humanities majors from China's top universities on DeepSeek's staff train the model to write classical Chinese poetry and ace questions taken from the country's difficult college entrance examination.
"Most of the team graduated from the top universities in China," said Yineng Zhang, a lead software engineer at Baseten in San Francisco who works on the SGLang, a project not part of DeepSeek that helps people build on top of DeepSeek's system. "They are very smart and very young."
For years, Chinese tech companies pioneered AI applications used in computer vision, like facial recognition. But OpenAI's release of ChatGPT prompted a reckoning. When no Chinese company immediately released anything comparable, many concluded that American companies had a lead in advanced AI.
In China, computer scientists were determined to prove they could compete. In 2023, many companies in China released their own large language models, the technology that underpins chatbots like ChatGPT.
But making advanced models would require using a large number of chips that would cost hundreds of millions of dollars.
High-Flyer was spending, too. By 2021, it was one of just a handful of Chinese companies that had been able to stockpile more than 10,000 advanced Nvidia A100 chips.
Yet DeepSeek's research gave it a surprising advantage. Last year, it dramatically cut the prices it charged developers who build applications using its model, prompting a price war with larger rivals.
Wang said there was little discussion of commercial applications for the technology they were building. Instead, he said, the company was focused on making an AI system that could be used by a range of people for many purposes.
"During my time there, we did not talk much about how we make money," Wang said. "They just focused on making a great foundation model."
A crucial part of DeepSeek's popularity is that it has made its developers' work public. This kind of information sharing, called open source, has been a cornerstone of the development of computer software, the internet and now artificial intelligence.
In the United States, AI researchers and entrepreneurs have long followed the progress of DeepSeek's technology. Last year, the company turned heads when it released systems designed to generate their own computer programs.
A new challenge for the company may come with its new high profile. The same day it released R1, the model behind its new chatbot, last week, Liang appeared at a round table discussion with Li Qiang, China's premier.
DeepSeek's sudden popularity has thrust it to the center of the Chinese Communist Party's efforts to spur innovation, and that could prove difficult to manage, said Jimmy Goodrich, a senior adviser for technology analysis to the RAND Corp., a federally funded think tank. "It's a big predicament for DeepSeek. I'm sure they weren't on the government's five-year plan," he said.
"Can they maintain this chaotic carefree vision when both the party and the world is watching?" Goodrich asked.
Budget with ET
India, get ready for the modern warfare of tomorrow
Women, youth, farmers and poor can continue to be Budget 2025's ‘roti, kapda aur makan’
Modi govt has a key task in Budget 2025: Unlocking the PLI goldmine
The Chinese startup has jolted the tech world with its claim that it created a powerful AI model that was significantly cheaper to build than the offerings of its better-funded American rivals.
In the rivalry between China and the United States over domination of artificial intelligence, DeepSeek seemed to come out of nowhere. In fact, it has skyrocketed through China's tech world in recent years with a path that was anything but conventional.
Its mission to pursue research mirrors that of companies like OpenAI, the Silicon Valley firm that marked an American signature over AI in fall 2022. But the similarities mostly end there.
DeepSeek's origins are in finance, not technology for technology's sake. Its parent company, a Chinese hedge fund called High-Flyer, began not as a laboratory devoted to safeguarding humanity from AI like Open AI, but as a business using AI to make bets in the Chinese stock market.
High-Flyer had thrived by capitalizing on a market dominated by China's retail investors, who are known for jumping in and out of stocks impulsively. In 2021, High-Flyer found itself pressured by regulatory crackdowns in China on speculative trading, which authorities in Beijing felt was at odds with their attempts to keep markets calm.
So High-Flyer pursued a new opportunity that it said aligned better with Chinese government priorities: advanced AI.
"We want to do things with greater value and things that go beyond the investment industry, but it has been misinterpreted as AI stock speculation," High-Flyer CEO Lu Zhengzhe told Chinese state media in 2023. "We have set up a new team independent of investment, which is equivalent to a second startup."
DeepSeek was born. As with many other Chinese startups, DeepSeek came at an established market with a different business approach.
DeepSeek's latest model for artificial intelligence is believed to be nearly as powerful as American rivals but far more efficient. Its success suggests that Silicon Valley's AI lead has shrunk. DeepSeek's breakthrough, despite efforts by Washington to limit Chinese access to the advanced chips needed for AI, raises questions about how effective those controls can be over the long term -- although DeepSeek's founder has acknowledged that the chip restrictions are a limitation.
DeepSeek did not rely on making consumer-facing AI products for revenue, and only this month released its first chatbot, which allows anyone to generate text and photos with simple commands. Instead, the company used the money that High-Flyer made from stock trading to bankroll ambitious research. The approach set it apart from U.S. rivals, all of which are ultimately consumer technology companies.
This unconventional approach also allowed DeepSeek to sidestep stringent regulations the Chinese government has placed on AI use by the public. Because its focus was research and selling to businesses that use its model -- and, until the release of its chatbot this month, not consumer applications -- its early work did not face the same government restrictions.
DeepSeek is run by its CEO, Liang Wenfeng, a thin, bespectacled engineer who studied at Zhejiang University in the eastern city of Hangzhou. He has said repeatedly in the few interviews he has given to Chinese media that to catch up with American innovation, Chinese companies must put research before profits. DeepSeek and High-Flyer did not respond to requests for comment.
What Chinese technology companies "lack in innovation is certainly not capital, but a lack of confidence and knowledge about how to organize a high density of talent to achieve effective innovation," he said in a widely circulated interview with Chinese tech outlet 36Kr.
Those who have worked with Liang describe him as a capable manager with a deep technical background, according to interviews and public accounts.
"He's definitely an INTP," said Zihan Wang, a computer engineer who worked on an earlier DeepSeek model, referring to an introspective personality type from the Myers-Briggs test, a popular personality test among young people in China. "INTPs are really good researchers and they have a willingness to explore," Wang said. "He is not one of those people who wants to control everything."
Liang was not too bothered with details like project timelines, and occasionally sent thought-provoking research questions to the entire team of researchers, Wang said. But mostly, Liang seemed driven to advance the technology and was not focused on profits.
Unlike many Chinese companies, which tend to focus on hiring programmers, Liang has gained a reputation for employing people from outside computing. Poets and humanities majors from China's top universities on DeepSeek's staff train the model to write classical Chinese poetry and ace questions taken from the country's difficult college entrance examination.
"Most of the team graduated from the top universities in China," said Yineng Zhang, a lead software engineer at Baseten in San Francisco who works on the SGLang, a project not part of DeepSeek that helps people build on top of DeepSeek's system. "They are very smart and very young."
For years, Chinese tech companies pioneered AI applications used in computer vision, like facial recognition. But OpenAI's release of ChatGPT prompted a reckoning. When no Chinese company immediately released anything comparable, many concluded that American companies had a lead in advanced AI.
In China, computer scientists were determined to prove they could compete. In 2023, many companies in China released their own large language models, the technology that underpins chatbots like ChatGPT.
But making advanced models would require using a large number of chips that would cost hundreds of millions of dollars.
High-Flyer was spending, too. By 2021, it was one of just a handful of Chinese companies that had been able to stockpile more than 10,000 advanced Nvidia A100 chips.
Yet DeepSeek's research gave it a surprising advantage. Last year, it dramatically cut the prices it charged developers who build applications using its model, prompting a price war with larger rivals.
Wang said there was little discussion of commercial applications for the technology they were building. Instead, he said, the company was focused on making an AI system that could be used by a range of people for many purposes.
"During my time there, we did not talk much about how we make money," Wang said. "They just focused on making a great foundation model."
A crucial part of DeepSeek's popularity is that it has made its developers' work public. This kind of information sharing, called open source, has been a cornerstone of the development of computer software, the internet and now artificial intelligence.
In the United States, AI researchers and entrepreneurs have long followed the progress of DeepSeek's technology. Last year, the company turned heads when it released systems designed to generate their own computer programs.
A new challenge for the company may come with its new high profile. The same day it released R1, the model behind its new chatbot, last week, Liang appeared at a round table discussion with Li Qiang, China's premier.
DeepSeek's sudden popularity has thrust it to the center of the Chinese Communist Party's efforts to spur innovation, and that could prove difficult to manage, said Jimmy Goodrich, a senior adviser for technology analysis to the RAND Corp., a federally funded think tank. "It's a big predicament for DeepSeek. I'm sure they weren't on the government's five-year plan," he said.
"Can they maintain this chaotic carefree vision when both the party and the world is watching?" Goodrich asked.