Home > News > ChatGPT Maker Suspects China’s Dirt Cheap DeepSeek AI Models Were Built Using OpenAI Data — and the Irony Is Not Lost on the Internet

ChatGPT Maker Suspects China’s Dirt Cheap DeepSeek AI Models Were Built Using OpenAI Data — and the Irony Is Not Lost on the Internet

Author:Kristen Update:Mar 14,2025

OpenAI has voiced concerns that China's DeepSeek AI models, known for their remarkably low cost, may have been developed using OpenAI's data. This has prompted strong reactions, with Donald Trump calling DeepSeek a wake-up call for the U.S. tech industry following a significant drop in Nvidia's market value—a loss of nearly $600 billion.

The emergence of DeepSeek triggered a sharp decline in the stock prices of major AI companies. Nvidia, a key player in the GPU market crucial for AI model operation, experienced a dramatic 16.86% fall—the largest single-day loss in Wall Street history. Microsoft, Meta Platforms, Alphabet (Google's parent company), and Dell Technologies also saw substantial decreases.

DeepSeek's R1 model is marketed as a significantly cheaper alternative to Western AI models like ChatGPT, built upon the open-source DeepSeek-V3. This model reportedly requires less computing power and had an estimated training cost of just $6 million, although this figure has been contested. Despite the debate, DeepSeek's low cost and reported effectiveness have raised questions about the massive investments made by American tech companies in AI, unsettling investors and propelling the DeepSeek app to the top of U.S. download charts.

Bloomberg reported that OpenAI and Microsoft are investigating whether DeepSeek utilized OpenAI's API to integrate OpenAI's AI models into its own, a practice considered a violation of OpenAI's terms of service. OpenAI confirmed its awareness of such attempts by Chinese and other companies to leverage leading U.S. AI models through distillation—a technique for training AI models by extracting data from larger ones. OpenAI emphasized its commitment to protecting its intellectual property and collaborating with the U.S. government to safeguard its technology.

David Sacks, President Trump's AI czar, stated that evidence suggests DeepSeek employed distillation to extract knowledge from OpenAI models, a move OpenAI reportedly finds unacceptable. He anticipates countermeasures from leading AI companies in the coming months to prevent similar actions.

DeepSeek is accused of using OpenAI’s model to train its competitor using distillation. Image credit: Andrey Rudakov/Bloomberg via Getty Images.
DeepSeek is accused of using OpenAI’s model to train its competitor using distillation. Image credit: Andrey Rudakov/Bloomberg via Getty Images.

The situation has highlighted the irony of OpenAI's accusations, given previous criticisms of OpenAI itself for allegedly using copyrighted internet content to train ChatGPT. This hypocrisy was noted by tech writer Ed Zitron, who pointed out OpenAI's reliance on vast amounts of internet data in its model's creation.

OpenAI previously acknowledged its dependence on copyrighted material for training its models, stating in a submission to the UK's House of Lords that creating AI tools like ChatGPT without copyrighted material would be impossible. This statement follows lawsuits from the New York Times and 17 authors alleging the unlawful use of their work by OpenAI and Microsoft. OpenAI maintains that its training practices constitute "fair use." The debate underscores the complex legal and ethical challenges surrounding the use of copyrighted material in AI model training, further complicated by a 2018 U.S. Copyright Office ruling that AI-generated art cannot be copyrighted.