{"id":95191,"date":"2025-02-20T15:58:47","date_gmt":"2025-02-20T20:58:47","guid":{"rendered":"https:\/\/glgstagingsite.com\/?post_type=insight-article&#038;p=95191"},"modified":"2025-11-04T12:56:21","modified_gmt":"2025-11-04T17:56:21","slug":"deepseek-impact-on-the-ai-market","status":"publish","type":"insight-article","link":"https:\/\/glginsights.com\/ja\/articles\/deepseek-impact-on-the-ai-market\/","title":{"rendered":"DeepSeek \u2013 AI\u5e02\u5834\u3078\u306e\u5f71\u97ff"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"95191\" class=\"elementor elementor-95191 elementor-70649\" data-elementor-post-type=\"insight-article\">\n\t\t\t\t<div class=\"elementor-element elementor-element-acd0ea1 e-flex e-con-boxed e-con e-parent\" data-id=\"acd0ea1\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-2de517f elementor-widget elementor-widget-text-editor\" data-id=\"2de517f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>As companies around the world race to keep up with the latest developments in AI, Chinese firm DeepSeek jolted the space on January 23 when it debuted its \u201cR1\u201d large language model (LLM), sending reverberations across markets.<\/p><p>To understand the wide-ranging impact of DeepSeek-R1\u2019s launch, GLG\u2019s Evan Moore sat down with industry veteran William Fong, PhD, whose more than 26 years of experience at Microsoft included leading the company\u2019s AI and digital transformation efforts.<\/p><p>Here\u2019s a summary of the key takeaways from the conversation:<\/p><hr \/><h2>Can you unpack the \u201cmix-of-experts\u201d approach that DeepSeek used? What are its advantages and pitfalls compared to conventional large language models?<\/h2><p>Generally, you have a smaller language model, train it to be a specialized expert (like a mathematician, for example), and then, when you want to ask questions about that subject, you go to that specific small language model. But DeepSeek has managed to do that within the large model they published. They have multiple types of these experts within their 671 billion parameters. That is an advantage for users, because you don&#8217;t need to go to multiple models \u2013 you just go to one and you get really high-definition results from it.<\/p><p>There are other advantages. If you\u2019re looking at an expert inside the model, you&#8217;re not using all the parameters. You\u2019re only using whatever part that expert happens to live in. Your latency, your inferencing, your costs \u2013 all that goes down. You don&#8217;t need high-intensity chips to randomly take a stab in the dark on your 671 billion parameters. You know exactly where to go, because it knows exactly what your question is about.<\/p><p>There are some disadvantages, too. As you start to have more and more experts, computing gets complicated, expensive, and clogged up, because you have multiple experts struggling to do the work, and multiple parameters live at a time, because you&#8217;re not serving just one person. You have to remember this: You\u2019re serving whoever is inferencing from that model, whoever, in any given instant, happens to be on a server somewhere. If you have multiple experts all working together, it can get very difficult to manage and coordinate.<\/p><p>It&#8217;s a balance. With DeepSeek, you get flexibility and adaptability in one model that does a lot, especially when you have multiple experts that you&#8217;ve trained specifically. But it gets difficult to maintain. And as you grow the number of experts, the amount of computing you use and the traffic controlling gets confusing as well. But overall, what they did is an absolute plus, in my opinion.<\/p><h2>How do you think DeepSeek&#8217;s cost efficiency could impact Generative AI spending levels, model API revenue generation, and chip demand?<\/h2><p>I\u2019m very skeptical of the cost they&#8217;ve mentioned, which I don\u2019t think is equivalent to the CapEx of a company. They&#8217;re just saying, &#8220;This is the computing cost.&#8221; It&#8217;s only what they&#8217;ve done to organize their training materials, pre-train the materials into their model, and do some fine-tuning.<\/p><p>They haven\u2019t told us the costs regarding where they got the data from. That data is expensive. There&#8217;s no disclosed cost in terms of everything around it . Here&#8217;s the point: Even if the overall cost is 4-5 times as expensive \u2013 which includes the cost of data, overhead, etc. \u2013 OpenAI is charging for Operator at $200 per user per month, the pricing absolutely will change, because you&#8217;re not going to be able to charge that moving forward.<\/p><p>You can write an operator at a much cheaper price than what Open AI\u2019s Operator is. To be fair, Open AI\u2019s Operator is a little different because their Operator can see your screen, you can browse and do things like that. It\u2019s a little more advanced. But you watch: Six months down the road, you&#8217;re going to have a DeepSeek Operator, I think, and they&#8217;re charging you $5 a month per user. Or a DeepSeek mathematician for $5 a month or $2 a month.<\/p><p>Look at Copilot. It was $30 per user per month. It kind of still is for the enterprise users, because it&#8217;s protecting your privacy and your data behind the firewall. But everyone else, it&#8217;s only an extra $3 a month now. If you have Office 365, the price goes up by $3 per month and you get the entire suite of Office 365 Copilot. You&#8217;re going to start seeing this shift downstream, where it becomes a lot more affordable.<\/p><p>The other question that might be coming up in terms of the pricing is, what about these GPUs? They didn&#8217;t do the frontier research. They simply copied, I think. Actually, they didn&#8217;t just simply copy \u2013 they used a lot of the techniques that the hyper-scalers used, that Llama 3 has, and then organized really efficiently the way they trained it. Whether they used H800s or GPU as a service, these H100s, who knows? It doesn&#8217;t matter. The fact is that they were able to efficiently optimize the training.<\/p><p>In the future, you may not need an H100. You might simply use an older GPU to do the same job, it just may take a little bit longer to do it. Or you may not need to spend $50,000 for a Blackwell chip \u2013 you just buy the NVIDIA DIGITS device, which is $3,000 and has a Grace Blackwell 10 inside it, and you stack these together. Jensen just announced this. Is there a need for everyone to have access to H100? In my opinion, no. Moving forward, assuming what they publish is right, accurate, and fully transparent, you can do a lot with a lot less using the techniques that DeepSeek published in their white paper.<\/p><h2>How quickly might we see other models catch up to DeepSeek or surpass OpenAI&#8217;s o1 model?<\/h2><p>Very quickly. It&#8217;s not because they already have it \u2013 they don&#8217;t. The fact is that DeepSeek, the actual foundation model itself, is different, because it&#8217;s been built on a mixture of experts. Not one large foundation model, but multiple. The foundation itself has been modified, and they modified Llama to do that. Any other company, especially the closed-source ones, can do that.<\/p><p>I have a feeling that before you know it, o3, o4, Gemini 2, 2.1, all of that is going to have these features built into it. Why would you go to a frontier model if it doesn\u2019t even offer a mixture of experts? If you don&#8217;t offer precision optimization for computing, or multi-head latent attention, all these technical things \u2013 if you don&#8217;t have that in your closed models \u2013 you are cutting yourself out of a big chunk of business. I suspect they will have it in their updates very soon \u2013 if not tomorrow, next week, or next month.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-fca9f2a elementor-widget-divider--view-line elementor-widget elementor-widget-divider\" data-id=\"fca9f2a\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"divider.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-divider\">\n\t\t\t<span class=\"elementor-divider-separator\">\n\t\t\t\t\t\t<\/span>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f6f2afd elementor-widget elementor-widget-text-editor\" data-id=\"f6f2afd\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p><strong>About William Fong<br \/><\/strong><br \/>Dr. William Fong, PhD is the former Global Solution Strategy Director for AI Digital Transformation at Microsoft, which he left in September 2022. In this role, Dr. Fong spearheaded the innovation and incubation of products across cloud solutions, AI-driven customer solutions, and go-to-market strategies for Modern Workplace supporting Microsoft\u2019s enterprise clients in their AI and Digital Transformation workflows. Prior to assuming this role, Dr. Fong held multiple senior positioning at Microsoft throughout his 26 years with the company. He currently serves as an independent consultant on AI and Digital Transformation.<\/p><p>This article is adapted from the GLG Teleconference \u201cDeepSeek\u2019s Challenge to the Generative AI Model Market and Hyperscale Investments,\u201d hosted on January 28, 2025. If you would like access to the full transcript or would like to speak with Dr. William Fong, or any of our industry experts, please contact us below.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t","protected":false},"author":2,"featured_media":70662,"template":"","insight-type":[1735],"insights-tag":[1624,1670],"insights-category":[1959],"class_list":["post-95191","insight-article","type-insight-article","status-publish","has-post-thumbnail","hentry","insight-type-articles","insights-tag-ai-ja","insights-tag-chatgpt-ja","insights-category-ai"],"_links":{"self":[{"href":"https:\/\/glginsights.com\/ja\/wp-json\/wp\/v2\/insight-article\/95191","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/glginsights.com\/ja\/wp-json\/wp\/v2\/insight-article"}],"about":[{"href":"https:\/\/glginsights.com\/ja\/wp-json\/wp\/v2\/types\/insight-article"}],"author":[{"embeddable":true,"href":"https:\/\/glginsights.com\/ja\/wp-json\/wp\/v2\/users\/2"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/glginsights.com\/ja\/wp-json\/wp\/v2\/media\/70662"}],"wp:attachment":[{"href":"https:\/\/glginsights.com\/ja\/wp-json\/wp\/v2\/media?parent=95191"}],"wp:term":[{"taxonomy":"insight-type","embeddable":true,"href":"https:\/\/glginsights.com\/ja\/wp-json\/wp\/v2\/insight-type?post=95191"},{"taxonomy":"insights-tag","embeddable":true,"href":"https:\/\/glginsights.com\/ja\/wp-json\/wp\/v2\/insights-tag?post=95191"},{"taxonomy":"insights-category","embeddable":true,"href":"https:\/\/glginsights.com\/ja\/wp-json\/wp\/v2\/insights-category?post=95191"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}