[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"article-blackwell-mlperf-training-6-0-sweep-en":3,"article-related-blackwell-mlperf-training-6-0-sweep-en":33,"series-industry-50270b29-033a-468f-a188-9163b77e0d0c":76},{"id":4,"slug":5,"title":6,"content":7,"summary":8,"source":9,"source_url":10,"author":11,"image_url":12,"cover_image":12,"category":13,"language":14,"translated_content":11,"related_article_id":15,"keywords":16,"key_takeaways":25,"views":29,"created_at":30,"published_at":31,"topic_cluster_id":32},"50270b29-033a-468f-a188-9163b77e0d0c","blackwell-mlperf-training-6-0-sweep-en","Blackwell’s MLPerf sweep shows why training speeds up","\u003Cp data-speakable=\"summary\">Blackwell led MLPerf Training 6.0 with faster training, larger scale, and stronger reliability.\u003C\u002Fp>\u003Cp>In MLPerf Training 6.0, \u003Ca href=\"\u002Ftag\u002Fnvidia\">NVIDIA\u003C\u002Fa> Blackwell posted the fastest time to train on all seven benchmarks and scaled to 8,192 GPUs.\u003C\u002Fp>\u003Ctable>\u003Cthead>\u003Ctr>\u003Cth>Item\u003C\u002Fth>\u003Cth>Scale\u003C\u002Fth>\u003Cth>Reported result\u003C\u002Fth>\u003C\u002Ftr>\u003C\u002Fthead>\u003Ctbody>\u003Ctr>\u003Ctd>GB300 NVL72\u003C\u002Ftd>\u003Ctd>Rack-scale\u003C\u002Ftd>\u003Ctd>Up to 1.6x faster than GB200 NVL72\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>DeepSeek-V3 671B\u003C\u002Ftd>\u003Ctd>8,192 GPUs\u003C\u002Ftd>\u003Ctd>Fastest time to train at the largest scale\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>Llama 3.1 405B on Azure\u003C\u002Ftd>\u003Ctd>8,192 GPUs\u003C\u002Ftd>\u003Ctd>Reference quality in 7.07 minutes\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>DeepSeek-V3 671B on CoreWeave\u003C\u002Ftd>\u003Ctd>8,192 GPUs\u003C\u002Ftd>\u003Ctd>Reference quality in 2.02 minutes\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>Higgsfield on Nebius\u003C\u002Ftd>\u003Ctd>Cloud deployment\u003C\u002Ftd>\u003Ctd>30% shorter training time\u003C\u002Ftd>\u003C\u002Ftr>\u003C\u002Ftbody>\u003C\u002Ftable>\u003Ch2>1. Fastest training across all seven benchmarks\u003C\u002Fh2>\u003Cp>The headline result is simple: NVIDIA was the only platform submitted across every \u003Ca href=\"\u002Ftag\u002Fbenchmark\">benchmark\u003C\u002Fa> in MLPerf Training 6.0, and it delivered the fastest time to train in all seven. That matters because MLPerf is a peer-reviewed benchmark suite, so the results are meant to compare real systems, not marketing claims.\u003C\u002Fp>\n\u003Cfigure class=\"my-6\">\u003Cimg src=\"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782181966500-yj09.png\" alt=\"Blackwell’s MLPerf sweep shows why training speeds up\" class=\"rounded-xl w-full\" loading=\"lazy\" \u002F>\u003C\u002Ffigure>\n\u003Cp>For teams choosing training infrastructure, this is the clearest signal in the batch. It says Blackwell is not tuned for one model family or one lab setup. It is being pushed across dense \u003Ca href=\"\u002Ftag\u002Fllms\">LLMs\u003C\u002Fa>, mixture-of-experts workloads, and fine-tuning cases with the same goal: finish training sooner.\u003C\u002Fp>\u003Cul>\u003Cli>Seven-for-seven fastest time to train\u003C\u002Fli>\u003Cli>Submitted on both GB200 NVL72 and GB300 NVL72\u003C\u002Fli>\u003Cli>Included new MoE workloads: DeepSeek-V3 671B and GPT-OSS-20B\u003C\u002Fli>\u003C\u002Ful>\u003Ch2>2. GB300 NVL72’s speed jump over GB200 NVL72\u003C\u002Fh2>\u003Cp>Blackwell Ultra matters because it raises the ceiling inside the same rack-scale design. NVIDIA reported that GB300 NVL72 delivered up to 1.6x faster training than GB200 NVL72 at the same scale, driven by higher compute density with NVFP4, more memory, and a higher power ceiling.\u003C\u002Fp>\u003Cp>That mix is useful when a model is already large enough that small gains in throughput compound into real schedule savings. If you are running long pretraining jobs or repeated fine-tunes, a 1.6x gain can change how many experiments fit into a week.\u003C\u002Fp>\u003Ccode>Key drivers of the GB300 NVL72 gain:\n- Higher compute density with NVFP4\n- Expanded memory capacity\n- Higher power ceiling for sustained performance\u003C\u002Fcode>\u003Ch2>3. 8,192-GPU scale for MoE and dense models\u003C\u002Fh2>\u003Cp>Scale is the other half of the story. NVIDIA scaled DeepSeek-V3 671B to 8,192 GPUs on GB200 NVL72 systems, which is the largest Blackwell-based submission in MLPerf Training to date. It also submitted Llama 3.1 405B at 5,120 GPUs, showing that the platform is not only about peak single-job speed but also about how far the cluster can stretch.\u003C\u002Fp>\n\u003Cfigure class=\"my-6\">\u003Cimg src=\"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782181967228-igb9.png\" alt=\"Blackwell’s MLPerf sweep shows why training speeds up\" class=\"rounded-xl w-full\" loading=\"lazy\" \u002F>\u003C\u002Ffigure>\n\u003Cp>The networking piece is what makes that scale practical. Within each rack, fifth-generation NVLink Switches connect all 72 GPUs into a shared pool of compute and memory. For distributed clusters, NVIDIA pairs that with Quantum InfiniBand or Spectrum-X Ethernet, depending on the data center design.\u003C\u002Fp>\u003Cul>\u003Cli>DeepSeek-V3 671B: 8,192 GPUs\u003C\u002Fli>\u003Cli>Llama 3.1 405B: 5,120 GPUs\u003C\u002Fli>\u003Cli>Rack-scale NVLink Switch fabric across 72 GPUs\u003C\u002Fli>\u003C\u002Ful>\u003Ch2>4. Partner results that show the platform in production\u003C\u002Fh2>\u003Cp>The most useful part of the blog may be the partner examples, because they show Blackwell outside NVIDIA’s own test cases. Cohere reported 3x faster training on GB200 NVL72 for its North \u003Ca href=\"\u002Ftag\u002Fagentic-ai\">agentic AI\u003C\u002Fa> platform. Midjourney trained v8 on a Blackwell cluster and is now scaling a large fleet of Blackwell Ultra GPUs on CoreWeave for upcoming image and video models.\u003C\u002Fp>\u003Cp>There are more signs that the platform is already in production use. \u003Ca href=\"\u002Ftag\u002Fmicrosoft\">Microsoft\u003C\u002Fa> Azure reached reference quality on Llama 3.1 405B in 7.07 minutes, CoreWeave hit 2.02 minutes on DeepSeek-V3 671B with GB300 NVL72, and Nebius said Higgsfield cut training time by 30% while serving 22 million users and generating over 6 million AI outputs per day.\u003C\u002Fp>\u003Cul>\u003Cli>Cohere: 3x faster training on GB200 NVL72\u003C\u002Fli>\u003Cli>Midjourney: training and scaling on Blackwell Ultra GPUs\u003C\u002Fli>\u003Cli>Thinking Machines Lab on Google Cloud: 2x faster training and serving\u003C\u002Fli>\u003Cli>Nebius and Higgsfield: 30% shorter training time\u003C\u002Fli>\u003C\u002Ful>\u003Ch2>5. Reliability features for long training runs\u003C\u002Fh2>\u003Cp>Performance only matters if a job survives long enough to finish. NVIDIA frames Blackwell’s reliability story around fewer interruptions and faster recovery. Before a GPU reaches a data center, it goes through 30+ manufacturing test stages. In operation, the Reliability, Availability and Serviceability Engine watches nearly the entire chip, while self-healing logic can route around faults without stopping the workload.\u003C\u002Fp>\u003Cp>At the cluster level, Spectrum-X Ethernet can reroute around failed links in milliseconds. If a fault does interrupt a job, NVIDIA Resiliency Extension, or NVRx, helps resume from a recent checkpoint instead of restarting from zero. That is especially relevant for runs that span weeks or months across hundreds of thousands of GPUs.\u003C\u002Fp>\u003Ccode>Reliability stack:\n- 30+ manufacturing test stages\n- RAS Engine monitoring\n- Self-healing fault routing\n- Spectrum-X link rerouting\n- NVRx checkpoint recovery\u003C\u002Fcode>\u003Ch2>How to decide\u003C\u002Fh2>\u003Cp>If you want the fastest benchmark story, look at the seven-for-seven MLPerf sweep and the GB300 NVL72 result. If your priority is cluster size, the 8,192-GPU DeepSeek-V3 671B run is the clearest proof point. If you care about real-world adoption, the partner wins from Cohere, Midjourney, Azure, CoreWeave, and Nebius are the strongest signals.\u003C\u002Fp>\u003Cp>For most AI teams, the practical takeaway is that Blackwell is being positioned as a full training platform, not just a fast GPU. It combines speed, scale, and recovery features in a way that fits frontier model work, where every lost hour and every failed run has a cost.\u003C\u002Fp>","5 Blackwell MLPerf 6.0 results show faster training, bigger scale, and better reliability for frontier AI teams.","blogs.nvidia.com","https:\u002F\u002Fblogs.nvidia.com\u002Fblog\u002Fblackwell-mlperf-training-6-0\u002F",null,"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782181966500-yj09.png","industry","en","2e953e03-6c74-468e-ab2f-9b698a3b2d39",[17,18,19,20,21,22,23,24],"NVIDIA Blackwell","MLPerf Training 6.0","GB300 NVL72","GB200 NVL72","AI training","Blackwell Ultra","NVLink Switch","Spectrum-X Ethernet",[26,27,28],"Blackwell led all seven MLPerf Training 6.0 benchmarks and scaled to 8,192 GPUs.","GB300 NVL72 delivered up to 1.6x faster training than GB200 NVL72 at the same scale.","Partner results from Cohere, Midjourney, Azure, CoreWeave, and Nebius show production use.",0,"2026-06-23T02:32:26.093865+00:00","2026-06-23T02:32:26.087+00:00","d19fc184-5852-4c4d-9ec0-db0c4841ac17",{"tags":34,"relatedLang":35,"relatedPosts":39},[],{"id":15,"slug":36,"title":37,"language":38},"blackwell-mlperf-training-6-0-sweep-zh","Blackwell 6.0 讓訓練速度、規模、穩定性一起升級","zh",[40,46,52,58,64,70],{"id":41,"slug":42,"title":43,"cover_image":44,"image_url":44,"created_at":45,"category":13},"61775efa-14fe-427d-9c1e-5e321959e777","baya-openchip-bet-ai-silicon-data-movement-en","Baya and Openchip are betting the future of AI silicon on data moveme…","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782193668059-ugc5.png","2026-06-23T05:47:24.076812+00:00",{"id":47,"slug":48,"title":49,"cover_image":50,"image_url":50,"created_at":51,"category":13},"af91ca07-713f-4a59-88da-b375a50701b9","citigroup-sees-tokenized-assets-hitting-8-2t-en","Citigroup Sees Tokenized Assets Hitting $8.2T","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782185576141-ygv9.png","2026-06-23T03:32:33.87259+00:00",{"id":53,"slug":54,"title":55,"cover_image":56,"image_url":56,"created_at":57,"category":13},"e4f5babc-93c4-4b55-a8fb-d9c746d4c873","rwa-tokenization-turns-assets-into-on-chain-rails-en","RWA tokenization turns assets into on-chain rails","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782184698158-j904.png","2026-06-23T03:17:50.637664+00:00",{"id":59,"slug":60,"title":61,"cover_image":62,"image_url":62,"created_at":63,"category":13},"bd34cea9-d4cc-4ccc-8367-25ce036b99d6","ai-companies-should-stop-pretending-midterm-spending-is-neut-en","AI companies should stop pretending midterm spending is neutral","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782181066305-u5lq.png","2026-06-23T02:17:19.41209+00:00",{"id":65,"slug":66,"title":67,"cover_image":68,"image_url":68,"created_at":69,"category":13},"5fa3ff5f-480f-459e-a143-77cbfb5bb7dd","ai-market-map-list-better-signal-than-newsletters-en","This AI market map list is a better signal than most AI newsletters","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782180166415-8oqb.png","2026-06-23T02:02:22.228436+00:00",{"id":71,"slug":72,"title":73,"cover_image":74,"image_url":74,"created_at":75,"category":13},"2bdb0a8e-0ae9-45b2-8399-71f60171168c","worldcoin-rally-credibility-test-not-breakout-en","Worldcoin’s rally is a credibility test, not a breakout to chase","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782179262640-ok5l.png","2026-06-23T01:47:19.421718+00:00",[77,82,87,92,97,102,107,112,117,122],{"id":78,"slug":79,"title":80,"created_at":81},"d35a1bd9-e709-412e-a2df-392df1dc572a","ai-impact-2026-developments-market-en","AI's Impact in 2026: Key Developments and Market Shifts","2026-03-25T16:20:33.205823+00:00",{"id":83,"slug":84,"title":85,"created_at":86},"5ed27921-5fd6-492e-8c59-78393bf37710","trumps-ai-legislative-framework-en","Trump's AI Legislative Framework: What's Inside?","2026-03-25T16:22:20.005325+00:00",{"id":88,"slug":89,"title":90,"created_at":91},"e454a642-f03c-4794-b185-5f651aebbaca","nvidia-gtc-2026-key-highlights-innovations-en","NVIDIA GTC 2026: Key Highlights and Innovations","2026-03-25T16:22:47.882615+00:00",{"id":93,"slug":94,"title":95,"created_at":96},"0ebb5b16-774a-4922-945d-5f2ce1df5a6d","claude-usage-diversifies-learning-curves-en","Claude Usage Diversifies, Learning Curves Emerge","2026-03-25T16:25:50.770376+00:00",{"id":98,"slug":99,"title":100,"created_at":101},"69934e86-2fc5-4280-8223-7b917a48ace8","openclaw-ai-commoditization-concerns-en","OpenClaw's Rise Raises Concerns of AI Model Commoditization","2026-03-25T16:26:30.582047+00:00",{"id":103,"slug":104,"title":105,"created_at":106},"b4b2575b-2ac8-46b2-b90e-ab1d7c060797","google-gemini-ai-rollout-2026-en","Google's Gemini AI Rollout Extended to 2026","2026-03-25T16:28:14.808842+00:00",{"id":108,"slug":109,"title":110,"created_at":111},"6e18bc65-42ae-4ad0-b564-67d7f66b979e","meta-llama4-fabricated-results-scandal-en","Meta's Llama 4 Scandal: Fabricated AI Test Results Unveiled","2026-03-25T16:29:15.482836+00:00",{"id":113,"slug":114,"title":115,"created_at":116},"bf888e9d-08be-4f47-996c-7b24b5ab3500","accenture-mistral-ai-deployment-en","Accenture and Mistral AI Team Up for AI Deployment","2026-03-25T16:31:01.894655+00:00",{"id":118,"slug":119,"title":120,"created_at":121},"5382b536-fad2-49c6-ac85-9eb2bae49f35","mistral-ai-high-stakes-2026-en","Mistral AI: Facing High Stakes in 2026","2026-03-25T16:31:39.941974+00:00",{"id":123,"slug":124,"title":125,"created_at":126},"9da3d2d6-b669-4971-ba1d-17fdb3548ed5","cursors-meteoric-rise-pressures-en","Cursor's Meteoric Rise Faces Industry Pressures","2026-03-25T16:32:21.899217+00:00"]