[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"article-nvidia-microsoft-agentic-ai-pc-cloud-local-en":3,"article-related-nvidia-microsoft-agentic-ai-pc-cloud-local-en":35,"series-industry-5c8f62ea-907e-41e9-9a91-7596c457c804":89},{"id":4,"slug":5,"title":6,"content":7,"summary":8,"source":9,"source_url":10,"author":11,"image_url":12,"cover_image":12,"category":13,"language":14,"translated_content":11,"related_article_id":15,"keywords":16,"key_takeaways":27,"views":31,"created_at":32,"published_at":33,"topic_cluster_id":34},"5c8f62ea-907e-41e9-9a91-7596c457c804","nvidia-microsoft-agentic-ai-pc-cloud-local-en","NVIDIA and Microsoft unify agentic AI from PC to cloud","\u003Cp data-speakable=\"summary\">NVIDIA and \u003Ca href=\"\u002Ftag\u002Fmicrosoft\">Microsoft\u003C\u002Fa> are linking PCs, cloud, local systems, and secure runtimes for \u003Ca href=\"\u002Ftag\u002Fagentic-ai\">agentic AI\u003C\u002Fa>.\u003C\u002Fp>\u003Cp>At Microsoft Build, NVIDIA said the stack now reaches from Windows devices to Azure and local deployment, with one benchmark showing Microsoft Fabric SQL running up to 6x faster than a CPU baseline.\u003C\u002Fp>\u003Ctable>\u003Cthead>\u003Ctr>\u003Cth>Item\u003C\u002Fth>\u003Cth>Where it runs\u003C\u002Fth>\u003Cth>Key spec\u003C\u002Fth>\u003C\u002Ftr>\u003C\u002Fthead>\u003Ctbody>\u003Ctr>\u003Ctd>RTX Spark\u003C\u002Ftd>\u003Ctd>Windows devices\u003C\u002Ftd>\u003Ctd>1 petaflop AI performance, up to 128GB unified memory\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>DGX Station for Windows\u003C\u002Ftd>\u003Ctd>Windows desktops\u003C\u002Ftd>\u003Ctd>Up to 20 petaflops FP4, up to 748GB coherent memory\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>Microsoft Fabric Data Warehouse\u003C\u002Ftd>\u003Ctd>Cloud data layer\u003C\u002Ftd>\u003Ctd>Up to 6x faster SQL execution vs CPU baseline\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>Azure Local with RTX PRO 6000 Blackwell Server Edition\u003C\u002Ftd>\u003Ctd>Local and sovereign deployments\u003C\u002Ftd>\u003Ctd>Multinode support, vLLM runtime\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>Vera Rubin on Azure\u003C\u002Ftd>\u003Ctd>AI factories\u003C\u002Ftd>\u003Ctd>Up to 10x inference throughput per megawatt\u003C\u002Ftd>\u003C\u002Ftr>\u003C\u002Ftbody>\u003C\u002Ftable>\u003Ch2>1. RTX Spark for Windows agents\u003C\u002Fh2>\u003Cp>\u003Ca href=\"https:\u002F\u002Fwww.nvidia.com\u002F\">NVIDIA\u003C\u002Fa> and Microsoft are positioning RTX Spark as a personal AI machine for developers who want agents to run natively on Windows. The pitch is simple: build, tune, and run local agents on a laptop or small desktop without depending on the cloud for every step.\u003C\u002Fp>\n\u003Cfigure class=\"my-6\">\u003Cimg src=\"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781275697977-l5l0.png\" alt=\"NVIDIA and Microsoft unify agentic AI from PC to cloud\" class=\"rounded-xl w-full\" loading=\"lazy\" \u002F>\u003C\u002Ffigure>\n\u003Cp>RTX Spark targets a practical middle ground between consumer PCs and full datacenter gear. NVIDIA says it delivers 1 petaflop of AI performance, up to 128GB of unified memory, and all-day battery life, while keeping full AI and graphics performance unplugged.\u003C\u002Fp>\u003Cul>\u003Cli>Purpose-built Windows PCs for personal agents\u003C\u002Fli>\u003Cli>Ships from Microsoft Surface, ASUS, Dell, HP, Lenovo, and MSI\u003C\u002Fli>\u003Cli>Includes CUDA, RTX, DLSS, and TensorRT support\u003C\u002Fli>\u003C\u002Ful>\u003Ch2>2. DGX Station for Windows for enterprise workflows\u003C\u002Fh2>\u003Cp>DGX Station for Windows is the bigger sibling for teams that need a deskside AI system for enterprise apps and long-running workflows. NVIDIA says it is built for always-on agents, with enough memory and compute to handle frontier-scale models locally.\u003C\u002Fp>\u003Cp>According to NVIDIA, the system uses the GB300 Grace Blackwell Ultra Desktop Superchip, offers up to 748GB of coherent memory, and reaches 20 petaflops of FP4 performance. That makes it a fit for model development, local \u003Ca href=\"\u002Ftag\u002Finference\">inference\u003C\u002Fa>, and heavier agent pipelines that cannot wait on round trips to the cloud.\u003C\u002Fp>\u003Cul>\u003Cli>Expected from ASUS, Dell, GIGABYTE, HP, MSI, and Supermicro in Q4\u003C\u002Fli>\u003Cli>Runs NVIDIA OpenShell secure runtime\u003C\u002Fli>\u003Cli>Supports models up to 1 trillion parameters\u003C\u002Fli>\u003C\u002Ful>\u003Ch2>3. NVIDIA models in Microsoft Foundry\u003C\u002Fh2>\u003Cp>Microsoft Foundry is becoming the place where enterprises compose agent systems from multiple model families instead of betting on one model for every job. NVIDIA says its open models now sit alongside \u003Ca href=\"\u002Ftag\u002Fanthropic\">Anthropic\u003C\u002Fa> and OpenAI models in Foundry Agent Service, with built-in identity and governance.\u003C\u002Fp>\n\u003Cfigure class=\"my-6\">\u003Cimg src=\"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781275693342-zr4x.png\" alt=\"NVIDIA and Microsoft unify agentic AI from PC to cloud\" class=\"rounded-xl w-full\" loading=\"lazy\" \u002F>\u003C\u002Ffigure>\n\u003Cp>The most notable addition is Nemotron 3 Ultra, an open frontier reasoning model aimed at coding, research, and enterprise workflows. NVIDIA also points to Nemotron 3.5 ASR for speech recognition and Nemotron 3.5 Content Safety, plus Cosmos 3 for physical AI and Earth-2 weather models for forecasting and risk analysis.\u003C\u002Fp>\u003Cul>\u003Cli>Nemotron models available on Foundry managed compute\u003C\u002Fli>\u003Cli>Anthropic Claude runs natively on NVIDIA GB300 Blackwell Ultra systems on Azure\u003C\u002Fli>\u003Cli>Agent Toolkit and NemoClaw blueprints are available for production agents\u003C\u002Fli>\u003C\u002Ful>\u003Ch2>4. Fabric and Azure Local for faster data and local control\u003C\u002Fh2>\u003Cp>Agentic systems need a data layer that can keep up with repeated queries, reasoning loops, and retrieval calls. NVIDIA says its accelerated computing is now built into Microsoft Fabric Data Warehouse, where Microsoft’s internal tests showed SQL execution up to 6x faster than a CPU baseline and up to 7x faster than three other cloud data warehouse providers in high-concurrency workloads.\u003C\u002Fp>\u003Cp>For teams that need to keep data on site or close to the edge, Microsoft is also bringing Foundry Local on Azure Local to the RTX PRO 6000 Blackwell Server Edition platform. Paired with Nemotron models, it supports multinode deployments and the vLLM runtime for manufacturing, energy, sovereign data centers, and other latency-sensitive uses.\u003C\u002Fp>\u003Ccode>Use case fit:\n- Fabric Data Warehouse: cloud analytics and agent queries\n- Azure Local: on-prem, hybrid, and sovereign deployments\n- vLLM: scaled inference where latency matters\u003C\u002Fcode>\u003Ch2>5. OpenShell and Vera Rubin for secure agents and AI factories\u003C\u002Fh2>\u003Cp>As agents move from suggestions to actions, NVIDIA is pushing a security model where each agent runs in its own sandbox and every outbound call is checked against policy before it reaches files, networks, or credentials. That is the job of OpenShell, now integrated into \u003Ca href=\"\u002Ftag\u002Fgithub-copilot\">GitHub Copilot\u003C\u002Fa> and released as \u003Ca href=\"\u002Fnews\u002Fopen-source-llms-run-locally-2026-en\">open source\u003C\u002Fa> under Apache 2.0.\u003C\u002Fp>\u003Cp>The other half of the story is the datacenter. Microsoft says Fairwater Wisconsin is live and validated for NVIDIA Vera Rubin, which can slot into Azure without retrofits. NVIDIA says the platform can deliver up to 10x inference throughput per megawatt and cut cost per agentic token by an order of magnitude, while Confidential Computing protects models and data at scale.\u003C\u002Fp>\u003Cul>\u003Cli>OpenShell is model-agnostic\u003C\u002Fli>\u003Cli>Policies are written as code and versioned in the repo\u003C\u002Fli>\u003Cli>Vera Rubin works alongside Blackwell in Azure data centers\u003C\u002Fli>\u003C\u002Ful>\u003Ch2>How to decide\u003C\u002Fh2>\u003Cp>If you want local development and fast iteration, start with RTX Spark. If your team needs heavier Windows-based model work, DGX Station for Windows is the stronger fit. If your priority is enterprise orchestration across data, governance, and model choice, Microsoft Foundry and Fabric are the center of gravity.\u003C\u002Fp>\u003Cp>For regulated or latency-sensitive deployments, Azure Local with RTX PRO 6000 Blackwell Server Edition is the better path. If your concern is agent safety, OpenShell matters most. If you are planning large-scale inference infrastructure, Vera Rubin and the AI factory stack are the pieces to watch.\u003C\u002Fp>","5 NVIDIA-Microsoft moves show how agentic AI now spans Windows devices, Azure, local deployment, and secure enterprise runtimes.","blogs.nvidia.com","https:\u002F\u002Fblogs.nvidia.com\u002Fblog\u002Fmicrosoft-build-windows-local-cloud-devices\u002F",null,"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781275697977-l5l0.png","industry","en","03788c2e-f066-4e34-bd65-3e7f0c70c3b9",[17,18,19,20,21,22,23,24,25,26],"NVIDIA","Microsoft","agentic AI","Windows","Azure","Microsoft Fabric","Foundry","OpenShell","Vera Rubin","DGX Station",[28,29,30],"RTX Spark and DGX Station bring agentic AI to Windows devices and deskside systems.","Foundry, Fabric, and Azure Local extend the stack from cloud orchestration to sovereign deployments.","OpenShell and Vera Rubin address agent security and high-scale inference economics.",0,"2026-06-12T14:47:45.763708+00:00","2026-06-12T14:47:45.756+00:00","f387c695-5c1b-40a6-9c25-94628cae173d",{"tags":36,"relatedLang":48,"relatedPosts":52},[37,39,41,44,46],{"name":20,"slug":38},"windows",{"name":18,"slug":40},"microsoft",{"name":42,"slug":43},"Nvidia","nvidia",{"name":21,"slug":45},"azure",{"name":19,"slug":47},"agentic-ai",{"id":15,"slug":49,"title":50,"language":51},"nvidia-microsoft-agentic-ai-pc-cloud-local-zh","NVIDIA 與微軟把代理式 AI 串到雲端與 PC","zh",[53,59,65,71,77,83],{"id":54,"slug":55,"title":56,"cover_image":57,"image_url":57,"created_at":58,"category":13},"865212b4-7bd6-4bb3-a1f1-592960b5b7a3","google-gemini-outage-error-1076-june-2026-en","Google Gemini outage hits users with error 1076","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781338673852-kpqi.png","2026-06-13T08:17:27.75214+00:00",{"id":60,"slug":61,"title":62,"cover_image":63,"image_url":63,"created_at":64,"category":13},"a3dc08d5-311b-4d76-990f-4f3add2133c9","nvidia-hugging-face-ai-pipelines-en","NVIDIA’s Hugging Face hub is built for AI pipelines","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781337773588-31s6.png","2026-06-13T08:02:23.733668+00:00",{"id":66,"slug":67,"title":68,"cover_image":69,"image_url":69,"created_at":70,"category":13},"d96ff33a-47a4-421f-b7d4-ded157b345b6","anthropic-public-record-ai-anxiety-policy-en","Anthropic’s survey turns AI anxiety into policy","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781327893716-5hv3.png","2026-06-13T05:17:42.92009+00:00",{"id":72,"slug":73,"title":74,"cover_image":75,"image_url":75,"created_at":76,"category":13},"07f6818a-6612-4e79-a0b6-7b5014fadafc","chatgpt-grew-from-chatbot-to-platform-en","ChatGPT grew from chatbot to platform","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781325174493-j6tn.png","2026-06-13T04:32:28.006595+00:00",{"id":78,"slug":79,"title":80,"cover_image":81,"image_url":81,"created_at":82,"category":13},"c750890e-4ddf-4e1c-85d5-a5bd4433620f","openai-files-confidential-ipo-after-122b-round-en","OpenAI Files Confidential IPO After $122B Round","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781323367848-n0ns.png","2026-06-13T04:02:24.359675+00:00",{"id":84,"slug":85,"title":86,"cover_image":87,"image_url":87,"created_at":88,"category":13},"b0cb27e2-ca71-40a2-a012-73627f1c995c","government-access-orders-frontier-model-access-en","Government access orders should govern frontier model access","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781319762267-0x3b.png","2026-06-13T03:02:19.503078+00:00",[90,95,100,105,110,115,120,125,130,135],{"id":91,"slug":92,"title":93,"created_at":94},"d35a1bd9-e709-412e-a2df-392df1dc572a","ai-impact-2026-developments-market-en","AI's Impact in 2026: Key Developments and Market Shifts","2026-03-25T16:20:33.205823+00:00",{"id":96,"slug":97,"title":98,"created_at":99},"5ed27921-5fd6-492e-8c59-78393bf37710","trumps-ai-legislative-framework-en","Trump's AI Legislative Framework: What's Inside?","2026-03-25T16:22:20.005325+00:00",{"id":101,"slug":102,"title":103,"created_at":104},"e454a642-f03c-4794-b185-5f651aebbaca","nvidia-gtc-2026-key-highlights-innovations-en","NVIDIA GTC 2026: Key Highlights and Innovations","2026-03-25T16:22:47.882615+00:00",{"id":106,"slug":107,"title":108,"created_at":109},"0ebb5b16-774a-4922-945d-5f2ce1df5a6d","claude-usage-diversifies-learning-curves-en","Claude Usage Diversifies, Learning Curves Emerge","2026-03-25T16:25:50.770376+00:00",{"id":111,"slug":112,"title":113,"created_at":114},"69934e86-2fc5-4280-8223-7b917a48ace8","openclaw-ai-commoditization-concerns-en","OpenClaw's Rise Raises Concerns of AI Model Commoditization","2026-03-25T16:26:30.582047+00:00",{"id":116,"slug":117,"title":118,"created_at":119},"b4b2575b-2ac8-46b2-b90e-ab1d7c060797","google-gemini-ai-rollout-2026-en","Google's Gemini AI Rollout Extended to 2026","2026-03-25T16:28:14.808842+00:00",{"id":121,"slug":122,"title":123,"created_at":124},"6e18bc65-42ae-4ad0-b564-67d7f66b979e","meta-llama4-fabricated-results-scandal-en","Meta's Llama 4 Scandal: Fabricated AI Test Results Unveiled","2026-03-25T16:29:15.482836+00:00",{"id":126,"slug":127,"title":128,"created_at":129},"bf888e9d-08be-4f47-996c-7b24b5ab3500","accenture-mistral-ai-deployment-en","Accenture and Mistral AI Team Up for AI Deployment","2026-03-25T16:31:01.894655+00:00",{"id":131,"slug":132,"title":133,"created_at":134},"5382b536-fad2-49c6-ac85-9eb2bae49f35","mistral-ai-high-stakes-2026-en","Mistral AI: Facing High Stakes in 2026","2026-03-25T16:31:39.941974+00:00",{"id":136,"slug":137,"title":138,"created_at":139},"9da3d2d6-b669-4971-ba1d-17fdb3548ed5","cursors-meteoric-rise-pressures-en","Cursor's Meteoric Rise Faces Industry Pressures","2026-03-25T16:32:21.899217+00:00"]