[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"article-anthropic-robodog-test-physical-agentic-ai-en":3,"article-related-anthropic-robodog-test-physical-agentic-ai-en":30,"series-research-8047afc9-35a3-4ad1-8e62-2a8881027bc3":76},{"id":4,"slug":5,"title":6,"content":7,"summary":8,"source":9,"source_url":10,"author":11,"image_url":12,"cover_image":12,"category":13,"language":14,"translated_content":11,"related_article_id":15,"keywords":16,"key_takeaways":22,"views":26,"created_at":27,"published_at":28,"topic_cluster_id":29},"8047afc9-35a3-4ad1-8e62-2a8881027bc3","anthropic-robodog-test-physical-agentic-ai-en","Anthropic’s robodog test shows physical agentic AI is arriving","\u003Cp data-speakable=\"summary\">\u003Ca href=\"\u002Ftag\u002Fanthropic\">Anthropic\u003C\u002Fa>’s Project Fetch Phase Two shows \u003Ca href=\"\u002Ftag\u002Fclaude\">Claude\u003C\u002Fa> can already outperform humans on limited robot tasks without help.\u003C\u002Fp>\u003Cp>Anthropic is right: the real story is not that robotics is solved, but that general-purpose models have crossed into useful physical action faster than most teams expected.\u003C\u002Fp>\u003Ch2>Models are now beating humans on narrow robot workflows\u003C\u002Fh2>\u003Cp>In Phase Two of Project Fetch, Claude \u003Ca href=\"\u002Ftag\u002Fopus-47\">Opus 4.7\u003C\u002Fa> completed every task that at least one human team had finished in the earlier experiment, and it did so at least ten times faster. On the four tasks both human teams completed, Anthropic says the model was more than 37 times faster than Team Claude-less and more than 18 times faster than Team Claude.\u003C\u002Fp>\n\u003Cfigure class=\"my-6\">\u003Cimg src=\"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782136968972-srd9.png\" alt=\"Anthropic’s robodog test shows physical agentic AI is arriving\" class=\"rounded-xl w-full\" loading=\"lazy\" \u002F>\u003C\u002Ffigure>\n\u003Cp>That is not a laboratory curiosity. It is a signal that the bottleneck has shifted from “can the model understand the task?” to “can the model execute the task efficiently enough to matter?” When a model can move from prompt to action faster than a trained human team can reason through the same setup, the old assumption that humans must remain the primary operators starts to break.\u003C\u002Fp>\u003Ch2>The speedup matters because robotics work is workflow work\u003C\u002Fh2>\u003Cp>Anthropic’s description makes the core point plain: the model excelled at choosing an interface path, writing effective code on the first try, and generating far less code than the human teams while still succeeding. That is the classic pattern of agentic software work, now showing up in the physical world through an off-the-shelf robot.\u003C\u002Fp>\u003Cp>This matters because most real deployments are not full autonomy fantasies. They are multi-step workflows where the value comes from reducing setup time, debugging time, and decision time. If Claude can connect to sensors, write a controller, and get to a working result with less thrash than a human team, then the economic unit is no longer the robot alone. It is the robot-plus-model stack, and that stack is already becoming cheaper to operate.\u003C\u002Fp>\u003Ch2>The result is stronger than the robotics skeptics want to admit\u003C\u002Fh2>\u003Cp>Anthropic is careful to note that the model still failed at the hardest part: precise closed-loop fetching of the beach ball. That caveat is real, but it does not erase the main conclusion. The model already handled the surrounding tasks well enough to make the remaining gap look like a bounded engineering problem, not a categorical wall.\u003C\u002Fp>\n\u003Cfigure class=\"my-6\">\u003Cimg src=\"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782136971033-vxui.png\" alt=\"Anthropic’s robodog test shows physical agentic AI is arriving\" class=\"rounded-xl w-full\" loading=\"lazy\" \u002F>\u003C\u002Ffigure>\n\u003Cp>The company also says this progress did not come from a robotics-specific push, but from general scaling. That is the uncomfortable part for skeptics. If capability gains in physical control are emerging as a side effect of broader model progress, then waiting for a special robotics breakthrough is the wrong bet. The general model will keep getting better, and with it the ability to use ordinary tools in the physical world.\u003C\u002Fp>\u003Ch2>The counter-argument\u003C\u002Fh2>\u003Cp>The strongest objection is that this is still a toy \u003Ca href=\"\u002Ftag\u002Fbenchmark\">benchmark\u003C\u002Fa>. A robodog in a warehouse is not a warehouse robot, and a beach ball is not a real-world object with safety constraints, compliance requirements, or high-stakes failure modes. Anthropic itself admits the model did not solve low-level actuation policy or the hardest closed-loop control problem.\u003C\u002Fp>\u003Cp>That critique is valid as far as it goes. A demo of limited autonomy is not proof of broad physical competence. But it misses the trend line Anthropic is documenting across domains: first models help humans, then humans help models, and then models do the work themselves. The important question is not whether Claude can replace a robotics engineer today. It is whether the remaining gaps are shrinking fast enough that off-the-shelf tools become model-native before companies have finished building around human-in-the-loop assumptions. On this evidence, they are.\u003C\u002Fp>\u003Ch2>What to do with this\u003C\u002Fh2>\u003Cp>Engineers, PMs, and founders should stop treating robotics as a separate universe from agentic software. The right move is to design systems where models can inspect, plan, call tools, recover from errors, and hand off cleanly when precision is required. Build for supervised autonomy now, because the gap between “assisted” and “independent” is closing faster than the last year of robotics folklore suggests.\u003C\u002Fp>","Anthropic’s Project Fetch Phase Two shows Claude can already outperform humans on limited robot tasks without help.","www.anthropic.com","https:\u002F\u002Fwww.anthropic.com\u002Fresearch\u002Fproject-fetch-phase-two",null,"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782136968972-srd9.png","research","en","f5561869-1184-42a7-a2f6-f952340e9742",[17,18,19,20,21],"Anthropic","Claude Opus 4.7","Project Fetch","robotics","physical agentic AI",[23,24,25],"Claude Opus 4.7 completed limited robot tasks far faster than human teams.","The main value is in workflow reduction, not full robotics replacement.","General model scaling is now pushing into physical tool use.",0,"2026-06-22T14:02:22.977364+00:00","2026-06-22T14:02:22.957+00:00","3103988e-c4fe-45e3-98ab-846500c9d507",{"tags":31,"relatedLang":35,"relatedPosts":39},[32,34],{"name":17,"slug":33},"anthropic",{"name":20,"slug":20},{"id":15,"slug":36,"title":37,"language":38},"anthropic-robodog-test-physical-agentic-ai-zh","Anthropic 的 robodog 測試證明：物理型 agentic AI…","zh",[40,46,52,58,64,70],{"id":41,"slug":42,"title":43,"cover_image":44,"image_url":44,"created_at":45,"category":13},"f05d7971-4858-4384-81d8-00299b99ed17","ethereum-wikipedia-dev-cheat-sheet-en","Ethereum turns Wikipedia into a dev cheat sheet","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782152297559-pocz.png","2026-06-22T18:17:50.367827+00:00",{"id":47,"slug":48,"title":49,"cover_image":50,"image_url":50,"created_at":51,"category":13},"354441d5-652c-4658-a446-14f101f5e084","rootly-benchmark-llama-4-trails-coding-models-en","Rootly benchmark: Llama 4 trails coding models","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782086567786-wz4t.png","2026-06-22T00:02:22.751682+00:00",{"id":53,"slug":54,"title":55,"cover_image":56,"image_url":56,"created_at":57,"category":13},"569999a1-0afb-46a6-929a-2c9089682668","8tai-jiqiren-ziji-zuo-shiyan-en","8台机器人怎么自己做实验","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782073087231-rcfn.png","2026-06-21T20:17:41.340146+00:00",{"id":59,"slug":60,"title":61,"cover_image":62,"image_url":62,"created_at":63,"category":13},"8cdb1cdd-1014-4c4c-9ea3-63dc78301524","xtragpt-paper-revision-human-ai-collaboration-en","XtraGPT lets you revise papers with control","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782066795284-78ju.png","2026-06-21T18:32:49.655317+00:00",{"id":65,"slug":66,"title":67,"cover_image":68,"image_url":68,"created_at":69,"category":13},"40d7a637-c770-47b5-8813-fd56a798b332","skill-to-lora-cuts-agent-token-overhead-en","Skill-to-LoRA cuts agent token overhead","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781993875444-u3le.png","2026-06-20T22:17:31.1477+00:00",{"id":71,"slug":72,"title":73,"cover_image":74,"image_url":74,"created_at":75,"category":13},"405de39d-cfc5-43bf-b47b-ff9ce7be96a9","turboquant-does-not-hurt-search-quality-equal-bytes-en","TurboQuant does not hurt search quality at equal byte budgets","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781857967113-2xax.png","2026-06-19T08:32:22.235692+00:00",[77,82,87,92,97,102,107,112,117,122],{"id":78,"slug":79,"title":80,"created_at":81},"a2715e72-1fe8-41b3-abb1-d0cf1f710189","ai-predictions-2026-big-changes-en","AI Predictions for 2026: Brace for Big Changes","2026-03-26T01:25:07.788356+00:00",{"id":83,"slug":84,"title":85,"created_at":86},"8404bd7b-4c2f-4109-9ec4-baf29d88af2b","ml-papers-of-the-week-github-research-desk-en","ML Papers of the Week Turns GitHub Into a Research Desk","2026-03-27T01:11:39.480259+00:00",{"id":88,"slug":89,"title":90,"created_at":91},"87897a94-8065-4464-a016-1f23e89e17cc","ai-ml-conferences-to-watch-in-2026-en","AI\u002FML Conferences to Watch in 2026","2026-03-27T01:51:54.184108+00:00",{"id":93,"slug":94,"title":95,"created_at":96},"6f1987cf-25f3-47a4-b3e6-db0997695be8","openclaw-agents-manipulated-self-sabotage-en","OpenClaw Agents Can Be Manipulated Into Failure","2026-03-28T03:03:18.899465+00:00",{"id":98,"slug":99,"title":100,"created_at":101},"a53571ad-735a-4178-9f93-cb09b699d99c","vega-driving-language-instructions-en","Vega: Driving with Natural Language Instructions","2026-03-28T14:54:04.698882+00:00",{"id":103,"slug":104,"title":105,"created_at":106},"a34581d6-f36e-46da-88bb-582fb3e7425c","personalizing-autonomous-driving-styles-en","Drive My Way: Personalizing Autonomous Driving Styles","2026-03-28T14:54:26.148181+00:00",{"id":108,"slug":109,"title":110,"created_at":111},"2bc1ad7f-26ce-4f02-9885-803b35fd229d","training-knowledge-bases-writeback-rag-en","Training Knowledge Bases with WriteBack-RAG","2026-03-28T14:54:45.643433+00:00",{"id":113,"slug":114,"title":115,"created_at":116},"71adc507-3c54-4605-bbe2-c966acd6187e","packforcing-long-video-generation-en","PackForcing: Efficient Long-Video Generation Method","2026-03-28T14:55:02.646943+00:00",{"id":118,"slug":119,"title":120,"created_at":121},"675942ef-b9ec-4c5f-a997-381250b6eacb","pixelsmile-facial-expression-editing-en","PixelSmile Framework Enhances Facial Expression Editing","2026-03-28T14:55:20.633463+00:00",{"id":123,"slug":124,"title":125,"created_at":126},"6954fa2b-8b66-4839-884b-e46f89fa1bc3","adaptive-block-scaled-data-types-en","IF4: Smarter 4-Bit Quantization That Adapts to Your Data","2026-03-31T06:00:36.65963+00:00"]