Tag
AI coding agents
AI coding agents are agentic tools that work in terminals, editors, or CI to write code, edit files, run tests, and open PRs. They matter because they can speed up delivery, but also raise questions about permissions, token cost, regressions, and real-world reliability.
20 articles

Best AI Coding Agent 2026, Ranked by Benchmarks
Codex CLI leads Terminal-Bench 2.1, while Claude Code wins on depth and opencode leads open source by stars.

Nx Polygraph targets AI agent bottlenecks
4 ways Nx Polygraph exposes why AI coding agents stall in monorepos and what teams can verify before scaling them up.

Libghostty is becoming the terminal substrate for agent workflows
Libghostty is turning into the default base layer for terminals and AI agent workspaces.

design.md turns brand tokens into agent-ready UI specs
4 ways design.md gives coding agents persistent design-system context and checks tokens, contrast, and regressions.

Claude Code, Cursor, and Copilot set the 2026 bar
6 AI coding agents dominate 2026 reviews, with developers weighing cost, repo context, privacy, and real productivity gains.

Anthropic’s own data says AI is already building AI
Anthropic’s data shows AI is already accelerating AI development, and that should alarm every serious builder.

Open-source AI tools beat Claude’s paid tiers on value
Open-source alternatives now cover most of Claude’s paid features and often do it with less lock-in.

AI Coding Agents in 2026: What Changes Next
AI coding in 2026 shifts from autocomplete to delegated agents, with cloud workflows, MCP, and stricter security controls.

Physicist Supervision Beat a Coding Agent
A physicist-supervised coding agent built scientific software, but human oversight caught failures tests missed.

How to Read Cognition AI's $26B Round
Cognition AI raised over $1 billion at a $26 billion valuation, resetting AI coding-agent benchmarks.

Grok Build turns xAI into a coding agent
Grok Build is xAI’s first coding agent, and I break down how to use the idea without the usual agent fluff.

Microsoft open-sources 174 AI coding skills
Microsoft’s new GitHub repo packages 174 skills, MCP configs, and custom agents for coding assistants working with Azure SDKs.

AWS ships Agent Toolkit for coding agents
AWS launched Agent Toolkit for AWS, adding MCP access, curated skills, and audit controls for Claude Code, Cursor, and other agents.

Why OpenAI Is Right to Put Codex on Phones
OpenAI’s move to mobile makes Codex more useful, not less, by turning coding agents into always-available workflow tools.

Why AI-agent CLIs are the new supply-chain attack surface
AI-agent CLIs create a new supply-chain backdoor class that scanners do not catch.

Why AI coding agents need an architecture compiler
Atomadic Forge proves AI code needs structural enforcement, not just tests and linting.

From Prompting to Harness Engineering
OpenAI says one team shipped a 1M-line product with 3 engineers and Codex, merging about 1,500 PRs in 5 months.

I Tested Devin on 10 Tasks. It Finished 3.
Devin scored 13.86% on SWE-bench and finished 3 of 10 real tasks in one test, showing where AI coding agents still fall short.

Claude Code Setup Guide for Research Workflows
A practical setup guide for Claude Code in research workflows, with terminal tips, context-window advice, and pricing details.

RTK cuts Claude Code token spend fast
RTK claims it can cut Claude Code token use by up to 80% by routing work through local shell commands and agents.