Back to home

Tag

AI coding agents

AI coding agents are agentic tools that work in terminals, editors, or CI to write code, edit files, run tests, and open PRs. They matter because they can speed up delivery, but also raise questions about permissions, token cost, regressions, and real-world reliability.

20 articles

Best AI Coding Agent 2026, Ranked by Benchmarks
Tools & Apps/Jun 29

Best AI Coding Agent 2026, Ranked by Benchmarks

Codex CLI leads Terminal-Bench 2.1, while Claude Code wins on depth and opencode leads open source by stars.

Nx Polygraph targets AI agent bottlenecks
Industry News/Jun 26

Nx Polygraph targets AI agent bottlenecks

4 ways Nx Polygraph exposes why AI coding agents stall in monorepos and what teams can verify before scaling them up.

Libghostty is becoming the terminal substrate for agent workflows
Tools & Apps/Jun 25

Libghostty is becoming the terminal substrate for agent workflows

Libghostty is turning into the default base layer for terminals and AI agent workspaces.

design.md turns brand tokens into agent-ready UI specs
Industry News/Jun 20

design.md turns brand tokens into agent-ready UI specs

4 ways design.md gives coding agents persistent design-system context and checks tokens, contrast, and regressions.

Claude Code, Cursor, and Copilot set the 2026 bar
Industry News/Jun 13

Claude Code, Cursor, and Copilot set the 2026 bar

6 AI coding agents dominate 2026 reviews, with developers weighing cost, repo context, privacy, and real productivity gains.

Anthropic’s own data says AI is already building AI
Research/Jun 12

Anthropic’s own data says AI is already building AI

Anthropic’s data shows AI is already accelerating AI development, and that should alarm every serious builder.

Open-source AI tools beat Claude’s paid tiers on value
Tools & Apps/Jun 10

Open-source AI tools beat Claude’s paid tiers on value

Open-source alternatives now cover most of Claude’s paid features and often do it with less lock-in.

AI Coding Agents in 2026: What Changes Next
Tools & Apps/Jun 6

AI Coding Agents in 2026: What Changes Next

AI coding in 2026 shifts from autocomplete to delegated agents, with cloud workflows, MCP, and stricter security controls.

Physicist Supervision Beat a Coding Agent
Research/May 29

Physicist Supervision Beat a Coding Agent

A physicist-supervised coding agent built scientific software, but human oversight caught failures tests missed.

How to Read Cognition AI's $26B Round
AI Agent/May 29

How to Read Cognition AI's $26B Round

Cognition AI raised over $1 billion at a $26 billion valuation, resetting AI coding-agent benchmarks.

Grok Build turns xAI into a coding agent
AI Agent/May 19

Grok Build turns xAI into a coding agent

Grok Build is xAI’s first coding agent, and I break down how to use the idea without the usual agent fluff.

Microsoft open-sources 174 AI coding skills
Tools & Apps/May 18

Microsoft open-sources 174 AI coding skills

Microsoft’s new GitHub repo packages 174 skills, MCP configs, and custom agents for coding assistants working with Azure SDKs.

AWS ships Agent Toolkit for coding agents
Tools & Apps/May 18

AWS ships Agent Toolkit for coding agents

AWS launched Agent Toolkit for AWS, adding MCP access, curated skills, and audit controls for Claude Code, Cursor, and other agents.

Why OpenAI Is Right to Put Codex on Phones
AI Agent/May 17

Why OpenAI Is Right to Put Codex on Phones

OpenAI’s move to mobile makes Codex more useful, not less, by turning coding agents into always-available workflow tools.

Why AI-agent CLIs are the new supply-chain attack surface
Industry News/May 12

Why AI-agent CLIs are the new supply-chain attack surface

AI-agent CLIs create a new supply-chain backdoor class that scanners do not catch.

Why AI coding agents need an architecture compiler
Tools & Apps/May 6

Why AI coding agents need an architecture compiler

Atomadic Forge proves AI code needs structural enforcement, not just tests and linting.

From Prompting to Harness Engineering
Industry News/Apr 8

From Prompting to Harness Engineering

OpenAI says one team shipped a 1M-line product with 3 engineers and Codex, merging about 1,500 PRs in 5 months.

I Tested Devin on 10 Tasks. It Finished 3.
AI Agent/Apr 3

I Tested Devin on 10 Tasks. It Finished 3.

Devin scored 13.86% on SWE-bench and finished 3 of 10 real tasks in one test, showing where AI coding agents still fall short.

Claude Code Setup Guide for Research Workflows
Tools & Apps/Apr 1

Claude Code Setup Guide for Research Workflows

A practical setup guide for Claude Code in research workflows, with terminal tips, context-window advice, and pricing details.

RTK cuts Claude Code token spend fast
Blockchain & Web3/Apr 1

RTK cuts Claude Code token spend fast

RTK claims it can cut Claude Code token use by up to 80% by routing work through local shell commands and agents.