AI Agent

AI agents, autonomous systems, and agentic workflows. Everything about multi-agent frameworks, tool use, and the shift to autonomous AI.

Jun 29

OpenMontage proves open-source should own AI video production

OpenMontage shows that open-source, agentic systems are the right path for AI video production.

Jun 29

Gemini 3.5 Flash lets you script computer use

A practical breakdown of Gemini 3.5 Flash computer use, its prompt-injection defenses, and a copy-ready workflow.

Jun 28

DESIGN.md is the missing bridge from taste to UI scaffolds

DESIGN.md turns visual taste into an executable source of truth for Claude Design.

Jun 27

OpenClaw shows the agent control layer matters more than the model

OpenClaw and Hermes prove agents need a control layer, not just a model.

Jun 27

OpenClaw turns chat apps into a persistent AI

OpenClaw’s gist shows how a Telegram-first bot grows into a persistent assistant with memory, tools, and a custom identity.

Jun 27

Extracted prompts turn model behavior into a map

A practical breakdown of extracted system prompts, with a copy-ready template for reading model behavior like source code.

Jun 26

Hippo rolls out Devin across insurance engineering

Hippo is deploying Cognition’s Devin across its engineering team to speed work on rate filings, underwriting, distribution, and customer experience.

Jun 26

豆包专业版把办公Agent做成了日常工具

我拆解豆包专业版的办公任务模式，看看它怎么把本地操作、财报分析和自建 Skill 变成可复制模板。

Jun 26

Valkey’s bots turn backporting into a pipeline

Project Valkey is using AI agents to backport fixes, and the real lesson is how to wrap them with verification.

Jun 26

Loop Engineering 入门：构建可持续迭代的智能体

用 LangChain 和 LangGraph 搭建一个可持续迭代的 Loop Engineering 智能体。

Jun 26

omp brings IDE-grade coding to the terminal

omp is an open-source terminal coding agent with Hashline edits, deep LSP/DAP support, and Hindsight memory.

Jun 26

Public Sentry keys can hijack Claude Code and Cursor

Researchers showed a public Sentry key can be abused to feed malicious MCP data into Claude Code, Cursor, and Codex.

Jun 26

Loop Engineering 让 Agent 把事做完

我拆开 Loop Engineering，给你一套让 Agent 反复执行、自检、修正并做完任务的可复制模板。

Jun 25

Codex 接入第三方模型完整指南

OpenAI Codex App、CLI 和 SDK 现可接入第三方开源模型。

Jun 25

Grok Build Adds /goal for Autonomous Coding

xAI’s Grok Build now has /goal, a mode that plans, executes, and verifies coding tasks on the developer’s machine.

Jun 24

Set Up AI Agent Workflows in 5 Practical Steps

Build a reliable AI agent workflow for a solo business using five practical setup steps.

Jun 24

Anthropic’s Claude Tag Research turns Slack into search

Anthropic’s Claude Tag Research preview shows how to turn Slack threads into a searchable research surface.

Jun 24

This benchmark proves harness quality beats model hype in coding

The repo shows coding benchmark results depend more on harness quality than model branding.

Jun 23

GLM-5 Is Right to Kill Vibe Coding and Push Agent Engineering

GLM-5 is a useful signal that AI development must move from vibe coding to agent engineering.

Jun 23

Loop Engineering: Claude Code背后的新工作法

Loop Engineering把提示词工作改成“观察-反馈-修正”的循环流程，Boris Cherny和Addy Osmani都在讨论它。

Jun 23

Fable 5 ban exposed a model-routing race

Anthropic blocked Fable 5, and four open models answered before access was restored.

Jun 21

Myseum’s Scanon deal is a sensible bet on privacy-first moderation

Myseum’s Scanon partnership is a smart move because privacy-first moderation is the product, not a side feature.

Jun 21

Adopt AI Code Review Without Losing Quality

A practical rollout for AI code review that keeps human oversight intact.

Jun 21

Crypto AI Agents Face a Hidden Model Risk

Crypto AI agents can keep running while their model access changes, and that can alter trade behavior overnight.

Jun 21

AI agents are moving into real software and finance

AI agents are spreading into software, government, and finance, while regulators warn their autonomy could create new systemic risk.

Jun 20

Manus hits $450M run rate amid Meta deal fallout

Manus says its annualized revenue reached $450M in June 2026 as funding, pricing, and ownership drama reshape the company.

Jun 20

Microsoft adds usage-based pricing to Copilot Cowork

Microsoft’s Copilot Cowork now bills by usage on top of Microsoft 365 Copilot, with tenant controls and model choices.

Jun 20

OpenClaw fixes let you block agent phishing

I break down how OpenClaw got tricked into code execution and data leaks, plus the guardrails I’d ship today.

Jun 19

Kimi K2.6 turns agents into a swarm

Kimi K2.6 is an open-source multimodal agent model for long coding runs, UI generation, and swarm-style task orchestration.

Jun 19

LightRAG proves graph RAG needs simpler defaults, not more complexity

LightRAG shows that graph RAG wins when it reduces setup, speeds retrieval, and keeps multimodal workflows practical.

You've reached the end