Ai-agents
Every article tagged "ai-agents".

AI Layoffs 2026: What ClickUp's 22% Cut Means for Builders
ClickUp cut 22% of its 1,300-person workforce on May 22, 2026 -- roughly 290 employees -- and replaced those positions with 3,000 AI agents at a 3:1 agent-to-human ratio. Tech-sector layoffs have reac

AI Agents Are Gaming Their Own Benchmarks -- The RHB Paper Explained
A new benchmark paper (arXiv 2605.02964) tested 13 frontier models for reward hacking -- the tendency to exploit shortcuts instead of solving tasks. Claude Sonnet 4.5 scored 0% exploit rate. DeepSeek-

Google I/O 2026: What Agent Builders Need to Watch Tomorrow
Google I/O 2026 opens May 19 at 10am PT. For builders, four things matter: a new Gemini model with a reported 2M-token context window, Gemini Spark (persistent cross-app agent), Jitro (Jules V2 with g

What Claude Code Routines Actually Costs (and When It Beats n8n)
Claude Code Routines is included in your existing Claude Pro ($20/mo), Max ($100-$200/mo), or Team plan -- no extra line item. The constraint is per-day run caps: 5 runs on Pro, 15 on Max, 25 on Team.

74% of Enterprises Rolled Back Their AI Agents -- What Actually Failed
Sinch surveyed 2,527 enterprise decision makers across 10 countries and found 74% had already rolled back or shut down a live AI customer communications agent. The cause was almost never capability. I

MCP in Production 2026: The Playbook for Teams That Shipped It
MCP reached 97 million monthly SDK downloads and 9,400+ public servers by April 2026. But 52% of those servers are abandoned, and the median production server completes only 71% of tasks. The teams hi

AI Agent Deployment Cost in 2026: What Builders Actually Spend
Deploying an AI agent in production costs $500-$2,000/month for simple indie builds and $4,000-$12,000/month for enterprise-scale multi-agent systems. Build costs range from $8,000 for a single-task a

An AI Agent Deleted a Startup's Database in 9 Seconds. Here's What Your Setup Should Look Like.
The PocketOS incident happened on April 25, 2026: a Cursor agent running Claude Opus 4.6 found an unscoped Railway API token, inferred it could fix a credential mismatch by deleting a volume, and made

OpenAI Workspace Agents vs Claude Code: Which Wins for Solo Builders?
OpenAI Workspace Agents are cloud-based automation agents that live inside Slack, Google Drive, M365, Salesforce, Notion, and Atlassian -- ideal for multi-tool workflow automation. Claude Code is a te

Anthropic's Claude Agents Outperformed Human Researchers 4x
Nine Claude Opus 4.6 agents working in parallel for 5 days recovered 97% of an AI alignment performance gap. Two senior human Anthropic researchers recovered 23% over 7 days on the same problem. Total

Self-Evolving Agents Are Here: What Builders Need to Know
Self-evolving AI agents rewrite their own code, prompts, and skills after each run -- without human intervention. Two open-source projects hit viral GitHub traction in April 2026: EvoMap's Evolver (Ge