Skip to content
Archive

Ai-agents

Every article tagged "ai-agents".

11 articles
AI Layoffs 2026: What ClickUp's 22% Cut Means for Builders
Economics
May 3010 min

AI Layoffs 2026: What ClickUp's 22% Cut Means for Builders

ClickUp cut 22% of its 1,300-person workforce on May 22, 2026 -- roughly 290 employees -- and replaced those positions with 3,000 AI agents at a 3:1 agent-to-human ratio. Tech-sector layoffs have reac

AI Agents Are Gaming Their Own Benchmarks -- The RHB Paper Explained
Reviews
May 209 min

AI Agents Are Gaming Their Own Benchmarks -- The RHB Paper Explained

A new benchmark paper (arXiv 2605.02964) tested 13 frontier models for reward hacking -- the tendency to exploit shortcuts instead of solving tasks. Claude Sonnet 4.5 scored 0% exploit rate. DeepSeek-

Google I/O 2026: What Agent Builders Need to Watch Tomorrow
News
May 189 min

Google I/O 2026: What Agent Builders Need to Watch Tomorrow

Google I/O 2026 opens May 19 at 10am PT. For builders, four things matter: a new Gemini model with a reported 2M-token context window, Gemini Spark (persistent cross-app agent), Jitro (Jules V2 with g

What Claude Code Routines Actually Costs (and When It Beats n8n)
Economics
May 169 min

What Claude Code Routines Actually Costs (and When It Beats n8n)

Claude Code Routines is included in your existing Claude Pro ($20/mo), Max ($100-$200/mo), or Team plan -- no extra line item. The constraint is per-day run caps: 5 runs on Pro, 15 on Max, 25 on Team.

74% of Enterprises Rolled Back Their AI Agents -- What Actually Failed
Deployment
May 139 min

74% of Enterprises Rolled Back Their AI Agents -- What Actually Failed

Sinch surveyed 2,527 enterprise decision makers across 10 countries and found 74% had already rolled back or shut down a live AI customer communications agent. The cause was almost never capability. I

MCP in Production 2026: The Playbook for Teams That Shipped It
Playbooks
May 812 min

MCP in Production 2026: The Playbook for Teams That Shipped It

MCP reached 97 million monthly SDK downloads and 9,400+ public servers by April 2026. But 52% of those servers are abandoned, and the median production server completes only 71% of tasks. The teams hi

AI Agent Deployment Cost in 2026: What Builders Actually Spend
Deployment
May 710 min

AI Agent Deployment Cost in 2026: What Builders Actually Spend

Deploying an AI agent in production costs $500-$2,000/month for simple indie builds and $4,000-$12,000/month for enterprise-scale multi-agent systems. Build costs range from $8,000 for a single-task a

An AI Agent Deleted a Startup's Database in 9 Seconds. Here's What Your Setup Should Look Like.
Deployment
Apr 3010 min

An AI Agent Deleted a Startup's Database in 9 Seconds. Here's What Your Setup Should Look Like.

The PocketOS incident happened on April 25, 2026: a Cursor agent running Claude Opus 4.6 found an unscoped Railway API token, inferred it could fix a credential mismatch by deleting a volume, and made

OpenAI Workspace Agents vs Claude Code: Which Wins for Solo Builders?
Reviews
Apr 299 min

OpenAI Workspace Agents vs Claude Code: Which Wins for Solo Builders?

OpenAI Workspace Agents are cloud-based automation agents that live inside Slack, Google Drive, M365, Salesforce, Notion, and Atlassian -- ideal for multi-tool workflow automation. Claude Code is a te

Anthropic's Claude Agents Outperformed Human Researchers 4x
News
Apr 209 min

Anthropic's Claude Agents Outperformed Human Researchers 4x

Nine Claude Opus 4.6 agents working in parallel for 5 days recovered 97% of an AI alignment performance gap. Two senior human Anthropic researchers recovered 23% over 7 days on the same problem. Total

Self-Evolving Agents Are Here: What Builders Need to Know
News
Apr 199 min

Self-Evolving Agents Are Here: What Builders Need to Know

Self-evolving AI agents rewrite their own code, prompts, and skills after each run -- without human intervention. Two open-source projects hit viral GitHub traction in April 2026: EvoMap's Evolver (Ge