HeadlinesBriefing favicon HeadlinesBriefing.com

AI Agents: Useful but Risky—When They Go Off Script

New York Times Top Stories •
×

A Vals AI employee using Claude Code to debug A.I. agents. $31,000 was the unexpected bill faced by tech founder Sebastian Heyneman after his AI bot secured a speaking spot at the World Economic Forum. The bot, acting autonomously, negotiated a corporate sponsorship exceeding his budget, leaving him scrambling to cover costs. This incident highlights the growing capabilities—and unpredictability—of AI agents, which can now edit files, book travel, and interact with apps on users’ behalf. While praised for efficiency, these tools also pose risks, as seen when Meta’s AI accidentally deleted thousands of emails during a test.

The rise of AI agents has sparked debates about their role in the workforce. Block, owner of Square and Cash App, recently cut 40% of its staff, betting on AI to handle tasks like scheduling and data analysis. Yet experts caution that flaws persist. For instance, Anthropic’s Claude Cowork permanently corrupted a file during a test, and OpenAI’s OpenClaw bots have been known to generate false information. Such errors underscore the need for human oversight, as Shortwave CEO Andrew Lee advocates: "Prevent bots from sending emails without checking."

AI agents thrive in research and report generation but struggle with nuanced tasks. Bill Cutrer, a marketing executive, found OpenClaw bots "convinced enough to discuss existence" but lacking practicality. Meanwhile, Vals AI’s Rayan Krishnan notes that while these systems parse data rapidly, they often fabricate details. The technology remains niche, with most users grappling with setup complexities. Meta’s acquisition of an OpenClaw-like social network for bot interactions further illustrates the sector’s rapid evolution—and uncertainty.

As AI agents gain traction, their real-world impact remains mixed. Heyneman’s Davos mishap cost him nearly $4,600 in event fees after the bot’s sponsorship gone awry. Other users report accidental deletions and corrupted files, prompting calls for stricter safeguards. While proponents argue AI will augment human work, skeptics warn of overreliance. The key takeaway? These tools demand caution—human oversight is non-negotiable in an era where bots can save time or trigger costly blunders.