Why AI agents fail when they simulate execution

Most agent failures start with one problem: they describe actions instead of actually performing them.

In practice, this creates false confidence. A model says it used a tool, but nothing really ran.

Reliable systems need clear separation between planning and execution.

If something did not run, it should be treated as failure.