Predicting agent failures from trajectory shape: trajectory-pattern retrieval outperforms basic RAG
📰 Medium · RAG
A trajectory-pattern retrieval engine reaches AUC 0.71 (95% CI [0.61, 0.78]) for per-step failure prediction on held-out coding-agent… Continue reading on Medium »
DeepCamp AI