AI agent benchmarks are misleading, study warns

2024-07-06 19:37:49

VentureBeat

A study by Princeton University shows that benchmarks made for AI agents don't account for costs and are prone to overfitting.