Tag: agents
All the articles with the tag "agents".
-
Evals for skills: are you doing it?
Tests are to code what evals are to skills. Same job, different artifact. A short field guide to evaluating Claude skills with cassettes, graders, and a CI gate that doesn't lie.