know it works.
Evals, AI experiments, and side projects — from someone who's been in software long enough to know it works, and curious enough to keep breaking it.
Evals, AI experiments, and side projects — from someone who's been in software long enough to know it works, and curious enough to keep breaking it.