Insights on AI agent testing, evals, and building reliable AI systems.
Current testing infrastructure is structurally broken. We pivoted in week 3 of YC to fix it. Here's why AI agent testing doesn't match the speed of development.