Building an internal agent: Evals to validate workflows

llm (27), agents (16), internal-agent (10)