Building an internal agent: Eval support and integration

llm (15), agents (6), internal-agent (2)