Skip to main content

Runner Preview

PreviewVerified
Preview surface

The runner is a preview surface for grey-box AI agent evaluation. It is not part of the current alpha package surface.

The runner is intended for capturing agent traces, evaluating behavior against rules, and supporting future deploy/eval workflows.

Use it as an experimental integration surface until the release surface is expanded.