- Whether the overall workflow or task was completed successfully
- The quality and correctness of the end-to-end process
- Aggregate metrics such as latency, errors, and evaluation labels for the entire trace
- The success of multi-step workflows (ex: Agentic reasoning, RAG pipelines)
- Trace-Level Evaluations via UI
- Trace-Level Evaluations via Code