Skip to main content
When you open Alyx on the Eval Builder or Task Builder page, Alyx has context of your current eval or task and can build custom evals, configure tasks, and help you choose columns and datasources.

Where to find Alyx

Open the Alyx chat from the Eval Builder (creating/editing an eval) or Task Builder (configuring a task). Alyx has context of your current eval or task configuration.
Open Alyx from the Eval Builder or Task Builder to build evals and configure tasks

Key skills

SkillDescription
Build evalWrite an eval based on your goals and data structure
Create / update eval formConfigure eval parameters and columns
Configure dataset taskSet dataset, filters, and sampling for a dataset-based task
Configure project taskSet project, filters, and sampling for a trace-based task
Propose task nameSuggest a descriptive name for the task
List datasets / experimentsList datasets and projects for task targeting
Dataset previewInspect structure and sample rows to choose columns
Choose evalsSelect or attach evals
…and more. Alyx can preview trace data.

Example prompts

  • “Build an eval that checks if the response answers the question”
  • “Change this task to use the customer-support dataset”
  • “Point this task at the production traces project”
  • “What columns should I use for input and output?”
  • “Update the sampling rate to 10%”