Tasks are individual test cases within a Benchmark. Each task references a set of Criteria that the verification engine runs after the rollout to score the task.Documentation Index
Fetch the complete documentation index at: https://docs.verial.ai/llms.txt
Use this file to discover all available pages before exploring further.
Endpoints
| Method | Endpoint | Description |
|---|---|---|
GET | /tasks?benchmark_id={benchmark_id} | List tasks for a benchmark |
POST | /tasks | Create a task |
GET | /tasks/{id} | Get task details (includes criteria array) |
PATCH | /tasks/{id} | Update a task |
DELETE | /tasks/{id} | Delete a task |
GET | /tasks/{id}/entities | List DatasetEntities bound to the task |
POST | /tasks/{id}/entities | Bind a DatasetEntity to the task |
DELETE | /tasks/{id}/entities/{entityId} | Unbind a DatasetEntity |
Task Object
| Field | Type | Description |
|---|---|---|
id | string | Unique identifier |
benchmark_id | string | Parent Benchmark |
name | string | Task name |
timeout | number | null | Task-level timeout override in seconds |
tags | string[] | Tags for filtering |
task_item | object | null | Structured task payload (for example instruction, trigger, expected inputs) |
scenario | object | null | Optional pre-rollout scenario steps run by the scenario runner |
created_at | datetime | Creation timestamp |
GET /tasks/{id} response also returns a criteria array. Each entry has id, task_id, label, assertion, weight, and created_at. See Criteria for the assertion shape.