User Prompt
“Set up an environment with a clearinghouse and test whether my agent can verify patient eligibility.”
Tool Call Sequence
Create clearinghouse simulator
Call: Response:What the agent learns: The clearinghouse simulator is created. It will accept 270/271 eligibility transactions once provisioned in a playground.
simulatorsCreate environment and link simulator
Call 1: Response:Call 2: Response:What the agent learns: The environment is ready with the clearinghouse attached.
environmentsenvironmentsCreate dataset with patient data
The agent creates a dataset containing a FHIR patient bundle with insurance coverage information.Call: Response:What the agent learns: The dataset includes a patient (Maria Chen) with active BlueCross PPO Gold coverage. This data will be loaded into the sandbox so the clearinghouse returns realistic eligibility responses.
datasetsCreate playground and load data
The agent provisions the environment, finds the sandbox, and loads the patient dataset.Call 1: Response:Call 2: Response:Call 3: Response:What the agent learns: The playground is live and the clearinghouse sandbox now has the patient data loaded. When the agent-under-test queries eligibility for Maria Chen (subscriber BCB-998877), the clearinghouse will return active PPO Gold coverage.
playgroundssandboxessandboxesCreate benchmark with eligibility eval
The agent defines the benchmark, task, and eval in sequence.Call 1: Response:Call 2: Response:Call 3: Response:What the agent learns: The benchmark is fully defined with one task and one eval. The eval checks that the agent-under-test successfully verified eligibility and received an active coverage response.
benchmarkstasksevalsAgent Response
Your eligibility check environment and benchmark are ready:Point your agent at the clearinghouse endpoint and run the benchmark with
Resource ID Detail Environment env_elig_01Clearinghouse simulator attached Dataset ds_elig_01Maria Chen, BCB PPO Gold, subscriber BCB-998877 Playground pg_elig_01Active at https://ch.verial.dev/sbx_ch_01Benchmark bm_elig_011 task, 1 eval, 120s timeout benchmark_runscreate to start testing.
Summary
| Metric | Value |
|---|---|
| Tool calls | 9 |
| Tools used | simulators, environments, datasets, playgrounds, sandboxes, benchmarks, tasks, evals |
| Key technique | Load patient data into the sandbox before running benchmarks so the clearinghouse returns realistic eligibility responses. The create-then-link pattern (simulator, environment, addSimulator, playground, addDataset) builds up the simulation layer by layer. |