Files
shivammittal274 29056226bb feat: add eval framework and coordinate-based input tools (#453)
- Add hover_at, type_at, drag_at coordinate tools to server
- Add hoverAt, typeAt, dragAt methods to Browser class
- Export server internals (browser, tool-loop, registry) for eval imports
- Copy eval app from enterprise repo with agents, graders, runner, dashboard
- Nest eval-targets inside apps/eval
- Adapt sessionExecutionDir → workingDir for current server API
- Add biome ignore for dashboard HTML to prevent lint breaking onclick handlers
2026-03-16 23:12:23 +05:30

2 lines
378 B
JSON
Vendored

{"query_id": "HN-1", "dataset": "webvoyager", "query": "go to HN best and click the comments section of 2nd post", "graders": ["webvoyager_grader"], "start_url": "https://www.amazon.com/", "metadata": {"original_task_id": "Amazon--0", "website": "Amazon", "category": "Amazon", "additional": {"ground_truth": "Sensodyne toothpaste ordered on Amazon", "answer_type": "golden"}}}