Skip to content

Commit 551183e

Browse files
authored
Merge pull request #1 from ServiceNow/aldro61-patch-1
Update README.md
2 parents 99cbced + cea8d8b commit 551183e

1 file changed

Lines changed: 4 additions & 0 deletions

File tree

README.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -8,6 +8,10 @@ By harnessing the ubiquitous [ServiceNow](https://www.servicenow.com/what-is-ser
88
WorkArena is included in [BrowserGym](https://github.com/ServiceNow/BrowserGym), a conversational gym environment for the evaluation of web agents.
99

1010

11+
https://github.com/ServiceNow/WorkArena/assets/2374980/ca61cbeb-0d13-474b-b444-db76a2d46456
12+
13+
14+
1115
## Benchmark Contents
1216

1317
At the moment, WorkArena includes `23,150` task instances drawn from `29` tasks that cover the main components of the ServiceNow user interface. The following videos show an agent built on `GPT-4-vision` interacting with every such component. As emphasized by our results, this benchmark is not solved and thus, the performance of the agent is not always on point.

0 commit comments

Comments
 (0)