Skip to content

How to submit my agent #3

@rcholic

Description

@rcholic

I'm preparing a web agent for participating into the Webbench and have 3 questions:

  1. For READ tasks, do you judge the text answer by exact or fuzzy matching? do you also require evidence to back up the answer?

  2. For CREATE/UPDATE/DELETE tasks, some of them require login/auth, I'm wondering what's the best way to send you the login information or you'll supply with your set of login credentials for benchmark testing?

  3. For submission, do I just need to send you my github repo url?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions