Workflow runs · sbintuitions/flexeval

Actions

All workflows
Workflows
- Copilot code review Copilot code review
- pages-build-deployment pages-build-deployment
- Publish PyPI Publish PyPI
- Run batch-api tests Run batch-api tests
- Run tests Run tests
- Update Documentation Update Documentation
Management
- Caches
- Deployments

All workflows

Actions

Loading...
Loading

Showing runs from all workflows

653 workflow runs

Merge pull request #288 from sbintuitions/add_prefix_conditioned_decode Run tests #1008: Commit 37b85c1 pushed by junya-takayama

22m 3s main

main

22m 3s

Added a feature to VLLM and HuggingFaceLM that forces a response prefix Run batch-api tests #156: Pull request #288 synchronize by junya-takayama

2m 14s add_prefix_conditioned_decode

add_prefix_conditioned_decode

2m 14s

Added a feature to VLLM and HuggingFaceLM that forces a response prefix Run tests #1007: Pull request #288 synchronize by junya-takayama

23m 14s add_prefix_conditioned_decode

add_prefix_conditioned_decode

23m 14s

Added a feature to VLLM and HuggingFaceLM that forces a response prefix Run tests #1006: Pull request #288 synchronize by junya-takayama

21m 41s add_prefix_conditioned_decode

add_prefix_conditioned_decode

21m 41s

Added a feature to VLLM and HuggingFaceLM that forces a response prefix Run batch-api tests #155: Pull request #288 synchronize by junya-takayama

2m 12s add_prefix_conditioned_decode

add_prefix_conditioned_decode

2m 12s

Added a feature to VLLM and HuggingFaceLM that forces a response prefix Run tests #1005: Pull request #288 opened by junya-takayama

21m 15s add_prefix_conditioned_decode

add_prefix_conditioned_decode

21m 15s

Added a feature to VLLM and HuggingFaceLM that forces a response prefix Run batch-api tests #154: Pull request #288 opened by junya-takayama

4m 26s add_prefix_conditioned_decode

add_prefix_conditioned_decode

4m 26s

Add ReasoningParser Run tests #1004: Pull request #287 synchronize by junya-takayama

22m 3s add_reasoning_parser

add_reasoning_parser

22m 3s

Add ReasoningParser Run batch-api tests #153: Pull request #287 synchronize by junya-takayama

2m 1s add_reasoning_parser

add_reasoning_parser

2m 1s

Add ReasoningParser Run batch-api tests #152: Pull request #287 synchronize by junya-takayama

4m 46s add_reasoning_parser

add_reasoning_parser

4m 46s

Add ReasoningParser Run tests #1003: Pull request #287 synchronize by junya-takayama

21m 32s add_reasoning_parser

add_reasoning_parser

21m 32s

Add ReasoningParser Run tests #1002: Pull request #287 synchronize by junya-takayama

21m 37s add_reasoning_parser

add_reasoning_parser

21m 37s

Add ReasoningParser Run batch-api tests #151: Pull request #287 synchronize by junya-takayama

4m 44s add_reasoning_parser

add_reasoning_parser

4m 44s

Add ReasoningParser Run tests #1001: Pull request #287 opened by junya-takayama

21m 32s add_reasoning_parser

add_reasoning_parser

21m 32s

Add ReasoningParser Run batch-api tests #150: Pull request #287 opened by junya-takayama

4m 38s add_reasoning_parser

add_reasoning_parser

4m 38s

Merge pull request #286 from sbintuitions/load_lmoutput Run tests #1000: Commit ae7a278 pushed by junya-takayama

23m 6s main

main

23m 6s

hotfix: "lm_output" is not output by flexeval_lm Run tests #999: Pull request #286 synchronize by junya-takayama

22m 7s load_lmoutput

load_lmoutput

22m 7s

hotfix: "lm_output" is not output by flexeval_lm Run tests #998: Pull request #286 opened by junya-takayama

21m 0s load_lmoutput

load_lmoutput

21m 0s

Merge pull request #285 from sbintuitions/load_lmoutput Run tests #997: Commit 4ed5363 pushed by junya-takayama

23m 24s main

main

23m 24s

Enable access to reasoning_text and tool_calls in post-hoc LLM judges via flexeval_file. Run tests #996: Pull request #285 synchronize by junya-takayama

21m 55s load_lmoutput

load_lmoutput

21m 55s

Enable access to reasoning_text and tool_calls in post-hoc LLM judges via flexeval_file. Run batch-api tests #149: Pull request #285 synchronize by junya-takayama

2m 24s load_lmoutput

load_lmoutput

2m 24s

Enable access to reasoning_text and tool_calls in post-hoc LLM judges via flexeval_file. Run tests #995: Pull request #285 synchronize by junya-takayama

21m 36s load_lmoutput

load_lmoutput

21m 36s

Enable access to reasoning_text and tool_calls in post-hoc LLM judges via flexeval_file. Run batch-api tests #148: Pull request #285 synchronize by junya-takayama

2m 19s load_lmoutput

load_lmoutput

2m 19s

Enable access to reasoning_text and tool_calls in post-hoc LLM judges via flexeval_file. Run tests #994: Pull request #285 synchronize by junya-takayama

22m 12s load_lmoutput

load_lmoutput

22m 12s

Enable access to reasoning_text and tool_calls in post-hoc LLM judges via flexeval_file. Run batch-api tests #147: Pull request #285 synchronize by junya-takayama

5m 20s load_lmoutput

load_lmoutput

5m 20s

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Actions

Workflows

Management

All workflows

Actions

Loading...
Loading

All workflows

Uh oh!

Filter by Workflow

Sorry, something went wrong.

Sorry, something went wrong.

No matching workflows.

Filter by Event

Sorry, something went wrong.

Sorry, something went wrong.

No matching events.

Filter by Status

Sorry, something went wrong.

Sorry, something went wrong.

No matching statuses.

Filter by Branch

Sorry, something went wrong.

Sorry, something went wrong.

No matching branches.

Filter by Actor

Sorry, something went wrong.

Sorry, something went wrong.

No matching users.

Actions: sbintuitions/flexeval

Actions

All workflows All workflows Actions Loading... Loading Sorry, something went wrong. Uh oh! There was an error while loading. Please reload this page.

All workflows

All workflows

Actions

Loading...
Loading