Populate performance summary in VLM runner / engine

### What are you trying to build?

Currently the text only models support this, but we should add this to the VLM path as well, ie the token counts and timings. In `runVLMInference`, call `setPromptTokenCount(vlmTokens.count)` and wrap the prefill + generation loop

Another design is to instrument `CoreAISequentialVLMEngine` to record `.prompt/.extend` spans the way the text engines do, so metrics work for any caller

### Where are the current docs or utilities unclear?

N/A

### Expected improvement

More readable and informative outputs

### Additional context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Populate performance summary in VLM runner / engine #70

What are you trying to build?

Where are the current docs or utilities unclear?

Expected improvement

Additional context

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Populate performance summary in VLM runner / engine #70

Description

What are you trying to build?

Where are the current docs or utilities unclear?

Expected improvement

Additional context

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions