Respect adapter endpoint routing in CLI stream retries by lyonsno · Pull Request #12516 · continuedev/continue

lyonsno · 2026-05-29T04:33:13Z

Summary

The CLI retry helper was routing Responses-capable model names directly to llmApi.responsesStream() whenever that optional method existed. That bypassed the OpenAI adapter's apiBase guard, so OpenAI-compatible providers and proxies could be sent to /v1/responses even when they only support /v1/chat/completions.

This change keeps endpoint selection inside the adapter by having chatCompletionStreamWithBackoff() delegate streaming calls to the required llmApi.chatCompletionStream() contract. The OpenAI adapter still uses the Responses API for official OpenAI endpoints when appropriate, but custom apiBase values stay on the chat-completions path.

Changes

Remove the CLI helper's direct responsesStream() routing branch.
Add a regression test proving a Responses-capable request with both adapter methods present still goes through chatCompletionStream().
Update the streaming/tool-call preservation test so it fails if the CLI calls responsesStream() directly.

Verification

npm --prefix extensions/cli test
npm --prefix packages/openai-adapters test -- --run
npm --prefix extensions/cli run build
npm --prefix extensions/cli run test:e2e
npm --prefix extensions/cli run test:smoke
npm --prefix extensions/cli run typecheck
git diff --check

I also ran two manual regression smokes:

a local OpenAI-compatible proxy with a Responses-capable model name and custom apiBase, which hit /v1/chat/completions once and /v1/responses zero times;
the official OpenAI path with a Responses-capable model, to confirm adapter-owned Responses routing still streams successfully.

github-actions · 2026-05-29T04:33:24Z

All contributors have signed the CLA ✍️ ✅
_{Posted by the CLA Assistant Lite bot.}

cubic-dev-ai

No issues found across 3 files

_{Re-trigger cubic}

chatgpt-codex-connector · 2026-05-29T04:36:04Z

💡 Codex Review

continue/gui/src/forms/AddModelForm.tsx

Lines 75 to 77 in b3af874

    
           setFetchedModelsList((prev) => 
        
             selectedProvider.provider === providerAtFetchTime ? models : prev, 
        
           );

Guard fetched models with current provider state

If the user clicks the refresh icon and then switches providers before the request completes, this closure still compares against the provider value captured when the request started, so the condition is always true and the old provider's fetched models are inserted into the newly selected provider's model list. This can make the form offer models with the wrong providerOptions until another selection clears the list.

continue/core/llm/fetchModels.ts

Lines 180 to 181 in b3af874

    
           const base = apiBase || "https://generativelanguage.googleapis.com/v1beta/"; 
        
           const url = new URL("models", base);

Normalize Gemini API base before appending models

When apiBase is provided without a trailing slash, new URL("models", base) replaces the final path segment instead of appending to it; for example https://generativelanguage.googleapis.com/v1beta becomes https://generativelanguage.googleapis.com/models?... rather than /v1beta/models. That makes model fetching fail for the common custom-base spelling without a trailing slash.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

lyonsno · 2026-05-29T04:37:58Z

I have read the CLA Document and I hereby sign the CLA

Recheck

Respect adapter endpoint routing in CLI stream retries

b3af874

lyonsno requested a review from a team as a code owner May 29, 2026 04:33

github-project-automation Bot added this to Issues and PRs May 29, 2026

github-project-automation Bot moved this to Todo in Issues and PRs May 29, 2026

dosubot Bot added the size:S This PR changes 10-29 lines, ignoring generated files. label May 29, 2026

cubic-dev-ai Bot reviewed May 29, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Respect adapter endpoint routing in CLI stream retries#12516

Respect adapter endpoint routing in CLI stream retries#12516
lyonsno wants to merge 1 commit into
continuedev:mainfrom
lyonsno:fix-cli-stream-routing-10474

lyonsno commented May 29, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 29, 2026 •

edited

Loading

Uh oh!

cubic-dev-ai Bot left a comment

Uh oh!

chatgpt-codex-connector Bot commented May 29, 2026

Uh oh!

lyonsno commented May 29, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

lyonsno commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Verification

Uh oh!

github-actions Bot commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cubic-dev-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot commented May 29, 2026

💡 Codex Review

Uh oh!

lyonsno commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

lyonsno commented May 29, 2026 •

edited

Loading

github-actions Bot commented May 29, 2026 •

edited

Loading

lyonsno commented May 29, 2026 •

edited

Loading