Fix: AttributeError: 'Stream' object has no attribute 'choices' and Real-Time Streaming by ramikhafagi96 · Pull Request #2145 · 567-labs/instructor

ramikhafagi96 · 2026-03-13T10:50:51Z

Fix streaming behavior in `create()` and `create_partial()`

Summary

This PR fixes two issues in the streaming path:

create(stream=True) crashes with
AttributeError: 'Stream' object has no attribute 'choices'.
create_partial() buffers the entire stream into a list, preventing
real-time streaming of partial models.

The changes enable true streaming for partial models and ensure
create(stream=True) works correctly.

Problems

1. `create(stream=True)` crash

When stream=True is used without Partial, the OpenAI Stream object
is passed directly to process_response().

Execution falls through to from_response(), which expects a
ChatCompletion object and accesses completion.choices. Since
Stream is an iterator, this results in:

AttributeError: 'Stream' object has no attribute 'choices'

2. `create_partial()` disables streaming

create_partial() wraps the model with Partial and sets
stream=True, but process_response() consumes the generator:

return list(
    response_model.from_streaming_response(response, mode=mode)
)

Because list() eagerly consumes the generator, the entire stream is
buffered before returning. As a result, partial models are not yielded
in real time.

Fix

Generator passthrough for partial streaming

Return the generator directly instead of wrapping it in list().

Before:

return list(
    response_model.from_streaming_response(response, mode=mode)
)

After:

return response_model.from_streaming_response(response, mode=mode)

This allows callers to iterate over partial models as they arrive.

Fallback for `create(stream=True)`

Added _accumulate_stream() to consume a raw Stream and construct a
synthetic ChatCompletion before passing it to from_response().

This prevents the Stream → choices crash.

Retry behavior with streaming

Retry logic is fundamentally incompatible with streaming partial
responses.

When create_partial() streams results, partial models may already be
yielded to the caller. If validation fails later in the stream, retrying
the request would require retracting previously yielded results, which
is not possible.

For this reason, when process_response() returns a generator
(streaming Partial case), retry_sync now returns it directly instead
of attempting retry logic.

This preserves existing retry behavior for non-streaming calls,
while allowing streaming generators to pass through unchanged.

Changes

response.py - Return generator for PartialBase + stream - Add
_accumulate_stream() helper - Add fallback handling for
create(stream=True)

retry.py - Add Generator import - Return generators directly in
retry_sync

Impact

create_partial() now streams partial models in real time
create(stream=True) no longer crashes
Async and non-streaming paths remain unchanged

Tested with instructor==1.14.1.

jxnl · 2026-03-18T02:32:24Z

I don’t have time to take this one over right now. I’m not going to merge it as-is, but I’ll revisit the streaming design later when I can review the retry/reask semantics and add the missing coverage.

Rami Khafagi and others added 4 commits March 13, 2026 10:36

fix: add synthetic accumulation for Stream object as ChatCompletion

17f91c9

handle streaming based on object type

58cccbb

return generator in case of streaming

9c6bb46

Merge branch 'main' into fix-tokens-streaming

cbe1dd9

ramikhafagi96 changed the title ~~Fix Streaming~~ Fix: AttributeError: 'Stream' object has no attribute 'choices' and Real-Time Streaming Mar 17, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix: AttributeError: 'Stream' object has no attribute 'choices' and Real-Time Streaming#2145

Fix: AttributeError: 'Stream' object has no attribute 'choices' and Real-Time Streaming#2145
ramikhafagi96 wants to merge 4 commits into567-labs:mainfrom
ramikhafagi96:fix-tokens-streaming

ramikhafagi96 commented Mar 13, 2026

Uh oh!

jxnl commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

ramikhafagi96 commented Mar 13, 2026

Fix streaming behavior in create() and create_partial()

Summary

Problems

1. create(stream=True) crash

2. create_partial() disables streaming

Fix

Generator passthrough for partial streaming

Fallback for create(stream=True)

Retry behavior with streaming

Changes

Impact

Uh oh!

jxnl commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix streaming behavior in `create()` and `create_partial()`

1. `create(stream=True)` crash

2. `create_partial()` disables streaming

Fallback for `create(stream=True)`