Qwen3 embed by DrJesseGlass · Pull Request #17 · DrJesseGlass/candle

DrJesseGlass · 2026-04-06T21:56:13Z

No description provided.

* Implement the new Google model * Fix model

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 647657e777

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-04-06T22:01:19Z

+        // Pad or truncate embeds to seq_len
+        let embed_len = embeds.dim(0)?;
+        if embed_len >= seq_len {
+            return embeds.narrow(0, 0, seq_len)?.unsqueeze(0);


Place modality embeddings at masked token indices

broadcast_embed_to_mask currently pads/truncates embeds from position 0 and returns it as a full sequence, but forward_multimodal later applies it with a sparse token mask (mask_expanded * image_embeds_broadcast). When image/audio placeholder tokens are not a prefix starting at index 0 (e.g., BOS/text before modality tokens), the selected positions pull zeros or the wrong embedding rows, so modality tokens are replaced incorrectly.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-06T22:01:19Z

+        return padded.unsqueeze(0);
+    }
+
+    Ok(zeros)


Support batched multimodal embedding broadcast

For b_sz > 1, broadcast_embed_to_mask unconditionally returns an all-zero tensor, which means every masked image/audio token in batched requests gets zeroed instead of receiving encoder features. This effectively drops all multimodal information whenever batch size is greater than 1.

Useful? React with 👍 / 👎.

…text.rs)

EricLBuehler and others added 3 commits April 2, 2026 11:57

Implement the new Google model (huggingface#3443)

097655a

* Implement the new Google model * Fix model

qwen3 embed

90a30b2

handle NaN

647657e

chatgpt-codex-connector Bot reviewed Apr 6, 2026

View reviewed changes

DrJesseGlass added 4 commits April 9, 2026 10:56

broadcast bug; position bug; clippy

6d4501c

Box Multimodal to address size inconsistency

1db8443

Empty image batch guard (vision.rs); Flash attention sliding window (…

26ae0ad

…text.rs)

remove gemma4 edits; clean doc

8d8a34a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qwen3 embed#17

Qwen3 embed#17
DrJesseGlass wants to merge 7 commits into
downstreamfrom
qwen3-embed

DrJesseGlass commented Apr 6, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Apr 6, 2026

Uh oh!

chatgpt-codex-connector Bot Apr 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

DrJesseGlass commented Apr 6, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 6, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Apr 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants