Add TranslateGemma support and reorganize Gemma module structure CAN-34 by DrJesseGlass · Pull Request #10 · DrJesseGlass/candle

DrJesseGlass · 2026-01-21T17:07:21Z

This PR adds support for Google's TranslateGemma translation models and reorganizes the Gemma model family into a unified module structure.
Changes
Module reorganization:

Moved gemma.rs → gemma/gemma1.rs
Consolidated gemma2.rs, gemma3.rs, quantized_gemma3.rs under gemma/
Added gemma/mod.rs with re-exports for backward compatibility

New: TranslateGemma support:

Added gemma/translate_gemma.rs with prompt formatting utilities and ISO 639-1 language codes (55 languages)
Added examples/translate-gemma.rs supporting both full precision and quantized inference

Bug fixes:

gemma3.rs: Make KV tensors contiguous before cache append. This fixes slice-set only supports contiguous tensors error that surfaces with certain GQA ratios (e.g., TranslateGemma 4B)
quantized_gemma3.rs: Added clear_kv_cache() method for multi-turn inference

Key and value states become non-contiguous after transpose but KvCache::append() requires contiguous tensors for slice_set. This worked for some model dimensions but failed for others (e.g., TranslateGemma 4B with different GQA ratios).

DrJesseGlass · 2026-01-21T17:18:41Z

Full precision models auto-download from HuggingFace. Quantized inference requires a local GGUF file via --model-path, as there are no official GGUF conversions from Google. Community conversions are available on HuggingFace.

DrJesseGlass · 2026-01-21T17:20:54Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: b83bcc5add

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-01-21T17:24:51Z

+        let prompt = format_translate_prompt(text, source.code(), target.code());
+
+        let encoding = self
+            .tokenizer
+            .encode(prompt.as_str(), true)
+            .map_err(E::msg)?;


Avoid double-adding special tokens in TranslateGemma

format_translate_prompt already embeds <bos>/turn markers, so calling encode(..., true) will run the tokenizer post-processor again. For Gemma tokenizers that add BOS/EOS, this yields a duplicated <bos> and a trailing <eos> in the input, which can cause empty/early-terminated translations or otherwise shift the generation context. Consider encode(..., false) here (or remove <bos> from the prompt) so the prompt is not post-processed twice.

Useful? React with 👍 / 👎.

…ls; however this is because quantized_gemma3 and gemma3 have different activation functions

DrJesseGlass added 8 commits January 20, 2026 16:06

move gemma,gemma2,gemma3,q_gemma3 to gemma mod with new translate_gemma

2b29147

redesign mod; add translate-gemma example

8e2a242

leverage chat template

f8234f9

unused config

3ddce64

include all langs

26ae63e

quantized version

4fc3905

parse not from_str

b61ff5d

DrJesseGlass changed the title ~~Add TranslateGemma support and reorganize Gemma module structure~~ Add TranslateGemma support and reorganize Gemma module structure CAN-34 Jan 21, 2026

too many args for translate; suppress clippy

b83bcc5

chatgpt-codex-connector Bot reviewed Jan 21, 2026

View reviewed changes

DrJesseGlass added 2 commits January 23, 2026 08:24

removed double bos; quantized still works well but full precision fai…

e046f42

…ls; however this is because quantized_gemma3 and gemma3 have different activation functions

corrected prompt format

a66d466

DrJesseGlass commented Jan 27, 2026

View reviewed changes

Comment thread candle-transformers/src/models/gemma/translate_gemma.rs Outdated

Remove bos in testing Apply suggestion from @DrJesseGlass

7d43890

DrJesseGlass commented Jan 27, 2026

View reviewed changes

Comment thread candle-transformers/src/models/gemma/translate_gemma.rs Outdated

DrJesseGlass and others added 2 commits January 26, 2026 22:16

Test prompt complete

8bd698e

cargo fmt transgemma

109212a

DrJesseGlass closed this Apr 2, 2026

DrJesseGlass reopened this Apr 2, 2026

simp doc

97a3fe3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add TranslateGemma support and reorganize Gemma module structure CAN-34#10

Add TranslateGemma support and reorganize Gemma module structure CAN-34#10
DrJesseGlass wants to merge 15 commits into
downstreamfrom
translate-gemma

DrJesseGlass commented Jan 21, 2026

Uh oh!

DrJesseGlass commented Jan 21, 2026 •

edited

Loading

Uh oh!

DrJesseGlass commented Jan 21, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jan 21, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

DrJesseGlass commented Jan 21, 2026

Uh oh!

DrJesseGlass commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DrJesseGlass commented Jan 21, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

DrJesseGlass commented Jan 21, 2026 •

edited

Loading