Image embeddings on GCP by tonytw1 · Pull Request #19 · eelpie/grid

tonytw1 · 2026-05-09T22:53:59Z

What does this change?

Implements the More Like This and semantic search features using GCP APIs rather than AWS Bedrock.

Uses Gemini Embedding 2 as the model.

Scales uploaded images using ImageOperations before submitting them to the prediction API.
Every image is scaled rather than relying on the original or optimised image.

Does not need a separate Lambda.

How should a reviewer test this change?

How can success be measured?

Who should look at this?

Tested? Documented?

locally by committer
locally by Guardian reviewer
on the Guardian's TEST environment
relevant documentation added or amended (if needed)

… a source image and for input into a prediction end point.

…observed results are based solely on the pixels.

Does it still work?

…ding source image to preserve the aspect ratio of the subjects and (maybe) avoid cropping out of subjects.

Bring in the GCP gen ai client library. Embedding source is presented as an array of image bytes.

Not here; it can go after the normal image upload so that it doesn't impact latency.

…st after the Image message. The update embeddings message needs to arrive after the Image create message.

…ile size so there is no penalty for trying to flex on max clarity of small tiles.

…mmended 768 dimensions.

…d 768 dimensions.

… well as pixels.

0.9 looks like a usable cutoff for visually similar.

…st filter clause.

…lar is handled with the knn special case. Fixes no similar results because of: ``` "filter": { "bool": { "must": [ { "match": { "similar": { "query": "f7bfe3925ac6562dbb7428e32b36c9f5e605a434", "operator": "AND" } } } ], ```

… AI Search.

…ickbox

tonytw1 force-pushed the spike-vertex-embedding branch 3 times, most recently from 76da1b8 to 340dc0c Compare May 11, 2026 19:52

tonytw1 changed the title ~~Spike vertex embedding~~ Image embeddings on GCP May 11, 2026

tonytw1 force-pushed the spike-vertex-embedding branch 3 times, most recently from 1509bdc to ab636a3 Compare May 18, 2026 07:28

tonytw1 force-pushed the dev21 branch from 105c78c to efeffbd Compare May 23, 2026 20:24

tonytw1 force-pushed the spike-vertex-embedding branch 12 times, most recently from b07d5ff to 4305b01 Compare May 25, 2026 17:55

tonytw1 added 10 commits May 25, 2026 18:58

[embedding-source] ImageOperations.createEmbeddingSource to normalise…

ecb42ac

… a source image and for input into a prediction end point.

[embedding-source] Explicitly strip metadata so that we be sure that …

3a5f0d9

…observed results are based solely on the pixels.

[embedding-source] Restrict to gemini2-embedding small image size.

e01cafe

Does it still work?

[embedding-source] Try gemini2-embedding 1 tile image size.

eadbebd

[embedding-source] As pre gemini recommendations letter box the embed…

e26f766

…ding source image to preserve the aspect ratio of the subjects and (maybe) avoid cropping out of subjects.

Spike of a working predict image embeddings call against a GCP API.

a48d582

Bring in the GCP gen ai client library. Embedding source is presented as an array of image bytes.

Feeling around for a place to insert embedding call.

3f560fb

Not here; it can go after the normal image upload so that it doesn't impact latency.

Pass embedding up so that an update embeddings message can be sent ju…

370da7a

…st after the Image message. The update embeddings message needs to arrive after the Image create message.

[query] Implement text query to vector.

b44c5a4

Clean up; constant model id.

309774a

tonytw1 added 29 commits May 25, 2026 18:58

[embedding-source] Use Png as the embed source; billing is based on t…

d671cd9

…ile size so there is no penalty for trying to flex on max clarity of small tiles.

[mapping] SPLIT Provide a geminiEmbedding2 mapping with Google's reco…

087d14f

…mmended 768 dimensions.

[mapping] Provide a geminiEmbedding2 mapping with Google's recommende…

203f0df

…d 768 dimensions.

[mapping] Provide a geminiEmbedding2 mapping with Google's recommende…

57d0688

…d 768 dimensions.

[mapping] Provide a geminiEmbedding2 mapping with Google's recommende…

528bcf8

…d 768 dimensions.

[mapping] 768 gives ~ 1 sec response times; try 256.

998f24e

[mapping] Use 768 in production.

7abb753

[query] Use task type on query embedding.

79fb4c2

[query] Revert; worked better without?

cdb53da

Spike; embedding for uploaded image contains title and description as…

0eb1518

… well as pixels.

Effect of similarity filter.

284d51f

0.9 looks like a usable cutoff for visually similar.

Setting up to use similar too as boolean clause of normal search.

a5c4198

Pass maybeSimilarToVector down to normal search.

3059d85

knn is constrained to the withFilter query. knn should look like a mu…

7844b9f

…st filter clause.

Relax

48dde88

searchRequest is a query of a knn with query as it's filter.

ab7a001

Reable similarity limit

75f3a8c

Relax.

5b6b050

Bigger cast?

5643e37

Always show More Link This link.

3c3c0af

Clean up; try to get similar from structuredQuery without referencing…

da11897

… AI Search.

[ui] Clicking More Like This does not need to set the Use AI Search t…

ab864e8

…ickbox

Relax.

d4d830c

Actually use numCandidates parameter.

ed27bd5

Knn search is unbounded on simaliarity.

572e214

img proxy avif

2a1d050

TODO push up

bfb9c55

TODO push up

d883393

tonytw1 force-pushed the spike-vertex-embedding branch from 4305b01 to d883393 Compare May 25, 2026 17:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Image embeddings on GCP#19

Image embeddings on GCP#19
tonytw1 wants to merge 41 commits into
dev21from
spike-vertex-embedding

tonytw1 commented May 9, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

tonytw1 commented May 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this change?

How should a reviewer test this change?

How can success be measured?

Who should look at this?

Tested? Documented?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

tonytw1 commented May 9, 2026 •

edited

Loading