Sai prefixv2 by pranavshenoy · Pull Request #4909 · apache/cassandra

pranavshenoy · 2026-06-28T18:09:08Z

Thanks for sending a pull request! Here are some tips if you're new here:

Ensure you have added or run the appropriate tests for your PR.
Be sure to keep the PR description updated to reflect all changes.
Write your PR title to summarize what this PR proposes.
If possible, provide a concise example to reproduce the issue for a faster review.
Read our contributor guidelines
If you're making a documentation change, see our guide to documentation contribution

Commit messages should follow the following format:

<One sentence description, usually Jira title or CHANGES.txt summary>

<Optional lengthier description (context on patch)>

patch by <Authors>; reviewed by <Reviewers> for CASSANDRA-#####

Co-authored-by: Name1 <email1>
Co-authored-by: Name2 <email2>

The Cassandra Jira

pranavshenoy · 2026-06-28T19:08:09Z

+     * Depth is 1-based on the key bytes (the first byte is depth 1); the empty prefix (depth 0, the root) is never
+     * accumulated. A null policy is equivalent to {@link #putSingleton(ByteComparable, Object, UpsertTransformer, boolean)}.
+     */
+    public <R> void putSingleton(ByteComparable key,


TODO: avoid code duplication. probably we could pass a new arg to putSingleton() and/or putRecursive.

We might need to support both the flows as well.

pranavshenoy · 2026-06-28T19:52:16Z

+            totalPostings++;
+        }
+
+        if (count > 0)


If it writes to the disk, we are technically doing two different IOs (one for exact and other for Prefix), this can be avoided

pranavshenoy · 2026-06-28T19:56:56Z

+            }
+
            this.blockSize = input.readVInt();
            //TODO This should need to change because we can potentially end up with postings of more than Integer.MAX_VALUE?


should we change it to long since we will have more postings with prefix?

pranavshenoy · 2026-06-30T04:36:50Z

+        {
+            ByteBuffer prefixValue = expression.lower().value.encoded;
+            ByteComparable start = v -> index.termType().asComparableBytes(prefixValue, v);
+            ByteBuffer successor = prefixSuccessor(prefixValue);


successor is used to pass it to the in-memory subtrie() API. do we actually need the successor here?

pranavshenoy · 2026-06-30T05:20:15Z

+        if (indexTermType.isLiteral() && literalPrefixEnabled)
+        {
+            int skip = CassandraRelevantProperties.SAI_POSTINGS_SKIP.getInt();
+            SegmentTrieBuffer buffer = new SegmentTrieBuffer(depth -> depth % skip == 0);


TODO: avoid using segmentriebuffer instead try to do it in place for memtableIndexwriter

pranavshenoy · 2026-06-30T05:21:10Z

+    {
+        ByteBuffer prefixBuffer = expression.lower().value.encoded;
+        ByteComparable lowerBound = asComparableBytes(prefixBuffer);
+        ByteBuffer successor = prefixSuccessor(prefixBuffer);


revisit if we really need prefixSuccessor

pranavshenoy added 2 commits June 15, 2026 23:32

supporting prefix search in SAI

08b97e0

changing the traversal logic

7d1a785

pranavshenoy commented Jun 28, 2026

View reviewed changes

support test and mechanism to verify traversal

3a5152e

pranavshenoy commented Jun 28, 2026

View reviewed changes

pranavshenoy commented Jun 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sai prefixv2#4909

Sai prefixv2#4909
pranavshenoy wants to merge 3 commits into
apache:trunkfrom
pranavshenoy:sai_prefixv2

pranavshenoy commented Jun 28, 2026

Uh oh!

pranavshenoy Jun 28, 2026

Uh oh!

pranavshenoy Jun 28, 2026

Uh oh!

pranavshenoy Jun 28, 2026

Uh oh!

pranavshenoy Jun 30, 2026

Uh oh!

pranavshenoy Jun 30, 2026

Uh oh!

pranavshenoy Jun 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

pranavshenoy commented Jun 28, 2026

Uh oh!

pranavshenoy Jun 28, 2026

Choose a reason for hiding this comment

Uh oh!

pranavshenoy Jun 28, 2026

Choose a reason for hiding this comment

Uh oh!

pranavshenoy Jun 28, 2026

Choose a reason for hiding this comment

Uh oh!

pranavshenoy Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

pranavshenoy Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

pranavshenoy Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant