fix: restore simple FTS tokenizer default#7006
Conversation
|
Important This PR touches the Lance format specification. Substantive changes to the format specification — the If this is a meaningful format change:
|
There was a problem hiding this comment.
Claude Code Review
This repository is configured for manual code reviews. Comment @claude review to trigger a review and subscribe this PR to future pushes, or @claude review once for a one-time review.
Tip: disable this comment in your organization's Code Review settings.
Not really |
Codecov Report✅ All modified and coverable lines are covered by tests. 📢 Thoughts on this report? Let us know! |
This restores the native FTS default tokenizer to
simpleafter ICU showed behavior differences that are too large for the default path. ICU remains available through explicitbase_tokenizer="icu", while docs and tests now describe the default assimpleagain.