Fix content resizing "Shorten" action for CJK languages by i-anubhav-anand · Pull Request #715 · WordPress/ai

i-anubhav-anand · 2026-06-12T11:43:03Z

Problem

Fixes #578

Japanese, Chinese, and Korean text doesn't use spaces as word separators. @wordpress/wordcount's count( text, 'words', {} ) returns near-zero for CJK content, so the "Shorten" action always showed the error "Text is too short to shorten further." even for long paragraphs.

Solution

Detect CJK content via a Unicode range regex and switch to 'characters_excluding_spaces' counting with a character-based minimum threshold (SHORTEN_MIN_CHARS = 10). The same locale-aware counting is applied to the word-diff display so the +/− indicator remains meaningful for CJK text.

Non-CJK content is unaffected — the existing SHORTEN_MIN_WORDS = 5 path runs as before.

Changes

src/experiments/content-resizing/components/ContentResizingToolbar.tsx
- Add CJK_REGEX and SHORTEN_MIN_CHARS constants
- handleAction('shorten'): use characters_excluding_spaces count for CJK content
- wordDiff memo: use locale-aware count for accurate +/− display

Testing

Create a post with Japanese/Chinese/Korean paragraph text (e.g. これはテストです。日本語のコンテンツをテストしています。)
Select a text block and open the AI resize menu
Click Shorten — it should proceed without the "too short" error
Verify the word-diff badge shows a reasonable character delta
With English text, verify the existing behaviour is unchanged

For languages like Japanese, Chinese, and Korean that don't use spaces as word separators, `count( text, 'words', {} )` returns a very small number (often 0 or 1), causing the "Text is too short to shorten further." error even for long paragraphs. Detect CJK content and use `characters_excluding_spaces` count with a character-based minimum threshold instead. Apply the same locale-aware counting in the word-diff display so the +/- indicator remains meaningful for CJK text.

The regex literal contained a raw ideographic space (U+3000) as the start of its first character range, which ESLint's no-irregular-whitespace rule rejects. Escaped ranges are equivalent and easier to review: \u3000-\u9FFF, \uAC00-\uD7FF, \uFF01-\uFF60.

github-actions · 2026-06-12T20:02:26Z

The following accounts have interacted with this PR and/or linked issues. I will continue to update these lists as activity occurs. You can also manually ask me to refresh this list by adding the props-bot label.

If you're merging code through a pull request on GitHub, copy and paste the following into the bottom of the merge commit message.

Co-authored-by: i-anubhav-anand <anubhav24@git.wordpress.org>
Co-authored-by: dkotter <dkotter@git.wordpress.org>
Co-authored-by: t-hamano <wildworks@git.wordpress.org>

To understand the WordPress project's expectations around crediting contributors, please review the Contributor Attribution page in the Core Handbook.

dkotter · 2026-06-12T20:09:24Z

@i-anubhav-anand Thanks for the PR but we do already have an open PR that resolves this same thing (see #581). I'd suggest reviewing that PR and if you have comments or concerns with the approach, best to leave those there instead of opening this duplicate PR.

i-anubhav-anand · 2026-06-15T14:20:53Z

Thanks @dkotter — you're right, this overlaps the existing PR you linked. Closing in favor of it to keep the work consolidated; happy to help review or iterate there instead. Apologies for the duplicated effort!

i-anubhav-anand and others added 2 commits June 12, 2026 17:10

i-anubhav-anand marked this pull request as ready for review June 12, 2026 20:02

dkotter assigned i-anubhav-anand Jun 12, 2026

i-anubhav-anand closed this Jun 15, 2026

dkotter mentioned this pull request Jun 15, 2026

Content Resizing: Fix "Shorten" action for CJK languages #729

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix content resizing "Shorten" action for CJK languages#715

Fix content resizing "Shorten" action for CJK languages#715
i-anubhav-anand wants to merge 2 commits into
WordPress:developfrom
i-anubhav-anand:fix/content-resizing-cjk-word-count

i-anubhav-anand commented Jun 12, 2026 •

edited by github-actions Bot

Loading

Uh oh!

github-actions Bot commented Jun 12, 2026 •

edited

Loading

Uh oh!

dkotter commented Jun 12, 2026

Uh oh!

i-anubhav-anand commented Jun 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

i-anubhav-anand commented Jun 12, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Changes

Testing

Uh oh!

github-actions Bot commented Jun 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dkotter commented Jun 12, 2026

Uh oh!

i-anubhav-anand commented Jun 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

i-anubhav-anand commented Jun 12, 2026 •

edited by github-actions Bot

Loading

github-actions Bot commented Jun 12, 2026 •

edited

Loading