I call count-words-region
(M-x =) on US/RU/IPA string:
HelloПривheləʊ
The following message is printed:
Region has 1 line, 4 words, and 14 characters.
All symbols have w
syntax, but differ in script:
(char-syntax ?H) ; ?w
(char-syntax ?П) ; ?w
(char-syntax ?ʊ) ; ?w
(aref char-script-table ?H) ; script: latin
(aref char-script-table ?П) ; script: cyrillic
(aref char-script-table ?ʊ) ; script: phonetic
Does that mean that word boundary is defined not only by char syntax but also by char script?
I would like to disable this behavior for selected modes in order to be able to navigate across words but not across scripts. How can this be achieved?