| Summary: | VIEWING: U+3000 IDEOGRAPHIC SPACE (CJK full width space) and other spaces not rendered as non-printing characters in Writer | ||
|---|---|---|---|
| Product: | LibreOffice | Reporter: | Matthew Francis <fdbugs> |
| Component: | Writer | Assignee: | Not Assigned <libreoffice-bugs> |
| Status: | NEEDINFO --- | QA Contact: | |
| Severity: | normal | ||
| Priority: | medium | CC: | fdbugs |
| Version: | 4.3.0.4 release | ||
| Hardware: | Other | ||
| OS: | All | ||
| Whiteboard: | |||
| i915 platform: | i915 features: | ||
| Bug Depends on: | |||
| Bug Blocks: | 83066 | ||
| Attachments: |
Sample document with spaces
Document rendered without non-printing characters enabled Document rendered with non-printing characters enabled |
||
|
Description
Matthew Francis
2014-08-16 05:32:36 UTC
Created attachment 104700 [details]
Sample document with spaces
Created attachment 104701 [details]
Document rendered without non-printing characters enabled
Created attachment 104702 [details]
Document rendered with non-printing characters enabled
(In reply to comment #0) > U+3000 IDEOGRAPHIC SPACE, which is a wide space used in CJK text, does not > show visibly as a non-printing character when View -> Non-printing > Characters is enabled in Writer. There is certainly no Interpunct character displayed over the Ideographic Space (U+3000) when Non-printing characters are displayed. There are possibly cultural reasons for this, given that the Middle Dot (U+00B7), which is used for Space (U+0020) and No-break Space (U+00A0), is in the Basic Latin block and some Asian scripts use a centralised dot for a full stop. According to http://en.wikipedia.org/wiki/Interpunct these are the main Asian language preferences: Chinese: "In Taiwan the Unicode code point U+2027, Hyphenation Point, is recommended by government as a fullwidth punctuation to separate the given name and the family name of non-Chinese." and "In Chinese, the middle dot is also fullwidth in printed matter, but the regular middle dot (·) is used in computer input, which is then rendered as fullwidth in Chinese-language fonts." Japanese: "Interpuncts are often used to separate transcribed foreign words written in katakana. [...] the Japanese writing system usually does not use space or punctuation to separate words." and "U+30FB ・ katakana middle dot" and "U+FF65 ・ halfwidth katakana middle dot." Korean: "Interpuncts are used in written Korean to denote a list of two or more words, more or less in the same way a slash (/) is used to juxtapose words in many other languages." and "The use of interpuncts has declined in years of digital typography and especially in place of slashes, but, in the strictest sense, a slash cannot replace a middle dot in Korean typography." and "U+318D ㆍ hangul letter araea (아래아) is used more than a middle dot when a interpunct is to be used in Korean typography." In accordance with this I am setting the status to NEEDINFO as Asian language (l10n) experts are required to comment further on what would be considered acceptable practice. > U+FEFF ZERO WIDTH NO-BREAK SPACE Please note that use of U+FEFF as ZWNBSP is deprecated since 2002 (Unicode v3.2) and the Word Joiner (U+2060) is recommended to be used in its place. Thanks for the above comment. Note that one mitigating factor to the other uses for • in CJK text is that, as of current master (4.4), the non-printing characters are displayed in blue text, rather than black, so there is some contrast there by default. For comparison, Word for Mac 2011 appears to use a rectangle the width of the ideographic space for this case. This might be a reasonable model to follow. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.