Summary: | Word Count Wrong for ZWSP delimited text in SEA langauges (Thai, Lao, Khmer, and Burmese) | ||
---|---|---|---|
Product: | LibreOffice | Reporter: | Robert M Campbell <robert.rcampbell> |
Component: | Linguistic | Assignee: | Not Assigned <libreoffice-bugs> |
Status: | NEW --- | QA Contact: | |
Severity: | normal | ||
Priority: | medium | CC: | qubit, robert.rcampbell, timar |
Version: | unspecified | ||
Hardware: | Other | ||
OS: | All | ||
Whiteboard: | |||
i915 platform: | i915 features: | ||
Attachments: |
Test document including ZWSP and non-ZWSP Thai, Lao, Khmer, and Burmese text
Test document including ZWSP and non-ZWSP Thai, Lao, Khmer, and Burmese text Mittaphap Mittaphap Book |
CONFIRMED in LO Version: 4.2.0.0.beta1 + Ubuntu 12.04.3 (In reply to comment #0) > When working with text that uses ZWSPs (zero width spaces) to delimit text, > LibreOffice does not count each word. When the ZWSPs are removed, the word > count acts fine. Per instructions in Test document: REPRO STEPS: - Open test document in LibreOffice - Highlight first 4 paragraphs As noted in the document, the bottom bar shows "202 words" - Highlight the next set of 4 paragraphs As noted in the document, the bottom bar shows "2 words" > But, word selection (double click) and line breaking work fine with or > without ZWSPs. Well, at least there's that! > > Testing document attached. Thanks for the test document. Some of the fonts are not present on my system -- would it be possible to change the test document to use fonts included in LO that exercise the same bug? (if not, perhaps point to where the fonts might be downloaded) Status -> NEW Andras - Is this behavior a bug? Paragraphs 1 & 5 (Thai) - No LibreOffice fonts that I can tell Droid Sans https://www.google.com/fonts/specimen/Droid+Sans Paragraphs 2 & 6 (Khmer) - No LibreOffice fonts that I can tell Khmer OS http://sourceforge.net/projects/khmer/files/Fonts%20-%20KhmerOS/KhmerOS%20Fonts%204.0-%20LGPL%20License/ Paragraphs 3 & 7 (Lao) - No LibreOffice fonts that I can tell Mittaphap http://hg.palaso.org/font-lao2/file/d0764b11848f Padauk (included in LibreOffice) is the Burmese Font I'll adjust the document to the fonts listed. Mittaphap in particular is fairly new and only available as source, not ttf yet, but I have generated some fonts and can attach them here if that would be helpful? Created attachment 89726 [details]
Test document including ZWSP and non-ZWSP Thai, Lao, Khmer, and Burmese text
(In reply to comment #3) > [...various font things ..] > I'll adjust the document to the fonts listed. thanks > Mittaphap in particular is > fairly new and only available as source, not ttf yet, but I have generated > some fonts and can attach them here if that would be helpful? As long as the links are stable and fonts under some FOSS license so we may test against them, then it's generally fine to link to external font files. Created attachment 89727 [details]
Mittaphap
Created attachment 89728 [details]
Mittaphap Book
Mittaphap is licensed OFL Any news on this bug? Anything I can do to help? (In reply to Robert M Campbell from comment #9) > Any news on this bug? Anything I can do to help? Hi Robert, Good question -- sorry for the late reply here! As you can see, we have a large number of open bug reports filed against LibreOffice, so it's often a matter of finding the right resource to help address a particular bug or set of bugs. This bug appears to affect a number of different languages including Thai, so I'd suggest that you check with the Thai mailing list and see if others are experiencing the same problem: https://wiki.documentfoundation.org/Local_Mailing_Lists#Thai If the problem is affecting many people, then we can try to identify someone who'd be interested in working on a fix. This could be a great opportunity for a university CS student or someone else familiar with programming to learn more about LibreOffice. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.
Created attachment 89590 [details] Test document including ZWSP and non-ZWSP Thai, Lao, Khmer, and Burmese text When working with text that uses ZWSPs (zero width spaces) to delimit text, LibreOffice does not count each word. When the ZWSPs are removed, the word count acts fine. But, word selection (double click) and line breaking work fine with or without ZWSPs. Testing document attached.