Summary: | Functionality request: option for removing BOM from beginning of saved text files | ||
---|---|---|---|
Product: | LibreOffice | Reporter: | Bruce Fowler <brf> |
Component: | Writer | Assignee: | Not Assigned <libreoffice-bugs> |
Status: | NEW --- | QA Contact: | |
Severity: | enhancement | ||
Priority: | medium | CC: | brf, iplaw67, leighmanthegreat, sasha.libreoffice |
Version: | 3.5.0 Beta2 | ||
Hardware: | x86 (IA32) | ||
OS: | Linux (All) | ||
Whiteboard: | |||
i915 platform: | i915 features: |
Description
Bruce Fowler
2011-12-29 13:29:58 UTC
Further experimentation reveled that this problem is not related to "base" but shows up simply by saving a "writer" file as "plain text". So I am changing the component from base to writer. To show it, one only need start with a short ".odt" file and follow steps 6-8 in the original bug report. Thanks for bugreport Explanations of these 3 bytes is here: http://en.wikipedia.org/wiki/Byte_order_mark Please, tell: which program has problem with it? Thanks for the reference. I have read the Wikipedia article. It appears to relate entirely to Unicode encoding. In relation to UTF-8 it says, "The Unicode Standard does permit the BOM in UTF-8, but does not require or recommend its use." It further states, "the need for a BOM arises in the context of text interchange, rather than in normal text processing within a closed environment" In any case, I don't want my data saved in UTF-8 for this particular application, but rather in plain ASCII. I tried setting the Tools/Options/Load save->HTML compatibility/Character set to Western Europe (ASCII/US), but the BOM is still there. I can appreciate the utility of the BOM for information interchange, but not for local work with Postscript programs and shell scripts. Perhaps the appropriate fix is to have an option in "load/save" that says, "I really want plain ASCII." I wish I were knowledgeable enough to send you a patch, but the LibreOffice code is a bit formidable! Thanks for your interest and help. > Perhaps the appropriate fix is to have an option in "load/save" that says, "I > really want plain ASCII." I agree with this. But currently we have very few developers. This may take several years. Sorry for such situation. > but not for local work with Postscript programs and shell scripts. But may be will more faster add to script removing this BOM and to ask Postscript programs authors to fix their programs It's easy enough to stop the BOM being written but I presume we want to preserve it in existing documents. Adding self to CC if not already on Glad to see that this bug is still alive. I fixed my immediate problem with a simple "tr" command in my shell script, but I am still not happy with extraneous stuff being inserted in my text data. The easy fix would seem to be to have "Save Text as UTF-8" and "Save Text as ASCII" options available as a preference I can set. Thanks for your continued interest. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.