Summary: | Mojibake when converting a pdf created with miktex on win | ||
---|---|---|---|
Product: | poppler | Reporter: | scriabin <seinsvergessen> |
Component: | utils | Assignee: | poppler-bugs <poppler-bugs> |
Status: | RESOLVED DUPLICATE | QA Contact: | |
Severity: | normal | ||
Priority: | medium | CC: | jason |
Version: | unspecified | ||
Hardware: | x86 (IA32) | ||
OS: | Linux (All) | ||
Whiteboard: | |||
i915 platform: | i915 features: | ||
Attachments: | pdf that results in Mojibake |
Description
scriabin
2013-11-28 15:35:30 UTC
> Strangely this problem does not occur if I create the pdf on Linux.
And why did you open against poppler and not against what creates the pdf file?
(In reply to comment #1) > > Strangely this problem does not occur if I create the pdf on Linux. > > And why did you open against poppler and not against what creates the pdf > file? Analysis of the pdf did not reveal anything weird. Adobe stuff and pdf2txt.py have no troubles converting it to text, so I deduced it to be rather a poppler issue. Ok, if you have a pdf that Adobe Reader can extract the text and poppler can't, please attach it. Created attachment 89960 [details]
pdf that results in Mojibake
This is pdf created by miktex on windows. I have no rights to its content and had most of it removed.
bug #60243 again with ZapfDingbats character names :) These producers really need to start including CMaps. (In reply to comment #5) > These producers really need to start including CMaps. And stop using Type 3 fonts. (In reply to comment #5) > bug #60243 again with ZapfDingbats character names :) > > These producers really need to start including CMaps. Thanks! I applied your patch and recompiled poppler 0.24.4. Works like a charm. f ligatures don't seem to be displayed correctly (like fl in flying, I cannot paste the resulting char here), but I can live with that, I only need the pdf's content for language detection + indexing and strip all non-printable chars before passing the content to the indexer. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.