Bug 19154 - pdftotext outputs special (danish) characters æøå wrong
Summary: pdftotext outputs special (danish) characters æøå wrong
Status: RESOLVED NOTABUG
Alias: None
Product: poppler
Classification: Unclassified
Component: general (show other bugs)
Version: unspecified
Hardware: Other All
: medium normal
Assignee: poppler-bugs
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2008-12-18 02:02 UTC by Rune Schjellerup Philosof
Modified: 2008-12-19 12:16 UTC (History)
0 users

See Also:
i915 platform:
i915 features:


Attachments
pdf file containing some æ, ø and å chars (33.45 KB, application/pdf)
2008-12-18 02:02 UTC, Rune Schjellerup Philosof
Details

Description Rune Schjellerup Philosof 2008-12-18 02:02:53 UTC
Created attachment 21268 [details]
pdf file containing some æ, ø and å chars

The attached pdf file does not correctly convert to text.
The æ, ø and å chars are mangled which is unfortunate as I was converting it to text in order to spell check the document :(

Similar to bug #18213 but not quite the same.
That bug concerns a large I in the pdf that is output as ^L,
while this bug concerns perfectly legal characters in Latin1 and UTF-8.
Comment 1 Albert Astals Cid 2008-12-18 11:31:36 UTC
Not a bug. I can only see "messing" in å. Both æ and ø show fine here. And å is extracted messed too in acroread so it's again latex being too smart and doing two characters instead of one.
Comment 2 Rune Schjellerup Philosof 2008-12-19 01:20:37 UTC
So I should report a bug in latex?
It sounds like you know some background information about this issue, could you point me to some information about it?
Comment 3 Albert Astals Cid 2008-12-19 12:16:53 UTC
Well, i don't know much latex at all, but for the info of past reports it seems there is a way of telling latex to really create only a char for "composed" characters instead of two. I don't really what the option is search the internet about it. You might want to bug latex to make that option default.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.