Summary: | wrong spaces in text | ||
---|---|---|---|
Product: | poppler | Reporter: | Pablo Rodríguez <freedesktop> |
Component: | general | Assignee: | poppler-bugs <poppler-bugs> |
Status: | RESOLVED MOVED | QA Contact: | |
Severity: | normal | ||
Priority: | medium | ||
Version: | unspecified | ||
Hardware: | Other | ||
OS: | All | ||
Whiteboard: | |||
i915 platform: | i915 features: | ||
Attachments: |
wrong text extraction with letterspace
pdf test case |
Description
Pablo Rodríguez
2010-06-02 14:29:37 UTC
pdf is not attached Created attachment 36216 [details]
wrong text extraction with letterspace
Sorry, I totally forgot that.
Attached you have the file.
Thanks,
Pablo
Created attachment 139972 [details] pdf test case A simpler example is (from https://gitlab.gnome.org/GNOME/evince/issues/111) is: \documentclass{article} \begin{document} TODO $TODO$ \end{document} which pdftotext extracts as: ------- TODO T ODO 1 ------- A consequence is that searching for 'TODO' only finds the first line, but there is not match for the second one. Even more, Poppler-glib (cairo backend), the second TODO is rendered with a slight space between T and O. Acroread and Foxit renders the text as expected, and find both lines when searching for 'TODO'. -- GitLab Migration Automatic Message -- This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity. You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/poppler/poppler/issues/137. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.