Summary: | pdftohtml -xml fails to extract text that is extracted in pdftotext | ||
---|---|---|---|
Product: | poppler | Reporter: | Petter Reinholdtsen <pere> |
Component: | pdftohtml | Assignee: | poppler-bugs <poppler-bugs> |
Status: | RESOLVED MOVED | QA Contact: | |
Severity: | normal | ||
Priority: | medium | ||
Version: | unspecified | ||
Hardware: | Other | ||
OS: | All | ||
Whiteboard: | |||
i915 platform: | i915 features: |
Description
Petter Reinholdtsen
2012-06-05 09:21:14 UTC
This bug has been reported against poppler 0.12.4 (old), but it can be reproduced also with a newer poppler 0.18.4. I didn't try with 0.20.x though. Note adding also -hidden to the arguments makes the text show up in the XML output. -- GitLab Migration Automatic Message -- This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity. You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/poppler/poppler/issues/417. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.