Summary: | pdfimages does not extract inline jpeg images as jpeg | ||
---|---|---|---|
Product: | poppler | Reporter: | Laur <laur.olb1> |
Component: | general | Assignee: | poppler-bugs <poppler-bugs> |
Status: | RESOLVED FIXED | QA Contact: | |
Severity: | normal | ||
Priority: | medium | CC: | laur.olb1, marti |
Version: | unspecified | ||
Hardware: | Other | ||
OS: | All | ||
Whiteboard: | |||
i915 platform: | i915 features: | ||
Attachments: |
example pdf
extra inline images extract inline images v2 |
Description
Laur
2009-12-13 15:11:34 UTC
Please provide a pdf with that problem. Created attachment 32076 [details]
example pdf
Attached is a sample pdf with an inline image created with sam2p.
FYI, I also reported this bug to the author of xpdf, here was his response: "The issue is that there is no good way to find the end of an inline image data stream without parsing the data. So I would need code that read the JPEG data without decompressing it - just doing enough parsing to find the end of the JPEG stream. That's probably not too hard to do, but it's not high on my priority list." Thanks. Same answer, if anyone is interested to code a patch, you need to start looking at ImageOutputDev.cc the problem lies in that the drawImg call is "inlineImg" so you can't know the length of the stream, and you need to work a bit more. Patches welcome since this is also not high priority for us either. Created attachment 84828 [details] [review] extra inline images Patch to extract inline images. The patch looks a bit complicated but i guess there's nothing more you can actually do. Have you run this over all the files with and without the reusableA bit set? (In reply to comment #6) > Have you run this over all the files with and without the reusableA bit set? yes Created attachment 133545 [details] [review] extract inline images v2 Rebased to master good i guess Pushed |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.