Bug 17429 - Provide a "pdftopdf" utility
Summary: Provide a "pdftopdf" utility
Status: RESOLVED MOVED
Alias: None
Product: poppler
Classification: Unclassified
Component: general (show other bugs)
Version: unspecified
Hardware: Other All
: medium enhancement
Assignee: poppler-bugs
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2008-09-04 09:08 UTC by Michael R Head
Modified: 2018-08-20 22:28 UTC (History)
2 users (show)

See Also:
i915 platform:
i915 features:


Attachments

Description Michael R Head 2008-09-04 09:08:29 UTC
I would like to see a complement to the pdftops and other pdfto* utilities that can output to pdf. I'd like to see options like -embedfonts (which would embed all non-embedded fonts in the output), -paper (for changing paper size) along with -expand and -nocrop, -createopw/-createupw (for creating user/owner passwords for the output). I'm sure there are many other useful options that could be added.

ghostscript provides a tool which can do some of this called pdf2pdf. Unfortunately, its pdf interpreter doesn't appear to be as correct as poppler's and the resulting pdf that ghostscript generates doesn't look that great. So, to properly embed fonts in an existing PDF (which is at times necessary for various publications), I've found I can only get acceptable output by using poppler's pdftops utility and then using ghostscript's ps2pdf utility with -dPDFSETTINGS=/prepress (or -dPDFSETTINGS=/printer) to embed even the standard postscript fonts.

I'd prefer just to be able to use a single poppler tool to get the job done.
Comment 1 Michael R Head 2008-09-04 09:10:49 UTC
Also, the process of pdf -> ps -> pdf throws away the PDF metadata, which I would be possible to keep with a single pdftopdf utility
Comment 2 dynamotwain 2009-01-28 23:15:57 UTC
It would also be useful if it had an option to write the PDF streams out in the uncompressed format to aid in debugging of flaky PDFs. It's a little easier to figure out what is going on when you have a plaintext PDF rather than one with Flate-encoded streams.
Comment 3 William Bader 2010-04-18 20:27:38 UTC
For modifying pdfs, you could look at http://code.google.com/p/pdfsizeopt/
It can subset fonts and perform lossless compression of images.

The program below uncompresses pdfs and writes the objects to a text file.
http://read.pudn.com/downloads174/sourcecode/windows/activex/806479/ExtractPDFText_src/cp/pdf.cpp__.htm
http://google.com/codesearch?q=%22This+file+contains+extremely+crude+C+source+code+to+extract+plain+text%22

For browsing objects in pdfs, I have used http://sourceforge.net/projects/pdfedit/
Comment 4 Petr Pisar 2013-11-03 16:13:38 UTC
CUPS filters contain pdftopdf tool with this functionality (see <http://en.sourceforge.jp/projects/opfc/scm/svn/tree/head/pdftopdf/>).

I would like to see another option to strip (or add) the user/owner password protection. I sometimes get password-protected PDF files and for archiving purposes, it's easier to strip the protection (and then encrypt with my PGP key ).
Comment 5 Pino Toscano 2014-07-06 09:55:58 UTC
(In reply to comment #4)
> I would like to see another option to strip (or add) the user/owner password
> protection. I sometimes get password-protected PDF files and for archiving
> purposes, it's easier to strip the protection (and then encrypt with my PGP
> key ).

This has been asked as bug #18440.
Comment 6 GitLab Migration User 2018-08-20 22:28:24 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/poppler/poppler/issues/239.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.