Bug 50646 - pdftotext -htmlmeta outputs incomplete metadata
Summary: pdftotext -htmlmeta outputs incomplete metadata
Status: RESOLVED MOVED
Alias: None
Product: poppler
Classification: Unclassified
Component: utils (show other bugs)
Version: unspecified
Hardware: Other Linux (All)
: medium normal
Assignee: poppler-bugs
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2012-06-03 11:15 UTC by madbiologist
Modified: 2018-08-20 21:58 UTC (History)
0 users

See Also:
i915 platform:
i915 features:


Attachments

Description madbiologist 2012-06-03 11:15:33 UTC
This bug was originally reported at https://bugs.launchpad.net/ubuntu/+source/poppler/+bug/993292

pdftotext -htmlmeta output is missing metadata from PDF catalog. pdfinfo does output all values known:

e.g. a pdfinfo output:

Title: Titel
Author: Word
Creator: WordToPDF 2.4 build 127
Producer: AFPL Ghostscript 8.54
CreationDate: Fri Jul 2 09:14:02 2007
ModDate: Fri Jul 2 09:14:02 2007
Tagged: no
Pages: 6
Encrypted: no
Page size: 595 x 842 pts (A4)
File size: 104664 bytes
Optimized: no
PDF version: 1.3

in contrast the meta section of the pdftotext -htmlmeta output:

<head>
<title>Titel</title>
<meta name="Author" content="Word"/>
<meta name="Creator" content="WordToPDF 2.4 build 127"/>
<meta name="Producer" content="AFPL Ghostscript 8.54"/>
<meta name="CreationDate" content=""/>
</head>
Comment 1 GitLab Migration User 2018-08-20 21:58:00 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/poppler/poppler/issues/136.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.