Bug 103724 - pdftohtml: Spaces are stripped in output (PDF generated with PhantomJS/Node.js)
Summary: pdftohtml: Spaces are stripped in output (PDF generated with PhantomJS/Node.js)
Status: RESOLVED MOVED
Alias: None
Product: poppler
Classification: Unclassified
Component: utils (show other bugs)
Version: unspecified
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: poppler-bugs
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2017-11-13 19:13 UTC by clark
Modified: 2017-11-15 22:13 UTC (History)
0 users

See Also:
i915 platform:
i915 features:


Attachments
test pdf (26.22 KB, application/pdf)
2017-11-13 19:13 UTC, clark
Details
attachment-29259-0.html (1.71 KB, text/html)
2017-11-15 22:13 UTC, clark
Details

Description clark 2017-11-13 19:13:47 UTC
Created attachment 135440 [details]
test pdf

This PDF genereated with PhantomJS are all spaces stripped in the output

pdftohtml -s -i $file $htm_output

poppler-0.61.0

The output for one text box is
<p>FrederikThomsen<br/>Att.:FrederikSpangThomsen</p>

But should be
<p>Frederik Thomsen<br/>Att. :Frederik Spang Thomsen</p>
Comment 1 clark 2017-11-15 21:42:22 UTC
I accidentally opened this in the wrong group.. So created a new one in pdftohtml :)
Comment 2 clark 2017-11-15 22:13:50 UTC
Created attachment 135503 [details]
attachment-29259-0.html

is it fixed?! :D


Den 15-11-2017 kl. 22:55 skrev bugzilla-daemon@freedesktop.org:
> Albert Astals Cid <mailto:aacid@kde.org> changed bug 103724 
> <https://bugs.freedesktop.org/show_bug.cgi?id=103724>
> What 	Removed 	Added
> Resolution 	--- 	MOVED
> Status 	NEW 	RESOLVED
>
> ------------------------------------------------------------------------
> You are receiving this mail because:
>
>   * You reported the bug.
>


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.