Bug 54436 - pdftohtml - Issues when converting PDF to HTML
Summary: pdftohtml - Issues when converting PDF to HTML
Status: RESOLVED INVALID
Alias: None
Product: poppler
Classification: Unclassified
Component: pdftohtml (show other bugs)
Version: unspecified
Hardware: All Windows (All)
: medium critical
Assignee: poppler-bugs
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2012-09-03 12:37 UTC by Nitesh G.
Modified: 2013-04-23 21:54 UTC (History)
1 user (show)

See Also:
i915 platform:
i915 features:


Attachments
input PDF (1.32 MB, application/pdf)
2012-09-03 12:37 UTC, Nitesh G.
Details

Description Nitesh G. 2012-09-03 12:37:38 UTC
Created attachment 66540 [details]
input PDF

Hi,

I have tried to convert the attached PDF to HTML(using pdftohtml.exe) and found several issues as
follows:-
Page 1-> All bullets are converted into some other character represented by
alphabetic character 'n'
Page 2-> The text "Media Services" is shown horizontal instead of vertical
Page 3-> There is extended spacing between word and hyperlink and the
underlining is stretched a bit too far
Page 4-> Japanese characters inside table are garbled
Page 5-> Bullets are lost
Page 6-> Style and font is different than that in original PDF.

I am attaching the reference PDF.

Thanks,
Nitesh
Comment 1 Albert Astals Cid 2012-09-03 21:37:37 UTC
Please open a bug for each of the problems. Otherwise you make it very hard for us to keep track of what has been fixed and what not in a bug.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.