Bug 74660

Summary: Text is broken 2/3rds of the way down the page for this PDF
Product: poppler Reporter: Peter Waller <p>
Component: glib frontendAssignee: poppler-bugs <poppler-bugs>
Status: RESOLVED MOVED QA Contact:
Severity: normal    
Priority: medium CC: carlosgc
Version: unspecified   
Hardware: x86-64 (AMD64)   
OS: Linux (All)   
Whiteboard:
i915 platform: i915 features:
Attachments: Broken page taken from the pdf using pdftk
png file showing text selection is broken

Description Peter Waller 2014-02-07 10:35:34 UTC
Created attachment 93596 [details]
Broken page taken from the pdf using pdftk

The glib API returning text and rectangles actually stops about two thirds of the way down the page. If you try and select them in evince it picks out one character for each line at the bottom of the page. I've attached a small png screenshot.

Chrome is correctly able to select all of the glyphs right the way across the page.

Tested with master on Jan 14, and evince "3.4.0, Using poppler/cairo (0.18.4)".

Perhaps the page is interesting because it contains a large number of glyphs (55,642).
Comment 1 Peter Waller 2014-02-07 10:36:04 UTC
Created attachment 93597 [details]
png file showing text selection is broken
Comment 2 Peter Waller 2014-02-07 10:36:54 UTC
To clarify, this is a problem for me because I can't get access to the glyphs through the API at all.
Comment 3 GitLab Migration User 2018-08-20 21:48:39 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/poppler/poppler/issues/84.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.