Bug 40279 - Poppler::Page::textList() wrong order
Summary: Poppler::Page::textList() wrong order
Status: RESOLVED MOVED
Alias: None
Product: poppler
Classification: Unclassified
Component: qt4 frontend (show other bugs)
Version: unspecified
Hardware: All All
: medium normal
Assignee: poppler-bugs
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2011-08-21 20:09 UTC by xiangxw
Modified: 2018-08-21 10:44 UTC (History)
0 users

See Also:
i915 platform:
i915 features:


Attachments
Page 1 is right, but page 2 is wrong. (713.09 KB, application/pdf)
2011-08-21 20:09 UTC, xiangxw
Details

Description xiangxw 2011-08-21 20:09:52 UTC
Created attachment 50442 [details]
Page 1 is right, but page 2 is wrong.

When I use the function Poppler::Page::textList(), I find that the QList<TextBox *> it produce have wrong order.
The attachment is a example.Page 1 is right, but page 2 is wrong.
Comment 1 Albert Astals Cid 2011-08-22 02:53:16 UTC
Can you be a bit more precise with what you mean that the order is wrong?
Comment 2 xiangxw 2011-08-22 04:10:33 UTC
(In reply to comment #1)
> Can you be a bit more precise with what you mean that the order is wrong?

I print the text of each textBox in QList<TextBox *> of page 2 (the PDF file in attachment). 

The result should be "as manpower, materials and  technology  in  the early stages of development by  relying ...".
But it come out to be "regional disparity from the perspective of the  lock-in 
efect ...".
The order should be left-to-right.
Comment 3 Albert Astals Cid 2011-08-23 11:41:11 UTC
I see, there is the problem that the documentation says that this should return the text in order, but it is not true :D It can be fixed to do what pdftotext does, but it is not on my priorities at the moment, maybe you want to help? If so say so and i will give you some pointers.
Comment 4 xiangxw 2011-08-24 03:44:37 UTC
(In reply to comment #3)
> I see, there is the problem that the documentation says that this should return
> the text in order, but it is not true :D It can be fixed to do what pdftotext
> does, but it is not on my priorities at the moment, maybe you want to help? If
> so say so and i will give you some pointers.
Yes, i'd like to. Please offer me some advice.
Comment 5 GitLab Migration User 2018-08-21 10:44:06 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/poppler/poppler/issues/349.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.