Bug 57299 - Add a way to render a page without getting the text
Summary: Add a way to render a page without getting the text
Status: RESOLVED NOTABUG
Alias: None
Product: poppler
Classification: Unclassified
Component: glib frontend (show other bugs)
Version: unspecified
Hardware: Other All
: medium normal
Assignee: poppler-bugs
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2012-11-20 00:05 UTC by Jose Aliste
Modified: 2012-11-21 00:15 UTC (History)
0 users

See Also:
i915 platform:
i915 features:


Attachments
test file (1.26 MB, text/plain)
2012-11-20 00:05 UTC, Jose Aliste
Details

Description Jose Aliste 2012-11-20 00:05:02 UTC
Created attachment 70290 [details]
test file

While looking at large map files like the one of the RATP for the metro of Paris (will upload) these have a lot of text, but this text is in all shapes... Currently the glib frontend will ALWAYS get the text of the page when rendering. A sysprof of the sample shows that most of the time is spent in coalescing the text, which of course is doing nothing as the text on this page is scattered all the way. 

I just quickly removed the setTextPage call in poppler_page_render, and this files render in 3 sec in my machine (i5 with ssd) while normally it takes about 30 sec... 

So, please allow for a way of rendering the file, and getting the text in different calls. I know I know, this will reparse the file twice... but in normal texts, rendering and coalescing is not that slow, so we probably won't notice it too much in applications, while going from 30 sec to 3 sec makes my "tiled manager"  evince finally renders such a file at 100% possible. :)
Comment 1 Jose Aliste 2012-11-21 00:15:14 UTC
Actually, the fact that poppler_page_render gets the text while rendering is an implementation detail, this makes this bug invalid so closing


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.