Bug 57299

Summary: Add a way to render a page without getting the text
Product: poppler Reporter: Jose Aliste <jose.aliste>
Component: glib frontendAssignee: poppler-bugs <poppler-bugs>
Status: RESOLVED NOTABUG QA Contact:
Severity: normal    
Priority: medium    
Version: unspecified   
Hardware: Other   
OS: All   
Whiteboard:
i915 platform: i915 features:
Attachments: test file

Description Jose Aliste 2012-11-20 00:05:02 UTC
Created attachment 70290 [details]
test file

While looking at large map files like the one of the RATP for the metro of Paris (will upload) these have a lot of text, but this text is in all shapes... Currently the glib frontend will ALWAYS get the text of the page when rendering. A sysprof of the sample shows that most of the time is spent in coalescing the text, which of course is doing nothing as the text on this page is scattered all the way. 

I just quickly removed the setTextPage call in poppler_page_render, and this files render in 3 sec in my machine (i5 with ssd) while normally it takes about 30 sec... 

So, please allow for a way of rendering the file, and getting the text in different calls. I know I know, this will reparse the file twice... but in normal texts, rendering and coalescing is not that slow, so we probably won't notice it too much in applications, while going from 30 sec to 3 sec makes my "tiled manager"  evince finally renders such a file at 100% possible. :)
Comment 1 Jose Aliste 2012-11-21 00:15:14 UTC
Actually, the fact that poppler_page_render gets the text while rendering is an implementation detail, this makes this bug invalid so closing

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.