Bug 7772

Summary: Evince Find misses certain words in PDF documents
Product: poppler Reporter: Johan Brannlund <freedesktop-bugs>
Component: generalAssignee: poppler-bugs <poppler-bugs>
Status: RESOLVED FIXED QA Contact:
Severity: normal    
Priority: high CC: nshmyrev
Version: unspecified   
Hardware: x86 (IA32)   
OS: Linux (All)   
Whiteboard:
i915 platform: i915 features:

Description Johan Brannlund 2006-08-04 17:07:41 UTC
This bug is from http://bugzilla.gnome.org/show_bug.cgi?id=346333 , which I was
asked by Nickolay Shmyrev to file here instead.

Please describe the problem:
The Find facility misses certain words in PDF documents. My guess is that this
is related to ligatures.


Steps to reproduce:
1. Download the LaTeX PDF document http://arxiv.org/pdf/gr-qc/0512076
2. Search for the string "indefinite" on the first page.
3. 


Actual results:
Evince says "0 found on this page"

Expected results:
That Evince actually finds the string "indefinite", which occurs several times
on the first page.


Does this happen every time?
Yes.

Other information:
As I stated above, I'm guessing that the bug is related to the "f-i" ligature
in "indefinite". Evince correctly finds the substring "inde", but not "indef"
or "indefi".

This is with evince 0.5.4, poppler 0.5.3 on Ubuntu Edgy.
Comment 1 Johan Brannlund 2006-08-04 17:15:26 UTC
I didn't test this with the newer evince/poppler that had appeared since I filed
the gnome bug - this actually works now. Sorry for the noise.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.