Bug 7772 - Evince Find misses certain words in PDF documents
Summary: Evince Find misses certain words in PDF documents
Status: RESOLVED FIXED
Alias: None
Product: poppler
Classification: Unclassified
Component: general (show other bugs)
Version: unspecified
Hardware: x86 (IA32) Linux (All)
: high normal
Assignee: poppler-bugs
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2006-08-04 17:07 UTC by Johan Brannlund
Modified: 2006-08-04 17:15 UTC (History)
1 user (show)

See Also:
i915 platform:
i915 features:


Attachments

Description Johan Brannlund 2006-08-04 17:07:41 UTC
This bug is from http://bugzilla.gnome.org/show_bug.cgi?id=346333 , which I was
asked by Nickolay Shmyrev to file here instead.

Please describe the problem:
The Find facility misses certain words in PDF documents. My guess is that this
is related to ligatures.


Steps to reproduce:
1. Download the LaTeX PDF document http://arxiv.org/pdf/gr-qc/0512076
2. Search for the string "indefinite" on the first page.
3. 


Actual results:
Evince says "0 found on this page"

Expected results:
That Evince actually finds the string "indefinite", which occurs several times
on the first page.


Does this happen every time?
Yes.

Other information:
As I stated above, I'm guessing that the bug is related to the "f-i" ligature
in "indefinite". Evince correctly finds the substring "inde", but not "indef"
or "indefi".

This is with evince 0.5.4, poppler 0.5.3 on Ubuntu Edgy.
Comment 1 Johan Brannlund 2006-08-04 17:15:26 UTC
I didn't test this with the newer evince/poppler that had appeared since I filed
the gnome bug - this actually works now. Sorry for the noise.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.