Bug 48012

Summary: cannot extract text
Product: poppler Reporter: Jakub Wilk <jwilk>
Component: generalAssignee: poppler-bugs <poppler-bugs>
Status: RESOLVED DUPLICATE QA Contact:
Severity: normal    
Priority: medium    
Version: unspecified   
Hardware: Other   
OS: All   
Whiteboard:
i915 platform: i915 features:
Attachments: the test case
pdftotext output

Description Jakub Wilk 2012-03-28 13:38:00 UTC
Created attachment 59174 [details]
the test case

pdftotext correctly extracts Cyrillic part of the attached PDF; however, it outputs garbage instead of the Latin part.

I can search through the Latin text in Adobe Reader, so the PDF itself is OK (or at least not helplessly bad).

$ pdftotext -v
pdftotext version 0.18.4
Copyright 2005-2011 The Poppler Developers - http://poppler.freedesktop.org
Copyright 1996-2004 Glyph & Cog, LLC
Comment 1 Jakub Wilk 2012-03-28 13:38:31 UTC
Created attachment 59175 [details]
pdftotext output
Comment 2 Jason Crain 2015-03-04 08:53:48 UTC

*** This bug has been marked as a duplicate of bug 78145 ***

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.