Bug 48012

Summary:	cannot extract text
Product:	poppler	Reporter:	Jakub Wilk <jwilk>
Component:	general	Assignee:	poppler-bugs <poppler-bugs>
Status:	RESOLVED DUPLICATE	QA Contact:
Severity:	normal
Priority:	medium
Version:	unspecified
Hardware:	Other
OS:	All
Whiteboard:
i915 platform:		i915 features:
Attachments:	the test case pdftotext output

Description Jakub Wilk 2012-03-28 13:38:00 UTC

Created attachment 59174 [details]
the test case

pdftotext correctly extracts Cyrillic part of the attached PDF; however, it outputs garbage instead of the Latin part.

I can search through the Latin text in Adobe Reader, so the PDF itself is OK (or at least not helplessly bad).

$ pdftotext -v
pdftotext version 0.18.4
Copyright 2005-2011 The Poppler Developers - http://poppler.freedesktop.org
Copyright 1996-2004 Glyph & Cog, LLC

Comment 1 Jakub Wilk 2012-03-28 13:38:31 UTC

Created attachment 59175 [details]
pdftotext output

Comment 2 Jason Crain 2015-03-04 08:53:48 UTC


*** This bug has been marked as a duplicate of bug 78145 ***

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.