Bug 29055

Summary: Weird behaviour when pasting words with accute accents
Product: poppler Reporter: Mateo Matachana <mat30.mail>
Component: generalAssignee: poppler-bugs <poppler-bugs>
Status: RESOLVED INVALID QA Contact:
Severity: normal    
Priority: medium    
Version: unspecified   
Hardware: Other   
OS: All   
Whiteboard:
i915 platform: i915 features:
Attachments: PDF testcase where c&p is not properly done

Description Mateo Matachana 2010-07-14 02:18:33 UTC
Created attachment 37033 [details]
PDF testcase where c&p is not properly done

I have noticed a small problem with copy-and-paste on a UTF-8 encoded PDF file.

I generate a PDF using LaTeX with these headers:

\documentclass[a4paper]{article}
\usepackage[utf8]{inputenc}
\usepackage[spanish]{babel}



The text is written in spanish using UTF-8 encoding. The text contains some words with accute accents, like comunicación. PDF is properly shown on Evince, but when I try to copy those words, a weird behaviour is shown.

Pasting a word with an accent somewhere, for example Gedit, is not properly done. Instead paste the corresponding word, the vowel with accent is replaced by a space and the acute  accent is placed on the next letter. For example, trying to paste comunicación I get comunicaci ń.
Comment 1 Albert Astals Cid 2010-07-14 11:10:39 UTC
The pdf is badly formed, if you open the file in Adobe Reader and try to copy and paste the text you'll realize you don't get comunicación either.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.