[poppler] Characters with accents not correctly handled

Pino Toscano toscano.pino at tiscali.it
Sun Aug 19 14:23:31 PDT 2007


Alle domenica 19 agosto 2007, Laurent Aguerreche ha scritto:
> Accents are correctly handled, that's right (but spaces are all replaced
> with "unbreakable" spaces!).
>
> > You can them open your latex pdf in acrobat reader and
> > see it can neither handle the accents correctly.
>
> Hum... That's wrong. My latex-generated PDF is perfectly opened with
> acroread, evince, kpdf and xpdf. Why?!

Nope, Albert was actually correct, pdflatex _can_, and actually _do_, generate 
PDFs with unreadable text. Example: 
http://bugs.kde.org/show_bug.cgi?id=131564 (note that also acroread fails).

Remember: visual representaton of characters and their "textual" information 
are totally separate - you can have documents with lots of pages, with not a 
single character that can be extracted from it.
That's why you can perfectly open and _read_ it, but not _extract_ its text.

> > So blame latex, not poppler.
>
> Ok but if you know the problem, are latex developers aware too? Do you
> know whether it is fixable?

Bug them, not poppler or any PDF reader.

-- 
Pino Toscano
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part.
Url : http://lists.freedesktop.org/archives/poppler/attachments/20070819/bc259f07/attachment.pgp 


More information about the poppler mailing list