[poppler] Characters with accents not correctly handled
Laurent Aguerreche
laurent.aguerreche at free.fr
Sun Aug 19 13:46:16 PDT 2007
Le dimanche 19 août 2007 à 22:34 +0200, Martin Schröder a écrit :
> 2007/8/19, Laurent Aguerreche <laurent.aguerreche at free.fr>:
> > It partially correct problems. All the "ff" are replaced by character
> > "ff" (a sort of two "f" in one character).
>
> It's a ligature. It's a feature. :-)
:-/
So with DéjàVu fonts and "ff" character, it looks rather ugly and this
character is not displayed by emacs22 (just an empty rectangle). \o/
But the real problem is that it is impossible to recognize :
- "fi" as "fi" too
- "ff" as "ff" too
Would it be possible to add a new parameter to pdftotext to make it
ignore ligatures but still export in UTF-8?
Laurent.
>
> Best
> Martin
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 827 bytes
Desc: Ceci est une partie de message
=?ISO-8859-1?Q?num=E9riquement?= =?ISO-8859-1?Q?_sign=E9e?=
Url : http://lists.freedesktop.org/archives/poppler/attachments/20070819/b7472320/attachment.pgp
More information about the poppler
mailing list