[poppler] Characters with accents not correctly handled

Laurent Aguerreche laurent.aguerreche at free.fr
Sun Aug 19 13:46:16 PDT 2007


Le dimanche 19 août 2007 à 22:34 +0200, Martin Schröder a écrit :
> 2007/8/19, Laurent Aguerreche <laurent.aguerreche at free.fr>:
> > It partially correct problems. All the "ff" are replaced by character
> > "ff" (a sort of two "f" in one character).
> 
> It's a ligature. It's a feature. :-)

:-/

So with DéjàVu fonts and "ff" character, it looks rather ugly and this
character is not displayed by emacs22 (just an empty rectangle).  \o/

But the real problem is that it is impossible to recognize :
- "fi" as "fi" too
- "ff" as "ff" too
Would it be possible to add a new parameter to pdftotext to make it
ignore ligatures but still export in UTF-8?


Laurent.

> 
> Best
>    Martin
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 827 bytes
Desc: Ceci est une partie de message
	=?ISO-8859-1?Q?num=E9riquement?= =?ISO-8859-1?Q?_sign=E9e?=
Url : http://lists.freedesktop.org/archives/poppler/attachments/20070819/b7472320/attachment.pgp 


More information about the poppler mailing list