[poppler] [PATCH] Fixup LaTeX composed characters

Tim Brody tdb2 at ecs.soton.ac.uk
Mon May 16 04:10:51 PDT 2011


On Sat, 2011-05-14 at 13:01 +0100, Albert Astals Cid wrote:

> Can you give a look at [bug143479.pdf]? pdftotext in one sentence changes
> 
> En konstant vandtemperatur ved brygningen på mellem 92° – 96° er optimal,
> 92
> 96
> idet de velsmagende komponenter frigøres ved denne temperatur.
> 
> to
> 
> En konstant vandtemperatur ved brygningen på mellem 92v̊arme
> 92 – 96° er optimal,
> 96
> idet de velsmagende komponenter frigøres ved denne temperatur.
> 
> It seems like you are composing the degree symbol with the v (actually no idea 
> where that extra varme comes from either)?

There's something wrong in the calling code. The stream of chars coming
into addChar() is this (varme is from the line below):
9-2-°-v-a-r-m-e

(Note '92 96' also get repeated)

I didn't add vertical overlap-checking because AFAIK that should happen
in ActualText.

/Tim.



More information about the poppler mailing list