[poppler] pdftohtml: extract non-jpeg mask as a PNG

Ihar `Philips` Filipau thephilips at gmail.com
Sat Mar 10 08:16:15 PST 2012


On 3/10/12, Albert Astals Cid <aacid at kde.org> wrote:
>> I have made a patch to popper to extract image masks as PNGs.
>>
>> Confusingly, being in slightly wrong frame of mind, I have attached it
>> to this bug - https://bugs.freedesktop.org/show_bug.cgi?id=32340
>>
>> What is the official process submitting patches to poppler?
>
> You attach the patch to a bug where it makes sense, i.e. not where you
> attached it.
>
> Please open a new bug and attach it there.

Done: https://bugs.freedesktop.org/show_bug.cgi?id=47186

> Thanks for the patch :-)

poppler is welcome to accept it, so that I do not need to struggle
with other barely maintained pdf2xml tools. ;)


BTW, does the pdftohtml has an official maintainer?
I want to try to work on my word breaking problem:
https://bugs.freedesktop.org/show_bug.cgi?id=47022
but the code in question has pretty scary TODO attached:

      // TODO: the following line assumes that text is flowing left to
      // right, which will not necessarily be the case, e.g. if rotated;
      // It assesses whether or not two characters are close enough to
      // be part of the same string

and I want to know whether there are objections against me simply
turning the constant used to control word break into a variable.


More information about the poppler mailing list