[poppler] images in pdftohtml -xml mode
Albert Astals Cid
aacid at kde.org
Tue Nov 15 12:54:35 PST 2011
A Dilluns, 14 de novembre de 2011, Igor Slepchin vàreu escriure:
> On 11/14/2011 07:38 PM, Albert Astals Cid wrote:
> > A Dilluns, 14 de novembre de 2011, Igor Slepchin vàreu escriure:
> >> <...>
> >> The change is small enough that I hope it won't be very
> >> controversial
> >> but comments are certainly appreciated.
> >
> > I'm a bit confused you add encoding="US-ASCII" to the first line
>
> pdf2xml.dtd
>
> > and then you remove it altogether?
>
> Oops, thanks for noticing - removing it was a typo. I added it back now
> - xmllint doesn't like the DTD without the encoding and it does no harm
> to have it there (encoding is theoretically required in external text
> entities that have the text declaration). I also changed the encoding
> there to UTF-8 just in case it matters to anyone (all XML processors are
> required to understand UTF-8).
>
> > I'm wondering if why you did not add make GfxState *state a parameter
>
> of the
>
> > constructor. Seems to be mandatory to call the transform method.
>
> Yeah, could be done that way as well - I sorta had the idea that
> (0,0)-(1,1) user space coordinates could somehow be useful on their own
> but they are clearly not at the moment.
>
> > I'd prefer if you make HtmlImage a class.
>
> Sure, I'll change that - I used struct since I wanted everything there
> to be public anyway.
>
> > It'd be cool if next time you attach the patches instead of making me
>
> go and
>
> > lose time trying to navigate github ;-)
>
> Here you go, with your suggested changes - sorry, I assumed you would
> prefer github :p Let me know if you want me to rebase the branch there
> so that you could pull it without intermediate commits.
Pushed to master, will be in poppler 0.20
Albert
>
> Igor
More information about the poppler
mailing list