[poppler] pdftohtml patch: new reflow option (fixed)

Albert Astals Cid aacid at kde.org
Mon Jan 5 16:40:24 PST 2009


A Dilluns 05 Gener 2009, Ross Moore va escriure:
> Hi Albert, and others,

You forgot to CC the list :D

>
> On 06/01/2009, at 9:17 AM, Albert Astals Cid wrote:
> > @@ -997,7 +998,7 @@
> >
> >        dumpMetaVars(page);
> >        fprintf(page,"</HEAD>\n");
>
> Is there an advantage in following XHTML recommendations,
> at least partially, by converting all tag-names
> and attribute names to lowercase?
>
>   i.e.  "</head>\n"   here, and  "<body    below, ...
>
> > -      fprintf(page,"<BODY bgcolor=\"#A0A0A0\" vlink=\"blue\"
> > link=\"blue\">\n");
> > +      fprintf(page,"<BODY vlink=\"blue\" link=\"blue\">\n");
> >      }
> >    }
> >
> > Also i see you do
> >
> > fputs(reFlow ? "<p>\n" : "<br>\n",f);
>
>    ... and using  "<br />\n"  here.
> And of course making similar changes throughout.
>
> > But i never see you closing <p> is that correct?
>
> It's OK in early versions of HTML, but not correct
> in XHTML, where every tag needs to have a matching
> close tag -- except when there is no content,
> as in <br /> and  <meta .... />  and <img .... />
> and a few others.

I agree than making XHTML compilant would be an idea, and almost mandatory if 
xml switch is used :D

But i've never used pdftohtml and poppler has lots of bugs to fix so patches 
welcome :-)

Albert

>
> > Albert
>
> Hope this helps,
>
> 	Ross
>
> ------------------------------------------------------------------------
> Ross Moore                                       ross at maths.mq.edu.au
> Mathematics Department                           office: E7A-419
> Macquarie University                             tel: +61 (0)2 9850 8955
> Sydney, Australia  2109                          fax: +61 (0)2 9850 8114
> ------------------------------------------------------------------------




More information about the poppler mailing list