Hello Albert ,<br><br>Thanks for committing the patch. Hope fully this will help to improve the accessibility and usability of pdftohtml option.<br><br><br>With regards<br>Leena Chourey<br>For Accessibility Team<br>CDAC Mumbai<br>
<br><br><div class="gmail_quote">On Fri, Sep 17, 2010 at 3:54 AM, Albert Astals Cid <span dir="ltr"><<a href="mailto:aacid@kde.org">aacid@kde.org</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">
A Dijous, 16 de setembre de 2010, vàreu escriure:<br>
<div class="im">> Dear Poppler Developers,<br>
><br>
> Thanks for response.<br>
><br>
> For copyright information:<br>
><br>
> *Copyright OSSD CDAC Mumbai<br>
><br>
> Changed 2010 by Leena Chourey (<a href="mailto:leenac@cdacmumbai.in">leenac@cdacmumbai.in</a>) & Onkar Potdar (<br>
> <a href="mailto:onkar@cdacmumbai.in">onkar@cdacmumbai.in</a>)<br>
</div>> *<br>
<br>
I've commited your code, it'll be in poppler >= 0.15.1<br>
<font color="#888888"><br>
Albert<br>
</font><div><div></div><div class="h5"><br>
><br>
> With regards<br>
> Leena Chourey<br>
> For Accessibility Team<br>
> CDAC Mumbai<br>
><br>
> On Thu, Sep 16, 2010 at 12:55 AM, Albert Astals Cid <<a href="mailto:aacid@kde.org">aacid@kde.org</a>> wrote:<br>
> > A Dimarts, 7 de setembre de 2010, leena chourey va escriure:<br>
> > > Thanks Albert to review this patch again.<br>
> ><br>
> > The patch seems ok.<br>
> ><br>
> > Before commiting i need the names and e-mail addresses of the people the<br>
> > copyright of this patch belongs to.<br>
> ><br>
> > Albert<br>
> ><br>
> > > Comments are inline:<br>
> > ><br>
> > > On Sun, Jul 25, 2010 at 9:27 PM, Albert Astals Cid <<a href="mailto:aacid@kde.org">aacid@kde.org</a>><br>
> ><br>
> > wrote:<br>
> > > > A Dimarts, 6 de juliol de 2010, leena chourey va escriure:<br>
> > > > > Dear Albert,<br>
> > > ><br>
> > > > Hi<br>
> > > ><br>
> > > > > Thanks for your response.<br>
> > > > ><br>
> > > > > As discussed in the last mail, we have modified the patch so that:<br>
> > > > > - There is no behavioural change in pdftohtml -c <filename><br>
> > > > > means<br>
> ><br>
> > it<br>
> ><br>
> > > > > produces exactly the same output it did before.<br>
> > > > > - Defined new option as pdftohtml -s <filename> to generate a<br>
> ><br>
> > single<br>
> ><br>
> > > > > html file corresponding to a pdf file.<br>
> > > > ><br>
> > > > > Please check and give your feedback if any further change is<br>
> ><br>
> > required.<br>
> ><br>
> > > > You are using a variable you deleted (tmp) in this chunk of code<br>
> > > ><br>
> > > > ***********************<br>
> > > ><br>
> > > > delete tmp;<br>
> > ><br>
> > > This (delete tmp) was from the original development code only, we<br>
> > > didn't made changes regarding tmp. I have checked, it is in the recent<br>
> ><br>
> > development<br>
> ><br>
> > > version also.<br>
> > ><br>
> > > > - fprintf(pageFile,"%s\n<HTML>\n<HEAD>\n<TITLE>Page<br>
> > > > %d</TITLE>\n\n",<br>
> > > ><br>
> > > > - DOCTYPE, page);<br>
> > > > + if (!singleHtml)<br>
> > > > + fprintf(pageFile,"%s\n<HTML>\n<HEAD>\n<TITLE>Page<br>
> > > > %d</TITLE>\n\n",<br>
> > > > DOCTYPE, page);<br>
> > > > + else<br>
> > > > + fprintf(pageFile,"%s\n<HTML>\n<HEAD>\n<TITLE><br>
> ><br>
> > %s</TITLE>\n\n",<br>
> ><br>
> > > > DOCTYPE, tmp->getCString());////file name<br>
> > > > ***********************<br>
> > > ><br>
> > > > I'm also concerned about you adding various <HTML> to the same .html<br>
> > > > page, my<br>
> > > > limited HTML knowledge says you can only have one of those.<br>
> > ><br>
> > > For the above: I would like to say that every page has different<br>
> > > heading details as well as title. This should not be changed for<br>
> > > pages.<br>
> > ><br>
> > > > Also it would be necessary that you update the pdftohtml.1 file (the<br>
> ><br>
> > man<br>
> ><br>
> > > > page)<br>
> > > > adding the new option.<br>
> > ><br>
> > > pdftohtml.1 is updated.<br>
> > ><br>
> > > Please find the latest patch for "pdftohtml -s <file.pdf> " and give<br>
> > > feedback.<br>
> > ><br>
> > > > Albert<br>
> > > ><br>
> > > > > With best regard<br>
> > > > > Leena C<br>
> > > > ><br>
> > > > > On Wed, Jun 23, 2010 at 1:19 AM, Albert Astals Cid <<a href="mailto:aacid@kde.org">aacid@kde.org</a>><br>
> > > ><br>
> > > > wrote:<br>
> > > > > > A Dimarts, 22 de juny de 2010, leena chourey va escriure:<br>
> > > > > > > Dear Albert,<br>
> > > > > > ><br>
> > > > > > > Thanks for giving detail comment to patch.<br>
> > > > > ><br>
> > > > > > > Please check updates given inline:<br>
> > > > > > Please do not forget to CC the poppler mailing list.<br>
> > > > > ><br>
> > > > > > > On Thu, Jun 17, 2010 at 4:14 AM, Albert Astals Cid <<br>
> ><br>
> > <a href="mailto:aacid@kde.org">aacid@kde.org</a>><br>
> ><br>
> > > > > > wrote:<br>
> > > > > > > > A Dimecres, 16 de juny de 2010, omkar va escriure:<br>
> > > > > > > > > Dear Albert,<br>
> > > > > > > > ><br>
> > > > > > > > > Please find the corrected patch for "accessibility of pdf<br>
> > > ><br>
> > > > document<br>
> > > ><br>
> > > > > > > > > " and give your feedback.<br>
> > > > > > > ><br>
> > > > > > > > Hi, some comments:<br>
> > > > > > > > * The comments like<br>
> > > > > > > > // One more parameter(int j) is added in the getCSStyle<br>
> ><br>
> > function<br>
> ><br>
> > > > by<br>
> > > ><br>
> > > > > > CDAC<br>
> > > > > ><br>
> > > > > > > > developer Team<br>
> > > > > > > ><br>
> > > > > > > > need to be removed, if each line had near it who coded it,<br>
> ><br>
> > the<br>
> ><br>
> > > > code<br>
> > > ><br>
> > > > > > > > will<br>
> > > > > > > ><br>
> > > > > > > > be<br>
> > > > > > > > twice as big and much more unreadable<br>
> > > > > > ><br>
> > > > > > > Done, deleted all unwanted comments<br>
> > > > > > ><br>
> > > > > > > > * The spacing of your patches could be better, that is<br>
> > > > > > > ><br>
> > > > > > > > GooString* HtmlFontAccu::getCSStyle(int i, GooString* content<br>
> > > > > > > > ,int j){ should be<br>
> > > > > > > > +GooString* HtmlFontAccu::getCSStyle(int i, GooString*<br>
> > > > > > > > content, int j){ but that's nothing huge, i can fix it<br>
> > > > > > ><br>
> > > > > > > Updated accordingly.<br>
> > > > > > ><br>
> > > > > > > > * You are leaking (i.e. not deleting) jStr in both<br>
> > > > > > > ><br>
> > > > > > > > HtmlFontAccu::getCSStyle<br>
> > > > > > > > and HtmlFontAccu::CSStyle<br>
> > > > > > ><br>
> > > > > > > Deleted jStr<br>
> > > > > > ><br>
> > > > > > > > * I see that the new HtmlPage::complexHtml and the old<br>
> > > > > > > ><br>
> > > > > > > > HtmlPage::dumpComplex<br>
> > > > > > > > are very simple, i if you reused the code instead of copying<br>
> > > > > > > > it<br>
> > > > > > > ><br>
> > > > > > > > * This introduces a behavioural change that is unaccetable,<br>
> > > > > > > > i<br>
> > > > > ><br>
> > > > > > understand<br>
> > > > > ><br>
> > > > > > > > you<br>
> > > > > > > > want pdftohtml to produce a different (in your opinion<br>
> > > > > > > > better) output, for that you'll have to introduce a new<br>
> > > > > > > > comandline<br>
> ><br>
> > option<br>
> ><br>
> > > > to<br>
> > > ><br>
> > > > > > > > pdftohtml (something<br>
> > > > > > > > like --singlehtml) or something like that<br>
> > > > > > ><br>
> > > > > > > For last 2 point we want some clarification.<br>
> > > > > > > As you said behavioural change is unacceptable and also<br>
> > > > > > > suggested to introduce a new command line option to generate<br>
> > > > > > > single html.<br>
> ><br>
> > So<br>
> ><br>
> > > > > > > if we do<br>
> > > > > ><br>
> > > > > > as<br>
> > > > > ><br>
> > > > > > > following, will it be acceptable?<br>
> > > > > > ><br>
> > > > > > > - *Existing is:*<br>
> > > > > > > Command line option: pdftohtml -c <filename><br>
> > > > > > ><br>
> > > > > > > Function called:<br>
> > > > > > > dumpComplex<br>
> > > > > > ><br>
> > > > > > > ()<br>
> > > > > > > {<br>
> > > > > > ><br>
> > > > > > > Read from input file<br>
> > > > > > > Write into file to Generates pagewise html format<br>
> > > > > > ><br>
> > > > > > > }<br>
> > > > > > ><br>
> > > > > > ><br>
> > > > > > > - *Proposed changes:*<br>
> > > > > > > New Command line option : pdftohtml -s <filename><br>
> > > ><br>
> > > > //Checked,<br>
> > > ><br>
> > > > > > > nothing is already defined for -s (pdftohtml -c<br>
> > > > > ><br>
> > > > > > <filename><br>
> > > > > ><br>
> > > > > > > will exists as it is)<br>
> > > > > > ><br>
> > > > > > > - Function called:<br>
> > > > > > > dumpSingle() //new function similar<br>
> > > > > > > to<br>
> > > > > > ><br>
> > > > > > > dumpComplex {<br>
> > > > > > ><br>
> > > > > > > Read from input file<br>
> > > > > > > Write into file to append single html format<br>
> > > > > > ><br>
> > > > > > > }<br>
> > > > > > ><br>
> > > > > > > - A function to “Read from input file” can be defined and<br>
> > > > > > > call it<br>
> > > ><br>
> > > > in<br>
> > > ><br>
> > > > > > > both dumpComplex() and dumpSingle(), So that code duplication<br>
> > > > > > > can be removed (for second last point of your mail).<br>
> > > > > > ><br>
> > > > > > > - And with -s option (for --single Html) behavioural change<br>
> ><br>
> > will<br>
> ><br>
> > > > be<br>
> > > ><br>
> > > > > > > defined separately. (-c will not be affected)<br>
> > > > > ><br>
> > > > > > To be clear, pdftohtml -c should produce exactly the same output<br>
> > > > > > it did before<br>
> > > > > > your patch, pdftohtml -s you can output your version.<br>
> > > > > ><br>
> > > > > > So yes, i think i kind of agree with your proposal.<br>
> > > > > ><br>
> > > > > > Albert<br>
> > > > > ><br>
> > > > > > > For your opinion<br>
> > > > > > ><br>
> > > > > > > With Regards<br>
> > > > > > > Leena C & Onkar P<br>
> > > > > > > (for CDAC Accessibility Team)<br>
> > ><br>
> > > With best regards<br>
> > > Leena C<br>
> ><br>
> > _______________________________________________<br>
> > poppler mailing list<br>
> > <a href="mailto:poppler@lists.freedesktop.org">poppler@lists.freedesktop.org</a><br>
> > <a href="http://lists.freedesktop.org/mailman/listinfo/poppler" target="_blank">http://lists.freedesktop.org/mailman/listinfo/poppler</a><br>
_______________________________________________<br>
poppler mailing list<br>
<a href="mailto:poppler@lists.freedesktop.org">poppler@lists.freedesktop.org</a><br>
<a href="http://lists.freedesktop.org/mailman/listinfo/poppler" target="_blank">http://lists.freedesktop.org/mailman/listinfo/poppler</a><br>
</div></div></blockquote></div><br>