[Bug 89290] HTML export: replace tabulator characters with double spaces or ellipses (three dot leader) in Table of Contents

bugzilla-daemon at bugs.documentfoundation.org bugzilla-daemon at bugs.documentfoundation.org
Wed Feb 3 17:14:18 UTC 2016


https://bugs.documentfoundation.org/show_bug.cgi?id=89290

--- Comment #6 from Martin Nathansen <marsianer at gmail.com> ---
I would like to take this Easy Hack and have a few questions:

I tested the HTML export with two different *.odt documents and found that the
HTML output for the “Table of contents” depends on how the document was
originally created:

1) When the document was created with OpenOffice 4.2 the HTML export of
LibreOffice 5.2 for the “Table of Contents” looks like expected (ugly
formatted, but without missing anything):

Table of Contents
Heading1  1
Heading2  1
  Heading21  1
  Heading22  2


2) When the document was created with LibreOffice 5.2 the HTML is exported
without page numbers:

Table of Contents
Heading1
Heading2
  Heading21
  Heading22

There are also differences in the generated HTML Source:
In 1)  HTML paragraphs <p>....</p> are used and 
in 2)  it is a HTML table <table>..<tbody> <tr> <td>...</td> …..

So the missing page numbers of 2) needs to be fixed as well?
Probably the “Table of Content” is not detected in the right way?


3) I also tested the HTML export when using the “Save as” dialog. This was
working with all documents without any problems:

Table of Contents
Heading1.......................................1
Heading2.......................................1
  Heading21....................................1
  Heading22....................................2

Here for the formatting HTML Spans are added for the Headings and for the page
numbers respectively:
<p style="margin-bottom: 0in" class="leaders"><span><a
href="#__RefHeading___Toc136_1696943280">Heading1</a></span><span>1</span></p>

Now from my understanding there are two tasks to be done:
1. Fix the missing page numbers bug
2. Implement a better “Table of Content” formatting for the “Export” dialog,
therefore parts of the “Save as” HTML export might be reused.

Am I on the right track?

-- 
You are receiving this mail because:
You are on the CC list for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/libreoffice/attachments/20160203/62a87222/attachment.html>


More information about the LibreOffice mailing list