[Libreoffice-bugs] [Bug 135282] New: A showcase of HTML import, editing and export bugs

bugzilla-daemon at bugs.documentfoundation.org bugzilla-daemon at bugs.documentfoundation.org
Wed Jul 29 15:12:42 UTC 2020


https://bugs.documentfoundation.org/show_bug.cgi?id=135282

            Bug ID: 135282
           Summary: A showcase of HTML import, editing and export bugs
           Product: LibreOffice
           Version: 7.0.0.0.beta1+
          Hardware: All
                OS: All
            Status: UNCONFIRMED
          Severity: normal
          Priority: medium
         Component: Writer
          Assignee: libreoffice-bugs at lists.freedesktop.org
          Reporter: eyalroz at technion.ac.il

This is filed as a single bug, because it involves a single document, but in
fact likely involves many bugs, some dupes and some probably not.

Instructions:

1. Have the Firefox web browser installed (I use v78.0.2 though it shouldn't
matter much)
2. Install the "Save Page WE" extension.
3. Visit this page:
https://medium.com/datadriveninvestor/what-are-time-series-databases-a3e847608f91
4. Press the "Save Page WE" button
5. Choose a path for the saved copy of the document; let's say you choose
/path/to/doc.html  (you should get a 1.2 MB file, unfortunately)
6. Open /path/to/doc.html in LibreOffice Writer

Now let's enumerate the issues.

HTML meta tags and comments
------------------------------

(See screenshot 01)

1. Many HTML meta tags show up as comments. Most/all of them shouldn't; they
should either go into Document Properties - if nothing else than as custom
properties. Some tags _do_ show up there, but not nearly all.

2. Many HTML meta tags have their name stripped for some reason. Examples:
"twitter:app:name:iphone", "al:ios:app_name".

3. The comment-overflow UI is quite inadequate: an idiosyncratic pair of
buttons, which don't really look like buttons, and their up-triangle and
down-triangle are also incredibly small. which stands in stark contrast to

4. Choice of a relatively-large comment font, and layout of comment balloons,
so that only about 25% of the comment balloons is actual used for comment text.
That's not just an issue with this document or with HTML; it's something I've
noticed elsewhere as well.

5. Comments exhibit entry time, although the comments are _not_ timed. They
exhibit an author even though they have no identified author. Now, if _some_
comments_ had an author and some didn't, that might perhaps make sense, but not
when no comments have authors. This also relates to issue (4.), since a lot of
balloon real-estate is used for the dummy time and author listing.

6. If you want to make HTML meta tags into comments - already a questionable
idea - why have the comment text be HTML _code_? Especially since all you have
about these meta tags is a name and a content string, or perhaps even _just_ a
comment string (issue (2.) )? Shouldn't the comments be just: 

    somenamehere: The tag content here

  and that's that?

Viewing the document:

(Screenshots 02, 03)

7. The document is opened for editing in Normal view rather than Web view. This
doesn't make sense to me - it's an HTML document, saved from the web.

8. The rendered document looks nothing like its rendering in a browser; and its
rendering in different browsers is almost-identical. Now, ok, LO Writer is an
editor, not a browser, so one can expect a few inaccuracies here and there.
But:

  8.1 The basic block/framek layout is very different, up to and including
vertical-vs-horizontal
  8.2 Font sizes and colors are different
  8.3 Areas which are supposed to be demarcated, aren't (e.g. the "Why data
will transform investment management" block)
  8.3 Navigation bar background is missing
  8.4 The centralized, width-limited style for rendering the content is not
respected.
  8.5. The size of (all copies of) each chart  is larger than the size of the
chart when displayed in a browser.

9. Instead of a single crisp chart, we're seeing three (!) copies of the chart,
two of which are blurry to different degrees. 

10. As you get to the end of the document (after the line saying "Original
Source"), LO Writer viewport repainting gets messed up, and scrolling
up-and-down results in the same parts of the content being repeated several
times, while other parts disappear (despite some of them having been visible
before).

Saving/exporting:

11. Saving the document - with no changes - to another HTML file results in 22
files in addition to the new HTML file. That should not happen: LO Writer
should be able to maintain data within "data:" URIs inside the HTML document.

12. Opening the newly-exported HTML document in a browser shows a document very
similar to what we saw in LO - not similar to the original document. That means
the many/most of changes we saw weren't just cosmetic and carried forward to
the output.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/libreoffice-bugs/attachments/20200729/abcde887/attachment-0001.htm>


More information about the Libreoffice-bugs mailing list