[Libreoffice-bugs] [Bug 108806] New: DOCX IMPORT: line break appears in a specific document that is absent in Word
bugzilla-daemon at bugs.documentfoundation.org
bugzilla-daemon at bugs.documentfoundation.org
Tue Jun 27 05:40:22 UTC 2017
https://bugs.documentfoundation.org/show_bug.cgi?id=108806
Bug ID: 108806
Summary: DOCX IMPORT: line break appears in a specific document
that is absent in Word
Product: LibreOffice
Version: unspecified
Hardware: All
OS: All
Status: UNCONFIRMED
Severity: normal
Priority: medium
Component: Writer
Assignee: libreoffice-bugs at lists.freedesktop.org
Reporter: mikekaganski at hotmail.com
Created attachment 134305
--> https://bugs.documentfoundation.org/attachment.cgi?id=134305&action=edit
A sanitized DOCX that has no line breaks in Word
The attachment does not have a line break in Word. Its single paragraph reads
> First part of a line (before CRLF). Second part of the same line (after CRLF).
When open in LO, it is split into two lines separated by a line break:
> First part of a line (before CRLF).
> Second part of the same line (after CRLF).
This happens because there is a line break (CRLF) in the markup (document.xml)
that is converted into space by Word, but treated as line break by LibreOffice.
Word behavior should be considered correct, as described at
https://msdn.microsoft.com/en-us/library/ms256097 and ECMA-376-1:2016 17.3.3.31
(although I didn't find documentation specifically discussing CRLF in
WordprocessingML). This is consistent with xml:space="preserve" attribute used
in the file, as documented at https://www.w3.org/TR/xml/#sec-white-space.
--
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/libreoffice-bugs/attachments/20170627/a060786b/attachment.html>
More information about the Libreoffice-bugs
mailing list