[Libreoffice-bugs] [Bug 79845] Mac OS X mdimporter for .odt files with tabulations doesn't import all text

bugzilla-daemon at bugs.documentfoundation.org bugzilla-daemon at bugs.documentfoundation.org
Fri Sep 14 22:44:07 UTC 2018


https://bugs.documentfoundation.org/show_bug.cgi?id=79845

--- Comment #7 from Dave Huang <khym at azeotrope.org> ---
I'm still seeing the bug in:

Version: 6.1.1.2
Build ID: 5d19a1bfa650b796764388cd8b33a5af1f5baa1b
CPU threads: 4; OS: Mac OS X 10.13.6; UI render: default; 
Locale: en-US (en_US.UTF-8); Calc: group threaded

So I don't know Objective C, but I do know regular C... looking at the source,
from what I can tell, it parses the XML content of the document and when it
finds a <text:p> start tag, it starts collecting the text in the element until
it finds any end tag.

The relevant part of the attached .odt is basically: <text:p><text:span>Column
one text<text:tab />Column two text</text:span></text:p>

So I think the problem is that it sees the <text:p> and starts collecting text,
but when it gets to the <text:tab />, it stops and hence ignores "Column two
text". What if only only stops collecting text when it finds the ending
</text:p>, rather than any end tag?

I'll attach a proposed patch, but as I said, I don't know Objective C. And I
don't have an environment where I can try to build and test the change.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/libreoffice-bugs/attachments/20180914/d3b89add/attachment.html>


More information about the Libreoffice-bugs mailing list