<html>
<head>
<base href="https://bugs.documentfoundation.org/">
</head>
<body>
<p>
<div>
<b><a class="bz_bug_link
bz_status_NEW "
title="NEW - Mac OS X mdimporter for .odt files with tabulations doesn't import all text"
href="https://bugs.documentfoundation.org/show_bug.cgi?id=79845#c7">Comment # 7</a>
on <a class="bz_bug_link
bz_status_NEW "
title="NEW - Mac OS X mdimporter for .odt files with tabulations doesn't import all text"
href="https://bugs.documentfoundation.org/show_bug.cgi?id=79845">bug 79845</a>
from <span class="vcard"><a class="email" href="mailto:khym@azeotrope.org" title="Dave Huang <khym@azeotrope.org>"> <span class="fn">Dave Huang</span></a>
</span></b>
<pre>I'm still seeing the bug in:
Version: 6.1.1.2
Build ID: 5d19a1bfa650b796764388cd8b33a5af1f5baa1b
CPU threads: 4; OS: Mac OS X 10.13.6; UI render: default;
Locale: en-US (en_US.UTF-8); Calc: group threaded
So I don't know Objective C, but I do know regular C... looking at the source,
from what I can tell, it parses the XML content of the document and when it
finds a <text:p> start tag, it starts collecting the text in the element until
it finds any end tag.
The relevant part of the attached .odt is basically: <text:p><text:span>Column
one text<text:tab />Column two text</text:span></text:p>
So I think the problem is that it sees the <text:p> and starts collecting text,
but when it gets to the <text:tab />, it stops and hence ignores "Column two
text". What if only only stops collecting text when it finds the ending
</text:p>, rather than any end tag?
I'll attach a proposed patch, but as I said, I don't know Objective C. And I
don't have an environment where I can try to build and test the change.</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
</ul>
</body>
</html>