Hi all,<div><br></div><div>what's the supposed regression, exactly? Do we have only sentences as segmented by LO? This would be a serious drawback as ICU methods are less than perfect, and our results are much more reliable (the BreakIterator simply uses a static list of abbreviations which is a vast simplification that cannot really capture a lot of ambiguous dots, so it's broken by design).</div>
<div><br></div><div>Best,</div><div>Marcin</div><div><br><div class="gmail_quote">On Mon, Mar 4, 2013 at 9:58 PM, Németh László <span dir="ltr"><<a href="mailto:nemeth@numbertext.org" target="_blank">nemeth@numbertext.org</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi,<br>
<br>
If I right know, that was an intended change from the original author,<br>
Thomas Lange, supported by the contributors, eg. Marcin Miłkowski and<br>
Daniel Naber, for the real needs, better sentence boundary<br>
disambiguation and grammar checking by LanguageTool and other grammar<br>
checker components. So the recent state is a drawback. I suggest to<br>
revert it (maybe it would be fine to add some comments to the<br>
ProofreadingResult.idl to prevent from similar changes, too).<br>
<br>
Best regards,<br>
László<br>
<br>
2013/3/4 Olivier R. <<a href="mailto:olivier.noreply@gmail.com">olivier.noreply@gmail.com</a>>:<br>
> Caolán McNamara wrote<br>
>> do you get the pre LO 4 behaviour ?<br>
><br>
> Probably.<br>
> With LO 3, in doProofreading:<br>
> - nStartOfSentencePos was always the beginning of the paragraph (=0)<br>
> - nSuggestedSentenceEndPos was always the end of the paragraph (=length of<br>
> rText)<br>
><br>
> And each paragraph was passed once to the GC.<br>
><br>
><br>
><br>
>> Assuming that you do, then it appears to me that the current LO4<br>
>> behaviour is the original programmer intent and that the intermediate<br>
>> behaviour was a bug (from the programmer intent perspective anyway) in<br>
>> whatever versions got released between<br>
>> 9f2fde7ab5de20926bb25a6b298b4e5dffb66eb2 and LO4<br>
><br>
> Yes, we can assume that was the original programmer intent.<br>
> But it worked another way for 3 years and nobody complained about it. :)<br>
> I prefer the unintended behavior, as LO does not assume wrongly what is the<br>
> end of sentences.<br>
><br>
> So what LO will do?<br>
><br>
> Olivier<br>
><br>
><br>
<span class="HOEnZb"><font color="#888888">><br>
> --<br>
> View this message in context: <a href="http://nabble.documentfoundation.org/Grammar-checker-Undocumented-change-in-the-API-for-LO-4-tp4030639p4041580.html" target="_blank">http://nabble.documentfoundation.org/Grammar-checker-Undocumented-change-in-the-API-for-LO-4-tp4030639p4041580.html</a><br>
> Sent from the Dev mailing list archive at Nabble.com.<br>
> _______________________________________________<br>
> LibreOffice mailing list<br>
> <a href="mailto:LibreOffice@lists.freedesktop.org">LibreOffice@lists.freedesktop.org</a><br>
> <a href="http://lists.freedesktop.org/mailman/listinfo/libreoffice" target="_blank">http://lists.freedesktop.org/mailman/listinfo/libreoffice</a><br>
</font></span></blockquote></div><br></div>