Hi all,<div><br></div><div>what's the supposed regression, exactly? Do we have only sentences as segmented by LO? This would be a serious drawback as ICU methods are less than perfect, and our results are much more reliable (the BreakIterator simply uses a static list of abbreviations which is a vast simplification that cannot really capture a lot of ambiguous dots, so it's broken by design).</div> <div><br></div><div>Best,</div><div>Marcin</div><div><br><div class="gmail_quote">On Mon, Mar 4, 2013 at 9:58 PM, Németh László <span dir="ltr"><<a href="mailto:nemeth@numbertext.org" target="_blank">nemeth@numbertext.org</a>></span> wrote:<br> <blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi,<br> <br> If I right know, that was an intended change from the original author,<br> Thomas Lange, supported by the contributors, eg. Marcin Miłkowski and<br> Daniel Naber, for the real needs, better sentence boundary<br> disambiguation and grammar checking by LanguageTool and other grammar<br> checker components. So the recent state is a drawback. I suggest to<br> revert it (maybe it would be fine to add some comments to the<br> ProofreadingResult.idl to prevent from similar changes, too).<br> <br> Best regards,<br> László<br> <br> 2013/3/4 Olivier R. <<a href="mailto:olivier.noreply@gmail.com">olivier.noreply@gmail.com</a>>:<br> > Caolán McNamara wrote<br> >> do you get the pre LO 4 behaviour ?<br> ><br> > Probably.<br> > With LO 3, in doProofreading:<br> > - nStartOfSentencePos was always the beginning of the paragraph (=0)<br> > - nSuggestedSentenceEndPos was always the end of the paragraph (=length of<br> > rText)<br> ><br> > And each paragraph was passed once to the GC.<br> ><br> ><br> ><br> >> Assuming that you do, then it appears to me that the current LO4<br> >> behaviour is the original programmer intent and that the intermediate<br> >> behaviour was a bug (from the programmer intent perspective anyway) in<br> >> whatever versions got released between<br> >> 9f2fde7ab5de20926bb25a6b298b4e5dffb66eb2 and LO4<br> ><br> > Yes, we can assume that was the original programmer intent.<br> > But it worked another way for 3 years and nobody complained about it. :)<br> > I prefer the unintended behavior, as LO does not assume wrongly what is the<br> > end of sentences.<br> ><br> > So what LO will do?<br> ><br> > Olivier<br> ><br> ><br> <span class="HOEnZb"><font color="#888888">><br> > --<br> > View this message in context: <a href="http://nabble.documentfoundation.org/Grammar-checker-Undocumented-change-in-the-API-for-LO-4-tp4030639p4041580.html" target="_blank">http://nabble.documentfoundation.org/Grammar-checker-Undocumented-change-in-the-API-for-LO-4-tp4030639p4041580.html</a><br> > Sent from the Dev mailing list archive at Nabble.com.<br> > _______________________________________________<br> > LibreOffice mailing list<br> > <a href="mailto:LibreOffice@lists.freedesktop.org">LibreOffice@lists.freedesktop.org</a><br> > <a href="http://lists.freedesktop.org/mailman/listinfo/libreoffice" target="_blank">http://lists.freedesktop.org/mailman/listinfo/libreoffice</a><br> </font></span></blockquote></div><br></div>