2007/1/17, Jean-Francois Dockes <<a href="mailto:jean-francois.dockes@wanadoo.fr">jean-francois.dockes@wanadoo.fr</a>>:<div><span class="gmail_quote"></span><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
Mikkel Kamstrup Erlandsen writes:<br> > 2007/1/17, Jean-Francois Dockes <<a href="mailto:jean-francois.dockes@wanadoo.fr">jean-francois.dockes@wanadoo.fr</a>>:<br> > > The user-level language needs to have a way to force stemming off.
<br> ><br> > I still think it should be an optional thing - as it is now. The language<br> > supports stemming control, but it is not required. As discussed a bit<br> > earlier such extended features should probably be introspectable with a a
<br> > getExtendedFeatures() method (both in simple and live api).<br><br>Not sure we understand each other here. I was referring to the simple<br>search language from <a href="http://wiki.freedesktop.org/wiki/WasabiDraft">
http://wiki.freedesktop.org/wiki/WasabiDraft</a>, and I<br>meant that there should be a way for the user to express her wish for an<br>exact match, with no transformation of the search term. The backend will<br>then do its best. I don't see how the *language* feature could be optional,
<br>not having it would unnecessarily cripple the capabilities of engines which<br>can switch stemming on or off. Maybe we could simply say that terms<br>enclosed in double-quotes are not to be stemmed if possible ?</blockquote>
<div><br>Ah, sorry. I was still thinking about the xml language, I see now that you wrote *user-level* language :-)<br><br>Regarding the user-level language now. Perhaps "flying Dutchman" would mean the unstemmed phrase (if supported), and 'flying Dutchman' (single quotes) could allow for stemming. This is not completely standard, but it doesn't break user expectation in horrible ways (as far as I can see).
<br></div><br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;"> > Do we all agree that we need to define a user level language as well? +1
<br> > from me at least.<br><br>Yes for me too.<br><br> > > I would be much in favour of using the raw user entry here.<br> ><br> > I would be much in favor of allowing userQuery elements embedded in the xml.
<br> > It does introduce a bit of extra coding work for servers...<br><br>I think that I'd prefer to keep xml out of the simple API, but I could also<br>work with your approach.</blockquote><div><br>We have a tie :-) What do you other guys think? I see two arguments for the xml language in both simple and live:
<br><br> - Common api between live and simple interfaces<br> - Both apis can have the powers of both the user-level and the full xml languages, while still having an easy way of doing direct user queries without much parsing.
<br><br>Cheers,<br>Mikkel <br></div><br></div><br>