hibernate - Stemming Search Using SnowballPorterFilterFactory Seems to Return Less Results -


i have 4 documents containing following 4 texts respectively.

  • xxx xxx
  • xxx xxx
  • xxx did xxx
  • xxx doing xxx

now perform search text "do" using snowballporterfilterfactory filter, , expect search above 4 documents out. following documents searched out.

  • xxx xxx
  • xxx doing xxx

but when try search text "refactor", documents containing texts "refactor", "refactors", "refactored" or "refactoring" searched out.

why search text "do" cannot return documents containing "does" , "did"? word "do" special , shouldn't using snowballporterfilterfactory filter?

thanks.

i'm not surprised: forms of declared stop words in stop words list provided snowballfilter in lucene didn't care case.

so it's more or less consistent intents.

you can either: * use stopfilterfactory provided english_stop.txt (read comment @ top of file format use); note searching on term do/doing/did won't work (but don't think it's use case per se) ignored if part of larger search, solving issue. * declare them synonyms synonymfilterfactory.

either way, welcome wonderful world of stemming: it's not easy subject.


Comments

Popular posts from this blog

mysql - Dreamhost PyCharm Django Python 3 Launching a Site -

java - Sending SMS with SMSLib and Web Services -

java - How to resolve The method toString() in the type Object is not applicable for the arguments (InputStream) -