hibernate - Stemming Search Using SnowballPorterFilterFactory Seems to Return Less Results -
i have 4 documents containing following 4 texts respectively.
- xxx xxx
- xxx xxx
- xxx did xxx
- xxx doing xxx
now perform search text "do" using snowballporterfilterfactory filter, , expect search above 4 documents out. following documents searched out.
- xxx xxx
- xxx doing xxx
but when try search text "refactor", documents containing texts "refactor", "refactors", "refactored" or "refactoring" searched out.
why search text "do" cannot return documents containing "does" , "did"? word "do" special , shouldn't using snowballporterfilterfactory filter?
thanks.
i'm not surprised: forms of declared stop words in stop words list provided snowballfilter in lucene didn't care case.
so it's more or less consistent intents.
you can either: * use stopfilterfactory provided english_stop.txt (read comment @ top of file format use); note searching on term do/doing/did won't work (but don't think it's use case per se) ignored if part of larger search, solving issue. * declare them synonyms synonymfilterfactory.
either way, welcome wonderful world of stemming: it's not easy subject.
Comments
Post a Comment