本文介绍了Lucene和Lucene.Net的俄语分析器的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
Lucene对俄语的支持很差。
Lucene has quite poor support for Russian language.
RussianAnalyzer(lucene-contrib的一部分)质量非常低。
RussianAnalyzer (part of lucene-contrib) is of very low quality.
Snowball的RussianStemmer模块更糟糕。它不能识别Unicode字符串中的俄语文本,显然假设必须使用Unicode和KOI8-R的一些奇怪组合。
RussianStemmer module for Snowball is even worse. It does not recognize Russian text in Unicode strings, apparently assuming that some bizarre mix of Unicode and KOI8-R must be used instead.
你知道更好的解决方案吗? / p>
Do you know any better solutions?
推荐答案
我的回答可能为时已晚,但为了记录,我发现比Lucene附带的分析仪要好得多。
My answer is probably too late, but for the record, I've found analyzers from AOT project much better then those shipped with Lucene.
这篇关于Lucene和Lucene.Net的俄语分析器的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!