问题描述
词干是多数民众赞成在需要标签系统的东西。我用好吃的,我没有时间来管理和修剪我的标签。我更小心,我的博客了一点,但它并非十全十美。我写软件的嵌入式系统,这将是更加的功能(有助于用户),如果他们列入制止。
Stemming is something that's needed in tagging systems. I use delicious, and I don't have time to manage and prune my tags. I'm a bit more careful with my blog, but it isn't perfect. I write software for embedded systems that would be much more functional (helpful to the user) if they included stemming.
例如:
解析
分析器
解析
For instance:
Parse
Parser
Parsing
应该都意味着同样的事情,无论系统我把他们入。
Should all mean the same thing to whatever system I'm putting them into.
在理想情况下有一个BSD许可的词干的地方,但如果没有,我在哪里看学习常用算法和技术呢?
Ideally there's a BSD licensed stemmer somewhere, but if not, where do I look to learn the common algorithms and techniques for this?
除了BSD词干,还有什么其他的开源许可的词干是那里?
Aside from BSD stemmers, what other open source licensed stemmers are out there?
推荐答案
雪球词干(C&放大器;的Java )我用它的Python绑定, PyStemmer
Snowball stemmer (C & Java)I've used it's Python binding, PyStemmer
这篇关于词干 - code例子或开源项目?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!