问题描述
我正在使用Java进行自然语言处理项目.我有一个要求,我要标识属于相似语义组的单词.例如:如果找到诸如study
,university
,graduate
,attend
之类的词,我希望将它们归类为与教育有关.如果找到诸如golfer
,batsman
,athlete
之类的单词,则应将所有单词归类为运动者"之类的父母.有没有一种无需使用培训方法就可以完成此任务的方法.是否有像WordNet这样的收费电话可以直接使用?任何指针将不胜感激!谢谢!! :-)
I am working on a natural language processing project in Java. I have a requirement where I want to identify words that belong to similar semantic groups.e.g. : if the words such as study
, university
, graduate
, attend
are found I want them to be categorized as being related to education.If words such as golfer
, batsman
, athlete
are found, it should categorize all under a parent like sportsperson.Is there a way I can achieve this task without using and training approach. Is there some toll like WordNet that can be used directly? Any pointer would be greatly appreciated!Thanx cheers!! :-)
推荐答案
是的,您可以使用WordNet.例如,您可以在当前单词的重音(例如study
)中搜索类别单词(例如education
或sport
).有 JAWS , JWNL 和其他库,请参见.
Yes, you can use WordNet. For example, you can search among hypernyms of the current word (e.g. study
) for your category word (e.g. education
or sport
). There are JAWS, JWNL, and other libraries, see related question.
或者,您可以计算候选词和类别词之间的相似度-例如通过使用 ws4j 或 Semilar .
Alternatively, you can compute similarity between candidate words and category words - e.g. by using ws4j or Semilar.
这篇关于查找语义相似的词以进行自然语言处理的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!