加速文本比较（使用稀疏矩阵）

本文介绍了加速文本比较（使用稀疏矩阵）的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

我有一个函数，它接受两个字符串，并给出余弦相似度值，显示两个文本之间的关系。

I have a function which takes two strings and gives out the cosine similarity value which shows the relationship between both texts.

如果我要比较75个文本其他，我需要做5,625个单一的比较，以使所有的文本相互比较。

If I want to compare 75 texts with each other, I need to make 5,625 single comparisons to have all texts compared with each other.

有没有办法减少这个比较的数量？例如稀疏矩阵或k-means？

Is there a way to reduce this number of comparisons? For example sparse matrices or k-means?

我不想谈论我的功能或比较文本的方法。

I don't want to talk about my function or about ways to compare texts. Just about reducing the number of comparisons.