python - nltk如何给多个分开的句子

我有英语句子列表（每个句子都是一个列表），我想获取ngram。
例如：

sentences = [['this', 'is', 'sentence', 'one'], ['hello','again']]

为了运行

nltk.utils.ngram

我需要整理清单以：

sentences = ['this','is','sentence','one','hello','again']

但后来我得到了错误的bgram

（“一个”，“你好”）

。
最好的解决方法是什么？

谢谢！

最佳答案

尝试这个：

from itertools import chain

sentences = list(chain(*sentences))

chain返回一个链对象，其.__next__()方法从第一个可迭代对象返回元素，直到用尽为止，然后从下一个迭代对象返回元素
迭代，直到所有迭代都用尽。

或者您可以执行以下操作：

 sentences = [i for s in sentences for i in s]

关于python - nltk如何给多个分开的句子，我们在Stack Overflow上找到一个类似的问题：https://stackoverflow.com/questions/52606753/