我想在meta-toolkit中使用默认过滤器链配置config.toml文件,但指定每个过滤器。我打算对某些过滤器进行修改,因此我想拥有一个基线过滤器链。

我尝试了以下属性:

[[analyzers]]
method = "ngram-word"
ngram = 1
    [[analyzers.filter]]
    type = "icu-tokenizer"

    [[analyzers.filter]]
    type = "lowercase"

    [[analyzers.filter]]
    type = "alpha"

    [[analyzers.filter]]
    type = "length"
    min = 2
    max = 35

    [[analyzers.filter]]
    type = "list"
    filename = "../data/lemur-stopwords.txt"

    [[analyzers.filter]]
    type = "porter2-stemmer"

    [[analyzers.filter]]
    type = "empty-sentence"


我越来越:


  token_stream_exception:what():list_filter配置所需的文件

最佳答案

该属性应该是文件而不是文件名。

[[analyzers.filter]]
type = "list"
file = "../data/lemur-stopwords.txt"

关于c++ - 在meta-toolkit中的config.toml中配置默认​​链,我们在Stack Overflow上找到一个类似的问题:https://stackoverflow.com/questions/29624601/

10-13 08:09