python - Pyparsing:将infixnotation与setResultsName组合

我正在尝试使用infixNotation（以前是operatorPrecedence）编写语法，但是我不知道该如何使用setResultsName。

我尝试执行此操作的原因是，我在searchparser之上构建了布尔搜索查询的语法，但对于很长的表达式却遇到了RecursionError: maximum recursion depth exceeded in comparison。

因此，似乎通过使用infixNotation（searchparser不会使用），我可以避免遇到此错误。
因此，我试图使语法适应infixNotation，但我的评估很大程度上依赖于结构化解析结果中每个运算符的名称，尤其是易于访问运算符的参数。

我从pyparsing书中给出的示例开始：

and_ = CaselessLiteral("and")
or_  = CaselessLiteral("or")
not_ = CaselessLiteral("not")
searchTerm = Word(alphanums) | quotedString.setParseAction( removeQuotes )
searchExpr = infixNotation( searchTerm,
      [
      (not_, 1, opAssoc.RIGHT),
      (and_, 2, opAssoc.LEFT),
      (or_, 2, opAssoc.LEFT),
      ])

那么，如何在这里设置ParseResultName？

如果我尝试将其设置为运算符：

or_ = CaselessLiteral("or").setResultsName("OR")

此字符串（“ term1 OR term2 OR term3”）的结果parseResult将如下所示：

<ITEM>
  <word>
    <word>
      <ITEM>term1</ITEM>
    </word>
    <OR>or</OR>
    <word>
      <ITEM>term2</ITEM>
    </word>
    <OR>or</OR>
    <word>
      <ITEM>term3</ITEM>
    </word>
  </word>
</ITEM>

这意味着所有术语和运算符都处于同一级别，而我想要这样的东西，其中术语作为运算符的参数进行排列：

<OR>
  <OR>
    <word>
      <ITEM>term1</ITEM>
    </word>
    <OR>
      <word>
        <ITEM>term2</ITEM>
      </word>
      <word>
        <ITEM>term3</ITEM>
      </word>
    </OR>
  </OR>
</OR>

我以前在以前的语法中是通过以下方式实现此目的的：

operatorOr << (Group(
            operatorAnd + Suppress(Keyword("OR", caseless=True)) + operatorOr
        ).setResultsName("OR") | operatorAnd)

但我不知道如何将结果名称设置为由运算符及其两个参数组成的组？

最佳答案

我鼓励您考虑使用类作为解析动作，而不是使用结果名称来构建操作节点树。

在下面的代码中，我将UnOp和BinOp类附加到每个infixNotation运算符级别，这将为这些类的实例提供正确分配的operator和operands属性：

class OpNode:
    def __repr__(self):
        return "{}({}):{!r}".format(self.__class__.__name__,
                                    self.operator, self.operands)
class UnOp(OpNode):
    def __init__(self, tokens):
        self.operator = tokens[0][0]
        self.operands = [tokens[0][1]]

class BinOp(OpNode):
    def __init__(self, tokens):
        self.operator = tokens[0][1]
        self.operands = tokens[0][::2]

and_ = CaselessLiteral("and")
or_  = CaselessLiteral("or")
not_ = CaselessLiteral("not")
searchTerm = Word(alphanums) | quotedString.setParseAction(removeQuotes)
searchExpr = infixNotation(searchTerm,
      [
      (not_, 1, opAssoc.RIGHT, UnOp),
      (and_, 2, opAssoc.LEFT, BinOp),
      (or_, 2, opAssoc.LEFT, BinOp),
      ])

这是一个示例字符串，显示了如何返回这些节点：

test = "term1 or term2 or term3 and term4 and not term5"
print(searchExpr.parseString(test))

给出：

[BinOp(or):['term1', 'term2', BinOp(and):['term3', 'term4', UnOp(not):['term5']]]]

您可以浏览此解析的树并根据节点的类型和运算符评估不同的节点。

同样，asXML()并不是转储已解析数据的最佳工具，最好使用dump()方法。

关于python - Pyparsing:将infixnotation与setResultsName组合，我们在Stack Overflow上找到一个类似的问题：https://stackoverflow.com/questions/47184677/