c++ - 换位表导致象棋引擎丢失

序言:我知道对于Stack Overflow之类的问题，这个问题可能有点过于广泛。虽然，我尝试添加尽可能多的信息。

我正在从头开始用C++编写象棋引擎的代码，而且大部分都已完成(除了它的评估功能很弱)。但是，很奇怪的是，引擎每次启用换位表时都会因一个惊人的明显错误(丢掉一个皇后，有时是一个皇后和另外一块)而损失。禁用换位表探测后，引擎将轻松击败Nero和TSCP。

我使用了一个非常简单的换位表实现方案，该方案具有始终替换的方案，该方案取自Bruce Morland的站点here。

这是探针实现:

bool probe_table(TranspositionTable& t_table, unsigned int ply,
uint64 hash_key, unsigned int depth, unsigned int& pv_move, int& score,
int alpha, int beta)
{
    unsigned int index = hash_key % t_table.num_entries;

    assert(index < t_table.num_entries);

    if(t_table.t_entry[index].hash_key == hash_key)
    {
        pv_move = t_table.t_entry[index].move;

        if(t_table.t_entry[index].depth >= depth)
        {
            score = t_table.t_entry[index].score;

            if(score > IS_MATE) score -= ply;
            else if(score < -IS_MATE) score += ply;

            switch(t_table.t_entry[index].flag)
            {
                case TFALPHA:
                {
                    if(score <= alpha)
                    {
                        score = alpha;
                        return 1;
                    }
                }
                case TFBETA:
                {
                    if(score >= beta)
                    {
                        score = beta;
                        return 1;
                    }
                }
                case TFEXACT:
                {
                    return 1;
                }
                default: assert(false); // At least one flag must be set.
            }
        }
    }

    return 0;
}

在alpha-beta搜索期间，将对该表进行探测(根据截止日期等，条目也将正确存储):

if(probe_table(board.t_table, board.ply, board.hash_key, depth, pv_move,
    score, alpha, beta))
{
    return score;
}

编辑:为了完整起见，以下是在搜索中存储代码的重要片段:

if(score > alpha) // Alpha cutoff.
{
    if(score >= beta) // Beta cutoff.
    {
        ...

        store_entry(board.t_table, board.ply, board.hash_key, best_move,
                    beta, depth, TFBETA);

        return beta;
    }

    alpha = score;

    ...
    }
}

...

assert(alpha >= old_alpha);

if(alpha != old_alpha)
{
    store_entry(board.t_table, board.ply, board.hash_key, best_move,
        best_score, depth, TFEXACT);
}
else
{
    store_entry(board.t_table, board.ply, board.hash_key, best_move,
        alpha, depth, TFALPHA);
}

我已经考虑并探索了引擎其他每个部分的故障，但都没有发生。完全禁用探测(但仍使用表格存储PV线)可以很好地工作。

我还考虑过我天真的思考是否正在影响事情。值得深思的是，只要有深思熟虑的举动，我只需将bestmove xxxx ponder xxxx打印到UCI。如果启用了思考功能，则GUI会让引擎进行思考功能(这只是常规搜索，但稍后会通过转置表使用)。

为了进行测试，我完全禁用了思考功能，但并没有任何改善。在获得胜利分数一段时间后，引擎简单地放弃了它的女王。我相信，这是由于表中的某些错误条目导致的，或者是我无法理解的其他某种形式的不稳定。

这就是我需要有人遇到这个问题的地方，以便向我指出一个总体方向。

编辑:如grek40所问，这是一个刚刚发生的示例(引擎为白色，移动时为白色):

引擎再次发动，基于愚蠢的举动输掉了比赛。请注意，可能是由于换位台本身，发动机到达了这样的不良位置。

用分析这个位置，在游戏过程中填充换位表:
info score cp -425 depth 1 nodes 1 time 0 pv e1d1 info score cp -425 depth 2 nodes 2 time 0 pv e1d1 info score cp -425 depth 3 nodes 3 time 0 pv e1d1 info score cp -425 depth 4 nodes 4 time 0 pv e1d1 info score cp -425 depth 5 nodes 5 time 0 pv e1d1 info score cp -425 depth 6 nodes 6 time 0 pv e1d1 info score cp -425 depth 7 nodes 7 time 0 pv e1d1 info score cp -425 depth 8 nodes 8 time 0 pv e1d1 info score cp -425 depth 9 nodes 9 time 0 pv e1d1 info score cp -425 depth 10 nodes 10 time 0 pv e1d1 info score cp -440 depth 11 nodes 10285162 time 3673 pv f5f8 e7f8 b2b3 b7b5 e1c1 d6d5 a3a4 info score cp -440 depth 12 nodes 29407669 time 10845 pv f5f8 e7f8 e1f1 f8e7 f1f5 d6d5 bestmove f5f8 ponder e7f8

再次分析而不使用换位表(实际上是带有该表，但已清除):
info score cp -415 depth 1 nodes 82 time 0 pv f5f8 info score cp -415 depth 2 nodes 200 time 0 pv f5f8 e7f8 info score cp -405 depth 3 nodes 900 time 0 pv f5f8 e7f8 b2b3 info score cp -425 depth 4 nodes 2936 time 1 pv f5f8 e7f8 b2b3 f8e7 info score cp -415 depth 5 nodes 10988 time 4 pv f5f8 e7f8 b2b3 b7b5 e1d1 info score cp -425 depth 6 nodes 65686 time 25 pv f5f8 e7f8 e1f1 d7e7 f1d1 b7b5 info score cp -420 depth 7 nodes 194124 time 76 pv f5f8 e7f8 b2b3 b7b5 e1f1 f8e7 f1f7 info score cp -425 depth 8 nodes 357753 time 141 pv f5f8 e7f8 b2b3 b7b5 e1f1 f8e7 f1f5 d7c7 info score cp -425 depth 9 nodes 779686 time 292 pv f5f8 e7f8 e1f1 f8e7 f1f5 h4h8 info score cp -425 depth 10 nodes 1484178 time 560 pv f5f8 e7f8 e1f1 f8e7 f1f5 h4h8 info score cp -435 depth 11 nodes 29481132 time 11117 pv f5f8 d6d5 e1e5 info score cp -435 depth 12 nodes 106448053 time 41083 pv f5f8 e7f8 bestmove f5f8 ponder e7f8

值得注意的是，在深度10上，从换位表搜索得出的分数是准确的。

编辑:比赛结束后对这个位置进行了分析。在游戏过程中，引擎没有足够的时间来完成更高深度的搜索，导致引擎播放e1d1，这很荒谬。

为什么会发生这种情况？从深度1开始，引擎发现更好的移动，但是从换位表中发现了不同的移动。我也想知道为什么换位表搜索中没有PV行。

我最大的猜测是布鲁斯·莫兰德(Bruce Morland)的站点中搜索不稳定的报价:Zobrist键未考虑到达节点的路径。并非每条路径都是一样的。如果在树中的其他某个点遇到哈希值，则哈希元素中的分数可能会基于包含重复的路径。重复可能会得出平局分数，或者至少是不同的分数。

编辑:当没有TFEXACT值时，我尝试禁用对表的存储。也就是说，我停止存储和检索TFALPHA / TFBETA值，并且效果很好。有人知道为什么吗？

最佳答案

我确实注意到您在检查是否满足深度标准之前设置了(通过引用)pv_move。这意味着probe_table可以更改pv_move并仍返回0(未命中)(而不设置score)。看起来像坏事吗？
关于c++ - 换位表导致象棋引擎丢失，我们在Stack Overflow上找到一个类似的问题：https://stackoverflow.com/questions/34193290/