如何提高性能而无需并行执行反向传播ANN

本文介绍了如何提高性能而无需并行执行反向传播ANN的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

在对我的反向传播算法进行分析后，我了解到它负责占用我60％的计算时间.在开始研究并行替代方案之前，我想看看是否还有其他可以做的事情.

After profiling my Back propagation algorithm, I have learnt it is responsible for taking up 60% of my computation time.Before I start looking at parallel alternatives, I would like to see if there is anything further I can do.

activate(const double input[])函数被配置为仅占用〜5％的时间.gradient(const double input)函数的实现如下:

The activate(const double input[]) function is profiled to only take ~5% of the time.The gradient(const double input) function is implemented as follows:

inline double gradient(const double input) { return (1 - (input * input)); }

有问题的训练功能:

void train(const vector<double>& data, const vector<double>& desired, const double learn_rate, const double momentum) {
        this->activate(data);
        this->calculate_error(desired);

        // adjust weights for layers
        const auto n_layers = this->config.size();
        const auto adjustment = (1 - momentum) * learn_rate;

        for (size_t i = 1; i < n_layers; ++i) {
            const auto& inputs = i - 1 > 0 ? this->outputs[i - 1] : data;
            const auto n_inputs = this->config[i - 1];
            const auto n_neurons = this->config[i];

            for (auto j = 0; j < n_neurons; ++j) {
                const auto adjusted_error = adjustment * this->errors[i][j];

                for (auto k = 0; k < n_inputs; ++k) {
                    const auto delta = adjusted_error * inputs[k] + (momentum * this->deltas[i][j][k]);

                    this->deltas[i][j][k] = delta;
                    this->weights[i][j][k] += delta;
                }

                const auto delta = adjusted_error * this->bias + (momentum * this->deltas[i][j][n_inputs]);

                this->deltas[i][j][n_inputs] = delta;
                this->weights[i][j][n_inputs] += delta;
            }
        }
    }
}

此问题可能更适合 https://codereview.stackexchange.com/.

Activate

如何提高性能而无需并行执行反向传播ANN

问题描述

推荐答案