Caffe MemoryData层和求解器接口

本文介绍了Caffe MemoryData层和求解器接口的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试训练网络，但不是使用lmdb或leveldb，而是将数据动态传输到我的网络。因此，我正在按照以下概述的步骤进行操作

I am trying to train a network but rather using lmdb or leveldb, I am feeding data to my network on the fly. So I am following procedure as outlined below

我的数据已加载到内存数据层中。

我使用python脚本创建了一个迷你批处理。

将数据和标签设置为 solver.net.set_input_arrays（batch，labels）

之后，我调用 solver.step（1）

My data is loaded in Memory Data Layer.
I create a mini batch using a python script.
Set data and label as solver.net.set_input_arrays(batch,labels)
After that I call solver.step(1)

输入SGDSolver。现在我的问题是 solver.solve（）和 solver.step（）有什么区别？

Here solver is of type SGDSolver. Now my question is what is the difference between solver.solve() and solver.step()?

第二，这种方法不能让我拥有用于测试网络的内存数据层。

2ndly this approach doesn't let me have a memory data layer for test network. Is there any work around for that?

我的solver.prototxt看起来像

My solver.prototxt looks like

net: "/path/to/train_val.prototxt"
base_lr: 0.01
lr_policy: "step"
gamma: 0.1
stepsize: 100000
display: 20
max_iter: 450000
momentum: 0.9
weight_decay: 0.0005
snapshot: 10000
snapshot_prefix: "/path/to/temporal_net_train"
solver_mode: GPU

使用我的方法，每20个迭代网络都会显示出一些输出损耗等。在某些情况下，损耗保持恒定迭代次数，这可能是什么原因。

With my approach every 20th iteration network displays some output loss etc. And somehow loss stays constant over some numbert of iterations, what could be the reason for that.

iterations

Caffe MemoryData层和求解器接口

问题描述

推荐答案