HDF5中的Caffe分类标签

本文介绍了HDF5中的Caffe分类标签的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在微调网络.在特定情况下，我想将其用于回归分析，这是可行的.在另一种情况下，我想将其用于分类.

I am finetuning a network. In a specific case I want to use it for regression, which works. In another case, I want to use it for classification.

对于这两种情况，我都有一个带有标签的HDF5文件.通过回归，这只是一个包含浮点数的1×1 numpy数组.我以为在将EuclideanLoss图层更改为SoftmaxLoss之后，可以使用相同的标签进行分类.但是，这样我得到了负损失:

For both cases I have an HDF5 file, with a label. With regression, this is just a 1-by-1 numpy array that contains a float. I thought I could use the same label for classification, after changing my EuclideanLoss layer to SoftmaxLoss. However, then I get a negative loss as so:

    Iteration 19200, loss = -118232
    Train net output #0: loss = 39.3188 (* 1 = 39.3188 loss)

您能解释一下，如果怎么了，怎么了?我确实看到训练损失约为40(这仍然很糟糕)，但是网络仍在训练吗?负损失不断增加，越来越负.

Can you explain if, and so what, goes wrong? I do see that the training loss is about 40 (which is still terrible), but does the network still train? The negative loss just keeps on getting more negative.

更新
阅读 Shai的评论和，我进行了以下更改:
-我制作了最后一个完全连接的第6层的num_output，因为我有6个标签(以前是1个).
-现在，我创建一个单向矢量，并将其作为标签传递到我的HDF5数据集中，如下所示:

UPDATE
After reading Shai's comment and answer, I have made the following changes:
- I made the num_output of my last fully connected layer 6, as I have 6 labels (used to be 1).
- I now create a one-hot vector and pass that as a label into my HDF5 dataset as follows

    f['label'] = numpy.array([1, 0, 0, 0, 0, 0])

现在尝试运行网络会返回

Trying to run my network now returns

   Check failed: hdf_blobs_[i]->shape(0) == num (6 vs. 1)

经过在线研究后，我将向量重塑为1x6向量.这导致以下错误:

After some research online, I reshaped the vector to a 1x6 vector. This lead to the following error:

  Check failed: outer_num_ * inner_num_ == bottom[1]->count() (40 vs. 240)
   Number of labels must match number of predictions; e.g., if softmax axis == 1
   and prediction shape is (N, C, H, W), label count (number of labels)
   must be N*H*W, with integer values in {0, 1, ..., C-1}.

我的想法是为每个数据集(图像)添加1个标签，并在我的train.prototxt文件中创建批次.这不应该创建正确的批量大小吗?

My idea is to add 1 label per data set (image) and in my train.prototxt I create batches. Shouldn't this create the correct batch size?

Negative

HDF5中的Caffe分类标签

问题描述

推荐答案