Vowpal Wabbit-如何从测试样本的上下文强盗模型中获得预测概率

本文介绍了Vowpal Wabbit-如何从测试样本的上下文强盗模型中获得预测概率的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

给定一个训练有素的上下文盗贼模型，我该如何检索测试样本上的预测向量?

Given a trained contextual bandit model, how can I retrieve a prediction vector on test samples?

例如，假设我有一个名为"train.dat"的火车，其中包含以下格式的行

For example, let's say I have a train set named "train.dat" containing lines formatted as below

1:-1:0.3 | a b c  # <action:cost:probability | features>
2:2:0.3 | a d d
3:-1:0.3 | a b e
....

我在命令下运行.

vw -d train.dat --cb 30 -f cb.model --save_resume

这将产生一个文件"cb.model".现在，假设我有一个如下的测试数据集

This produces a file, 'cb.model'. Now, let's say I have a test dataset as below

| a d d
| a b e

我希望看到以下概率

0.2 0.7 0.1

这些概率的解释是，应在20％的时间选择动作1，在2-70％的动作中选择动作3-10％的时间.

The interpretation of these probabilities would be that action 1 should be picked 20% of the time, action 2 - 70%, and action 3 - 10% of the time.

有没有办法得到这样的东西?

Is there a way to get something like this?