给定数值目标变量，我是否应该转换目标变量以获得多类分类的指标矩阵?

本文介绍了给定数值目标变量，我是否应该转换目标变量以获得多类分类的指标矩阵?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用 RandomForestClassifier 处理多类分类问题.目标变量 Y 仅包含 3 个值 {-1,0,1} 之一.我了解数字编码是必要的.

I am working on a multiclass classification problem using RandomForestClassifier. The target variable Y only contain one of 3 values {-1,0,1 }. I understand that numerical encoding is necessary.

但是，我想了解是否有必要通过执行 pd.get_dummies(Y) 来转换 Y 以获得如下所示的指标矩阵，然后将此指标矩阵输入 RandomForestClassifier?

However, I would like to understand if it is necessary for me to transform Y to obtain an indicator matrix like below by doing pd.get_dummies(Y) and then feed this indicator matrix into the RandomForestClassifier?

      -1.0   0.0   1.0
0        0     0     1
1        1     0     0
2        0     0     1
3        1     0     0
4        1     0     0
   ...   ...   ...
6516     1     0     0
6517     0     0     1
6518     0     0     1
6519     0     0     1
6520     1     0     0

与将未变换的目标变量 Y(即一维序列)输入 RandomForestClassifier 相比，这将如何影响机器学习算法?结果会不同吗?为什么?

Comparing above to feeding the untransformed target variable Y (i.e. a 1 dimensional series) into RandomForestClassifier, how would this affect the machine learning algorithm ? Would the results be different and why ?

RandomForestClassifier 在这两种不同的情况下做不同的事情吗?推荐哪种方法(指标矩阵与未变换)?

Is the RandomForestClassifier doing different things under these 2 different scenarios ?Which approach is recommended (indicator matrix vs untransformed)?

RandomForestClassifier

给定数值目标变量，我是否应该转换目标变量以获得多类分类的指标矩阵?

问题描述

推荐答案