python - Pandas.get_dummies返回两列(_Y和_N)而不是一列

我试图使用sklearn来训练基于我的数据集的决策树。
当我试图将数据切片到（结果：Y，预测变量：X）时，结果（我的标签）在True/False:

#data slicing
X = df.values[:,3:27] #X are the sets of predicting variable, dropping unique_id and student name here
Y = df.values[:,'OffTask'] #Y is our predicted value (outcome), it is in the 3rd column

我就是这样做的，但我不知道这是否是正确的做法：

#convert the label "OffTask" to dummy

df1 = pd.get_dummies(df,columns=["OffTask"])
df1

我的问题是数据集df1将我的标签Offtask返回到OffTask_N和OffTask_Y
有人知道怎么修吗？

最佳答案

get_dummies用于将标称字符串值转换为整数。它返回尽可能多的列，尽可能多的列中可用的唯一字符串值，例如：

df={'color':['red','green','blue'],'price':[1200,3000,2500]}
my_df=pd.DataFrame(df)
pd.get_dummies(my_df)

在您的情况下，可以删除第一个值，只要值为空，就可以认为它是第一个值