DataFrame中的字符串，但dtype是object

本文介绍了DataFrame中的字符串，但dtype是object的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

为什么Pandas告诉我我有对象，尽管所选列中的每个项目都是一个字符串，即使经过显式转换也是如此.

Why does Pandas tell me that I have objects, although every item in the selected column is a string — even after explicit conversion.

这是我的数据框:

<class 'pandas.core.frame.DataFrame'>
Int64Index: 56992 entries, 0 to 56991
Data columns (total 7 columns):
id            56992  non-null values
attr1         56992  non-null values
attr2         56992  non-null values
attr3         56992  non-null values
attr4         56992  non-null values
attr5         56992  non-null values
attr6         56992  non-null values
dtypes: int64(2), object(5)

其中有五个是dtype object.我将这些对象明确转换为字符串:

Five of them are dtype object. I explicitly convert those objects to strings:

for c in df.columns:
    if df[c].dtype == object:
        print "convert ", df[c].name, " to string"
        df[c] = df[c].astype(str)

然后，df["attr2"]仍然具有dtype object，尽管type(df["attr2"].ix[0]显示str，这是正确的.

Then, df["attr2"] still has dtype object, although type(df["attr2"].ix[0] reveals str, which is correct.

熊猫区分int64和float64以及object.没有dtype str时，其背后的逻辑是什么?为什么str被object覆盖?

Pandas distinguishes between int64 and float64 and object. What is the logic behind it when there is no dtype str? Why is a str covered by object?

DataFrame中的字符串

DataFrame中的字符串，但dtype是object

问题描述

推荐答案