python - Pandas -获取给定列的第一行值

这似乎是一个非常简单的问题……但是我没有看到我期望的简单答案。

那么，如何获得Pandas中给定列的第n行的值？ (我对第一行特别感兴趣，但也对更通用的做法也很感兴趣)。

例如，假设我想将Btime中的1.2值作为变量。

什么是正确的方法？

df_test =

  ATime   X   Y   Z   Btime  C   D   E
0    1.2  2  15   2    1.2  12  25  12
1    1.4  3  12   1    1.3  13  22  11
2    1.5  1  10   6    1.4  11  20  16
3    1.6  2   9  10    1.7  12  29  12
4    1.9  1   1   9    1.9  11  21  19
5    2.0  0   0   0    2.0   8  10  11
6    2.4  0   0   0    2.4  10  12  15

最佳答案

要选择ith行，use iloc :

In [31]: df_test.iloc[0]
Out[31]:
ATime     1.2
X         2.0
Y        15.0
Z         2.0
Btime     1.2
C        12.0
D        25.0
E        12.0
Name: 0, dtype: float64

要在Btime列中选择第i个值，可以使用:

In [30]: df_test['Btime'].iloc[0]
Out[30]: 1.2

df_test['Btime'].iloc[0](推荐)和df_test.iloc[0]['Btime']之间有区别:

DataFrames将数据存储在基于列的块中(其中每个块都有一个
dtype)。如果首先按列选择，则可以返回 View (即
比返回副本要快)，并且保留了原始dtype。相比之下，
如果您先按行选择，并且DataFrame的列不同
dtypes，然后Pandas将数据复制到新的Object dtype系列中。所以
选择列比选择行要快一些。因此，尽管df_test.iloc[0]['Btime']有效，df_test['Btime'].iloc[0]有点
更高效。

在分配方面，两者之间存在很大差异。df_test['Btime'].iloc[0] = x影响df_test，但df_test.iloc[0]['Btime']不得。有关原因的说明，请参见下文。因为在
索引的顺序在行为上有很大的不同，最好使用单个索引分配:

df.iloc[0, df.columns.get_loc('Btime')] = x

df.iloc[0, df.columns.get_loc('Btime')] = x(推荐):

recommended way 将新值分配给
DataFrame是avoid chained indexing，而是使用shown byandrew方法，

df.loc[df.index[n], 'Btime'] = x

或者

df.iloc[n, df.columns.get_loc('Btime')] = x

后一种方法要快一些，因为df.loc必须将行和列标签转换为
位置索引，因此如果使用
改为df.iloc。
df['Btime'].iloc[0] = x有效，但不建议这样做:

尽管这可行，但它利用了当前实现DataFrames的方式。不能保证 Pandas 将来会以这种方式工作。特别是，利用了以下事实:(当前)df['Btime']始终返回一个
View (不是副本)，因此df['Btime'].iloc[n] = x可用于分配新值
在Btime的df列的第n个位置。

由于Pandas无法明确保证索引器何时返回 View 还是副本，因此使用链式索引的分配通常总会引发SettingWithCopyWarning，即使在这种情况下，该分配成功修改了df:

In [22]: df = pd.DataFrame({'foo':list('ABC')}, index=[0,2,1])
In [24]: df['bar'] = 100
In [25]: df['bar'].iloc[0] = 99
/home/unutbu/data/binky/bin/ipython:1: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame

See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/indexing.html#indexing-view-versus-copy
  self._setitem_with_indexer(indexer, value)

In [26]: df
Out[26]:
  foo  bar
0   A   99  <-- assignment succeeded
2   B  100
1   C  100

df.iloc[0]['Btime'] = x不起作用:

相比之下，使用df.iloc[0]['bar'] = 123进行分配将不起作用，因为df.iloc[0]返回了一个副本:

In [66]: df.iloc[0]['bar'] = 123
/home/unutbu/data/binky/bin/ipython:1: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame

See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/indexing.html#indexing-view-versus-copy

In [67]: df
Out[67]:
  foo  bar
0   A   99  <-- assignment failed
2   B  100
1   C  100

警告:我之前曾建议df_test.ix[i, 'Btime']。但这不能保证为您提供ith值，因为ix会先尝试按标签进行索引，然后再尝试按位置进行索引。因此，如果DataFrame的整数索引不是从0开始的排序顺序，则使用ix[i]将返回标有i而不是ith的行。例如，

In [1]: df = pd.DataFrame({'foo':list('ABC')}, index=[0,2,1])

In [2]: df
Out[2]:
  foo
0   A
2   B
1   C

In [4]: df.ix[1, 'foo']
Out[4]: 'C'