本文介绍了对列进行条件计数,并对结果进行平均计算的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我有一个具有这种结构的数据框

I have a dataframe with this structure

v1|v2|v3|
 2| 3| 4|
  | 5| 4|
 5| 1| 4|

我希望计算每列的所有4和5,然后将其除以该列所有条目的数目.之后,我想计算所有这些值的平均值.在这种情况下(1/2+1/3+3/3)/3=0.611

I waht to count all 4's and 5's for each column and divide it by the number of all entries of that column. Afterwards I want to calculate the mean of all these values. In this case (1/2+1/3+3/3)/3=0.611

sum over x(Count Vx if 4 or 5)/Count Vx(without missings)/x

到目前为止,我尝试过的是:df[df > 4 and df > 5].count(),但在这里我已经收到很多错误.有谁能走上正确的轨道?

What I have tried so far is:df[df > 4 and df > 5].count() but I already get a lot of errors here. Anyone who can bring on on the right track?

推荐答案

您可以尝试:

df.isin([4,5]).sum().div(df.count()).mean()

输出:

0.611111111111111

这篇关于对列进行条件计数,并对结果进行平均计算的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!

08-26 07:41