原文链接:https://www.jianshu.com/p/f773b4b82c66

value_counts()是一种查看表格某列中有多少个不同值的快捷方法,并计算每个不同值有在该列中有多少重复值。
value_counts()是Series拥有的方法,一般在DataFrame中使用时,需要指定对哪一列或行使用,该函数返回的也是Series类型,且index为该列的不同值,values为不同值的个数

 import pandas as pd
import numpy as np
filepath='C:\python\data_src\GFSCOFOG_03-05-2018 03-04-36-54_timeSeries\GFSCOFOG_CHA.csv'
data = pd.read_csv(filepath,encoding='utf-8')

数据样例如下图所示

pandas中.value_counts()的用法-LMLPHP

查看Unit Name中有哪些不同的值,并计算每个值有多少个重复值

data['Unit Name'].value_counts()

 data['Unit Name'].value_counts()
#输出
Percent of GDP 3561
Domestic currency 3561
Percent of total expenditure 470
Name: Unit Name, dtype: int64

查看Sector Name中有哪些不同的值,并计算每个值有多少个重复值

data['Sector Name'].value_counts()

 data['Sector Name'].value_counts()
#输出结果
Extrabudgetary central government 1020
Social security funds 1002
Central government (incl. social security funds) 944
Budgetary central government 944
Local governments 944
General government 944
Central government (excl. social security funds) 944
State governments 850
Name: Sector Name, dtype: int64
05-22 18:24