从netcdf文件获取每月的每小时平均数

本文介绍了从netcdf文件获取每月的每小时平均数的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个netCDF文件，其时间维度包含按小时排列的2年数据.我希望将其取平均值，以获得每月每个小时的每小时平均值.我试过了:

I have a netCDF file with the time dimension containing data by the hour for 2 years. I want to average it to get an hourly average for each hour of the day for each month. I tried this:

import xarray as xr
ds = xr.open_mfdataset('ecmwf_usa_2015.nc')    
ds.groupby(['time.month', 'time.hour']).mean('time')

但我收到此错误:

*** TypeError: `group` must be an xarray.DataArray or the name of an xarray variable or dimension

我该如何解决?如果我这样做:

How can I fix this? If I do this:

ds.groupby('time.month', 'time.hour').mean('time')

我没有收到错误，但是结果的时间范围为12(每个月一个值)，而我想要每个月的小时平均值，即12个月中的每个月24个值.此处提供数据: https://www.dropbox.com/s/yqgg80wn8bjdksy/ecmwf_usa_2015.nc?dl = 0

I do not get an error but the result has a time dimension of 12 (one value for each month), whereas I want an hourly average for each month i.e. 24 values for each of 12 months. Data is available here: https://www.dropbox.com/s/yqgg80wn8bjdksy/ecmwf_usa_2015.nc?dl=0

2.将其转换为熊猫数据框并使用分组依据

警告:并非所有netcdfs都可以转换为panda数据框，转换时可能会丢失元数据.

2. convert it into pandas dataframe and use group by

Warning : Not all netcdfs are convertable to panda dataframe , there may be meta data loss while conversion.

通过df = ds.to_dataframe()将ds转换为pandas数据帧并使用使用pandas.Grouper like

Convert ds into pandas dataframe by df = ds.to_dataframe()and use group by as you require by using pandas.Grouperlike

df.set_index('time').groupby([pd.Grouper(freq='1M'), 't2m']).mean()

注意::我看到了pandas.TimeGrouper的几个答案，但已弃用，现在必须使用pandas.Grouper.

Note : I saw couple of answers with pandas.TimeGrouper but its deprecated and one has to use pandas.Grouper now.

由于您的数据集太大，并且问题并没有使数据最少化并且需要消耗大量资源来进行处理，所以我建议看看熊猫上的这些示例

Since your data set is too big and question does not have minimized data and working on it consuming heavy resources I would suggest to look at these examples on pandas

按工作日分组
按时间分组
groupby-date-range -每行依赖
>组数逐月和逐行

group by weekdays
group by time
groupby-date-range-depending-on-each-row
group-and-count-rows-by-month-and-year

这篇关于从netcdf文件获取每月的每小时平均数的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持！

pandas