本文介绍了如何用NAN值分割大 pandas 时间序列的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
我有一个熊猫TimeSeries,看起来像这样:
I have a pandas TimeSeries which looks like this:
2007-02-06 15:00:00 0.780
2007-02-06 16:00:00 0.125
2007-02-06 17:00:00 0.875
2007-02-06 18:00:00 NaN
2007-02-06 19:00:00 0.565
2007-02-06 20:00:00 0.875
2007-02-06 21:00:00 0.910
2007-02-06 22:00:00 0.780
2007-02-06 23:00:00 NaN
2007-02-07 00:00:00 NaN
2007-02-07 01:00:00 0.780
2007-02-07 02:00:00 0.580
2007-02-07 03:00:00 0.880
2007-02-07 04:00:00 0.791
2007-02-07 05:00:00 NaN
我想在每次连续出现一个或多个NaN值时将熊猫TimeSeries拆分.目的是我将事件分开.
I would like split the pandas TimeSeries everytime there occurs one or more NaN values in a row. The goal is that I have separated events.
Event1:
2007-02-06 15:00:00 0.780
2007-02-06 16:00:00 0.125
2007-02-06 17:00:00 0.875
Event2:
2007-02-06 19:00:00 0.565
2007-02-06 20:00:00 0.875
2007-02-06 21:00:00 0.910
2007-02-06 22:00:00 0.780
我可以遍历每一行,但是还有一种聪明的方式做到这一点吗???
I could loop through every row but is there also a smart way of doing that???
推荐答案
您可以使用numpy.split
,然后过滤结果列表.这是一个示例,假定带有值的列标记为"value"
:
You can use numpy.split
and then filter the resulting list. Here is one example assuming that the column with the values is labeled "value"
:
events = np.split(df, np.where(np.isnan(df.value))[0])
# removing NaN entries
events = [ev[~np.isnan(ev.value)] for ev in events if not isinstance(ev, np.ndarray)]
# removing empty DataFrames
events = [ev for ev in events if not ev.empty]
您将拥有一个列表,其中所有事件均由NaN
值分隔.
You will have a list with all the events separated by the NaN
values.
这篇关于如何用NAN值分割大 pandas 时间序列的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!