我有一个带有日期和数字的数据框(df)。我想在日期上加上数字。如何使用pd.offsets()将df ['additional_days']系列添加到df ['start_date']系列中?有一个更好的方法吗?


  start_date另外_days
  
  2018-03-29 360
  
  2018-07-31 0
  
  2018-11-01 360
  
  2016-11-03 720
  
  2018-12-04 480


尝试时出现错误

df['start_date'] + pd.offsets.Day(df['additional_days'])


这是错误

TypeError                                 Traceback (most recent call last)
pandas/_libs/tslibs/offsets.pyx in pandas._libs.tslibs.offsets._BaseOffset._validate_n()

/opt/conda/lib/python3.6/site-packages/pandas/core/series.py in wrapper(self)
    117         raise TypeError("cannot convert the series to "
--> 118                         "{0}".format(str(converter)))
    119

TypeError: cannot convert the series to <class 'int'>

During handling of the above exception, another exception occurred:

TypeError                                 Traceback (most recent call last)
<ipython-input-76-03920804db29> in <module>
----> 1 df_test['start_date'] + pd.offsets.Day(df_test['additional_days'])

/opt/conda/lib/python3.6/site-packages/pandas/tseries/offsets.py in __init__(self, n, normalize)
   2219     def __init__(self, n=1, normalize=False):
   2220         # TODO: do Tick classes with normalize=True make sense?
-> 2221         self.n = self._validate_n(n)
   2222         self.normalize = normalize
   2223

pandas/_libs/tslibs/offsets.pyx in pandas._libs.tslibs.offsets._BaseOffset._validate_n()

TypeError: `n` argument must be an integer, got <class 'pandas.core.series.Series'>

最佳答案

使用pd.to_timedelta

import pandas as pd
#df['start_date'] = pd.to_datetime(df.start_date)

df['start_date'] + pd.to_timedelta(df.additional_days, unit='d')

#0   2019-03-24
#1   2018-07-31
#2   2019-10-27
#3   2018-10-24
#4   2020-03-28
#dtype: datetime64[ns]

关于python - 将系列而不是整数传递给Pandas偏移量,我们在Stack Overflow上找到一个类似的问题:https://stackoverflow.com/questions/54581339/

10-13 07:13
查看更多