我进行了30年的调查,对植物中的花朵和水果有一个很大的data.frame。我想在某些行中添加零(0),这些行代表植物在特定月份没有flowersfruits(因为它是季节性物种)的个体。

例:

Year Month Flowers Fruits
2004 6      25      2
2004 7      48      4
2005 7      20      1
2005 8      16      1

我想添加不包含零值的月份,所以我在考虑一个识别丢失月份并用0填充的月份的函数。

谢谢。

最佳答案

## x is the data frame you gave in the question

x <- data.frame(
  Year = c(2004, 2004, 2005, 2005),
  Month = c(6, 7, 7, 8),
  Flowers = c(25, 48, 20, 16),
  Fruits = c(2, 4, 1, 1)
)

## y is the data frame that will provide the missing values,
## so you can replace 2004 and 2005 with whatever your desired
## time interval is

y <- expand.grid(Year = 2004:2005, Month = 1:12)

## this final step fills in missing dates and replaces NA's with zeros

library(tidyr)
x <- merge(x, y, all = TRUE) %>%
  replace_na(list(Flowers = 0, Fruits = 0))

## if you don't want to use tidyr, you can alternatively do

x <- merge(x, y, all = TRUE)
x[is.na(x)] <- 0

看起来像这样:
head(x, 10)

#    Year Month Flowers Fruits
# 1  2004     1       0      0
# 2  2004     2       0      0
# 3  2004     3       0      0
# 4  2004     4       0      0
# 5  2004     5       0      0
# 6  2004     6      25      2
# 7  2004     7      48      4
# 8  2004     8       0      0
# 9  2004     9       0      0
# 10 2004    10       0      0

10-08 17:16