Extract or compute hindcasts and forecasts for a fitted mvgam object

Extract or compute hindcasts and forecasts for a fitted mvgam object

Usage

# S3 method for mvgam
forecast(object, newdata, data_test, n_cores = 1, type = "response", ...)

Arguments

object: list object of class mvgam or jsdgam. See mvgam()
newdata: Optional dataframe or list of test data containing the same variables that were included in the original data used to fit the model. If included, the covariate information in newdata will be used to generate forecasts from the fitted model equations. If this same newdata was originally included in the call to mvgam, then forecasts have already been produced by the generative model and these will simply be extracted and plotted. However if no newdata was supplied to the original model call, an assumption is made that the newdata supplied here comes sequentially after the data supplied in the original model (i.e. we assume there is no time gap between the last observation of series 1 in the original data and the first observation for series 1 in newdata)
data_test: Deprecated. Still works in place of newdata but users are recommended to use newdata instead for more seamless integration into R workflows
n_cores: Deprecated. Parallel processing is no longer supported
type: When this has the value link (default) the linear predictor is calculated on the link scale. If expected is used, predictions reflect the expectation of the response (the mean) but ignore uncertainty in the observation process. When response is used, the predictions take uncertainty in the observation process into account to return predictions on the outcome scale. When variance is used, the variance of the response with respect to the mean (mean-variance relationship) is returned. When type = "terms", each component of the linear predictor is returned separately in the form of a list (possibly with standard errors, if summary = TRUE): this includes parametric model components, followed by each smooth component, but excludes any offset and any intercept. Two special cases are also allowed: type latent_N will return the estimated latent abundances from an N-mixture distribution, while type detection will return the estimated detection probability from an N-mixture distribution
...: Ignored

Value

An object of class mvgam_forecast containing hindcast and forecast distributions. See mvgam_forecast-class for details.

Details

Posterior predictions are drawn from the fitted mvgam and used to simulate a forecast distribution

Examples

# \donttest{
  # Simulate data with 3 series and AR trend model
  simdat <- sim_mvgam(n_series = 3, trend_model = AR())

  # Fit mvgam model
  mod <- mvgam(
    y ~ s(season, bs = 'cc', k = 6),
    trend_model = AR(),
    noncentred = TRUE,
    data = simdat$data_train,
    chains = 2,
    silent = 2
  )

  # Hindcasts on response scale
  hc <- hindcast(mod)
  str(hc)
#> List of 15
#>  $ call              :Class 'formula'  language y ~ s(season, bs = "cc", k = 6)
#>   .. ..- attr(*, ".Environment")=<environment: 0x558336aa6b70> 
#>  $ trend_call        : NULL
#>  $ family            : chr "poisson"
#>  $ trend_model       :List of 7
#>   ..$ trend_model: chr "AR1"
#>   ..$ ma         : logi FALSE
#>   ..$ cor        : logi FALSE
#>   ..$ unit       : chr "time"
#>   ..$ gr         : chr "NA"
#>   ..$ subgr      : chr "series"
#>   ..$ label      : language AR()
#>   ..- attr(*, "class")= chr "mvgam_trend"
#>   ..- attr(*, "param_info")=List of 2
#>   .. ..$ param_names: chr [1:8] "trend" "tau" "sigma" "ar1" ...
#>   .. ..$ labels     : chr [1:8] "trend_estimates" "precision_parameter" "standard_deviation" "autoregressive_coef_1" ...
#>  $ drift             : logi FALSE
#>  $ use_lv            : logi FALSE
#>  $ fit_engine        : chr "stan"
#>  $ type              : chr "response"
#>  $ series_names      : chr [1:3] "series_1" "series_2" "series_3"
#>  $ train_observations:List of 3
#>   ..$ series_1: int [1:75] 3 1 2 1 1 0 7 5 5 1 ...
#>   ..$ series_2: int [1:75] 2 0 0 1 3 0 4 8 4 2 ...
#>   ..$ series_3: int [1:75] 0 0 0 0 0 3 0 4 2 3 ...
#>  $ train_times       :List of 3
#>   ..$ series_1: int [1:75] 1 2 3 4 5 6 7 8 9 10 ...
#>   ..$ series_2: int [1:75] 1 2 3 4 5 6 7 8 9 10 ...
#>   ..$ series_3: int [1:75] 1 2 3 4 5 6 7 8 9 10 ...
#>  $ test_observations : NULL
#>  $ test_times        : NULL
#>  $ hindcasts         :List of 3
#>   ..$ series_1: num [1:1000, 1:75] 0 4 3 1 0 1 0 3 2 2 ...
#>   .. ..- attr(*, "dimnames")=List of 2
#>   .. .. ..$ : NULL
#>   .. .. ..$ : chr [1:75] "ypred[1,1]" "ypred[2,1]" "ypred[3,1]" "ypred[4,1]" ...
#>   ..$ series_2: num [1:1000, 1:75] 1 4 1 3 1 3 1 2 1 0 ...
#>   .. ..- attr(*, "dimnames")=List of 2
#>   .. .. ..$ : NULL
#>   .. .. ..$ : chr [1:75] "ypred[1,2]" "ypred[2,2]" "ypred[3,2]" "ypred[4,2]" ...
#>   ..$ series_3: num [1:1000, 1:75] 2 0 0 1 1 0 2 1 1 1 ...
#>   .. ..- attr(*, "dimnames")=List of 2
#>   .. .. ..$ : NULL
#>   .. .. ..$ : chr [1:75] "ypred[1,3]" "ypred[2,3]" "ypred[3,3]" "ypred[4,3]" ...
#>  $ forecasts         : NULL
#>  - attr(*, "class")= chr "mvgam_forecast"

  # Use summary() to extract hindcasts / forecasts for custom plotting
  head(summary(hc), 12)
#> # A tibble: 12 × 7
#>    series    time predQ50 predQ2.5 predQ97.5 truth type    
#>    <fct>    <int>   <dbl>    <dbl>     <dbl> <int> <chr>   
#>  1 series_1     1       1        0         5     3 response
#>  2 series_1     2       1        0         3     1 response
#>  3 series_1     3       0        0         3     2 response
#>  4 series_1     4       0        0         2     1 response
#>  5 series_1     5       1        0         3     1 response
#>  6 series_1     6       1        0         4     0 response
#>  7 series_1     7       4        1        11     7 response
#>  8 series_1     8       5        1        11     5 response
#>  9 series_1     9       4        1        10     5 response
#> 10 series_1    10       2        0         7     1 response
#> 11 series_1    11       2        0         6     3 response
#> 12 series_1    12       1        0         5     4 response

  # Or just use the plot() function for quick plots
  plot(hc, series = 1)
#> No non-missing values in test_observations; cannot calculate forecast score

  plot(hc, series = 2)
#> No non-missing values in test_observations; cannot calculate forecast score

  plot(hc, series = 3)
#> No non-missing values in test_observations; cannot calculate forecast score


  # Forecasts on response scale
  fc <- forecast(
    mod,
    newdata = simdat$data_test
  )
  str(fc)
#> List of 16
#>  $ call              :Class 'formula'  language y ~ s(season, bs = "cc", k = 6)
#>   .. ..- attr(*, ".Environment")=<environment: 0x558336aa6b70> 
#>  $ trend_call        : NULL
#>  $ family            : chr "poisson"
#>  $ family_pars       : NULL
#>  $ trend_model       :List of 7
#>   ..$ trend_model: chr "AR1"
#>   ..$ ma         : logi FALSE
#>   ..$ cor        : logi FALSE
#>   ..$ unit       : chr "time"
#>   ..$ gr         : chr "NA"
#>   ..$ subgr      : chr "series"
#>   ..$ label      : language AR()
#>   ..- attr(*, "class")= chr "mvgam_trend"
#>   ..- attr(*, "param_info")=List of 2
#>   .. ..$ param_names: chr [1:8] "trend" "tau" "sigma" "ar1" ...
#>   .. ..$ labels     : chr [1:8] "trend_estimates" "precision_parameter" "standard_deviation" "autoregressive_coef_1" ...
#>  $ drift             : logi FALSE
#>  $ use_lv            : logi FALSE
#>  $ fit_engine        : chr "stan"
#>  $ type              : chr "response"
#>  $ series_names      : Factor w/ 3 levels "series_1","series_2",..: 1 2 3
#>  $ train_observations:List of 3
#>   ..$ series_1: int [1:75] 3 1 2 1 1 0 7 5 5 1 ...
#>   ..$ series_2: int [1:75] 2 0 0 1 3 0 4 8 4 2 ...
#>   ..$ series_3: int [1:75] 0 0 0 0 0 3 0 4 2 3 ...
#>  $ train_times       :List of 3
#>   ..$ series_1: int [1:75] 1 2 3 4 5 6 7 8 9 10 ...
#>   ..$ series_2: int [1:75] 1 2 3 4 5 6 7 8 9 10 ...
#>   ..$ series_3: int [1:75] 1 2 3 4 5 6 7 8 9 10 ...
#>  $ test_observations :List of 3
#>   ..$ series_1: int [1:25] 0 2 4 4 3 4 3 2 4 1 ...
#>   ..$ series_2: int [1:25] 0 1 4 7 10 2 3 2 3 0 ...
#>   ..$ series_3: int [1:25] 0 1 1 1 2 0 1 1 0 0 ...
#>  $ test_times        :List of 3
#>   ..$ series_1: int [1:25] 76 77 78 79 80 81 82 83 84 85 ...
#>   ..$ series_2: int [1:25] 76 77 78 79 80 81 82 83 84 85 ...
#>   ..$ series_3: int [1:25] 76 77 78 79 80 81 82 83 84 85 ...
#>  $ hindcasts         :List of 3
#>   ..$ series_1: num [1:1000, 1:75] 0 4 3 1 0 1 0 3 2 2 ...
#>   .. ..- attr(*, "dimnames")=List of 2
#>   .. .. ..$ : NULL
#>   .. .. ..$ : chr [1:75] "ypred[1,1]" "ypred[2,1]" "ypred[3,1]" "ypred[4,1]" ...
#>   ..$ series_2: num [1:1000, 1:75] 1 4 1 3 1 3 1 2 1 0 ...
#>   .. ..- attr(*, "dimnames")=List of 2
#>   .. .. ..$ : NULL
#>   .. .. ..$ : chr [1:75] "ypred[1,2]" "ypred[2,2]" "ypred[3,2]" "ypred[4,2]" ...
#>   ..$ series_3: num [1:1000, 1:75] 2 0 0 1 1 0 2 1 1 1 ...
#>   .. ..- attr(*, "dimnames")=List of 2
#>   .. .. ..$ : NULL
#>   .. .. ..$ : chr [1:75] "ypred[1,3]" "ypred[2,3]" "ypred[3,3]" "ypred[4,3]" ...
#>  $ forecasts         :List of 3
#>   ..$ series_1: int [1:1000, 1:25] 0 0 0 2 1 2 1 0 2 1 ...
#>   ..$ series_2: int [1:1000, 1:25] 0 1 2 0 0 0 1 1 1 2 ...
#>   ..$ series_3: int [1:1000, 1:25] 0 0 1 0 5 0 2 1 0 0 ...
#>  - attr(*, "class")= chr "mvgam_forecast"
  head(summary(fc), 12)
#> # A tibble: 12 × 7
#>    series    time predQ50 predQ2.5 predQ97.5 truth type    
#>    <fct>    <int>   <dbl>    <dbl>     <dbl> <int> <chr>   
#>  1 series_1     1       1        0         5     3 response
#>  2 series_1     2       1        0         3     1 response
#>  3 series_1     3       0        0         3     2 response
#>  4 series_1     4       0        0         2     1 response
#>  5 series_1     5       1        0         3     1 response
#>  6 series_1     6       1        0         4     0 response
#>  7 series_1     7       4        1        11     7 response
#>  8 series_1     8       5        1        11     5 response
#>  9 series_1     9       4        1        10     5 response
#> 10 series_1    10       2        0         7     1 response
#> 11 series_1    11       2        0         6     3 response
#> 12 series_1    12       1        0         5     4 response
  plot(fc, series = 1)
#> Out of sample DRPS:
#> 17.506906

  plot(fc, series = 2)
#> Out of sample DRPS:
#> 21.961446

  plot(fc, series = 3)
#> Out of sample DRPS:
#> 28.240195


  # Forecasts as expectations
  fc <- forecast(
    mod,
    newdata = simdat$data_test,
    type = 'expected'
  )
  head(summary(fc), 12)
#> # A tibble: 12 × 6
#>    series    time predQ50 predQ2.5 predQ97.5 type    
#>    <fct>    <int>   <dbl>    <dbl>     <dbl> <chr>   
#>  1 series_1     1   1.39     0.762      2.87 expected
#>  2 series_1     2   0.777    0.343      1.60 expected
#>  3 series_1     3   0.624    0.271      1.47 expected
#>  4 series_1     4   0.521    0.224      1.15 expected
#>  5 series_1     5   0.698    0.307      1.50 expected
#>  6 series_1     6   1.25     0.501      2.32 expected
#>  7 series_1     7   4.48     2.68       7.86 expected
#>  8 series_1     8   5.03     2.68       8.41 expected
#>  9 series_1     9   4.41     2.28       7.80 expected
#> 10 series_1    10   2.26     0.952      4.06 expected
#> 11 series_1    11   1.97     1.06       3.85 expected
#> 12 series_1    12   1.60     0.821      3.46 expected
  plot(fc, series = 1)

  plot(fc, series = 2)

  plot(fc, series = 3)


  # Dynamic trend extrapolations
  fc <- forecast(
    mod,
    newdata = simdat$data_test,
    type = 'trend'
  )
  head(summary(fc), 12)
#> # A tibble: 12 × 6
#>    series    time  predQ50 predQ2.5 predQ97.5 type 
#>    <fct>    <int>    <dbl>    <dbl>     <dbl> <chr>
#>  1 series_1     1  0.118     -0.368     0.923 trend
#>  2 series_1     2  0.00390   -0.740     0.782 trend
#>  3 series_1     3  0.113     -0.536     1.01  trend
#>  4 series_1     4  0.0274    -0.714     0.763 trend
#>  5 series_1     5  0.00646   -0.658     0.688 trend
#>  6 series_1     6 -0.144     -1.01      0.437 trend
#>  7 series_1     7  0.260     -0.247     0.863 trend
#>  8 series_1     8 -0.0150    -0.580     0.553 trend
#>  9 series_1     9  0.0691    -0.530     0.636 trend
#> 10 series_1    10 -0.151     -0.909     0.442 trend
#> 11 series_1    11  0.0954    -0.467     0.792 trend
#> 12 series_1    12  0.262     -0.319     1.07  trend
  plot(fc, series = 1)

  plot(fc, series = 2)

  plot(fc, series = 3)

# }

Extract or compute hindcasts and forecasts for a fitted `mvgam` object

Usage

Arguments

Value

Details

See also

Examples