21 Generalized Additive Models

Generalized Additive Models (GAMs) are a flexible class of statistical models that extend traditional linear regression by allowing non-linear relationships between predictors and the response variable. Instead of assuming straight-line relationships, GAMs use smooth functions called splines that can capture complex patterns in the data.

This flexibility makes GAMs particularly useful for modeling real-world phenomena where relationships may be curved, seasonal, or otherwise non-linear. For example, the relationship between temperature and energy consumption often follows a U-shape that would be poorly captured by linear regression.

While GAMs are powerful and flexible, their estimated parameters can be difficult to interpret directly since they involve complex spline basis functions. Fortunately, the marginaleffects package provides tools to understand and visualize GAM results using the same intuitive workflow as simpler models - through predictions, counterfactual comparisons, and slopes.

21.1 Estimation

We will estimate a GAM model using the mgcv package and the simdat dataset distributed with the itsadug package:

library(marginaleffects)
library(itsadug)
library(mgcv)

simdat$Subject <- as.factor(simdat$Subject)

dim(simdat)
#> [1] 75600     6
head(simdat)
#>    Group      Time Trial Condition Subject         Y
#> 1 Adults   0.00000   -10        -1     a01 0.7554469
#> 2 Adults  20.20202   -10        -1     a01 2.7834759
#> 3 Adults  40.40404   -10        -1     a01 1.9696963
#> 4 Adults  60.60606   -10        -1     a01 0.6814298
#> 5 Adults  80.80808   -10        -1     a01 1.6939195
#> 6 Adults 101.01010   -10        -1     a01 2.3651969

Fit a model with a random effect and group-time smooths:

model <- bam(Y ~ Group + s(Time, by = Group) + s(Subject, bs = "re"),
             data = simdat)

summary(model)
#> 
#> Family: gaussian 
#> Link function: identity 
#> 
#> Formula:
#> Y ~ Group + s(Time, by = Group) + s(Subject, bs = "re")
#> 
#> Parametric coefficients:
#>             Estimate Std. Error t value Pr(>|t|)   
#> (Intercept)   2.0574     0.6903   2.980  0.00288 **
#> GroupAdults   3.1265     0.9763   3.202  0.00136 **
#> ---
#> Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
#> 
#> Approximate significance of smooth terms:
#>                         edf Ref.df    F p-value    
#> s(Time):GroupChildren  8.26  8.850 3649  <2e-16 ***
#> s(Time):GroupAdults    8.66  8.966 6730  <2e-16 ***
#> s(Subject)            33.94 34.000  569  <2e-16 ***
#> ---
#> Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
#> 
#> R-sq.(adj) =  0.609   Deviance explained =   61%
#> fREML = 2.3795e+05  Scale est. = 31.601    n = 75600

21.2 Predictions

Compute adjusted predictions for each observed combination of regressor in the dataset used to fit the model. This gives us a dataset with the same number of rows as the original data, but new columns with predicted values and uncertainty estimates:

pred <- predictions(model)
dim(pred)
#> [1] 75600    13
head(pred)
#> 
#>  Estimate Std. Error     z Pr(>|z|)    S   2.5 %  97.5 %
#>    -1.874      0.199 -9.41   <0.001 67.4 -2.2643 -1.4834
#>    -1.346      0.182 -7.41   <0.001 42.8 -1.7025 -0.9901
#>    -0.819      0.167 -4.90   <0.001 20.0 -1.1467 -0.4916
#>    -0.293      0.156 -1.88   0.0605  4.0 -0.5988  0.0129
#>     0.231      0.149  1.55   0.1204  3.1 -0.0606  0.5232
#>     0.753      0.146  5.17   <0.001 22.0  0.4675  1.0379
#> 
#> Type: response

We can easily plot adjusted predictions for different values of a regressor using the plot_predictions() function:

plot_predictions(model, condition = "Time")

21.3 Slopes

Marginal effects are slopes of the prediction equation. They are an observation-level quantity. The slopes() function produces a dataset with the same number of rows as the original data, but with new columns for the slope and uncertainty estimates:

mfx <- slopes(model, variables = "Time")
head(mfx)
#> 
#>  Estimate Std. Error    z Pr(>|z|)     S  2.5 % 97.5 %
#>    0.0261    0.00137 19.1   <0.001 267.9 0.0234 0.0288
#>    0.0261    0.00136 19.2   <0.001 270.3 0.0234 0.0288
#>    0.0261    0.00133 19.5   <0.001 280.1 0.0235 0.0287
#>    0.0260    0.00128 20.3   <0.001 301.0 0.0235 0.0285
#>    0.0259    0.00120 21.6   <0.001 339.9 0.0235 0.0282
#>    0.0257    0.00109 23.5   <0.001 404.5 0.0236 0.0279
#> 
#> Term: Time
#> Type: response
#> Comparison: dY/dX

We can plot marginal effects for different values of a regressor using the plot_slopes() function. This next plot shows the slope of the prediction equation, that is, the slope of the previous plot, at every value of the Time variable.

plot_slopes(model, variables = "Time", condition = "Time")

The marginal effects in this plot can be interpreted as measuring the change in Y that is associated with a small increase in Time, for different baseline values of Time.

21.4 Excluding terms

The predict() method of the mgcv package allows users to “exclude” some smoothing terms, using the exclude argument. You can pass the same argument to any function in the marginaleffects package:

predictions(model, newdata = "mean", exclude = "s(Subject)")
#> 
#>  Estimate Std. Error    z Pr(>|z|)     S 2.5 % 97.5 %
#>      11.7      0.695 16.9   <0.001 210.8  10.4   13.1
#> 
#> Type: response

See the documentation in ?mgcv:::predict.bam for details.

21.5 Warning

It may not always be appropriate to compute aggregated (aka, average or marginal) for models with splines. See Wood (2017) for a technical reference and this Cross Validated post for a discussion.