Normal mixed models

Chapter 2
Normal mixed models

In this chapter, we discuss in more detail the mixed model with normally distributed errors. We will refer to this as the ‘normal mixed model’. Of course, this does not imply that values of the response variables follow normal distributions because they are, in fact, mixtures of effects with different means. In practice, though, if a variable appears to have a normal distribution, the assumption of normal residuals and random effects is often reasonable.

In the examples introduced in Sections 1.1–1.4, we defined several mixed models using a notation chosen to suit each situation. In Section 2.1, we define the mixed model using a general matrix notation, which can be used for all types of mixed model. Matrix notation may at first be unfamiliar to some readers, and it is outwith the scope of this book to teach matrix algebra. A good introductory guide is Matrices for Statistics, Second Edition by Healy (2000). Once grasped, though, matrix notation can make the overall theory underlying mixed models easier to comprehend. Mixed models methods based on classical statistical techniques are described in Section 2.2, and in Section 2.3, the Bayesian approach to fitting mixed models will be introduced. These two sections can be omitted by readers who do not desire a detailed understanding of the more theoretical aspects of mixed models. In Section 2.4, some practical issues related to the use and interpretation of mixed models are considered, and a worked example illustrating several of the points made in Section 2.4 is described in Section 2.5. For those who wish a more in-depth understanding of the theory underlying mixed models, the textbook Mixed Models: Theory and Applications with R, Second Edition by Demidenko (2013) is recommended.

2.1 Model definition

In this section, the mixed model is defined using a general matrix notation that provides a compact means to specify all types of mixed model. We start by defining the fixed effects model, and then extend this notation to encompass the mixed model.

2.1.1 The fixed effects model

All fixed effects models can be specified in the general form

For example, in Section 1.2, Model B was presented as

This model used a subscript i to denote results from the ith patient and a subscript j to denote results on the jth treatment, in the context of a cross-over trial. In the general model notation, however, every observation is denoted separately with a single subscript. Thus, y₁ and y₂ could represent the observations from patient 1, y₃ and y₄ the observations from patient 2, and so on. The α terms in the general model will correspond to p₁, p₂, p₃, p₄, p₅ and p₆ and to t₁ and t₂ and are constants giving the size of the patient and treatment effects. The terms x_i1, x_i2, …, x_i8 are used in this example to indicate the patient and treatment to which the observation y_i belongs, and in this case will take the values one or zero. If y₁ is the observation from patient 1 who receives treatment 1, x₁₁ then will equal one (corresponding to α₁, which represents the first patient effect), x₁₂–x₁₆ will equal zero (as this observation is not from patients 2 to 6), x₁₇ will equal one (corresponding to α₇, representing the first treatment effect) and x₁₈ will equal zero. A further example to follow shortly should clarify this notation further.

The above model fits p + 1 fixed effects parameters, α₁–α_p, and an intercept term, μ. If there are n observations, then these may be written as

These can be expressed more concisely in matrix notation as

where

y = (y₁, y₂, y₃, …, y_n)′ = observed values,
α = (μ, α₁, α₂, …, α_p)′ = fixed effects parameters,
e = (e₁, e₂, e₃, …, e_n)′ = residuals,
σ² = residual variance,
I = n × n identity matrix.

The parameters in α may encompass several variables. In the above example, they covered patient effects and treatment effects. Both of these are qualitative or categorical variables, and we will refer to such effects as categorical effects. They are also sometimes referred to as factor effects. More generally, categorical effects are those where observations will belong to one of several classes. There may also be several covariate effects (such as age or baseline measurement) contained in α. These relate to variables that are measured on a quantitative scale. Several parameters may be required to model a categorical effect, but just one parameter is needed to model a covariate effect.

X is known as the design matrix and has the dimension n × p (i.e. n rows and p columns). It specifies values of fixed effects corresponding to each parameter for each observation. For categorical effects, the values of zero and one are used to denote the absence and presence of effect categories, and for covariate effects, the variable values themselves are used in X.

We will exemplify the notation with the following data, which are the first nine observations in a multi-centre trial of two treatments to lower blood pressure.

Centre	Treatment	Pre-treatment systolic BP	Post-treatment systolic BP
1	A	178	176
1	A	168	194
1	B	196	156
1	B	170	150
2	A	165	150
2	B	190	160
3	A	175	150
3	A	180	160
3	B	175	160

The observation vector y is formed from the values of the post-treatment systolic blood pressure:

If pre-treatment blood pressure and treatment were fitted in the analysis model as fixed effects (ignoring centres for the moment), then the design matrix would be

where the columns of the design matrix correspond to the parameters

We note in this case that the design matrix, X, is overparameterised. This means that there are linear dependencies between the columns, for example, we know that α₃ will be zero if α₂ = 1 and one if α₂ = 0. X could alternatively be specified omitting the α₃ column to correspond with the number of parameters actually modelled. However, the overparameterised form is used here since it is used for specifying contrasts by SAS procedures such as PROC MIXED (this procedure will be used to analyse most of the examples in this book).

V is a matrix containing the variances and covariances of the observations. In the usual fixed effects model, variances for all observations are equal, and no observations are correlated. Thus, V is simply σ²I.

2.1.2 The mixed model

The mixed model extends the fixed effects model by including random effects, random coefficients and/or covariance terms in the residual variance matrix. In this section, the general notation will be given, and in the following three sections, the specific forms of the covariance matrices for each type of mixed model will be specified.

Extending our fixed effects model to incorporate random effects (or coefficients), the mixed model may be specified as

for a model fitting p fixed effects parameters and q random effects (or coefficients) parameters. It will be recalled from Chapter 1 that random effects are assumed to follow a distribution, whereas fixed effects are regarded as fixed constants. The model can be expressed in matrix notation as

where y, X, α and e are as defined in the fixed effects model, and

Z is a second design matrix with dimension n × q giving the values of random effects corresponding to each observation. It is specified in exactly the same way as X was for the fixed effects, except that an intercept term is not included. If centres were fitted as random in the multi-centre example given previously, the β vector would then consist of three parameters, β₁, β₂ and β₃, corresponding to the three centres, and the Z matrix would be

Alternatively, if both the centre and the centre·treatment effects were fitted as random, then the vector of random effects parameters, β, would consist of the three centre parameters, plus six centre·treatment interaction parameters β₄, β₅, β₆, β₇, β₈ and β₉. The Z matrix would then be

Again, note that this matrix is overparameterised due to linear dependencies between the columns. It could alternatively have been written using four columns: 3 − 1 = 2 for the centre effects and (3 − 1) × (2 − 1) = 2 for the centre·treatment effects.

Covariance matrix, V

We saw in the fixed effects model that all observations have equal variances, and the observations are uncorrelated. This leads to the V matrix being diagonal. When random effects are fitted, we saw in Section 1.2 that this results in correlated observations. In the context of the cross-over trial, we saw that observations on the same patient were correlated (with covariance equal to the patient variance component), while those on different patients were uncorrelated. We now generalise this result, using the matrix notation.

The covariance of y, var(y) = V, can be written as

Since we assume that the random effects and the residuals are uncorrelated,

Since α describes the fixed effects parameters, var(Xα) = 0. Also, Z is a matrix of constants. Therefore,

We will let G denote var(β), and since the random effects are assumed to follow normal distributions, we may write β ∼ N(0, G). Similarly, we write var(e) = R, the residual covariance matrix, and e ∼ N(0, R). Hence,

In the following three sections, we will define the structure of the G and R matrices in random effects models, random coefficients models and covariance pattern models.

2.1.3 The random effects model covariance structure

The G matrix

The dimension of G is q × q, where q is equal to the total number of random effects parameters.

In random effects models, G is always diagonal (i.e. random effects are assumed uncorrelated). If just centre effects were fitted as random in the simple multi-centre example with three centres, then G would have the form

where $c02-math-0018$ is the centre variance component. If both centre and centre·treatment effects were fitted as random, then G would have the form

where $c02-math-0020$ is the centre·treatment variance component.

The R matrix

The residuals are uncorrelated in random effects models and R = σ²I:

The V matrix

We showed earlier that the variance matrix, V, has the form V = ZGZ′ + R.

ZGZ′ specifies the covariance due to the random effects. If just centre effects are fitted as random, then we obtain

This matrix could be obtained by the laborious process of matrix multiplication but it always has the same form. It has a block diagonal form with the size of blocks corresponding to the number of observations at each random effects category. The total variance matrix, V = ZGZ′ + R, is then

This also has a block diagonal form with the covariances for observations at the same centre equal to the random effects variance component, $c02-math-0024$ , and variance terms on the diagonal equal to the sum of the centre and residual variance components, $c02-math-0025$ . (We note that this corresponds to the results from the cross-over trial example introduced in Section 1.2, where the random effect was patient rather than centre.) If both centre and centre·treatment effects had been fitted as random, then

and

where $c02-math-0028$ . Thus, V again has a block diagonal form with a slightly more complicated structure. The centre·treatment variance component is added to the covariance terms for observations at the same centre and with the same treatment.

2.1.4 The random coefficients model covariance structure

The statistical properties of random coefficients models were described in the repeated measures example introduced in Section 1.4. We will define their covariance structure in terms of the general matrix notation we have just introduced for mixed models. Random coefficients models will be discussed in more detail in Section 6.5.

The following data will be used to illustrate the covariance structure. They represent measurement times for the first three patients in a repeated measures trial of two treatments.

Patient	Treatment	Time (days)
1	A	t₁₁
1	A	t₁₂
1	A	t₁₃
1	A	t₁₄
2	B	t₂₁
2	B	t₂₂
3	A	t₃₁
3	A	t₃₂
3	A	t₃₃

If patient and patient·time effects were fitted as random coefficients, then there would be six random coefficients. We will change notation from Chapter 1 for ease of reading to define these as β_{p, 1}, β_{pt, 1}, β_{p, 2}, β_{pt, 2}, β_{p, 3} and β_{pt, 3}, allowing an intercept (patient) and slope (patient·time) to be calculated for each of the three patients. The Z matrix would then be

Only gold members can continue reading. Log In or Register to continue

Stay updated, free articles. Join our Telegram channel

Tags: Applied Mixed Models in Medicine

Dec 12, 2016 | Posted by admin in GENERAL & FAMILY MEDICINE | Comments Off

Like this:

Related

Stay updated, free articles. Join our Telegram channel

Basicmedical Key

Fastest Basicmedical Insight Engine

Normal mixed models

2.1 Model definition

2.1.1 The fixed effects model

2.1.2 The mixed model

Covariance matrix, V

2.1.3 The random effects model covariance structure

The G matrix

The R matrix

The V matrix

2.1.4 The random coefficients model covariance structure

Full access? Get Clinical Tree

Basicmedical Key

Fastest Basicmedical Insight Engine

Normal mixed models

2.1 Model definition

2.1.1 The fixed effects model

2.1.2 The mixed model

Covariance matrix, V

2.1.3 The random effects model covariance structure

The G matrix

The R matrix

The V matrix

2.1.4 The random coefficients model covariance structure

Share this:

Like this:

Related

Related posts:

Stay updated, free articles. Join our Telegram channel

Full access? Get Clinical Tree