Software for fitting mixed models

Chapter 9
Software for fitting mixed models

In this chapter, we will look at software for fitting mixed models, with a particular emphasis on the SAS package, which has been used to analyse the majority of our examples. In Section 9.1, we mention briefly some of other packages and programs that are available for fitting mixed models. Basic details on the use of the SAS procedure PROC MIXED for fitting normal mixed models are given in Section 9.2. In Section 9.3, we give details on using PROC GENMOD and PROC GLIMMIX for fitting GLMMs, and in Section 9.4, details on PROC MCMC for fitting models using a Bayesian approach.

9.1 Packages for fitting mixed models

There have been very many different programs and software packages developed to implement mixed models or multi-level models. These range from programs to implement a single type of model to comprehensive statistical packages such as SAS. Some are regularly updated to run on the latest versions of operating systems, while others are only available on relatively old operating systems. New products are constantly being introduced to the market, often for a very specific application. As an example, GEMMA is software ‘implementing the Genome-wide Efficient Mixed Model Association algorithm for a standard linear mixed model and some of its close relatives for genome wide association studies (GWAS)’. Further details can be found at (home.uchicago.edu/xz7/software.html). It is impractical to list all of the software available for fitting mixed models, and we will not attempt to try it. We will, however, say a few words about some of the packages available. Inclusion within the following list is not an endorsement of these packages, and neither should the absence of any package be taken as a criticism.

License fees for many of the more comprehensive statistical software packages can be quite expensive, and so it is a wise precaution to ensure that the capabilities match all of a user’s needs and not just those relating to mixed models. For those unable to afford license fees, it is worth mentioning R software. This is free software that is now used widely, but by no means exclusively, in the academic community. The authors of this book are not themselves R users, but some of our colleagues are enthusiastic, particularly about its graphics capabilities. It requires an initial investment of time to learn R but is well worth considering, especially when funding is an issue, though its mixed models capability is not as good as that provided by SAS.

SAS The MIXED and GLIMMIX procedures provide very versatile software for fitting mixed models. They can be used to fit all types of mixed models (random effects, random coefficients and covariance pattern models). In addition, the MCMC procedure may be used to fit mixed models using a Bayesian approach. More details are provided in Sections 9.2–9.4.

Genstat Mixed models for normal data can be fitted using the VCOMP and REML directives, and GLMMs can be fitted using the GLMM directive. In addition, a set of procedures to fit the GLMM models described by Lee and Nelder (1996) is available. These allow the random effects in GLMMs to assume alternative distributions to the normal distribution. The VSTRUCTURE directive can be used to specify covariance patterns. There is not a directive for analysing ordinal data; however, the Biometris GenStat Procedure Library written by members of the Centre for Biometry Wageningen (CBW) contains a suitable procedure, IRCLASS. This procedure library may be freely downloaded from http://www.wageningenur.nl/en/show/Biometris-GenStat-Procedure-Library-Edition-15.htm.

MLwiN MLwiN is a package originally developed by the Multilevel Modelling Project at the Institute of Education in London. It can be used to fit both normal mixed models, GLMMs, and mixed models for ordinal and categorical data. However, the choice of covariance patterns available is more limited than in PROC MIXED. An R command interface to the MLwiN software package is available via software called R2MLwiN. Further information on MLwiN can be found at http://www.bristol.ac.uk/cmm/software/mlwin/.

R R is a free software environment for statistical computing and graphics. Mixed models for normal data can be fitted in using ‘lme4’ and ‘nlme4’ functions. More details on these functions can be found in the textbook ‘Mixed-effects models in S and S-PLUS’ by Pinheiro and Bates (2000) (R functions were originally developed for use within S and S-PLUS). Covariance patterns can be specified using the ‘correlation’ argument. GLMMs can be fitted using the ‘glmm’ or ‘glmmpql’ functions. The ‘glmmpql’ function is discussed in the fourth edition of Modern applied statistics with S, Venables and Ripley (2002); the book also covers normal theory models; and there is online support for the book at www.stats.ox.ac.uk/pub/MASS4/. Interfaces have been written to allow use of both MlwiN and WinBUGS software to be accessed from within R (R2MLwiN and R2WinBUGS). More information on R is available at www.r-project.org.

SPSS Mixed models for normal data can be fitted using the MIXED procedure. GLMMs may be fitted using the GLMM procedure, which is available within the Advanced Statistics software ad-on.

Stata Random effects models may be fitted using the ‘xtmixed’ command. Non-normal mixed models may be fitted within the GLLAMM add-on software (see www.gllamm.org/). The range of available covariance pattern models appears to be more restricted than is available in SAS.

WinBUGS This is a package dedicated to Bayesian analysis using the Gibbs sampler and has been developed by the Medical Research Council Biostatistics Unit in Cambridge. It can be used to fit random effects models to all types of data. The package can be obtained at www.mrc-bsu.cam.ac.uk/bugs. It is free of charge at the time of writing. WinBUGS software may be accessed from R using R2WinBUGS. This writes a data file, input file, and a script, runs the script in WinBUGS, and returns the output simulations to R.

9.2 `PROC MIXED`

PROC MIXED is a SAS procedure for fitting normal mixed models. It can be used to fit any type of mixed model (random effects, random coefficients, covariance pattern or a combination). It has great flexibility, and there are many options available for defining mixed models and their output. By default, the REML estimation method is applied (see Section 2.2.1). Alternatively, other likelihood-based methods can be applied, and a Bayesian analysis is also available for random effects and random coefficients models (see Section 2.3). Documentation for PROC MIXED is perhaps most easily accessed online during a SAS session. Details are also available from support.sas.com/documentation/onlinedoc. Another excellent text on using SAS to fit mixed models is SAS System for Mixed Models, Second edition by Littell et al. (2006).

Our aim in this section is to give a basic description of the most useful PROC MIXED statements and options to enable those not wishing to learn the procedure in depth to perform mixed models analyses. Details on the many other options available can be found in the SAS documentation. The presentation in this section will assume a working knowledge of SAS.

Syntax

The statements available in PROC MIXED in version 9.3 are as follows: PROC MIXED statement, BY, CLASS, CONTRAST, ESTIMATE, ID, LSMEANS, LSMESTIMATE, MODEL, PARMS, PRIOR, RANDOM, REPEATED, SLICE, STORE, and WEIGHT statements. In addition, ODS statements can be used to control what is printed when running PROC MIXED and what is output in SAS datasets for handling by other procedures. The BY statement is the same as in other SAS procedures and will not be considered further in this chapter. We will mention all of the other statements, at least briefly, while concentrating on those statements we have used in this book. Those with older versions of SAS may find that some of the statements listed here are not implemented in their version.

Simple example

Data from the multi-centre hypertension trial introduced in Section 1.3 will be used to illustrate the use of PROC MIXED to fit a simple random effects model. The following code fits a random effects model with pre-treatment DBP (dbp1) and treatment effects fixed and centre and centre·treatment effects random.

PROC MIXED; CLASS centre treat;

MODEL dbp = dbp1 treat;

RANDOM centre centre*treat;

`The Mixed Procedure`
`Model Information`
`Data Set`	`WORK.A`
`Dependent Variable`	`dbp`
`Covariance Structure`	`Variance Components`
`Estimation Method`	`REML`
`Residual Variance Method`	`Profile`
`Fixed Effects SE Method`	`Model-Based`
`Degrees of Freedom Method`	`Containment`
`Class Level Information`
`Class`	`Levels`	`Values`
`centre`	`29`	`1 2 3 4 5 6 7 8 9 11 12 13 14`
		`15 18 23 24 25 26 27 29 30 31`
		`32 35 36 37 40 41`
`treat`	`3`	`A B C`
`Dimensions`
`Covariance Parameters`	`3`
`Columns in X`	`5`
`Columns in Z`	`108`
`Subjects`	`1`
`Max Obs Per Subject`	`288`
`Number of Observations`
`Number of Observations Read`	`288`
`Number of Observations Used`	`288`
`Number of Observations Not Used`	`0`

`Iteration History`
`Iteration`	`Evaluations`	`−2 Res Log Like`	`Criterion`
`0`	`1`	`2072.30225900`
`1`	`3`	`2055.64188178`	`0.00000322`
`2`	`1`	`2055.63936685`	`0.00000000`
`Convergence criteria met.`

The Model Information gives basic information on the dataset that was used and some methodological information on how the model was fitted (in this case using the default options). The Class Level Information provides us with the values in our dataset for those categorical variables specified in the CLASS statement. Dimensions show the size of the matrices that SAS is working with. It is potentially confusing that the number of subjects is shown as one when we have 288 patients in the trial. This is because our dataset has not been ‘blocked’ (see Section 6.2). The Number of Observations allows us to see if any observations have been excluded from analysis, perhaps, because of missing values. The Iteration History table shows how quickly the algorithm has converged. The criterion is a measure of convergence and should be very close to zero. In this example, convergence has been reached quickly. The table gives the value of minus twice the REML log likelihood. If this is very large, then it is likely that the covariance matrix, V, is singular, leading to an infinite likelihood. In this situation, the results would be invalid and the model would need to be respecified, probably refitting certain random effects as fixed or removing them altogether. The Evaluations column gives the number of evaluations of the objective function carried out at each iteration. Occasionally, a message stating that the G matrix is not positive definite will appear. Usually, this indicates that a negative variance component has been set to zero by PROC MIXED and is not a cause for concern. However, if this is not the case, it is possible that only a local maximum has been reached and a ‘grid search’ for other solutions might be advisable (see the PARMS statement).

`Covariance Parameter`
`Estimates`
`Cov Parm`	`Estimate`
`centre`	`6.4628`
`centre*treat`	`4.0962`
`Residual`	`68.3677`

This gives the variance parameter estimates. The variance component for centres is 6.46 and for the centre·treatment interaction 4.10. The residual estimate is 68.37.

`Fit Statistics`
`−2 Res Log Likelihood`	`2055.6`
`AIC (smaller is better)`	`2061.6`
`AICC (smaller is better)`	`2061.7`
`BIC (smaller is better)`	`2065.7`

This table gives information about the model fit. It includes several statistics based on the likelihood value. AIC is Akaike’s Information Criterion, AICC is a variation on AIC and BIC is Schwartz’s Bayesian Information Criterion. Each of these criteria attempts to make adjustment for the number of parameters fitted in the model so that models can be compared directly. However, note that SAS presents minus twice the usual criteria. They are of greatest value for comparing models using different covariance patterns (see Chapter 6). More detail on their calculation is given in the SAS documentation.

`Type 3 Tests of Fixed Effects`
	`Num`	`Den`
`Effect`	`DF`	`DF`	`F Value`	`Pr > F`
`dbp1`	`1`	`208`	`6.31`	`0.0128`
`treat`	`2`	`48`	`2.28`	`0.1131`

Results from F tests are given for all fixed effects. The ‘Type 3’ tests are the Wald tests described in Section 2.4.4. These tests arise naturally from mixed models and adjust for other effects fitted in the model. The denominator DF for the F tests are given by the residual DF or, if the fixed effect is contained, by the DF of the containing effect. Here dbp1 is tested using the residual DF of 208 and treat using the centre·treatment DF of 48, since it is contained within this effect. Other methods for calculating the denominator DF are available by using the DDFM option in the MODEL statement (see the following section).

We will now give basic details on the use of each statement within the procedure.

The `PROC MIXED` statement

This statement initiates the procedure, and its options are concerned with the general aspects of fitting the procedure and specifying its output.

PROC MIXED options;

Some of the more useful PROC MIXED options are listed in the following table.

The `MODEL` statement

This statement is used to specify the dependent variable and the fixed effects in the model. Options are available for specifying the model, significance tests and for requesting specific output to be printed.

MODEL <dependent> = <fixed effects>/options;

Options relating to the model specification

NOINT Requests no intercept be included in the model.

Options relating to statistical tests

`DDFM =<DF type>`	Selects DF type for F test denominator DF for all F tests.
`RESIDUAL`	Uses residual DF.
`CONTAIN`	If the effect of interest is contained within another effect, then the DF of the containing effect are used. If the effect is not contained, then the residual DF are used. These are the default DF.
`BETWITHIN`	Assigns between- or within-subject DF to effects when the `REPEATED` statement is used. Effects nested within the `SUBJECT` effect take the subject effect DF, others take the residual DF.
`SATTERTH`	Uses Satterthwaite’s approximation to the true DF (see Section 2.4.4).
`KR <(FIRSTORDER)> or KENWARDROGER <(FIRSTORDER)>`	Calculates DF suggested by Kenward and Roger (1997). This substitutes an improved estimate of the covariance matrix of the fixed and random effects into Satterthwaite’s approximation for the DF (see Section 2.4.3). It also causes the improved estimates of fixed and random effects variances to be used throughout the whole procedure (e.g. for calculating standard errors and test statistics). Addition of the `FIRSTORDER` or `LINEAR` sub-option will sometimes lead to an improved adjustment in certain types of covariance pattern model (see Section 6.2.4).

Options relating to output

Options relating to model checking

`RESIDUAL`	Produces residual plots when used with `ODS` graphics.
`INFLUENCE`	Influence diagnostic plots to be produced when used with `ODS` graphics.
`VCIRY`	Standardises the residuals in a covariance pattern model
	(see Section 6.2.4).

Example

PROC MIXED DATA=a; CLASS centre treat;

MODEL dbp = dbp1 treat/ E3 SOLUTION DDFM=KR OUTP=pred

   OUTPM=predm;

RANDOM centre centre*treat;

PROC PRINT NOOBS DATA=pred;

PROC PRINT NOOBS DATA=predm;

Use of the `SOLUTION` and `DDFM=KR` options

`Solution for Fixed Effects`
			`Standard`
`Effect`	`treat`	`Estimate`	`Error`	`DF`	`t Value`	`Pr > \|t\|`
`Intercept`		`61.7638`	`11.2440`	`284`	`5.49`	`<.0001`
`dbp1`		`0.2689`	`0.1084`	`284`	`2.48`	`0.0137`
`treat`	`A`	`2.9274`	`1.4109`	`25.6`	`2.07`	`0.0482`
`treat`	`B`	`1.6415`	`1.4453`	`25.7`	`1.14`	`0.2666`
`treat`	`C`	`0`	··	··	··	··

Note that use of DDFM=KR has caused the standard errors to be based on an improved estimate (as described by Kenward and Roger, 1997), which helps alleviate bias (see Section 2.4.3). These standard errors are also used to calculate the t statistics and their approximate DF. (It is somewhat confusing that the DDFM=KR option in fact relates to the standard error calculation. The DF are still calculated using Satterthwaite’s approximation but based on these improved standard errors.)

Use of the `E3` option

`Type 3 Coefficients` `for dbp1`
`Effect`	`treat`	`Row1`
`Intercept`
`dbp1`		`1`
`treat`	`A`
`treat`	`B`
`treat`	`C`

`Type 3 Coefficients for treat`
`Effect`	`treat`	`Row1`	`Row2`
`Intercept`
`dbp1`
`treat`	`A`	`1`
`treat`	`B`		`1`
`treat`	`C`	−1	−1

This shows that SAS has parameterised treatment effects by using differences from the last treatment (C).

Use of the `DDFM=KR` option

`Type 3 Tests of Fixed Effects`
	`Num`	`Den`
`Effect`	`DF`	`DF`	`F Value`	`Pr > F`
`dbp1`	`1`	`284`	`6.16`	`0.0137`
`treat`	`2`	`25`	`2.16`	`0.1364`

Note that these tests differ from the earlier analysis where the DDFM=KR option was not used.

Use of the `OUTP=` option

S

t

d

p E

a c r

t v e t r A L U R

i i n r d P P l o p e

e s t e d b c r r p w p s

n i r a b p c f e e D h e e i

t t e t p 1 f 1 d d F a r r d

1 6 29 C 86 97 . 1 84.9727 3.16871 18.4533 0.05 78.3272 91.618 1.0273

2 3 29 C 72 109 . . 88.1995 3.15570 17.9756 0.05 81.5690 94.830 −16.1995

3 6 5 B 109 117 . 5 96.4061 3.03187 47.2421 0.05 90.3076 102.505 12.5939

4 6 5 A 87 100 3 1 93.7690 2.78800 16.4832 0.05 87.8727 99.665 −6.7690

5 6 29 A 85 105 3 3 90.6234 3.25199 11.7170 0.05 83.5189 97.728 −5.6234

7 6 3 A 100 114 1 2 99.9119 3.01901 20.6093 0.05 93.6263 106.198 0.0881

8 6

etc.

Use of the `OUTPM=` option

S

t

d

p E

a c r

t v e t r A L U R

i i n r d P p l o p e

e s t e d b c r r p w p s

n i r a b p c f e e D h e e i

t t e t p 1 f 1 d d F a r r d

1 6 29 C 86 97 . 1 87.8475 1.31873 95.212 0.05 85.2296 90.4655 −1.8475

2 3 29 C 72 109 . . 91.0744 1.29205 85.384 0.05 88.5056 93.6432 −19.0744

3 6 5 B 109 117 . 5 94.8671 1.95897 189.699 0.05 91.0029 98.7313 14.1329

4 6 5 A 87 100 3 1 91.5816 1.17842 58.874 0.05 89.2235 93.9397 −4.5816

5 6 29 A 85 105 3 3 92.9262 1.13687 55.620 0.05 90.6484 95.2039 −7.9262

7 6 3 A 100 114 1 2 95.3463 1.61199 153.490 0.05 92.1617 98.5309 4.6537

etc.

Options relating to model checking

Use of these options is illustrated in Section 2.5 and Section 6.3.

The `RANDOM` statement

This statement can be used to specify random effects and/or random coefficients, β, and the form of their variance matrix, G. Several RANDOM statements may be specified, although usually only one will be needed. When no RANDOM statement is included, the results will be the same as those obtained using PROC GLM.

RANDOM <random effects>/options;

Output options

`CL`	Requests t-type confidence intervals for each random effects estimate.
`ALPHA=<p>`	Specifies size of confidence interval (default is 0.05 to give 95% confidence intervals).
`SOLUTION`	Solution for random effects is printed.

Options corresponding to the G and V matrices

`G`	Prints G.
`GCORR`	Prints correlation matrix corresponding to G.
`V`	Prints V.
`VCORR`	Prints correlation matrix corresponding to V.
`GDATA=<dataset>`	Specifies a fixed G matrix.

Example

PROC MIXED DATA=a; CLASS centre treat;

MODEL dbp = dbp1 treat/ DDFM=KR;

RANDOM centre centre*treat/ V SOLUTION;

Use of the `V` option

Estimated V Matrix for Subject 1
Row	Col1	Col2	Col3	Col4	Col5	Col6	Col7
1	78.9267	10.5590			6.4628
2	10.5590	78.9267			6.4628
3			78.9267	6.4628
4			6.4628	78.9267
5	6.4628	6.4628			78.9267
6						78.9267	6.4628

This matrix has a dimension equal to the number of observations in the dataset and is hence very large in this example. (If a SUBJECT option is used in the RANDOM statement, the matrix will relate to only the first subject, otherwise the full variance matrix for all observations is given.) The diagonal terms are equal to the sum of the three variance components. Off-diagonal terms are equal to the sum of the centre and centre·treatment variance components, 10.6, when patients are at the same centre and receive the same treatment; they are equal to the centre variance component, 6.5, when patients are at the same centre but receive different treatments; they and are zero when patients are at different centres (denoted by blank entries).

Use of the `SOLUTION` option

`Solution for Random Effects`
				`Std Err`
`Effect`	`treat`	`centre`	`Estimate`	`Pred`	`DF`	`t Value`	`Pr > \|t\|`
`centre`		`1`	`0.7966`	`1.8325`	`20.7`	`0.43`	`0.6683`
`centre`		`2`	`−2.3441`	`2.1884`	`14.5`	`−1.07`	`0.3016`
`centre`		`3`	`1.9760`	`2.2547`	`12.1`	`0.88`	`0.3978`
`centre`		`4`	`−0.1429`	`2.1410`	`16.4`	`−0.07`	`0.9476`
`centre`		`5`	`1.1779`	`2.1764`	`15.4`	`0.54`	`0.5961`
·
·
`centre`		`31`	`−4.7913`	`1.8470`	`21.5`	`−2.59`	`0.0167`
^·
^·

`centre*treat`	`A`	`1`	`−0.4605`	`2.2025`	`2.36`	`−0.21`	`0.8511`
`centre*treat`	`B`	`1`	`1.6900`	`2.1965`	`2.38`	`0.77`	`0.5106`
`centre*treat`	`C`	`1`	`−0.7246`	`2.2093`	`2.3`	`−0.33`	`0.7705`
`centre*treat`	`A`	`2`	`−0.8330`	`2.2406`	`1.56`	`−0.37`	`0.7543`
^·
^·
^·

The Estimate and Std Err Pred columns give the random effects estimates and their standard errors. The Wald t tests help to determine whether any centre is outlying.

Options for specifying the G matrix structure

However, the statement should be used cautiously when there are many group levels, since a large number of parameters will then need to be estimated.

The G matrix is always diagonal in random effects models, and the following options given for the RANDOM statement are not required. However, a non-diagonal G matrix is necessary to fit a random coefficients model to allow the intercepts and slopes of the random coefficients to be correlated and also for fitting certain covariance structures in hierarchical repeated measures designs (see Section 8.1). The G matrix is blocked by the specified SUBJECT effect, and the covariance pattern is specified by the TYPE options in a very similar way to the REPEATED statement.

RANDOM <random effects or coefficients>/

   SUBJECT=<blocking effect> TYPE=<covariance pattern>;

Covariance patterns available for the TYPE= option will be listed under the REPEATED statement.

Random coefficients models In these models, the random coefficients on the same subject are assumed to be correlated (e.g. intercepts and slope effects are correlated for each subject in a repeated measures trial). This is achieved using a RANDOM statement of the following form:

RANDOM <random coefficients>/SUBJECT=<subject effect> TYPE=UN;

For example, to model random coefficients for the effect of time, the code might be

RANDOM INT time/SUBJECT=patient TYPE=UN;

This fits the random coefficients patient and patient·time. Patient effects are specified by the reserved SAS term INT, which fits an intercept for each patient.

Example

Examples of fitting random coefficients models are given in Section 6.6.

The `LSMEANS` statement

This statement calculates the least squares mean estimates of specified fixed effects.

LSMEANS <fixed effects>/options;

Options

`CL`	Requests t-type confidence intervals for least squares means.
`ALPHA=<p>`	Specifies size of confidence interval (default is 0.05 for a 95% confidence interval).
`ADJUST=<type>`	Requests adjusted p-values for multiple comparisons (see manual for further details).
`DIFF`	Prints differences between each pair of least squares means.
`PDIFF`	Prints p-values for comparisons between each pair of least squares means, by default. `PDIFF=CONTROL` compares difference with a control group (first level of the `LSMEANS` variables, by default).
`AT<variables>= <values>`	Modifies the covariate value(s) at which `LSMEANS` are computed. Otherwise they are evaluated at mean values.

Example

PROC MIXED; CLASS centre treat;

MODEL dbp = dbp1 treat/ DDFM=KR;

RANDOM centre centre*treat;

LSMEANS treat/ CL COV CORR DIFF PDIFF;

`Least Squares Means`
			`Standard`
`Effect`	`treat`	`Estimate`	`Error`	`DF`	`t Value`	`Pr > \|t\|`	`Alpha`	`Lower`	`Upper`
`treat`	`A`	`92.3491`	`1.1233`	`52`	`82.21`	`<.0001`	`0.05`	`90.0951`	`94.6032`
`treat`	`B`	`91.0632`	`1.1695`	`51.4`	`77.86`	`<.0001`	`0.05`	`88.7157`	`93.4107`
`treat`	`C`	`89.4217`	`1.1326`	`58.7`	`78.96`	`<.0001`	`0.05`	`87.1552`	`91.6882`
`Least Squares Means`
`Effect`	`treat`	`Cov1`	`Cov2`	`Cov3`	`Corr1`	`Corr2`	`Corr3`
`treat`	`A`	`1.2619`	`0.2923`	`0.2770`	`1.0000`	`0.2225`	`0.2177`
`treat`	`B`	`0.2923`	`1.3678`	`0.2807`	`0.2225`	`1.0000`	`0.2119`
`treat`	`C`	`0.2770`	`0.2807`	`1.2827`	`0.2177`	`0.2119`	`1.0000`

The last six columns show covariance and correlation matrices for the least squares means.

`Differences of Least Squares Means`
				`Standard`
`Effect`	`treat`	`−treat`	`Estimate`	`Error`	`DF`	`t Value`	`Pr > \|t\|`	`Alpha`
`treat`	`A`	`B`	`1.2859`	`1.4300`	`23.8`	`0.90`	`0.3775`	`0.05`
`treat`	`A`	`C`	`2.9274`	`1.4109`	`25.6`	`2.07`	`0.0482`	`0.05`
`treat`	`B`	`C`	`1.6415`	`1.4453`	`25.7`	`1.14`	`0.2666`	`0.05`

`Differences of Least Squares Means`
`Effect`	`treat`	`−treat`	`Lower`	`Upper`
`treat`	`A`	`B`	`−1.6665`	`4.2384`
`treat`	`A`	`C`	`0.02511`	`5.8297`
`treat`	`B`	`C`	`−1.3314`	`4.6144`

Note that the DF differ markedly between the least squares means and their differences because they are estimated using information from different combinations of error strata.

The `ESTIMATE` statement

This statement calculates a linear function of fixed and/or random effects estimates ( $c09-math-0007$ and $c09-math-0008$ ).



ESTIMATE ‘<label>’ <fixed effect1> <values>

              <fixed effect2><values>

              … |

              <random effect 1> <values>

              <random effect 2> <values>

              … / options;

Options

`CL`	Requests t-type confidence intervals for estimate.
`ALPHA=<p>`	Specifies size of confidence interval (default is 0.05 for a 95% CI).
`DIVISOR=<n>`	Value by which to divide all coefficients so that fractional coefficients can be entered as integer numerators.
`E`	Shows the effect coefficients used in the estimate (useful to check ordering if interaction effects are used).

When the effect of interest is contained within another fixed interaction effect, the coefficients for this effect are automatically included in the estimate by SAS. For example, the following code for a fixed effects analysis of the multi-centre hypertension data

PROC MIXED; CLASS centre treat;

MODEL dbp = dbp1 treat centre centre*treat/ DDFM=KR;

ESTIMATE ‘A−B’ treat 1 −1 0/ E CL;

would cause the following coefficients to be used by SAS for constructing the estimate.

`Coefficients for A-B`
`Effect`	`treat`	`centre`	`Row1`
`centre`		`4`
`centre`		`5`
`centre`		`6`
`centre`		`7`
`centre`		`8`
`centre`		`9`
`centre`		`11`
`.`
`.`
`.`

`centre*treat`	`A`	`1`	`0.0385`
`centre*treat`	`B`	`1`	`-0.04`
`centre*treat`	`C`	`1`
`centre*treat`	`A`	`2`	`0.0385`
`centre*treat`	`B`	`2`	`-0.04`
`centre*treat`	`C`	`2`
`centre*treat`	`A`	`3`	`0.0385`
`.`
`.`
`.`

The centre·treatment coefficients are equal to 1/n_i, where n_i is the number of centres at which treatment i is received (26 for treatment A and 25 for treatment B). Because treatments A and B are not received at every centre, this estimate is in fact ‘non-estimable’, and SAS gives the following output:

`Estimates`
		`Standard`
`Label`	`Estimate`	`Error`	`DF`	`t Value`	`Pr > \|t\|`	`Alpha`	`Lower`	`Upper`
`A-B`	`Non-est`	`.`	`.`	`.`	`.`	`.`	`.`	`.`

However, if the containing effect is fitted as random, SAS does not include any coefficients for this containing effect in the estimate. For example, the following code for a random effects analysis of the multi-centre hypertension data

PROC MIXED; CLASS centre treat;

MODEL dbp = dbp1 treat/DDFM=KR;

RANDOM centre centre*treat;

ESTIMATE ‘A−B’ treat 1 −1 0/ E CL;

would cause the following coefficients to be used by SAS for constructing the estimate:

`Coefficients for A-B`
`Effect`	`treat`	`Row1`
`Intercept`
`dbp1`
`treat`	`A`	`1`
`treat`	`B`	`-1`
`treat`	`C`

`Estimates`
		`Standard`
`Label`	`Estimate`	`Error`	`DF`	`t Value`	`Pr > \|t\|`	`Alpha`	`Lower`	`Upper`
`A-B`	`1.2859`	`1.4300`	`23.8`	`0.90`	`0.3775`	`0.05`	`-1.6665`	`4.2384`

The ESTIMATE statement can also be used to estimate shrunken random effects by specifying the coefficients of the random effects. For example, to estimate the shrunken difference between the first two treatments within the first two centres, the following ESTIMATE statements can be added to the previous code:

ESTIMATE ‘C1, A−B’ treat 1 −1 0| centre*treat 1 − 1 0/ E CL;

ESTIMATE ‘C2, A−B’ treat 1 −1 0| centre*treat

          0 0 0 1 − 1 0/ CL;

These cause the following coefficients to be used for the first estimate

`Coefficients for C1, A-B`
`Effect`	`treat`	`centre`	`Row1`
`Intercept`
`dbp1`
`treat`	`A`		`1`
`treat`	`B`		`-1`
`treat`	`C`
`centre`		`1`
`centre`		`2`
`centre`		`3`
`.`
`.`
`.`
`centre`		`41`
`centre*treat`	`A`	`1`	`1`
`centre*treat`	`B`	`1`	`-1`
`centre*treat`	`C`	`1`
`centre*treat`	`A`	`2`
`centre*treat`	`B`	`2`
`centre*treat`	`C`	`2`
`centre*treat`	`A`	`3`
`.`
`.`
`.`

`centre*treat`	`A`	`41`
`centre*treat`	`B`	`41`
`centre*treat`	`C`	`41`

and give the following estimates:

`Estimates`
	`Standard`
`Label`	`Estimate`	`Error`	`DF`	`t Value`	`Pr > \|t\|`	`Alpha`	`Lower`	`Upper`
`C1, A-B`	`-0.8646`	`2.8933`	`7.35`	`-0.30`	`0.7733`	`0.05`	`-7.6405`	`5.9113`
`C2, A-B`	`1.6405`	`3.4054`	`2.65`	`0.48`	`0.6669`	`0.05`	`-10.0418`	`13.3229`

The `CONTRAST` statement

This statement can be used to define F tests for fixed and random effects. A single or multiple contrast, C, can be defined by

Only gold members can continue reading. Log In or Register to continue

Stay updated, free articles. Join our Telegram channel

Tags: Applied Mixed Models in Medicine

Dec 12, 2016 | Posted by admin in GENERAL & FAMILY MEDICINE | Comments Off

`METHOD = <method>`	Specifies estimation method.
`= REML`	REML (default).
`= ML`	Maximum likelihood.
`= MINQUE`	Minimum variance quadratic variance estimation. This is an estimation method that we have not covered. It is based on equating mean squares to their expected values, and it is less computationally expensive than maximum likelihood. Further information can be found in Rao (1971 1972). Searle et al. (1992) show that the solution is equivalent to that obtained using just one iteration with REML.
`CL`	Prints Wald confidence intervals for covariance parameters. Default is 95% limits unless `ALPHA=<p>` option is used.
`ASYCOV`	Prints asymptotic covariance matrix of covariance parameters.
`EMPIRICAL`	Empirical estimator of variance matrix of fixed effects, $c09-math-0001$ , is used in place of V for all fixed effects variance estimates in covariance pattern models fitted using `REPEATED` statement (see Section 2.4.3).
`IC`	Displays additional information criteria to compare model fit. Criteria due to Akaike–AIC and AICC (1974), Hannan and Quinn–HQIC (1979), Schwarz–BIC (1978) and Bozdogan–CAIC (1987) are displayed.
`NOCLPRINT`	Suppresses printing of class levels. (This is useful when a `CLASS` effect has many categories.)
`NOINFO`	Suppresses printing of Model Information, Dimensions and Number of Observations tables.
`NOITPRINT`	Suppresses printing of Iteration History
`PLOTS options`	A variety of statistical graphics can be specified via the Output Delivery System (in more recent versions of SAS). See online help for details

`SOLUTION`	Solution for fixed effects is printed.
`CL`	Requests t-type confidence intervals for fixed effects given by `SOLUTION` option.
`ALPHA=<p>`	Specifies size of confidence intervals (default 0.05).
`E3`	Prints design matrix for all fixed effects.
`OUTP=<dataset>`	For each observation gives predicted values $c09-math-0002$ and residuals $c09-math-0003$ based on fixed and random effects. Approximate variances, standard errors and 95% confidence intervals for each predicted value are also listed.
`OUTPM=<dataset>`	Predicted values $c09-math-0004$ and residuals $c09-math-0005$ based on fixed effects only are given. Approximate variances, standard errors and 95% confidence intervals for each predicted value are also listed.

Like this:

Related

Stay updated, free articles. Join our Telegram channel

Basicmedical Key

Fastest Basicmedical Insight Engine

Software for fitting mixed models

9.1 Packages for fitting mixed models

9.2 `PROC MIXED`

Syntax

Simple example

The `PROC MIXED` statement

The `MODEL` statement

Options relating to the model specification

Options relating to statistical tests

Options relating to output

Options relating to model checking

Example

Use of the `SOLUTION` and `DDFM=KR` options

Use of the `E3` option

Use of the `DDFM=KR` option

Use of the `OUTP=` option

Use of the `OUTPM=` option

Options relating to model checking

The `RANDOM` statement

Output options

Options corresponding to the G and V matrices

Example

Use of the `V` option

Use of the `SOLUTION` option

Options for specifying the G matrix structure

Example

The `LSMEANS` statement

Options

Example

The `ESTIMATE` statement

Options

The `CONTRAST` statement

Full access? Get Clinical Tree

Basicmedical Key

Fastest Basicmedical Insight Engine

Software for fitting mixed models

9.1 Packages for fitting mixed models

9.2 PROC MIXED

Syntax

Simple example

The PROC MIXED statement

The MODEL statement

Options relating to the model specification

Options relating to statistical tests

Options relating to output

Options relating to model checking

Example

Use of the SOLUTION and DDFM=KR options

Use of the E3 option

Use of the DDFM=KR option

Use of the OUTP= option

Use of the OUTPM= option

Options relating to model checking

The RANDOM statement

Output options

Options corresponding to the G and V matrices

Example

Use of the V option

Use of the SOLUTION option

Options for specifying the G matrix structure

Example

The LSMEANS statement

Options

Example

The ESTIMATE statement

Options

The CONTRAST statement

Share this:

Like this:

Related

Related posts:

Stay updated, free articles. Join our Telegram channel

Full access? Get Clinical Tree

9.2 `PROC MIXED`

The `PROC MIXED` statement

The `MODEL` statement

Use of the `SOLUTION` and `DDFM=KR` options

Use of the `E3` option

Use of the `DDFM=KR` option

Use of the `OUTP=` option

Use of the `OUTPM=` option

The `RANDOM` statement

Use of the `V` option

Use of the `SOLUTION` option

The `LSMEANS` statement

The `ESTIMATE` statement

The `CONTRAST` statement