13 Multivariable Analysis
I Overview of Multivariable Statistics
1. To equalize research groups (i.e., make them as comparable as possible) when studying the effects of medical or public health interventions.
2. To build causal models from observational studies that help investigators understand which factors affect the risk of different diseases in populations (assisting clinical and public health efforts to promote health and prevent disease and injury).
3. To create clinical indices that can suggest the risk of disease in well people or a certain diagnosis, complications, or death in ill people.
Statistical models that have one outcome variable but more than one independent variable are generally called multivariable models (or multivariate models, but many statisticians reserve this term for models with multiple dependent variables).1 Multivariable models are intuitively attractive to investigators because they seem more “true to life” than models with only one independent variable. A bivariate (two-variable) analysis simply indicates whether there is significant movement in Y in tandem with movement in X. Multivariable analysis allows for an assessment of the influence of change in X and change in Y once the effects of other factors (e.g., A, B, and C) are considered.
Multivariable analysis does not enable an investigator to ignore the basic principles of good research design, however, because multivariable analysis also has many limitations. Although the statistical methodology and interpretation of findings from multivariable analysis are difficult for most clinicians, the methods and results are reported routinely in the medical literature.2,3 To be intelligent consumers of the medical literature, health care professionals should at least understand the use and interpretation of the findings of multivariable analysis as usually presented.
II Assumptions Underlying Multivariable Methods
A Conceptual Understanding of Equations for Multivariable Analysis
This statement could be made to look more mathematical simply by making a few slight changes:
Although equation 13-5 looks complex, it really means the same thing as equations 13-1 through 13-4.
B Best Estimates
This equation is true because is only an estimate, which can have error. When equation 13-6 is subtracted from equation 13-5, the following equation for the error term emerges:
C General Linear Model
Numerous procedures for multivariable analysis are based on the general linear model. These include methods with such imposing designations as analysis of variance (ANOVA), analysis of covariance (ANCOVA), multiple linear regression, multiple logistic regression, the log-linear model, and discriminant function analysis. As discussed subsequently and outlined in Table 13-1, the choice of which procedure to use depends primarily on whether the dependent and independent variables are continuous, dichotomous, nominal, or ordinal. Knowing that the procedures listed in Table 13-1 are all variations of the same theme (the general linear model) helps to make them less confusing. Detailing these methods is beyond the scope of this text but readily available both online* and in print.4
Table 13-1 Choice of Appropriate Procedure to Be Used in Multivariable Analysis (Analysis of One Dependent Variable and More than One Independent Variable)

You may also need

Full access? Get Clinical Tree

