The Analysis of Individual Health Trajectories Across the Life Course: Latent Class Growth Models Versus Mixed Models

  

BMI (kg/m−2), SD

BMI (kg/m−2), SD

BMI (kg/m−2), SD

Age

Sample size

Females

Males

Study sample

13

321

17.81, 2.14

16.93, 1.42

17.39, 1.88

14

279

18.66, 2.08

17.68, 1.49

18.23, 1.90

15

272

19.41, 2.14

18.38, 1.66

18.95, 2.01

16

275

20.01, 2.10

19.21, 1.70

19.66, 1.97

21

131

21.65, 2.48

21.20, 1.67

21.45, 2.17

27

133

22.03, 2.36

22.46, 2.17

22.22, 2.28

29

125

22.46, 2.58

22.94, 2.22

23.70, 2.41

32

314

22.85, 3.13

23.79, 2.41

23.28, 2.85

36

300

23.57, 3.47

24.60, 2.68

24.06, 3.16

42

336

24.09, 3.87

25.20, 2.92

24.62, 3.49


SD standard deviation






Analysis of Health Trajectories Over Time: Mixed Models


When analysing health trajectories over time, we should take into account the fact that repeated observations of each individual in the dataset are not independent (Singer and Willett 2003; Twisk 2013). Simple regression analyses or analysis of variance are unable to fully incorporate these correlated measures and therefore, more sophisticated statistical techniques are needed to analyse these data accordingly. Several techniques to do so are available, of which mixed models (MM) are the most well-known techniques.


Model Assumptions


Mixed models have relatively straightforward underlying assumptions. Because they are regression-based models MM rely on the same assumptions (Altman 1991). Specifically, the residuals of the outcome variable should be normally distributed and in addition, the residuals around the (random) intercept and slopes should approximate a normal distribution. This can be easily assessed in any statistical software package. MM are also well suited in the case of unequal repeated measurements and the model assumes data to be missing at random.

The idea behind MM (Twisk 2013) was initially developed in the social sciences, specifically for educational research. Investigating the performance of pupils within schools, researchers realised that the performances of pupils within the same class were not independent (in other words; these performances are correlated). This type of study design is characterised by a hierarchical structure; also referred to as a multilevel structure. Pupils are nested within classes and classes are nested within schools. More or less the same structure is seen in longitudinal datasets where observations are nested within the subject: observations of one subject over time are correlated. MM adjusts for this correlation between repeated observations by modelling the variability among individuals through the inclusion of both fixed and random effects. To understand the general idea behind a mixed model, it is important to realise that with simple regression analyses adjusting for a certain variable (for example gender) means that different intercepts are calculated. In a longitudinal setting (thus adjusting for subject), this then means that for each subject, different intercepts are calculated. The simplest form of a MM takes into account this random intercept only. Figure 9.1 demonstrates that a model with a random intercept only allows the intercepts to vary across subjects. In addition to a random intercept, it is also possible that the development over time varies across individuals (indicated by a random slope). This is demonstrated in Fig. 9.2, where only linear slopes are assumed.

A328113_1_En_9_Fig1_HTML.gif


Fig. 9.1
Visualisation of different intercepts for each subject (random intercept)


A328113_1_En_9_Fig2_HTML.gif


Fig. 9.2
Visualisation of differences in the development over time for each subject (random intercept and slope)


The Modelling of Time


Within life course epidemiology, MM are usually conducted to describe the development of a particular outcome variable over time (Beunen et al. 2002; Twisk 2013). In that case, the only independent variable of interest is a time variable, which can be included either as a continuous variable (indicating a linear development or other polynomial function of the outcome variable over time) or as a categorical variable. The modelling of time as a categorical variable allows for the incorporation of different developmental shapes during different stages in the life course, such as childhood, adolescence, adulthood and the elderly. This closely maps onto one of the focal points of life course research, which offers opportunities to understand the natural history of chronic disorders with a specific focus on health transitions and phases across the life course (Kuh and Ben-Shlomo 2004).


Comparing Models


Model fit of MM can be assessed by comparing the log likelihood values of neighbouring [nested] models. The likelihood ratio test (Bollen 1989) subsequently calculates the difference in -2 log-likelihood between two neighbouring models. This difference is assumed to follow a Chi-squared distribution with the number of degrees of freedom denoted as the difference in the number of parameters estimated with the two models.


Strategy and Results: Mixed Models


First, we will start by modelling a linear development of BMI over time. This model is relatively straightforward and includes the time variable as a continuous variable, which, in the case of the AGHLS would be 1, 2, 3, 4, 9, 15, 17, 20, 24, and 30. These values reflect the actual time between measurement rounds, taking into account the unequally spaced time points. Alternatively, ignoring the unequally spaced time points, this variable could also have values between 1 and 10 (one unit per measurement).

The simplest model includes a random intercept only. Table 9.2 demonstrates the results of this model and indicates that body mass index values are increasing over time. The regression coefficient for time for the linear model (model 1) is 0.240, so for each year, the BMI increases (linearly) with 0.240 kg/m2. The corresponding standard error (0.003; indicative of the preciseness in the estimation of the regression coefficient – the smaller the more precise) can be used to assess the statistical significance of the regression coefficient by calculating the z-statistic (Altman 1991; Twisk 2013). The z-statistic is calculated by dividing the regression coefficient with the corresponding standard error. Values above 1.96 indicate statistical significance.


Table 9.2
Results of the mixed models













































































































































































































































       
Slope (SE)

Slope (SE)
 

Model

Slope (SE)

−2 LL

Model

Males

Females

−2 LL

Linear
   
Linear
     

Model 1

0.240 (0.003)

10,469

Model 1

0.285 (0.005)

0.200 (0.004)

10,288

Model 2

0.242 (0.006)

9,425

Model 2

0.286 (0.009)

0.202 (0.008)

9,367

Quadratic
   
Quadratic
     

Model 1

0.436 (0.013); −0.007a

10,244

Model 1

0.516 (0.018); −0.008a

0.368 (0.017); −0.006a

10,035

Model 2

(0.0004)a

0.443 (0.014); −0.007a (0.0004)a

8,753

Model 2

(0.001)a

0.527 (0.019); −0.008a (0.001)a

(0.001)a

0.368 (0.017); −0.006a (0.001)a

8,688

Life course
   
Life course
     

Model 1
 
10,134

Model 1
   
9,916

Time2

0.936 (0.13)
 
Time2

0.788 (0.181)

6.277 (0.160)
 

Time 3

1.677 (0.13)
 
Time 3

1.520 (0.183)

5.753 (0.165)
 

Time 4

2.334 (0.13)
 
Time 4

2.344 (0.183)

5.088 (0.162)
 

Time 9

4.046 (0.17)
 
Time 9

4.318 (0.237)

4.706 (0.224)
 

Time 15

5.928 (0.17)
 
Time 15

5.633 (0.233)

4.286 (0.213)
 

Time 17

5.400 (0.17)
 
Time 17

6.117 (0.231)

3.749 (0.212)
 

Time 20

5.920 (0.12)
 
Time 20

6.871 (0.173)

2.255 (0.167)
 

Time 24

6.654 (0.13)
 
Time 24

7.636 (0.173)

1.729 (0.167)
 

Time 30

7.221 (0.12)
 
Time 30

8.253 (0.168)

0.982 (0.166)
 

Model 2
 
8,026

Model 2
   
7,958

Time2

0.809 (0.04)
 
Time2

0.895 (0.059)

0.747 (0.065)
 

Time 3

1.563 (0.06)
 
Time 3

1.622 (0.079)

1.487 (0.088)
 

Time 4

2.212 (0.06)
 
Time 4

2.153 (0.087)

2.276 (0.097)
 

Time 9

4.023 (0.13)
 
Time 9

3.711 (0.179)

4.350 (0.200)
 

Time 15

4.893 (0.14)
 
Time 15

4.214 (0.184)

5.641 (0.199)
 

Time 17

5.325 (0.16)
 
Time 17

4.552 (0.205)

6.176 (0.216)
 

Time 20

5.960 (0.14)
 
Time 20

5.079 (0.185)

6.934 (0.195)
 

Time 24

6.630 (0.15)
 
Time 24

5.711 (0.199)

7.642 (0.209)
 

Time 30

Only gold members can continue reading. Log In or Register to continue

Stay updated, free articles. Join our Telegram channel

Nov 20, 2016 | Posted by in PUBLIC HEALTH AND EPIDEMIOLOGY | Comments Off on The Analysis of Individual Health Trajectories Across the Life Course: Latent Class Growth Models Versus Mixed Models

Full access? Get Clinical Tree

Get Clinical Tree app for offline access