Share on
BACKGROUND
Vaccination against COVID-19 was considered the key strategy in limiting the spread of the highly mutable SARS-CoV-2 virus. In this context, observational studies remain crucial to assess and monitor the effectiveness of approved vaccines. However, observational vaccine effectiveness (VE) studies, although crucial for understanding the real-world impact of vaccines, are prone to bias (e.g., due to confounding) [1, 2]. Compared to randomized trials, where all participants have an equal probability to receive the vaccine, in observational studies the choice to get vaccinated or not is not evenly distributed across the population. Indicatively, confounding factors like age, pre-existing health conditions, comorbidities, and socio-economic background influence the individual decision to get vaccinated and the probability of a COVID-19 associated event, thus introducing potential bias in the VE estimates [3]. The risk of bias due to differences between vaccinated and unvaccinated populations could be amplified by the seasonal booster campaigns implemented in Italy and other European countries, targeting the elderly population and those with risk factors.
Several statistical methods have been proposed in the literature to consider the presence of potential confounding factors, aiming to balance the difference between those who received the intervention and those who did not. These methods include covariates adjustment, stratification, matching, regression models that use instrumental variables, and methods based on propensity score (PS), representing the probability of receiving a specific treatment, given a set of observed covariates [4].
The performance of these methods in limiting bias could significantly impact the accuracy of VE estimates, particularly in the context of seasonal COVID-19 vaccine programmes. Nonetheless, there is a lack of studies comparing VE estimates considering potential confounding, especially in the context of large, real-world datasets.
Based on published data recorded in Italy [5], we aimed to compare estimates of relative VE (rVE) using different PS-based methods – i.e., inclusion of the PS as an adjustment variable, inverse probability weight (IPW), stabilized IPW, truncated stabilized inverse probability weights (SIPW) and matching – [6-12] with a reference approach based on a Cox proportional hazards regression analysis.
MATERIALS AND METHODS
Study design
We conducted a retrospective cohort study based on the individual data previously used by our study group to estimate the rVE of a booster dose of the bivalent (Original/Omicron BA.4-5) mRNA vaccine in preventing severe COVID-19 among people ≥60 years during a period with prevalent circulation of the Omicron XBB subvariant (3 April-4 June 2023) [5].
Specifically, we analyzed time to infection leading to severe COVID-19 consequences (i.e., hospitalization or death within 28 days of the swab date for a test that was SARS-CoV-2-positive) comparing individuals who had received only the first booster dose of an mRNA vaccine at least 120 days earlier and those who had received a second or third booster dose of the bivalent Original/Omicron BA.4-5 mRNA vaccine during the study period.
After splitting individual data into multiple records to account for the time-varying vaccination status, we estimated rVE both overall and by time since booster administration (i.e., 15-60 days, 61-120 days, 121-180 days and 181-265 days).
More details about the data sources, the selection of the study population, and the outcome classification have been published elsewhere [5].
Statistical analysis
Reference model
The reference model (M0) used in the original analysis was a multivariable Cox proportional hazards model M0 include the following demographic and clinical variables as covariates to adjust for: sex, age group (5-year age group from 60-64 to 90-94, then grouping individuals ≥95), country of birth (Italian-born or foreign-born), geographical area where the vaccination took place (NUTS 21 regions/autonomous provinces of Italy), and high-risk group (none or individuals residing in long-term care facilities and/or with health risk conditions; see Supplementary Table S1 available online). Additional covariates included the days since 1st booster dose (≤180, 181-365, >365), days since prior infection (≤180, 181-365, >365, no prior infection) and type of 1st booster vaccine (mRNA monovalent, mRNA bivalent Original/Omicron BA.1, mRNA bivalent Original/Omicron BA.4-5).
Alternative models
In order to take into account potential imbalance between the analyzed groups in the observed covariates we estimated adjusted VE against severe COVID-19 using seven alternative PS-based Cox regression models (M1-M7), as reported in Table 1. The alternative models are still Cox proportional hazards, but instead of using covariates for adjustment (as in the reference model), covariates are used for the calculation of PS, weighting and matching.
To calculate the PS we used a logistic regression model based on individual data, estimating the probability of receiving a second or third booster dose of the bivalent Original/Omicron BA.4-5 mRNA vaccine as a function of the covariates used in model M0. The first alternative model (M1) was based on adjusting for the propensity score (PS), used as the only covariate in addition to the exposure variable (vaccination status). The second model (M2) was adjusted including the covariates considered in model M0 plus the PS. In models M3-M6, observations were weighted and only the exposure variable was included. Specifically, in model M3, observations were weighted using the inverse probability weighting approach (IPW) with individual weights defined as 1/PS if the subject received the treatment and 1/(1-PS) otherwise. To account for the different size of the treated/untreated groups and to reduce the variance of the estimated hazard ratios (HRs), we calculated the stabilized inverse probability weights (SIPW) by dividing the IPW by the total sum of the IPWs of the subjects in the same treatment group; and used these weights in model M4. To further reduce the variance due to extreme values of the PS and to study the sensitivity of the model to these values, we calculated the stabilized truncated inverse probability weights (STIPW) including only cases with PS between the 1st and the 99th percentile (M5) or between the 5th and 95th percentile (M6) when calculating SIPW. Finally, we matched 1:1 without replacement the observations according to the PS, before splitting the data, using the nearest neighbour matching with a calliper of 0.05, and then run a univariate Cox regression analysis on split data from the matched sample (M7).
All models based on IPW (M3-M6) were run using a robust variance estimator [6]. To verify the covariates balance after IPW, SIPW, STIPW and PS-matching (M3-M7) we used the Standardized Mean Difference (SMD), assuming a satisfactory balance where SMD values were less than 0.1.
For all models, the HRs and 95% confidence intervals (CI) were used to calculate the adjusted rVE against severe COVID-19 using the formula ((1-HR)x100). In addition, we calculated the BIC and AIC for all models to compare their performance. All the analyses were carried out with RStudio 2023.12.1+402 under R 4.3.2 [13].
RESULTS
We analysed data on 11,879,461 subjects included in the original study [2], of which 1,970,598 (16.6%) received at least the second booster dose. Among all subjects, there were 4,405 (0.04%) severe events, most of which occurred before receiving a second or third booster dose (n=3,565; 0.036%). After receiving a second or third booster doses, the highest event rate was observed in the time interval 121-180 days post administration (n=467; 0.025%) (Table 2).
All the coefficient estimates obtained with the logistic regression model used to calculate the PS were statistically significant, although there was not much heterogeneity between the two treatment groups (Figure 1). The estimated area under the curve (AUC) obtained with the model is 0.72.
After matching (3,941,196 subjects, 33.2% of the total, including all vaccinated individuals, were matched successfully), all the potential confounding variables appeared balanced according to the SMD values (SMD<0.1). The same was observed after weighting data through IPW, SIPW and the STIPW (truncation at level 1% removed 239,155 subjects, 2.01% of the total; truncation at level 5% removed 1,217,007 individuals, 10.24% of the total) (Figure 2 and Supplementary Figure S1 available online).
Model comparisons
The overall analysis shows homogeneous results between all models, with the point estimate going from a minimum of 16.4% with model M7 to 22.1% with model M5. The 95% CI width also does not change much increasing slightly when the observations in the models M3-M6 are weighted (from 12.7 in the models M0-M2 to 15.4 in model M7). Both BIC and AIC, using the “lower is better” rule, indicate as better performing the models based on SIPW, with little difference between using truncation or not (Table 3 and Supplementary Figure S2 available online).
The results from the reference model (M0) show that, compared with a first booster of an mRNA vaccine received at least 120 days earlier, the rVE against severe COVID-19 for a second or a third booster dose of bivalent Original/Omicron BA.4-5 mRNA vaccine in the time interval 15-60 days post-administration was 45.6% (95% CI: 1.6-69.9). Relative VE progressively decrease to 14.3% (95% CI: 1.6-25.3) in the interval 181-265 days post-administration (Table 3 and Figure 3). The same decreasing trend is shown by all the models, except models M5 and M6 where we used the STIPW. In these cases, the rVE obtained in the first time-interval post-administration is lower than that obtained in the second one. The rVE estimates with 95% CIs obtained with models M1, M2 and M7 are similar to those obtained with model M0, although models M1 and M7 showed slightly lower estimates than M0 in the first time-interval post-administration (42.3% and 39.6%, respectively, both with a 95% lower bound of opposite sign than M0). In addition, model M1 shows slightly higher estimates in the other time-intervals. Model M2 shows estimates very close to those from model M0 (45.6% vs 45.8% in the 15-60 days interval, 24.7% vs 25.1% in the 61-120 days interval, 17.0% vs 17.3% in the 121-180 days interval, and 14.3% vs 14.5 in the 181-265 days interval). There are no differences between the rVE estimated with model M3 with respect to those estimated using model M4. The rVE estimates and 95% CIs obtained with the IPW-based models M3-M6 are generally similar to those derived from the reference model M0, except in the first period post-administration (15-60 days). This applies especially to model M6, which estimates a rVE more than 6 times lower and with a much higher variability than that derived from M0 (7.1%, 95% CI: -91.2-54.9). In the other time-intervals post-administration the differences appear much reduced, with slightly higher rVE estimates and slightly larger 95% CIs for models M3-M6 compared to M0. Looking at the AIC and BIC values, conclusions are similar to those made previously for the overall analysis (Table 3).
DISCUSSION
Our study showed homogeneous VE estimates among all the models. In particular, two groups appear, whereby the rVE values estimated by models including PS as a covariate (M1 and M2) and by the model run after PS-matching (M7) are very close to the rVE values estimated by the reference multivariable model (M0). On the other hand, PS models based on IPW (M3), SIPW (M4) and STIPW (M5 and M6) yield estimates that are further away from the reference multivariable model (M0), especially in the early time-interval after vaccination for STIPW models (M5 and M6).
According to several studies [9-11], SIPW (M4) can increase the variance estimate compared to that obtained with IPW (M3). Our results show the same estimated values, probably due to the homogeneity (i.e., absence of excessive skewness) of the PS values between the treated and untreated groups. A factor that influenced the difference of the estimates obtained with models M5 and M6 compared to the other models is related to having removed records with extreme PS values from the sample (truncation at level 1% removed 239,155 individuals, 2.01% of the total; truncation at level 5% removed 1,217,007 individuals, 10.24% of the total). Truncation is used to improve estimation accuracy by truncating extreme values of the PS [12], many of them falling in the first exposure level (15-60 days after vaccine uptake), when the lowest percentage of severe events (study outcome) was observed.
Based on our results, we found no clear advantage in applying PS-based methods instead of multivariable analysis. Previous studies have shown that these methodologies can produce similar results under certain conditions [14, 15], especially when the starting dataset is relatively balanced and the sample size is large [8, 16], as in our case. Moreover, it is important to note that, even using PS methods, a residual bias due to unmeasured confounders might remain, as with all analytic methods based on observational data [17, 18].
Although PS-based methods are sensitive to misspecifications of the model used to calculate it (e.g., omission of interaction effects, or misspecification of functional form of the included covariates [19]), they may still prove advantageous when many covariates are considered as potential confounders in the analysis, in which case reference models might result unstable. Furthermore in addition to the PS-based methods presented in this article, there are other PS-based methods in the literature, aiming to optimise survival data [20] and time-dependent exposure analyses [21-23]. The purpose of this study, however, was to present and compare the most widely used methods in the literature.
CONCLUSIONS
Our study, based on a large study population and a relatively low number of potential confounders, does not show advantages in using PS-based models compared to using a traditional multivariable model.
Other Information
Acknowledgments
Preliminary results of this study were presented as an abstract at the PNRR meeting held in Pavia, Lombardy, Italy, 11-12 September 2024. Daniele Petrone has a temporary PNRR contract since 19 July 2023.
Authors’ contributions
PP, MF and DP designed the paper; DP, CS, AMU and MF ensured quality of COVID-19 surveillance; DP, supported by MF, AMU, CS and EAF, carried out the analysis; DP, MF, MA, AMU, EAF, CS and PP wrote the manuscript which was reviewed and approved by all Authors.
Conflict of interest statement
The Authors declare that they have no competing interests.
Funding source
This research was supported by EU funding within the NextGeneration EU-MUR PNRR Extended Partnership Initiative on Emerging Infectious Diseases (Project n. PE00000007 INF-ACT).
Address for correspondence: Daniele Petrone, Dipartimento di Malattie Infettive, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, Italy. E-mail: daniele.petrone@iss.it
Figures and tables
Figure 1. Density distribution of the propensity score for individuals aged ≥60 years who received either a second or third booster dose of the bivalent Original/Omicron BA.4-5 mRNA vaccine, Italy, 3 April-4 June 2023 (n=11,879,461).
Figure 2. Standardized mean differences for covariates used in the models before and after observation matching (A), and weighting with IPW (B) and weighting with SIPW (C).
Figure 3. Point estimation and 95% confidence interval of rVE against severe COVID-19 of a second or third booster of the bivalent Original/Omicron BA.4-5 mRNA vaccine relative to a first booster of an mRNA vaccine received at least 120 days earlier with truncated lower bounds for visual clarity, Italy, 3 April-4 June 2023.
| Model name | Differences with respect to baseline Cox proportional hazard model | Covariates included |
|---|---|---|
| M0 | - | Exposure level, sex, age group, country of birth, geographical area, high-risk group, days since 1st booster vaccine, days since prior infection, type of 1st booster vaccine |
| M1 | Including (only) the PS | Exposure level, PS |
| M2 | Including (also) the PS | Exposure level, sex, age group, country of birth, geographical area, high-risk group, days since 1st booster vaccine, days since prior infection, type of 1st booster vaccine, PS |
| M3 | Weighting observations with IPW | Exposure level |
| M4 | Weighting observations with SIPW | Exposure level |
| M5 | Weighting observations with STIPW 1% | Exposure level |
| M6 | Weighting observations with STIPW 5% | Exposure level |
| M7 | Matching observations | Exposure level |
| Exposure level: time since administration of a second or third booster dose of the bivalent Original/Omicron BA.4-5 mRNA vaccine: 15-60 days, 61-120 days, 121-180 days and 181-265 days; sex: male, female; age group: 5-year age group from 60-64 to 90-94, then grouping individuals ≥90; country of birth: Italian-born or foreign-born; Geographical area: geographical area where the vaccination took place (NUTS 2, which are the 19 regions and two autonomous provinces of Italy); high-risk group: none or residents in long-term care facilities and individuals with health risk conditions (Supplementary Table S1 available online); days since 1st booster vaccine: ≤180 days, 181-365 days, >365 days; days since prior infection: ≤180 days, 181-365 days, >365 days, no prior infection; type of 1st booster vaccine: mRNA monovalent, mRNA bivalent Original/Omicron BA.1, mRNA bivalent Original/Omicron BA.4-5. | ||
| PS: propensity score; IPW: inverse probability weight; SIPW: stabilized inverse probability weights; STIPW: stabilized truncated inverse probability weights. | ||
| Time since booster | Number of individualsa | Number of events | Percentage (%) |
|---|---|---|---|
| First booster ≥120 days | 9,912,672 | 3,565 | 0.036 |
| Second or third booster | |||
| 15-60 days | 90,807 | 11 | 0.012 |
| 61-120 days | 715,375 | 135 | 0.019 |
| 121-180 days | 1,843,177 | 467 | 0.025 |
| 181-265 days | 1,259,968 | 227 | 0.018 |
| Total | 13,821,999 | 4,405 | 0.032 |
| aThe sum of the number of individuals in each time interval is greater than the total number of individuals included in the analysis because of the possible change in vaccination status during the observation period leading an individual to be counted more than once. | |||
| First booster vaccine: Comirnaty monovalent Original strain mRNA vaccine (BNT162b2 mRNA, BioNTech-Pfizer); Spikevax monovalent Original strain mRNA vaccine (mRNA-1273, Moderna); Comirnaty bivalent Original/Omicron BA.1 mRNA vaccine (BNT162b2 mRNA, BioNTech-Pfizer); Spikevax bivalent Original/Omicron BA.1 mRNA vaccine (mRNA-1273.214, Moderna) or Comirnaty bivalent Original/Omicron BA.4-5 mRNA vaccine (BNT162b2 mRNA, BioNTech-Pfizer). Second or third booster vaccine: Comirnaty bivalent Original/Omicron BA.4-5 mRNA vaccine (BNT162b2 mRNA, BioNTech-Pfizer). | |||
| Time since 2nd or 3rd booster | M0 | M1 | M2 | M3 | M4 | M5 | M6 | M7 | |
|---|---|---|---|---|---|---|---|---|---|
| 15-60 days | Point Es. | 45.6 | 42.3 | 45.8 | 34.5 | 34.5 | 18.6 | 7.1 | 39.6 |
| 95% CI | 1.6-69.9 | -4.5-68.1 | 2.0-70.1 | -32.2-67.5 | -32.2-67.5 | -64.0-59.6 | -91.2-54.9 | -9.5-66.7 | |
| In. Width | 68.3 | 72.6 | 68.1 | 99.7 | 99.7 | >100 | >100 | 76.2 | |
| 61-120 days | Point Es. | 24.7 | 25.8 | 25.1 | 30.7 | 30.7 | 31.6 | 27.9 | 23.8 |
| 95% CI | 10.5-36.7 | 11.7-37.7 | 10.8-37.0 | 16.0-42.9 | 16.0-42.9 | 16.5-44.0 | 10.8-41.7 | 8.7-36.5 | |
| In. Width | 26.2 | 26.0 | 26.2 | 26.9 | 26.9 | 27.5 | 30.9 | 27.8 | |
| 121-180 days | Point Es. | 17.0 | 18.3 | 17.3 | 21.2 | 21.2 | 22.0 | 21.4 | 15.5 |
| 95% CI | 8.4-24.8 | 9.7-26.0 | 8.7-25.1 | 11.9-29.5 | 11.9-29.5 | 12.6-30.4 | 10.9-30.7 | 5.7-24.3 | |
| In. Width | 16.4 | 16.3 | 16.4 | 17.6 | 17.6 | 17.8 | 19.8 | 18.6 | |
| 181-265 days | Point Es. | 14.3 | 17.0 | 14.5 | 15.5 | 15.5 | 16.4 | 17.9 | 11.3 |
| 95% CI | 1.6-25.3 | 4.7-27.7 | 1.9-25.5 | 0.0-28.5 | 0.0-28.5 | 0.7-29.6 | 1.2-31.8 | -2.8-23.4 | |
| In. Width | 23.7 | 23.0 | 23.6 | 28.5 | 28.5 | 28.9 | 30.6 | 26.2 | |
| Overall | Point Es. | 18.2 | 19.7 | 18.5 | 21.6 | 21.6 | 22.1 | 21.3 | 16.4 |
| 95% CI | 11.6-23.3 | 13.1-25.8 | 11.9-24.6 | 14.5-28.2 | 14.5-28.2 | 14.8-28.7 | 13.3-28.6 | 8.3-23.7 | |
| In. Width | 12.7 | 12.7 | 12.7 | 13.7 | 13.7 | 13.9 | 15.3 | 15.4 | |
| AIC (overall) | 138,461.7 | 142,356.6 | 138,461.5 | 278,371.3 | 2.0 | 2.0 | 2.0 | 47,330.3 | |
| BIC (overall) | 138,704.6 | 142,369.3 | 138,710.7 | 278,377.7 | 8.4 | 8.4 | 8.2 | 47,335.9 | |
| Point Es.: Point Estimation; 95% CI: 95% confidence interval; In. Width: Interval Width; AIC: Akaike’s information criterion; BIC: Bayesian information criterion; M0: multivariable model; M1: PS model; M2: PS and M0 covariates; M3: IPW, M4: SIPW; M5: STIPW 1%-99%; M6: STIPW 5%-95%; M7: matching model; rVE: relative vaccine effectiveness; PS: propensity score; IPW: inverse probability weight; SIPW: stabilized inverse probability weights; STIPW: stabilized truncated inverse probability weights. | |||||||||

pdf
