The propensity score is the probability of treatment assignment conditional on observed baseline characteristics. Adjustments to eliminate bias can be made by using the propensity score instead of using all the covariates one by one.[2] To reduce the bias in the estimation of treatment effects, with the help of the propensity score method proposed by Rosenbaum and Rubin, the error is reduced by balancing the units in the two groups in terms of the variables used. On the other hand, it is possible to test whether there is a difference between two groups (the treatment and control) by using the Mann-Whitney U test, t-test, chi-square test or other pairwise comparison tests.
Propensity score, in a n-unit data set with no missing observations, i. the individual's propensity score value is the conditional probability of being assigned a particular treatment (Yi=1) versus the control group (Yi=0) according to the observed covariate vector xi. In other words, propensity scores are defined as the conditional probability of assigning a unit to a particular treatment condition, given the observed variables. The propensity score can be used in both observational studies and experimental studies. Propensity score can be obtained using the logistic regression model, probit model, discriminant analysis, and cluster analysis.[3] The purpose of calculating the propensity score is to be able to conclude that two units, one in the treatment group and the other in the control group, with the same or very close propensity scores, are randomly assigned to each group. If there is no data loss in the independent variables and the distribution of the independent variables fits the multivariate normal distribution, discriminant analysis is used while estimating the propensity score. In addition, the covariances of the independent variables should be equal at each group level. If nominal or ordinal scaled variable(s) are used among the independent variables in the discriminant analysis, these two assumptions cannot be met. In this case, logistic regression analysis is used instead of discriminant analysis. In the logistic regression analysis, it is not necessary to provide these assumptions for the independent variables. Logistic regression analysis provides an alternative to discriminant analysis and crosstabs in case of various assumptions distortions (such as normal distribution and lack of common covariance). In case the dependent variable is binary or discrete variable containing more than two levels, it is an alternative to linear regression analysis, since the assumption of normality is not provided. On the other hand, when the number of groups (clusters) to which the observed units would be assigned is not known exactly, cluster analysis can be used in propensity score estimation. In discriminant and logistic regression analysis, the number of groups to be assigned is known. In practice, the propensity score is most often estimated using a logistic regression model, in which treatment status is regressed on observed baseline characteristics. The estimated propensity score is the predicted probability of treatment derived from the fitted regression model.[4] As the number of covariates in the model increases, it may become difficult to find units with similar characteristics for units in the treatment group. Adjustments to eliminate bias in the sample with the propensity score can be made by using the propensity score instead of using all the covariates one by one. Four different propensity score methods are used for removing the effects of confounding while estimating the effects of treatment on outcomes: propensity score matching, stratification (or subclassification) on the propensity score, inverse probability of treatment weighting using the propensity score, and covariate adjustment using the propensity score.[5] With the propensity score calculated by choosing one of these methods, resampling is performed and the effect of the factor is estimated for the new sample obtained. If there are two units with the same propensity score, one in the case group and one in the control group, then it is thought that the two units are randomly assigned to each group, that is, they are in the case or control groups with equal similarity. Therefore, propensity score can be said to be a balancing score.[6]
In the article entitled "Factors Associated with Advanced Tricuspid Regurgitation After Left-Side Double Valve Replacement in Propensity-Scorematched Analysis," the authors aimed to investigate the association of progression of tricuspid regurgitation (TR) following double valve replacement by comparing the tricuspid valve (TV) repair and no-TV repair groups. They also aimed to analyze the outcomes of repair groups and patients with unrepaired mild-to-moderate TR. A total of 157 patients, 52% of whom were women, participated in the study and the data were evaluated using propensity score analysis. A comparison of baseline clinical and echocardiographic parameters between TV repair and no-TV repair groups was made, and it was observed that AF was significantly lower in the no-TV repair group, while the TV repair group had a significantly longer cardiopulmonary bypass duration. In addition, there was no statistically significant difference between the groups in valvular etiology. On the other hand, the no-TV repair group had significantly decreased pulmonary artery pressure and decreased TR degree. The no-TV repair group had a significantly smaller tricuspid annulus diameter, and a smaller left atrial diameter. It was also observed that neurological complications did not make a significant difference between the groups. However, the TV repair group had a lesser degree of TR and the mechanical ventilation and intensive care unit (ICU) stay were significantly shorter in the no-TV repair group. While group comparisons of continuous variables were made using the Mann-Whitney U test, categorical variables were compared using the chi-square test.
In the next step of the study, two groups were evaluated with the help of propensity score analysis. Baseline characteristics, postoperative ICU and hospital stay, ventilation time, and mortality were examined through propensity score analysis. As previously mentioned, when assumptions such as the measurement type of the independent variable or the normal distribution of the dependent variable are not provided, logistic regression analysis is used to estimate the propensity score. The relationship between eight predictors (age, sex, atrial fibrillation, ejection fraction, diameter of tricuspid annulus, rheumatic etiology, left ventricular end-diastolic diameter, left ventricular end-systolic diameter) and moderate and severe TR was examined in the logistic model. Rheumatic etiology was statistically significant for TR progression in the univariate analysis and it was an independent factor for postoperative progression of TR on multivariable analysis. No-TV repair (reference repair) and rheumatic etiology were independently associated with presence of progression TR in all groups for propensity score matched data in the conditional multivariable logistic regression analysis. In this study, with the help of propensity score analysis, rheumatic etiology was shown to be an independent marker for the progression of TR. On the other hand, it was also independently associated with progression of TR in patients with mild-to-moderate no-TV repair TR. According to the results of TV repair and no TV repair binary group comparisons, the postoperative TR grade decreases with TV repair and the two groups were found to have a similar operative mortality rate and potential complications.
1) Rosenbaum PR. Observational studies (Springer Series in
Statistics). New York: Springer-Verlag; 1995.
2) D'Agostino RB Jr. Propensity score methods for bias reduction
in the comparison of a treatment to a non-randomized control
group. Stat Med 1998;17:2265-81.
3) Rubin DB, Thomas N. Matching using estimated
propensity scores: relating theory to practice. Biometrics
1996;52:249-64.
4) Austin PC. An Introduction to Propensity Score Methods
for Reducing the Effects of Confounding in Observational
Studies. Multivariate Behav Res 2011;46:399-424.