Publication | Open Access
The intermediate endpoint effect in logistic and probit regression
256
Citations
33
References
2007
Year
An intermediate (surrogate or mediating) endpoint lies between an independent variable and the ultimate outcome, and clinical studies often target it to influence the final endpoint. The study aims to expose a limitation of common logistic/probit approaches for binary outcomes and to propose a more accurate product‑of‑coefficients method. The authors model the binary outcome using both true binary and latent continuous dichotomization, then evaluate methods with plots and simulations, and derive a solution grounded in latent variable theory. They find that widely used methods distort intermediate endpoint effect estimates and lead to incorrect conclusions, whereas the product‑of‑coefficients approach based on latent variable modeling yields accurate results.
An intermediate endpoint is hypothesized to be in the middle of the causal sequence relating an independent variable to a dependent variable. The intermediate variable is also called a surrogate or mediating variable and the corresponding effect is called the mediated, surrogate endpoint, or intermediate endpoint effect. Clinical studies are often designed to change an intermediate or surrogate endpoint and through this intermediate change influence the ultimate endpoint. In many intermediate endpoint clinical studies the dependent variable is binary, and logistic or probit regression is used.The purpose of this study is to describe a limitation of a widely used approach to assessing intermediate endpoint effects and to propose an alternative method, based on products of coefficients, that yields more accurate results.The intermediate endpoint model for a binary outcome is described for a true binary outcome and for a dichotomization of a latent continuous outcome. Plots of true values and a simulation study are used to evaluate the different methods.Distorted estimates of the intermediate endpoint effect and incorrect conclusions can result from the application of widely used methods to assess the intermediate endpoint effect. The same problem occurs for the proportion of an effect explained by an intermediate endpoint, which has been suggested as a useful measure for identifying intermediate endpoints. A solution to this problem is given based on the relationship between latent variable modeling and logistic or probit regression.More complicated intermediate variable models are not addressed in the study, although the methods described in the article can be extended to these more complicated models.Researchers are encouraged to use an intermediate endpoint method based on the product of regression coefficients. A common method based on difference in coefficient methods can lead to distorted conclusions regarding the intermediate effect.
| Year | Citations | |
|---|---|---|
Page 1
Page 1