Publication | Closed Access
Semiparametric Regression for Clustered Data Using Generalized Estimating Equations
280
Citations
30
References
2001
Year
Profile-kernel MethodParameter EstimationDensity EstimationEngineeringHigh-dimensional MethodEstimation StatisticSemiparametric RegressionEquationskernel Methodlongitudinal DatanonparametricBiostatisticsBayesian MethodsStatistical InferenceSemiparametric Efficient ScorePublic HealthEstimation TheoryStatisticsSemi-nonparametric Estimation
AbstractWe consider estimation in a semiparametric generalized linear model for clustered data using estimating equations. Our results apply to the case where the number of observations per cluster is finite, whereas the number of clusters is large. The mean of the outcome variable μ is of the form g(μ) = XTβ + θ(T), where g(·) is a link function, X and T are covariates, β is an unknown parameter vector, and θ(t) is an unknown smooth function. Kernel estimating equations proposed previously in the literature are used to estimate the infinite-dimensional nonparametric function θ(t), and a profile-based estimating equation is used to estimate the finite-dimensional parameter vector β. We show that for clustered data, this conventional profile-kernel method often fails to yield a √n-consistent estimator of β along with appropriate inference unless working independence is assumed or θ(t) is artificially undersmoothed, in which case asymptotic inference is possible. To gain insight into these results, we derive the semiparametric efficient score of β, which is found to have a complicated form, and show that, unlike for independent data, the profile-kernel method does not yield a score function asymptotically equivalent to the semiparametric efficient score of β, even when the true correlation is assumed and θ(t) is undersmoothed. We illustrate the methods with an application to infectious disease data and evaluate their finite-sample performance through a simulation study.KEY WORDS: AsymptoticsClustered dataConsistencyEfficiencyGeneralized estimating equationsKernel methodLongitudinal dataNonparametric regressionPartially linear modelProfile methodSandwich estimatorSemiparametric efficient scoreSemiparametric efficiency bound
| Year | Citations | |
|---|---|---|
Page 1
Page 1