首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
In this paper we consider a semiparametric regression model involving a d-dimensional quantitative explanatory variable X and including a dimension reduction of X via an index βX. In this model, the main goal is to estimate the Euclidean parameter β and to predict the real response variable Y conditionally to X. Our approach is based on sliced inverse regression (SIR) method and optimal quantization in Lp-norm. We obtain the convergence of the proposed estimators of β and of the conditional distribution. Simulation studies show the good numerical behavior of the proposed estimators for finite sample size.  相似文献   

2.
This work focuses on the estimation of distribution functions with incomplete data, where the variable of interest Y has ignorable missingness but the covariate X is always observed. When X is high dimensional, parametric approaches to incorporate X—information is encumbered by the risk of model misspecification and nonparametric approaches by the curse of dimensionality. We propose a semiparametric approach, which is developed under a nonparametric kernel regression framework, but with a parametric working index to condense the high dimensional X—information for reduced dimension. This kernel dimension reduction estimator has double robustness to model misspecification and is most efficient if the working index adequately conveys the X—information about the distribution of Y. Numerical studies indicate better performance of the semiparametric estimator over its parametric and nonparametric counterparts. We apply the kernel dimension reduction estimation to an HIV study for the effect of antiretroviral therapy on HIV virologic suppression.  相似文献   

3.
Abstract.  A semiparametric mixture model is characterized by a non-parametric mixing distribution Q (with respect to a parameter θ ) and a structural parameter β common to all components. Much of the literature on mixture models has focused on fixing β and estimating Q . However, this can lead to inconsistent estimation of both Q and the order of the model m . Creating a framework for consistent estimation remains an open problem and is the focus of this article. We formulate a class of generalized exponential family (GEF) models and establish sufficient conditions for the identifiability of finite mixtures formed from a GEF along with sufficient conditions for a nesting structure. Finite identifiability and nesting structure lead to the central result that semiparametric maximum likelihood estimation of Q and β fails. However, consistent estimation is possible if we restrict the class of mixing distributions and employ an information-theoretic approach. This article provides a foundation for inference in semiparametric mixture models, in which GEFs and their structural properties play an instrumental role.  相似文献   

4.
Abstract. We propose a spline‐based semiparametric maximum likelihood approach to analysing the Cox model with interval‐censored data. With this approach, the baseline cumulative hazard function is approximated by a monotone B‐spline function. We extend the generalized Rosen algorithm to compute the maximum likelihood estimate. We show that the estimator of the regression parameter is asymptotically normal and semiparametrically efficient, although the estimator of the baseline cumulative hazard function converges at a rate slower than root‐n. We also develop an easy‐to‐implement method for consistently estimating the standard error of the estimated regression parameter, which facilitates the proposed inference procedure for the Cox model with interval‐censored data. The proposed method is evaluated by simulation studies regarding its finite sample performance and is illustrated using data from a breast cosmesis study.  相似文献   

5.
This article deals with the issue of using a suitable pseudo-likelihood, instead of an integrated likelihood, when performing Bayesian inference about a scalar parameter of interest in the presence of nuisance parameters. The proposed approach has the advantages of avoiding the elicitation on the nuisance parameters and the computation of multidimensional integrals. Moreover, it is particularly useful when it is difficult, or even impractical, to write the full likelihood function.

We focus on Bayesian inference about a scalar regression coefficient in various regression models. First, in the context of non-normal regression-scale models, we give a theroetical result showing that there is no loss of information about the parameter of interest when using a posterior distribution derived from a pseudo-likelihood instead of the correct posterior distribution. Second, we present non trivial applications with high-dimensional, or even infinite-dimensional, nuisance parameters in the context of nonlinear normal heteroscedastic regression models, and of models for binary outcomes and count data, accounting also for possibile overdispersion. In all these situtations, we show that non Bayesian methods for eliminating nuisance parameters can be usefully incorporated into a one-parameter Bayesian analysis.  相似文献   

6.
Abstract.  We propose and study a class of regression models, in which the mean function is specified parametrically as in the existing regression methods, but the residual distribution is modelled non-parametrically by a kernel estimator, without imposing any assumption on its distribution. This specification is different from the existing semiparametric regression models. The asymptotic properties of such likelihood and the maximum likelihood estimate (MLE) under this semiparametric model are studied. We show that under some regularity conditions, the MLE under this model is consistent (when compared with the possibly pseudo-consistency of the parameter estimation under the existing parametric regression model), is asymptotically normal with rate and efficient. The non-parametric pseudo-likelihood ratio has the Wilks property as the true likelihood ratio does. Simulated examples are presented to evaluate the accuracy of the proposed semiparametric MLE method.  相似文献   

7.
This paper investigates several semiparametric estimators of the dispersion parameter in the analysis of over- or underdispersed count data when there is no likelihood available. In the context of estimating the dispersion parameter, we consider the double-extended quasi-likelihood (DEQL), the pseudo-likelihood and the optimal quadratic estimating (OQE) equations method and compare them with the maximum likelihood method, the method of moments and the extended quasi-likelihood through simulation study. The simulation study shows that the estimator based on the DEQL has superior bias and efficiency property for moderate and large sample size, and for small sample size the estimator based on the OQE equations outperforms the other estimators. Three real-life data sets arising in biostatistical practices are analyzed, and the findings from these analyses are quite similar to what are found from the simulation study.  相似文献   

8.
A formal semiparametric statistical inference framework is proposed for the evaluation of the age-dependent penetrance of a rare genetic mutation, using family data generated under a case-family design, where phenotype and genotype information are collected from first-degree relatives of case probands carrying the targeted mutation. The proposed approach allows for unobserved risk factors that are correlated among family members. Some rigorous large sample properties are established, which show that the proposed estimators were asymptotically semiparametric efficient. A simulation study is conducted to evaluate the performance of the new approach, which shows the robustness of the proposed semiparametric approach and its advantage over the corresponding parametric approach. As an illustration, the proposed approach is applied to estimating the age-dependent cancer risk among carriers of the MSH2 or MLH1 mutation.  相似文献   

9.
Let X1,… Xm be a random sample of m failure times under normal conditions with the underlying distribution F(x) and Y1,…,Yn a random sample of n failure times under accelerated condititons with underlying distribution G(x);G(x)=1?[1?F(x)]θ with θ being the unknown parameter under study.Define:Uij=1 otherwise.The joint distribution of ijdoes not involve the distribution F and thus can be used to estimate the acceleration parameter θ.The second approach for estimating θ is to use the ranks of the Y-observations in the combined X- and Y-samples.In this paper we establish that the rank of the Y-observations in the pooled sample form a sufficient statistic for the information contained in the Uii 's about the parameter θ and that there does not exist an unbiassed estimator for the parameter θ.We also construct several estimators and confidence interavals for the parameter θ.  相似文献   

10.
Let X1,…,Xr?1,Xr,Xr+1,…,Xn be independent, continuous random variables such that Xi, i = 1,…,r, has distribution function F(x), and Xi, i = r+1,…,n, has distribution function F(x?Δ), with -∞ <Δ< ∞. When the integer r is unknown, this is refered to as a change point problem with at most one change. The unknown parameter Δ represents the magnitude of the change and r is called the changepoint. In this paper we present a general review discussion of several nonparametric approaches for making inferences about r and Δ.  相似文献   

11.
In semiparametric inference we distinguish between the parameter of interest which may be a location parameter, and a nuisance parameter that determines the remaining shape of the sampling distribution. As was pointed out by Diaconis and Freedman the main problem in semiparametric Bayesian inference is to obtain a consistent posterior distribution for the parameter of interest. The present paper considers a semiparametric Bayesian method based on a pivotal likelihood function. It is shown that when the parameter of interest is the median, this method produces a consistent posterior distribution and is easily implemented, Numerical comparisons with classical methods and with Bayesian methods based on a Dirichlet prior are provided. It is also shown that in the case of symmetric intervals, the classical confidence coefficients have a Bayesian interpretation as the limiting posterior probability of the interval based on the Dirichlet prior with a parameter that converges to zero.  相似文献   

12.
In this study, we provide the Farlie–Gumbel–Morgenstern bivariate copula of rth and sth order statistics. The main emphasis in this study is on the inference procedure which is based on the maximum pseudo-likelihood estimate for the copula parameter. As for the methodology, goodness-of-fit test statistic for copulas which is based on a Cramér–von Mises functional of the empirical copula process is applied for selecting an appropriate model by bootstrapping. An application of the methodology to simulated data set is also presented.  相似文献   

13.
Kai B  Li R  Zou H 《Annals of statistics》2011,39(1):305-332
The complexity of semiparametric models poses new challenges to statistical inference and model selection that frequently arise from real applications. In this work, we propose new estimation and variable selection procedures for the semiparametric varying-coefficient partially linear model. We first study quantile regression estimates for the nonparametric varying-coefficient functions and the parametric regression coefficients. To achieve nice efficiency properties, we further develop a semiparametric composite quantile regression procedure. We establish the asymptotic normality of proposed estimators for both the parametric and nonparametric parts and show that the estimators achieve the best convergence rate. Moreover, we show that the proposed method is much more efficient than the least-squares-based method for many non-normal errors and that it only loses a small amount of efficiency for normal errors. In addition, it is shown that the loss in efficiency is at most 11.1% for estimating varying coefficient functions and is no greater than 13.6% for estimating parametric components. To achieve sparsity with high-dimensional covariates, we propose adaptive penalization methods for variable selection in the semiparametric varying-coefficient partially linear model and prove that the methods possess the oracle property. Extensive Monte Carlo simulation studies are conducted to examine the finite-sample performance of the proposed procedures. Finally, we apply the new methods to analyze the plasma beta-carotene level data.  相似文献   

14.
Given two independent non-degenerate positive random variables X and Y, Lukacs (1955) proved that X/(X+Y) and X+Y are independent if and only if X and Y are gammally distributed with the same scale parameter.In this work, properties of bivariate gamma distribution are studied. Certain regression version of Lukacs's theorem are given for the bivariate case. Furthermore, characterization of bivariate gamma distribution by the conditions of constancy regression of quadratic statistics is also given.  相似文献   

15.
Abstract.  Multivariate correlated failure time data arise in many medical and scientific settings. In the analysis of such data, it is important to use models where the parameters have simple interpretations. In this paper, we formulate a model for bivariate survival data based on the Plackett distribution. The model is an alternative to the Gamma frailty model proposed by Clayton and Oakes. The parameter in this distribution has a very appealing odds ratio interpretation for dependence between the two failure times; in addition, it allows for negative dependence. We develop novel semiparametric estimation and inference procedures for the model. The asymptotic results of the estimator are developed. The performance of the proposed techniques in finite samples is examined using simulation studies; in addition, the proposed methods are applied to data from an observational study in cancer.  相似文献   

16.
Using local kappa coefficients, we develop a method to assess the agreement between two discrete survival times that are measured on the same subject by different raters or methods. We model the marginal distributions for the two event times and local kappa coefficients in terms of covariates. An estimating equation is used for modeling the marginal distributions and a pseudo-likelihood procedure is used to estimate the parameters in the kappa model. The performance of the estimation procedure is examined through simulations. The proposed method can be extended to multivariate discrete survival distributions.  相似文献   

17.
An estimating equation for a parameter θ, based on an observation ?, is an equation g(x,θ)=0 which can be solved for θ in terms of x. An estimating equation is unbiased if the funaction g has 0 mean for every θ. For the case when the form of the frequency function p(x,θ) is completely specified up to the unknown real parameter θ, the optimality of the m.1 equation ?logp=0 in the class of all unbiased estimating equations was established by Godambe (1960). In this paper we allow the form of the frequency function p to vary assuming that x=(x1,…,xn)?Rn and that under p, E(xi)=θ. x1,…, xn are independent observations on a variate x, it is shown that among all the unbiased estimating equations for θ, x??θ=0 is uniquely optimum up to a constant multiple.  相似文献   

18.
ABSTRACT

In a model of the form Y = h(X1, …, Xd) where the goal is to estimate a parameter of the probability distribution of Y, we define new sensitivity indices which quantify the importance of each variable Xi with respect to this parameter of interest. The aim of this paper is to define goal oriented sensitivity indices and we will show that Sobol indices are sensitivity indices associated to a particular characteristic of the distribution Y. We name the framework we present as Goal Oriented Sensitivity Analysis (GOSA).  相似文献   

19.

There have been many advances in statistical methodology for the analysis of recurrent event data in recent years. Multiplicative semiparametric rate-based models are widely used in clinical trials, as are more general partially conditional rate-based models involving event-based stratification. The partially conditional model provides protection against extra-Poisson variation as well as event-dependent censoring, but conditioning on outcomes post-randomization can induce confounding and compromise causal inference. The purpose of this article is to examine the consequences of model misspecification in semiparametric marginal and partially conditional rate-based analysis through omission of prognostic variables. We do so using estimating function theory and empirical studies.

  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号