Empirical Bayes is a versatile approach to “learn from a lot” in two ways: first, from a large number of variables and, second, from a potentially large amount of prior information, for example, stored in public repositories. We review applications of a variety of empirical Bayes methods to several well‐known model‐based prediction methods, including penalized regression, linear discriminant analysis, and Bayesian models with sparse or dense priors. We discuss “formal” empirical Bayes methods that maximize the marginal likelihood but also more informal approaches based on other data summaries. We contrast empirical Bayes to cross‐validation and full Bayes and discuss hybrid approaches. To study the relation between the quality of an empirical Bayes estimator and p, the number of variables, we consider a simple empirical Bayes estimator in a linear model setting. We argue that empirical Bayes is particularly useful when the prior contains multiple parameters, which model a priori information on variables termed “co‐data”. In particular, we present two novel examples that allow for co‐data: first, a Bayesian spike‐and‐slab setting that facilitates inclusion of multiple co‐data sources and types and, second, a hybrid empirical Bayes–full Bayes ridge regression approach for estimation of the posterior predictive interval. 相似文献
We define the exponentiated power exponential distribution and propose a regression model with different systematic structures based on the new distribution. We show that the new regression model can be applied to dispersion data since it represents a parametric family of models that includes as sub-models some widely-known regression models. It then can be used more effectively in the analysis of real data. We use maximum likelihood estimation and derive the appropriate matrices for assessing local influence on the parameter estimates under different perturbation schemes. Some global-influence measurements are also investigated and simulation studies are performed to evaluate the accuracy of the estimates. We provide an application of the regression model with four systematic structures to nursing activities score data in the Unit of the Medical Clinic of University of São Paulo (USP) Hospital. 相似文献
In this article, we introduce tempered Mittag-Leffler Lévy processes (TMLLP). TMLLP is represented as tempered stable subordinator delayed by a gamma process. Its probability density function and Lévy density are obtained in terms of infinite series and Mittag-Leffler function, respectively. Asymptotic forms of the tails and moments are given. A step-by-step procedure of the parameters estimation and simulation of sample paths is given. We also provide main results available for Mittag-Leffler Lévy processes (MLLP) and some extensions which are not available in a collective way in a single article. Our results generalize and complement the results available on Mittag-Leffler distribution and MLLP in several directions. Further, the asymptotic forms of the moments of the first-exit times of the TMLLP are also discussed. 相似文献
In this article, we consider the prediction of future failure times based on Type-I hybrid censored samples. Point predictors and prediction intervals using different procedures are discussed for a general model. The exponential and Rayleigh distributions are used as illustrative examples to show the most simplified forms of the so obtained predictors as well as prediction intervals. Intensive simulation study and a real life dataset are presented to illustrate our findings and results. 相似文献
In this paper, a comparison between the life distribution of a new unit with that of a used unit in the increasing convex order is made leading to a new class of life distributions which we call “new better than used in convex ordering of second order”. This class includes as subclasses the NBU and the NBUC and is a subclass of the NBUCA class. Preservation properties under convolution, random maxima, mixing and formation of coherent structures are established. Stochastic comparisons of the excess lifetime when the inter-arrival times belong to the NBUC(2) class are developed. Some applications of Poisson shock models and a test of exponentiality against NBUC(2) alternative are presented. 相似文献
We analyze a class of linear regression models including interactions of endogenous regressors and exogenous covariates. We show how to generate instrumental variables using the nonlinear functional form of the structural equation when traditional excluded instruments are unknown. We propose to use these instruments with identification robust IV inference. We furthermore show that, whenever functional form identification is not valid, the ordinary least squares (OLS) estimator of the coefficient of the interaction term is consistent and standard OLS inference applies. Using our alternative empirical methods we confirm recent empirical findings on the nonlinear causal relation between financial development and economic growth. 相似文献
Lifetime Data Analysis - CD4-based multi-state back-calculation methods are key for monitoring the HIV epidemic, providing estimates of HIV incidence and diagnosis rates by disentangling their... 相似文献
There have been many advances in statistical methodology for the analysis of recurrent event data in recent years. Multiplicative semiparametric rate-based models are widely used in clinical trials, as are more general partially conditional rate-based models involving event-based stratification. The partially conditional model provides protection against extra-Poisson variation as well as event-dependent censoring, but conditioning on outcomes post-randomization can induce confounding and compromise causal inference. The purpose of this article is to examine the consequences of model misspecification in semiparametric marginal and partially conditional rate-based analysis through omission of prognostic variables. We do so using estimating function theory and empirical studies.
We present an algorithm for learning oblique decision trees, called HHCART(G). Our decision tree combines learning concepts from two classification trees, HHCART and Geometric Decision Tree (GDT). HHCART(G) is a simplified HHCART algorithm that uses linear structure in the training examples, captured by a modified GDT angle bisector, to define splitting directions. At each node, we reflect the training examples with respect to the modified angle bisector to align this linear structure with the coordinate axes. Searching axis parallel splits in this reflected feature space provides an efficient and effective way of finding oblique splits in the original feature space. Our method is much simpler than HHCART because it only considers one reflected feature space for node splitting. HHCART considers multiple reflected feature spaces for node splitting making it more computationally intensive to build. Experimental results show that HHCART(G) is an effective classifier, producing compact trees with similar or better results than several other decision trees, including GDT and HHCART trees. 相似文献