共查询到20条相似文献,搜索用时 31 毫秒
1.
Junyong Park Jayson D. Wilbur Jayanta K. Ghosh Cindy H. Nakatsu Corinne Ackerman 《统计学通讯:模拟与计算》2013,42(4):855-869
We adopt boosting for classification and selection of high-dimensional binary variables for which classical methods based on normality and non singular sample dispersion are inapplicable. Boosting seems particularly well suited for binary variables. We present three methods of which two combine boosting with the relatively classical variable selection methods developed in Wilbur et al. (2002). Our primary interest is variable selection in classification with small misclassification error being used as validation of proposed method for variable selection. Two of the new methods perform uniformly better than Wilbur et al. (2002) in one set of simulated and three real life examples. 相似文献
2.
Shesh N. Rai Jianmin Pan Xiaobin Yuan Jianguo Sun Melissa M. Hudson Deo K. Srivastava 《统计学通讯:理论与方法》2013,42(17):3117-3133
New drug discovery in the pediatrics has dramatically improved survival, but with long- term adverse events. This motivates the examination of adverse outcomes such as long-term toxicity in a phase IV trial. An ideal approach to monitor long-term toxicity is to systematically follow the survivors, which is generally not feasible. Instead, cross-sectional surveys are conducted in Hudson et al. (2007), with one of the objectives to estimate the cumulative incidence rates along with specific interest in fixed-term (5 or 10 year) rates. We present inference procedures based on current status data to our motivating example with very interesting findings. 相似文献
3.
In this article, we directly introduce the continuous version of the general discrete triangular distributions (Kokonendji and Zocchi, 2010). It is bounded and, in general, unimodal with pike. It contains thus a very useful class of two-sided power distributions (van Dorp and Kotz, 2002a,b, 2003). Moments, particular cases, limit distributions, and relations between parameters are straightforwardly derived. 相似文献
4.
A Bottom-Up Dynamic Model of Portfolio Credit Risk with Stochastic Intensities and Random Recoveries
Tomasz R. Bielecki Areski Cousin Stéphane Crépey Alexander Herbertsson 《统计学通讯:理论与方法》2014,43(7):1362-1389
In Bielecki et al. (2014a), the authors introduced a Markov copula model of portfolio credit risk where pricing and hedging can be done in a sound theoretical and practical way. Further theoretical backgrounds and practical details are developed in Bielecki et al. (2014b,c) where numerical illustrations assumed deterministic intensities and constant recoveries. In the present paper, we show how to incorporate stochastic default intensities and random recoveries in the bottom-up modeling framework of Bielecki et al. (2014a) while preserving numerical tractability. These two features are of primary importance for applications like CVA computations on credit derivatives (Assefa et al., 2011; Bielecki et al., 2012), as CVA is sensitive to the stochastic nature of credit spreads and random recoveries allow to achieve satisfactory calibration even for “badly behaved” data sets. This article is thus a complement to Bielecki et al. (2014a), Bielecki et al. (2014b) and Bielecki et al. (2014c). 相似文献
5.
《统计学通讯:理论与方法》2013,42(11):2227-2244
Abstract Monotone failure rate models [Barlow Richard, E., Marshall, A. W., Proschan, Frank. (1963). Properties of probability distributions with monotone failure rate. Annals of Mathematical Statistics 34:375–389, and Barlow Richard, E., Proschan, Frank. (1965). Mathematical Theory of Reliability. New York: John Wiley & Sons, Barlow Richard, E., Proschan, Frank. (1966a). Tolerance and confidence limits for classes of distributions based on failure rate. Annals of Mathematical Statistics 37(6):1593–1601, Barlow Richard, E., Proschan, Frank. (1966b). Inequalities for linear combinations of order statistics from restricted families. Annals of Mathematical Statistics 37(6):1574–1592, Barlow Richard, E., Proschan, Frank. (1975). Statistical Theory of Reliability and Life Testing. New York: Holt, Rinehart and Winston, Inc.] have become one of the most important models of failure time for reliability practitioners to consider and use. The above authors also developed models and bounds for monotone increasing failure rates (IFR) and for monotone decreasing failure rates (DFR). The IFR models and bounds appear to be especially useful for describing and bounding the hazard of aging. This article extends a new model for time to failure based onthe log odds rate [Zimmer William, J., Wang Yao, Pathak, P. K. (1998). Log-odds rate and monotone log-odds rate distributions. Journal of Quality Technology 30(4):376–385.] which is comparable to the model based on the failure rate. It is shown that in the case of increasing log odds rate (ILOR) in terms of log time (ln t), the model is less stringent than the IFR model for aging. The characterization of distributions of failure time by log odds rate is also derived. It is shown that the logistic distribution has the property of constant log odds rate over time and that the log logistic distribution has the property of constant log odds rate with respect to ln t. Some other properties of ILOR distributions are presented and bounds based on the relationship to the log logistic distribution are provided for distributions which are ILOR with respect to ln t. Motivational examples are provided. The ILOR bounds are compared to the more stringent bounds based on the IFR model. Bounds on system reliability are also provided for certain systems. 相似文献
6.
Mike G. Tsionas 《统计学通讯:理论与方法》2018,47(12):3022-3028
The properties of high-dimensional Bingham distributions have been studied by Kume and Walker (2014). Fallaize and Kypraios (2016) propose the Bayesian inference for the Bingham distribution and they use developments in Bayesian computation for distributions with doubly intractable normalizing constants (Møller et al. 2006; Murray, Ghahramani, and MacKay 2006). However, they rely heavily on two Metropolis updates that they need to tune. In this article, we propose instead a model selection with the marginal likelihood. 相似文献
7.
Rodrigo R. Pescim Edwin M. M. Ortega Gauss M. Cordeiro Morad Alizadeh 《Journal of applied statistics》2017,44(2):233-252
We introduce a log-linear regression model based on the odd log-logistic generalized half-normal distribution [7]. Some of its structural properties including explicit expressions for the density function, quantile and generating functions and ordinary moments are derived. We estimate the model parameters by the maximum likelihood method. For different parameter settings, proportion of censoring and sample size, some simulations are performed to investigate the behavior of the estimators. We derive the appropriate matrices for assessing local influence diagnostics on the parameter estimates under different perturbation schemes. We also define the martingale and modified deviance residuals to detect outliers and evaluate the model assumptions. In addition, we demonstrate that the extended regression model can be very useful in the analysis of real data and provide more realistic fits than other special regression models. The potentiality of the new regression model is illustrated by means of a real data set. 相似文献
8.
ABSTRACTThe present article is an attempt to explore the rotation patterns using exponential ratio type estimators for the estimation of finite population median at current occasion in two occasion rotation sampling. Properties of the proposed estimators including the optimum replacement strategies have been elaborated. The proposed estimators have been compared with sample median estimator when there is no matching from previous occasion as well with the ratio type estimator proposed by Singh et al. (2007) for second quantile. The behaviors of the proposed estimators are justified by empirical interpretations and validated by means of simulation study with the help of some natural populations. 相似文献
9.
This article proposes a marginalized model for repeated or otherwise hierarchical, overdispersed time-to-event outcomes, adapting the so-called combined model for time-to-event outcomes of Molenberghs et al. (in press), who combined gamma and normal random effects. The two sets of random effects are used to accommodate simultaneously correlation between repeated measures and overdispersion. The proposed version allows for a direct marginal interpretation of all model parameters. The outcomes are allowed to be censored. Two estimation methods are proposed: full likelihood and pairwise likelihood. The proposed model is applied to data from a so-called comet assay and to data from recurrent asthma attacks in children. Both estimation methods perform very well. From simulation results, it follows that the marginalized combined model behaves similarly to the ordinary combined model in terms of point estimation and precision. It is also observed that the pairwise likelihood required more computation time on the one hand but is less sensitive to starting values and stabler in terms of bias with increasing sample size and censoring percentage than full likelihood, on the other, leaving room for both in practice. 相似文献
10.
This paper is based on the application of a Bayesian model to a clinical trial study to determine a more effective treatment to lower mortality rates and consequently to increase survival times among patients with lung cancer. In this study, Qian et al. [13] strived to determine if a Weibull survival model can be used to decide whether to stop a clinical trial. The traditional Gibbs sampler was used to estimate the model parameters. This paper proposes to use the independent steady-state Gibbs sampling (ISSGS) approach, introduced by Dunbar et al. [3], to improve the original Gibbs sampler in multidimensional problems. It is demonstrated that ISSGS provides accuracy with unbiased estimation and improves the performance and convergence of the Gibbs sampler in this application. 相似文献
11.
Simard et al. [16 17] proposed a transformation distance called “tangent distance” (TD) which can make pattern recognition be efficient. The key idea is to construct a distance measure which is invariant with respect to some chosen transformations. In this research, we provide a method using adaptive TD based on an idea inspired by “discriminant adaptive nearest neighbor” [7]. This method is relatively easy compared with many other complicated ones. A real handwritten recognition data set is used to illustrate our new method. Our results demonstrate that the proposed method gives lower classification error rates than those by standard implementation of neural networks and support vector machines and is as good as several other complicated approaches. 相似文献
12.
Viswanathan Ramakrishnan 《统计学通讯:模拟与计算》2013,42(3):405-418
In many genetic analyses of dichotomous twin data, odds ratios have been used to test hypotheses on heritability and shared common environment effects of a given disease (Lichtenstein et al., 2000; Ahlbom et al., 1997; Ramakrishnan et al., 1992, 4). However, estimates of these two effects have not been dealt with in the literature. In epidemiology, the attributable fraction (AF), a function of the odds ratio and the prevalence of the risk factor has been used to describe the contribution of a risk factor to a disease in a given population (Leviton, 1973). In this article, we adapt the AF to quantify the heritability and the shared common environment. Twin data on cancer, gallstone disease and phobia are used to illustrate the applicability of the AF estimate as a measure of heritability. 相似文献
13.
In this article, another version of the generalized exponential geometric distribution different to that of Silva et al. (2010) is proposed. This new three-parameter lifetime distribution with decreasing, increasing, and bathtub failure rate function is created by compounding the generalized exponential distribution of Gupta and Kundu (1999) with a geometric distribution. Some basic distributional properties, moment-generating function, rth moment, and Rényi entropy of the new distribution are studied. The model parameters are estimated by the maximum likelihood method and the asymptotic distribution of estimators is discussed. Finally, an application of the new distribution is illustrated using the two real data sets. 相似文献
14.
This article extends the correlation methodology developed by Chinchilli et al. (2005) for the 2 × 2 crossover design to more complex crossover designs for clinical trials. We describe how the methodology can be adapted to a general type of two-treatment crossover design which includes either at least two sequences or at least two treatment periods or both. We then derive the asymptotic theory for the corresponding correlation statistics, investigate the statistical accuracy of the estimators via bootstrap analyses, and demonstrate their use with two real data examples. 相似文献
15.
Modeling Electricity Price Using A Threshold Conditional Autoregressive Geometric Process Jump Model
Electricity market prices are highly volatile and often have high spikes. Both government authorities and market participants require sophisticated models and techniques for forecasting future prices and managing relevant financial risks in such a volatile market. This article extends the conditional autoregressive geometric process (CARGP) model (Chan et al., 2012) to the CARGP model with thresholds and jumps, which is abbreviated as CARGP-TJ model in this article. We will demonstrate that the proposed CARGP-TJ model not only captures the unique features of the electricity price but also performs better than other existing models. For robustness consideration, a heavy-tailed error distribution is adopted. Model implementation relies on the powerful Bayesian Markov chain Monte Carlo simulation techniques via WinBUGS software. The analysis of the daily maximum electricity prices of the New South Wales, Australia reveals that the proposed CARGP-TJ model captures the price spikes well for both in-sample estimation and out-of-sample forecast. 相似文献
16.
The inverse Gaussian distribution is often suited for modeling positive and/or positively skewed data (see Chhikara and Folks, 1989) and presents an interesting alternative to the Gaussian model in such cases. We note here that overlap coefficients and their variants are widely studied in the literature for Gaussian populations (see Mulekar and Mishra, 1994, 2000, and references therein for further details). This article studies the properties and addresses point estimation for large samples of commonly used measures of overlap when the populations are described by inverse Gaussian distributions. The bias and mean square error properties of the estimators are studied through a simulation study. 相似文献
17.
This paper addresses a generalization of the bivariate Cauchy distribution discussed by Fang et al. (1990), derived from a trivariate normal distribution with a general correlation matrix. We obtain explicit expressions for the joint distribution function and joint density function, and show that they reduce in a special case to the corresponding expressions of Fang et al. (1990). Finally, we show that this generalized distribution is useful in determining the orthant probability of a bivariate skew-normal distribution of Azzalini and Dalla Valle (1996). 相似文献
18.
Samridhi Mehta 《统计学通讯:理论与方法》2018,47(16):4021-4028
Sihm et al. (2016) proposed an unrelated question binary optional randomized response technique (RRT) model for estimating the proportion of population that possess a sensitive characteristic and the sensitivity level of the question. In our work, decision theoretic approach has been followed to obtain Bayes estimates of the two parameters along with their corresponding minimal Bayes posterior expected losses (BPEL) using beta prior and squared error loss function (SELF). Relative losses are also examined to compare the performances of the Bayes estimates with those of the classical estimates obtained by Sihm et al. (2016). The results obtained are illustrated with the help of real survey data using non informative prior. 相似文献
19.
Pao-Sheng Shen 《统计学通讯:模拟与计算》2013,42(3):603-612
In this article, we consider the M-estimators for the linear regression model when both response and covariate variables are subject to double censoring. The proposed estimators are constructed as some functional of three types of estimators for a bivariate survival distribution. The first two estimators are the generalizations of the Campbell and Földes (1982) and Dabrowska (1988) estimators proposed by Shen (2009). The third estimator is the generalization of the Prentice and Cai (1992) estimator. The consistency of the proposed M-estimators is established. A simulation study is conducted to investigate the performance of the proposed estimators. Furthermore, the simple bootstrap methods are used to estimate standard deviations and construct interval estimators. 相似文献
20.
Abdullah Yilmaz 《统计学通讯:理论与方法》2013,42(23):7053-7059
ABSTRACTSkew-symmetric distributions have been discussed by several research-ers. In this article we construct a skew-symmetric Laplace distribution, which is the generalization of distribution given by Ali et al. (2009) and Nekoukhou and Alamatsaz (2012). This new distribution contains more parameters, and this induces flexibility properties, such as unimodality or bimodality. We study on some properties of this distribution. In the last section we also provide an application with a real data. Concerning example has recently been discussed by Nekoukhou et al. (2013) to apply to their model. We compare the behavior of our distribution to their distribution on this example. 相似文献