首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
A random effects model for analyzing mixed longitudinal count and ordinal data is presented where the count response is inflated in two points (k and l) and an (k,l)-Inflated Power series distribution is used as its distribution. A full likelihood-based approach is used to obtain maximum likelihood estimates of parameters of the model. For data with non-ignorable missing values models with probit model for missing mechanism are used.The dependence between longitudinal sequences of responses and inflation parameters are investigated using a random effects approach. Also, to investigate the correlation between mixed ordinal and count responses of each individuals at each time, a shared random effect is used. In order to assess the performance of the model, a simulation study is performed for a case that the count response has (k,l)-Inflated Binomial distribution. Performance comparisons of count-ordinal random effect model, Zero-Inflated ordinal random effects model and (k,l)-Inflated ordinal random effects model are also given. The model is applied to a real social data set from the first two waves of the national longitudinal study of adolescent to adult health (Add Health study). In this data set, the joint responses are the number of days in a month that each individual smoked as the count response and the general health condition of each individual as the ordinal response. For the count response there is incidence of excess values of 0 and 30.  相似文献   

2.
A random effects model for analyzing mixed longitudinal normal and count outcomes with and without the possibility of non ignorable missing outcomes is presented. The count response is inflated in two points (k and l) and the (k, l)-Hurdle power series is used as its distribution. The new distribution contains, as special submodels, several important distributions which are discussed, such as (k, l)-Hurdle Poisson and (k, l)-Hurdle negative binomial and (k, l)-Hurdle binomial distributions among others. Random effects are used to take into account the correlation between longitudinal outcomes and inflation parameters. A full likelihood-based approach is used to yield maximum likelihood estimates of the model parameters. A simulation study is performed in which for count outcome (k, l)-Hurdle Poisson, (k, l)-Hurdle negative binomial and (k, l)-Hurdle binomial distributions are considered. To illustrate the application of such modelling the longitudinal data of body mass index and the number of joint damage are analyzed.  相似文献   

3.
ABSTRACT

Inflated data are prevalent in many situations and a variety of inflated models with extensions have been derived to fit data with excessive counts of some particular responses. The family of information criteria (IC) has been used to compare the fit of models for selection purposes. Yet despite the common use in statistical applications, there are not too many studies evaluating the performance of IC in inflated models. In this study, we studied the performance of IC for data with dual-inflated data. The new zero- and K-inflated Poisson (ZKIP) regression model and conventional inflated models including Poisson regression and zero-inflated Poisson (ZIP) regression were fitted for dual-inflated data and the performance of IC were compared. The effect of sample sizes and the proportions of inflated observations towards selection performance were also examined. The results suggest that the Bayesian information criterion (BIC) and consistent Akaike information criterion (CAIC) are more accurate than the Akaike information criterion (AIC) in terms of model selection when the true model is simple (i.e. Poisson regression (POI)). For more complex models, such as ZIP and ZKIP, the AIC was consistently better than the BIC and CAIC, although it did not reach high levels of accuracy when sample size and the proportion of zero observations were small. The AIC tended to over-fit the data for the POI, whereas the BIC and CAIC tended to under-parameterize the data for ZIP and ZKIP. Therefore, it is desirable to study other model selection criteria for dual-inflated data with small sample size.  相似文献   

4.
Let {xij(1 ? j ? ni)|i = 1, 2, …, k} be k independent samples of size nj from respective distributions of functions Fj(x)(1 ? j ? k). A classical statistical problem is to test whether these k samples came from a common distribution function, F(x) whose form may or may not be known. In this paper, we consider the complementary problem of estimating the distribution functions suspected to be homogeneous in order to improve the basic estimator known as “empirical distribution function” (edf), in an asymptotic setup. Accordingly, we consider four additional estimators, namely, the restricted estimator (RE), the preliminary test estimator (PTE), the shrinkage estimator (SE), and the positive rule shrinkage estimator (PRSE) and study their characteristic properties based on the mean squared error (MSE) and relative risk efficiency (RRE) with tables and graphs. We observed that for k ? 4, the positive rule SE performs uniformly better than both shrinkage and the unrestricted estimator, while PTEs works reasonably well for k < 4.  相似文献   

5.
Suppose there are k 1 (k 1 ≥ 1) test treatments that we wish to compare with k 2 (k 2 ≥ 1) control treatments. Assume that the observations from the ith test treatment and the jth control treatment follow a two-parameter exponential distribution and , where θ is a common scale parameter and and are the location parameters of the ith test and the jth control treatment, respectively, i = 1, . . . ,k 1; j = 1, . . . ,k 2. In this paper, simultaneous one-sided and two-sided confidence intervals are proposed for all k 1 k 2 differences between the test treatment location and control treatment location parameters, namely , and the required critical points are provided. Discussions of multiple comparisons of all test treatments with the best control treatment and an optimal sample size allocation are given. Finally, it is shown that the critical points obtained can be used to construct simultaneous confidence intervals for Pareto distribution location parameters.  相似文献   

6.
Zero-inflated power series distribution is commonly used for modelling count data with extra zeros. Inflation at point zero has been investigated and several tests for zero inflation have been examined. However sometimes, inflation occurs at a point apart from zero. In this case, we say inflation occurs at an arbitrary point j. The j-inflation has been discussed less than zero inflation. In this paper, inflation at an arbitrary point j is studied with more details and a Bayesian test for detecting inflation at point j is presented. The Bayesian method is extended to inflation at arbitrary points i and j. The relationship between the distribution for inflation at point j, inflation at points i and j and missing value imputation is studied. It is shown how to obtain a proper estimate of the population variance if a mean-imputed missing at random data set is used. Some simulation studies are conducted and the proposed Bayesian test is applied on two real data sets.  相似文献   

7.
Dependent multivariate count data occur in several research studies. These data can be modelled by a multivariate Poisson or Negative binomial distribution constructed using copulas. However, when some of the counts are inflated, that is, the number of observations in some cells are much larger than other cells, then the copula-based multivariate Poisson (or Negative binomial) distribution may not fit well and it is not an appropriate statistical model for the data. There is a need to modify or adjust the multivariate distribution to account for the inflated frequencies. In this article, we consider the situation where the frequencies of two cells are higher compared to the other cells and develop a doubly inflated multivariate Poisson distribution function using multivariate Gaussian copula. We also discuss procedures for regression on covariates for the doubly inflated multivariate count data. For illustrating the proposed methodologies, we present real data containing bivariate count observations with inflations in two cells. Several models and linear predictors with log link functions are considered, and we discuss maximum likelihood estimation to estimate unknown parameters of the models.  相似文献   

8.
This paper introduces several forms of nested bivariate zero-inflated generalized Poisson (BZIGP) regression model which can be fitted to bivariate and zero-inflated count data. The main advantage of having several forms of BZIGP regression model is that they are nested and allow likelihood ratio test to be performed for choosing the best model. In addition, the BZIGP regression models have flexible forms of marginal mean–variance relationship, can be fitted to bivariate and zero-inflated count data with positive or negative correlations, and allow additional overdispersion of the two response variables. The BZIGP regression models are fitted to the Australian Health Survey data.  相似文献   

9.
10.
A generalized k-out-of-n system consists of N modules in which the i th module is composed of ni components in parallel. The system failswhen at least f components in the whole system or at least k consecutive modules have failed. In this article, we obtain the mean residual life function of such a generalized k-out-of-n system under different conditions, namely, when the number of components in each module is equal or unequal and when the components of the system are independent or exchangeable.  相似文献   

11.
Data sets with excess zeroes are frequently analyzed in many disciplines. A common framework used to analyze such data is the zero-inflated (ZI) regression model. It mixes a degenerate distribution with point mass at zero with a non-degenerate distribution. The estimates from ZI models quantify the effects of covariates on the means of latent random variables, which are often not the quantities of primary interest. Recently, marginal zero-inflated Poisson (MZIP; Long et al. [A marginalized zero-inflated Poisson regression model with overall exposure effects. Stat. Med. 33 (2014), pp. 5151–5165]) and negative binomial (MZINB; Preisser et al., 2016) models have been introduced that model the mean response directly. These models yield covariate effects that have simple interpretations that are, for many applications, more appealing than those available from ZI regression. This paper outlines a general framework for marginal zero-inflated models where the latent distribution is a member of the exponential dispersion family, focusing on common distributions for count data. In particular, our discussion includes the marginal zero-inflated binomial (MZIB) model, which has not been discussed previously. The details of maximum likelihood estimation via the EM algorithm are presented and the properties of the estimators as well as Wald and likelihood ratio-based inference are examined via simulation. Two examples presented illustrate the advantages of MZIP, MZINB, and MZIB models for practical data analysis.  相似文献   

12.
Count data have emerged in many applied research areas. In recent years, there has been a considerable interest in models for count data. In modelling such data, it is common to face a large frequency of zeroes. The data are regarded as zero-inflated when the frequency of observed zeroes is larger than what is expected from a theoretical distribution such as Poisson distribution, as a standard model for analysing count data. Data analysis, using the simple Poisson model, may lead to over-dispersion. Several classes of different mixture models were proposed for handling zero-inflated data. But they do not apply to cases when inflated counts happen at some other points, in addition to zero. In these cases, a doubly-inflated Poisson model has been suggested which only be used for cross-sectional data and cannot consider correlations between observations. However, correlated count data have a large application, especially in the health and medical fields. The present study aims to introduce a Doubly-Inflated Poisson models with random effect for correlated doubly-inflated data. Then, the best performance of the proposed method is shown via different simulation scenarios. Finally, the proposed model is applied to a dental study.KEYWORDS: Count data, doubly-inflated, Poisson regression, zero-inflated, correlated data  相似文献   

13.
Abstract

Let the data from the ith treatment/population follow a distribution with cumulative distribution function (cdf) F i (x) = F[(x ? μ i )/θ i ], i = 1,…, k (k ≥ 2). Here μ i (?∞ < μ i  < ∞) is the location parameter, θ i i  > 0) is the scale parameter and F(?) is any absolutely continuous cdf, i.e., F i (?) is a member of location-scale family, i = 1,…, k. In this paper, we propose a class of tests to test the null hypothesis H 0 ? θ1 = · = θ k against the simple ordered alternative H A  ? θ1 ≤ · ≤ θ k with at least one strict inequality. In literature, use of sample quasi range as a measure of dispersion has been advocated for small sample size or sample contaminated by outliers [see David, H. A. (1981). Order Statistics. 2nd ed. New York: John Wiley, Sec. 7.4]. Let X i1,…, X in be a random sample of size n from the population π i and R ir  = X i:n?r  ? X i:r+1, r = 0, 1,…, [n/2] ? 1 be the sample quasi range corresponding to this random sample, where X i:j represents the jth order statistic in the ith sample, j = 1,…, n; i = 1,…, k and [x] is the greatest integer less than or equal to x. The proposed class of tests, for the general location scale setup, is based on the statistic W r  = max1≤i<jk (R jr /R ir ). The test is reject H 0 for large values of W r . The construction of a three-decision procedure and simultaneous one-sided lower confidence bounds for the ratios, θ j i , 1 ≤ i < j ≤ k, have also been discussed with the help of the critical constants of the test statistic W r . Applications of the proposed class of tests to two parameter exponential and uniform probability models have been discussed separately with necessary tables. Comparisons of some members of our class with the tests of Gill and Dhawan [Gill A. N., Dhawan A. K. (1999). A One-sided test for testing homogeneity of scale parameters against ordered alternative. Commun. Stat. – Theory and Methods 28(10):2417–2439] and Kochar and Gupta [Kochar, S. C., Gupta, R. P. (1985). A class of distribution-free tests for testing homogeneity of variances against ordered alternatives. In: Dykstra, R. et al., ed. Proceedings of the Conference on Advances in Order Restricted Statistical Inference at Iowa city. Springer Verlag, pp. 169–183], in terms of simulated power, are also presented.  相似文献   

14.
Suppose that data {(x l,i,n , y l,i,n ): l?=?1, …, k; i?=?1, …, n} are observed from the regression models: Y l,i,n ?=?m l (x l,i,n )?+?? l,i,n , l?=?1, …, k, where the regression functions {m l } l=1 k are unknown and the random errors {? l,i,n } are dependent, following an MA(∞) structure. A new test is proposed for testing the hypothesis H 0: m 1?=?·?·?·?=?m k , without assuming that {m l } l=1 k are in a parametric family. The criterion of the test derives from a Crámer-von-Mises-type functional based on different distances between {[mcirc]} l and {[mcirc]} s , l?≠?s, l, s?=?1, …, k, where {[mcirc] l } l=1 k are nonparametric Gasser–Müller estimators of {m l } l=1 k . A generalization of the test to the case of unequal design points, with different sample sizes {n l } l=1 k and different design densities {f l } l=1 k , is also considered. The asymptotic normality of the test statistic is obtained under general conditions. Finally, a simulation study and an analysis with real data show a good behavior of the proposed test.  相似文献   

15.
For count responses, there are situations in biomedical and sociological applications in which extra zeroes occur. Modeling correlated (e.g. repeated measures and clustered) zero-inflated count data includes special challenges because the correlation between measurements for a subject or a cluster needs to be taken into account. Moreover, zero-inflated count data are often faced with over/under dispersion problem. In this paper, we propose a random effect model for repeated measurements or clustered data with over/under dispersed response called random effect zero-inflated exponentiated-exponential geometric regression model. The proposed method was illustrated through real examples. The performance of the model and asymptotical properties of the estimations were investigated using simulation studies.KEYWORDS: Count model, under- and over-dispersion, zero-inflation, mixture model, zero-inflated poisson model  相似文献   

16.
The inverse Gaussian distribution provides a flexible model for analyzing positive, right-skewed data. The generalized variable test for equality of several inverse Gaussian means with unknown and arbitrary variances has satisfactory Type-I error rate when the number of samples (k) is small (Tian, 2006). However, the Type-I error rate tends to be inflated when k goes up. In this article, we propose a parametric bootstrap (PB) approach for this problem. Simulation results show that the proposed test performs very satisfactorily regardless of the number of samples and sample sizes. This method is illustrated by an example.  相似文献   

17.
ABSTRACT

Consider k(≥ 2) independent exponential populations Π1, Π2, …, Π k , having the common unknown location parameter μ ∈ (?∞, ∞) (also called the guarantee time) and unknown scale parameters σ1, σ2, …σ k , respectively (also called the remaining mean lifetimes after the completion of guarantee times), σ i  > 0, i = 1, 2, …, k. Assume that the correct ordering between σ1, σ2, …, σ k is not known apriori and let σ[i], i = 1, 2, …, k, denote the ith smallest of σ j s, so that σ[1] ≤ σ[2] ··· ≤ σ[k]. Then Θ i  = μ + σ i is the mean lifetime of Π i , i = 1, 2, …, k. Let Θ[1] ≤ Θ[2] ··· ≤ Θ[k] denote the ranked values of the Θ j s, so that Θ[i] = μ + σ[i], i = 1, 2, …, k, and let Π(i) denote the unknown population associated with the ith smallest mean lifetime Θ[i] = μ + σ[i], i = 1, 2, …, k. Based on independent random samples from the k populations, we propose a selection procedure for the goal of selecting the population having the longest mean lifetime Θ[k] (called the “best” population), under the subset selection formulation. Tables for the implementation of the proposed selection procedure are provided. It is established that the proposed subset selection procedure is monotone for a general k (≥ 2). For k = 2, we consider the loss measured by the size of the selected subset and establish that the proposed subset selection procedure is minimax among selection procedures that satisfy a certain probability requirement (called the P*-condition) for the inclusion of the best population in the selected subset.  相似文献   

18.
Abstract

In this paper, we derive Bayesian estimators of the parameters of modified power series distributions inflated at any of a support point under linex and general entropy loss function. We assume that the prior information can be summarized by a uniform, Beta, two-sided power, Gamma or generalized Pareto distributions. The obtained results are demonstrated on the generalized Poisson and the generalized negative binomial distribution inflated at a given point.  相似文献   

19.
Let X1 be a strictly stationary multiple time series with values in Rd and with a common density f. Let X1,.,.,Xn, be n consecutive observations of X1. Let k = kn, be a sequence of positive integers, and let Hni be the distance from Xi to its kth nearest neighbour among Xj, j i. The multivariate variable-kernel estimate fn, of f is defined by where K is a given density. The complete convergence of fn, to f on compact sets is established for time series satisfying a dependence condition (referred to as the strong mixing condition in the locally transitive sense) weaker than the strong mixing condition. Appropriate choices of k are explicitly given. The results apply to autoregressive processes and bilinear time-series models.  相似文献   

20.
In the study of the reliability of technical systems, k-out-of-n systems play an important role. In the present paper, we consider a (nk + 1)-out-of-n system consisting of n identical components such that the lifetimes of components are independent and have a common distribution function F. It is assumed that the number of monitoring is l and the total number of failures of the components at time t i is m i , i = 1, . . . , l − 1. Also at time t l (t 1 < . . . < t l ) the system have failed or the system is still working. Under these conditions, the mean past lifetime, the mean residual lifetime of system and their properties are investigated.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号