期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Optimality of Equal vs. Unequal Cluster Sizes in Multilevel Intervention Studies: A Monte Carlo Study for Small Sample Sizes

Math J. J. M. Candel Gerard J. P. Van Breukelen Larissa Kotova Martijn P. F. Berger 《统计学通讯:模拟与计算》2013,42(1):222-239

Optimality of equal versus unequal cluster sizes in the context of multilevel intervention studies is examined. A Monte Carlo study is done to examine to what degree asymptotic results on the optimality hold for realistic sample sizes and for different estimation methods. The relative D-criterion, comparing equal versus unequal cluster sizes, almost always exceeded 85%, implying that loss of information due to unequal cluster sizes can be compensated for by increasing the number of clusters by 18%. The simulation results are in line with asymptotic results, showing that, for realistic sample sizes and various estimation methods, the asymptotic results can be used in planning multilevel intervention studies. 相似文献

2.

Factor recovery by principal axis factoring and maximum likelihood factor analysis as a function of factor pattern and sample size

J. C.F. de Winter D. Dodou 《Journal of applied statistics》2012,39(4):695-710

Principal axis factoring (PAF) and maximum likelihood factor analysis (MLFA) are two of the most popular estimation methods in exploratory factor analysis. It is known that PAF is better able to recover weak factors and that the maximum likelihood estimator is asymptotically efficient. However, there is almost no evidence regarding which method should be preferred for different types of factor patterns and sample sizes. Simulations were conducted to investigate factor recovery by PAF and MLFA for distortions of ideal simple structure and sample sizes between 25 and 5000. Results showed that PAF is preferred for population solutions with few indicators per factor and for overextraction. MLFA outperformed PAF in cases of unequal loadings within factors and for underextraction. It was further shown that PAF and MLFA do not always converge with increasing sample size. The simulation findings were confirmed by an empirical study as well as by a classic plasmode, Thurstone's box problem. The present results are of practical value for factor analysts. 相似文献

3.

Efficiency and Validity Analyses of Two-Stage Estimation Procedures and Derived Testing Procedures in Quantitative Linear Models with AR(1) Errors

《统计学通讯:模拟与计算》2013,42(3):799-833

Abstract

In a quantitative linear model with errors following a stationary Gaussian, first-order autoregressive or AR(1) process, Generalized Least Squares (GLS) on raw data and Ordinary Least Squares (OLS) on prewhitened data are efficient methods of estimation of the slope parameters when the autocorrelation parameter of the error AR(1) process, ρ, is known. In practice, ρ is generally unknown. In the so-called two-stage estimation procedures, ρ is then estimated first before using the estimate of ρ to transform the data and estimate the slope parameters by OLS on the transformed data. Different estimators of ρ have been considered in previous studies. In this article, we study nine two-stage estimation procedures for their efficiency in estimating the slope parameters. Six of them (i.e., three noniterative, three iterative) are based on three estimators of ρ that have been considered previously. Two more (i.e., one noniterative, one iterative) are based on a new estimator of ρ that we propose: it is provided by the sample autocorrelation coefficient of the OLS residuals at lag 1, denoted r(1). Lastly, REstricted Maximum Likelihood (REML) represents a different type of two-stage estimation procedure whose efficiency has not been compared to the others yet. We also study the validity of the testing procedures derived from GLS and the nine two-stage estimation procedures. Efficiency and validity are analyzed in a Monte Carlo study. Three types of explanatory variable x in a simple quantitative linear model with AR(1) errors are considered in the time domain: Case 1, x is fixed; Case 2, x is purely random; and Case 3, x follows an AR(1) process with the same autocorrelation parameter value as the error AR(1) process. In a preliminary step, the number of inadmissible estimates and the efficiency of the different estimators of ρ are compared empirically, whereas their approximate expected value in finite samples and their asymptotic variance are derived theoretically. Thereafter, the efficiency of the estimation procedures and the validity of the derived testing procedures are discussed in terms of the sample size and the magnitude and sign of ρ. The noniterative two-stage estimation procedure based on the new estimator of ρ is shown to be more efficient for moderate values of ρ at small sample sizes. With the exception of small sample sizes, REML and its derived F-test perform the best overall. The asymptotic equivalence of two-stage estimation procedures, besides REML, is observed empirically. Differences related to the nature, fixed or random (uncorrelated or autocorrelated), of the explanatory variable are also discussed. 相似文献

4.

Multilevel modelling of complex survey data

Sophia Rabe-Hesketh Anders Skrondal 《Journal of the Royal Statistical Society. Series A, (Statistics in Society)》2006,169(4):805-827

Summary. Multilevel modelling is sometimes used for data from complex surveys involving multistage sampling, unequal sampling probabilities and stratification. We consider generalized linear mixed models and particularly the case of dichotomous responses. A pseudolikelihood approach for accommodating inverse probability weights in multilevel models with an arbitrary number of levels is implemented by using adaptive quadrature. A sandwich estimator is used to obtain standard errors that account for stratification and clustering. When level 1 weights are used that vary between elementary units in clusters, the scaling of the weights becomes important. We point out that not only variance components but also regression coefficients can be severely biased when the response is dichotomous. The pseudolikelihood methodology is applied to complex survey data on reading proficiency from the American sample of the 'Program for international student assessment' 2000 study, using the Stata program gllamm which can estimate a wide range of multilevel and latent variable models. Performance of pseudo-maximum-likelihood with different methods for handling level 1 weights is investigated in a Monte Carlo experiment. Pseudo-maximum-likelihood estimators of (conditional) regression coefficients perform well for large cluster sizes but are biased for small cluster sizes. In contrast, estimators of marginal effects perform well in both situations. We conclude that caution must be exercised in pseudo-maximum-likelihood estimation for small cluster sizes when level 1 weights are used. 相似文献

5.

Nonlinear semiparametric AR(1) model with skew-symmetric innovations

Arezo Hajrajabi Afshin Fallah 《统计学通讯:模拟与计算》2018,47(5):1453-1462

In this paper, we expand a first-order nonlinear autoregressive (AR) model with skew normal innovations. A semiparametric method is proposed to estimate a nonlinear part of model by using the conditional least squares method for parametric estimation and the nonparametric kernel approach for the AR adjustment estimation. Then computational techniques for parameter estimation are carried out by the maximum likelihood (ML) approach using Expectation-Maximization (EM) type optimization and the explicit iterative form for the ML estimators are obtained. Furthermore, in a simulation study and a real application, the accuracy of the proposed methods is verified. 相似文献

6.

On the estimates of the parameters of the first order autoregressive process

A. Bustos M. Ahsanullah 《统计学通讯:模拟与计算》2013,42(4):489-505

Two methods of estimation for the parameters of an AR(1) process which are based on a non-linear least-squares approach are presented. On the basis of some simulation results they are compared with two maximum likelihood estimates and their relative merits are discussed. 相似文献

7.

Statistical Engineering

Robert C. Tumbleson 《The American statistician》2013,67(1):7-11

This article compares the accuracy of the median unbiased estimator with that of the maximum likelihood estimator for a logistic regression model with two binary covariates. The former estimator is shown to be uniformly more accurate than the latter for small to moderately large sample sizes and a broad range of parameter values. In view of the recently developed efficient algorithms for generating exact distributions of sufficient statistics in binary-data problems, these results call for a serious consideration of median unbiased estimation as an alternative to maximum likelihood estimation, especially when the sample size is not large, or when the data structure is sparse. 相似文献

8.

Inverse Adaptive Cluster Sampling with Unequal Selection Probabilities: Case Studies on Crab Holes and Arsenic Pollution

下载免费PDF全文

Mohammad Salehi Mohammad Moradi Jassim A. Al Khayat Jennifer Brown Adil Eltayeb Mohamed Yousif 《Australian & New Zealand Journal of Statistics》2015,57(2):189-201

Adaptive cluster sampling is an efficient method of estimating the parameters of rare and clustered populations. The method mimics how biologists would like to collect data in the field by targeting survey effort to localised areas where the rare population occurs. Another popular sampling design is inverse sampling. Inverse sampling was developed so as to be able to obtain a sample of rare events having a predetermined size. Ideally, in inverse sampling, the resultant sample set will be sufficiently large to ensure reliable estimation of population parameters. In an effort to combine the good properties of these two designs, adaptive cluster sampling and inverse sampling, we introduce inverse adaptive cluster sampling with unequal selection probabilities. We develop an unbiased estimator of the population total that is applicable to data obtained from such designs. We also develop numerical approximations to this estimator. The efficiency of the estimators that we introduce is investigated through simulation studies based on two real populations: crabs in Al Khor, Qatar and arsenic pollution in Kurdistan, Iran. The simulation results show that our estimators are efficient. 相似文献

9.

Efficient closed-form maximum a posteriori estimators for the gamma distribution

Francisco Louzada 《Journal of Statistical Computation and Simulation》2018,88(6):1134-1146

We proposed a new class of maximum a posteriori estimators for the parameters of the Gamma distribution. These estimators have simple closed-form expressions and can be rewritten as a bias-corrected maximum likelihood estimators presented by Ye and Chen [Closed-form estimators for the gamma distribution derived from likelihood equations. Am Statist. 2017;71(2):177–181]. A simulation study was carried out to compare different estimation procedures. Numerical results revels that our new estimation scheme outperforms the existing closed-form estimators and produces extremely efficient estimates for both parameters, even for small sample sizes. 相似文献

10.

Likelihood-Based Inference in Autoregressive Models with Scaled t-Distributed Innovations by Means of EM-Based Algorithms

H. Haghbin A. R. Nematollahi 《统计学通讯:模拟与计算》2013,42(10):2239-2252

This article applies the EM-based (ECM and ECME) algorithms to find the maximum likelihood estimates of model parameters in general AR models with independent scaled t-distributed innovations whenever the degrees of freedom are unknown. The ECME, sharing advantages with both EM and Newton–Raphson algorithms, is an extension of ECM, which itself is an extension of the EM algorithm. The ECM and ECME algorithms, which are analytically quite simple to use, are then compared based on the computational running time and the accuracy of estimation via a simulation study. The results demonstrate that the ECME is efficient and usable in practice. We also show how our method can be applied to the Wolfer's sunspot data. 相似文献

11.

Testing for a unit root in an arima(p,1,0) signal observed with ma(q) noise

Dong Wan Shin Sahadeb Sarkar 《统计学通讯:理论与方法》2013,42(9):2643-2670

An ARIMA(p,1,0) signal disturbed by MA(q) noise is an ARIMA(p,1, p+q+1) process restricted by nonlinear constraints on parameters. For this model with a unit root the restricted maximum likelihood estimator (RMLE) of the unit root is strongly consistent and it has the same limiting distribution as the ordinary least squares estimator of the unit root in an AR(1) model tabulated by Dickey and Fuller (1979). A modified RMLE is proposed which has the same limiting properties as the RMLE and is computationally much simpler. Simulation results show that our unit root tests based on the modified RMLE perform very well for small samples and compare favorably with the Said and Dickey (1985) tests with respect to both sizes and powers. An illustrative example from sample survey is given. 相似文献

12.

ESTIMATION FOR THE POSITIVE STABLE LAWS, I

P. J. Brockwell B. M. Brown 《Australian & New Zealand Journal of Statistics》1979,21(2):139-148

Estimation of scale and index parameters of positive stable laws is considered. Maximum likelihood estimation is known to be efficient, but very difficult to compute, while methods based on the sample characteristic function are computationally easy, but have uncertain efficiency properties.
In this paper an estimation method is presented which is reasonably easy to compute, and which has good efficiency properties, at least when the index α (0, 0.5). The method is based on an expression for the characteristic function of the logarithm of a positive stable random variable, and is derived by relating the stable estimation problem to that of location/scale estimation in extreme-value-distribution families, for which efficient methods are known.
The proposed method has efficiency which →1 as α→,but on the other hand, efficiencies deteriorate after α >0.5, and in fact appear to →0 as α+ 1. 相似文献

13.

Inference in Semi‐Parametric Dynamic Models for Repeated Count Data

下载免费PDF全文

Brajendra C. Sutradhar K.V. Vineetha Warriyar Nan Zheng 《Australian & New Zealand Journal of Statistics》2016,58(3):397-434

This paper deals with a longitudinal semi‐parametric regression model in a generalised linear model setup for repeated count data collected from a large number of independent individuals. To accommodate the longitudinal correlations, we consider a dynamic model for repeated counts which has decaying auto‐correlations as the time lag increases between the repeated responses. The semi‐parametric regression function involved in the model contains a specified regression function in some suitable time‐dependent covariates and a non‐parametric function in some other time‐dependent covariates. As far as the inference is concerned, because the non‐parametric function is of secondary interest, we estimate this function consistently using the independence assumption‐based well‐known quasi‐likelihood approach. Next, the proposed longitudinal correlation structure and the estimate of the non‐parametric function are used to develop a semi‐parametric generalised quasi‐likelihood approach for consistent and efficient estimation of the regression effects in the parametric regression function. The finite sample performance of the proposed estimation approach is examined through an intensive simulation study based on both large and small samples. Both balanced and unbalanced cluster sizes are incorporated in the simulation study. The asymptotic performances of the estimators are given. The estimation methodology is illustrated by reanalysing the well‐known health care utilisation data consisting of counts of yearly visits to a physician by 180 individuals for four years and several important primary and secondary covariates. 相似文献

14.

ESTIMATION OF FAMILIAL CORRELATIONS UNDER AUTOREGRESSIVE CIRCULAR COVARIANCE

《统计学通讯:理论与方法》2013,42(8-9):1811-1828

Circular covariance matrices play an important role in modeling phenomena in numerous epidemiological, communications and physical contexts. In this article, we propose a parsimonious, autoregressive type of circular covariance structure for modeling correlations between the “siblings” of a “family”. This structure, similar to AR(1) structure used in time series models, involves only two parameters. We derive the maximum likelihood estimators of these parameters, and discuss testing of hypotheses about the autoregressive parameter. Estimation of “parent-sib” correlation, namely, the interclass correlation, is also considered. Estimation of the parameters when there are unequal numbers of siblings in different families is also discussed. 相似文献

15.

New monte carlo results on the robustness of the anova f,w and f statistics

Rand R. Wilcox Ventura L. Charin Karen L. Thompson 《统计学通讯:模拟与计算》2013,42(4):933-943

Because the usual F test for equal means is not robust to unequal variances, Brown and Forsythe (1974a) suggest replacing F with the statistics F or W which are based on the Satterthwaite and Welch adjusted degrees of freedom procedures. This paper reports practical situations where both F and W give * unsatisfactory results. In particular, both F and W may not provide adequate control over Type I errors. Moreover, for equal variances, but unequal sample sizes, W should be avoided in favor of F (or F ), but for equal sample sizes, and possibly unequal variances, W was the only satisfactory statistic. New results on power are included as well. The paper also considers the effect of using F or W only after a significant test for equal variances has been obtained, and new results on the robustness of the F test are described. It is found that even for equal sample sizes as large as 50 per treatment group, there are practical situations where the F test does not provide adequately control over the probability of a Type I error. 相似文献

16.

Modelling a non-stationary BINAR(1) Poisson process

《Journal of Statistical Computation and Simulation》2012,82(15):3106-3126

ABSTRACT

Non-stationarity in bivariate time series of counts may be induced by a number of time-varying covariates affecting the bivariate responses due to which the innovation terms of the individual series as well as the bivariate dependence structure becomes non-stationary. So far, in the existing models, the innovation terms of individual INAR(1) series and the dependence structure are assumed to be constant even though the individual time series are non-stationary. Under this assumption, the reliability of the regression and correlation estimates is questionable. Besides, the existing estimation methodologies such as the conditional maximum likelihood (CMLE) and the composite likelihood estimation are computationally intensive. To address these issues, this paper proposes a BINAR(1) model where the innovation series follow a bivariate Poisson distribution under some non-stationary distributional assumptions. The method of generalized quasi-likelihood (GQL) is used to estimate the regression effects while the serial and bivariate correlations are estimated using a robust moment estimation technique. The application of model and estimation method is made in the simulated data. The GQL method is also compared with the CMLE, generalized method of moments (GMM) and generalized estimating equation (GEE) approaches where through simulation studies, it is shown that GQL yields more efficient estimates than GMM and equally or slightly more efficient estimates than CMLE and GEE. 相似文献

17.

A comparison of LS/ML and GMM estimation in a simple AR(1) model

Dimitrios V. Vougas 《统计学通讯:模拟与计算》2013,42(1):239-258

This paper compares least squares (LS)/maximum likelihood (ML) and generalised method of moments (GMM) estimation in a simple. Gaussian autoregressive of order one (AR(1)) model. First, we show that the usual LS/ML estimator is a corner solution to a general minimisation problem that involves two moment conditions, while the new GMM we devise is not. Secondly, we examine asymptotic and finite sample properties of the new GMM estimator in comparison to the usual LS/ML estimator in a simple AR(1) model. For both stable and unstable (unit root) specifications, we show asymptotic equivalence of the distributions of the two estimators. However, in finite samples, the new GMM estimator performs better. 相似文献

18.

ON THE COMMON SCALE PARAMETER OF SEVERAL PARETO POPULATIONS IN CENSORED SAMPLES

《统计学通讯:理论与方法》2013,42(3):451-462

The problem of estimation of an unknown common scale parameter of several Pareto distributions with unknown and possibly unequal shape parameters in censored samples is considered. A new class of estimators which includes both the maximum likelihood estimator (MLE) and the uniformly minimum variance unbiased estimator (UMVUE) is proposed and examined under a squared error loss. 相似文献

19.

Inference for the tail index of a GARCH(1,1) model and an AR(1) model with ARCH(1) errors

Rongmao Zhang Chenxue Li 《Econometric Reviews》2019,38(2):151-169

For a GARCH(1,1) sequence or an AR(1) model with ARCH(1) errors, one can estimate the tail index by solving an estimating equation with unknown parameters replaced by the quasi maximum likelihood estimation, and a profile empirical likelihood method can be employed to effectively construct a confidence interval for the tail index. However, this requires that the errors of such a model have at least a finite fourth moment. In this article, we show that the finite fourth moment can be relaxed by employing a least absolute deviations estimate for the unknown parameters by noting that the estimating equation for determining the tail index is invariant to a scale transformation of the underlying model. 相似文献

20.

Some inadmissibility results for estimating ordered uniform scale parameters

Neeraj Misra Ishwari D. Dhariyal 《统计学通讯:理论与方法》2013,42(3):675-685

Independent random samples (of possibly unequal sizes) are drawn from k (≥2) uniform populations having unknown scale parameters μ₁,…,μ_k. The problem of componentwise estimation of ordered parameters is investigated. The loss function is assumed to be squared error and the cases of known and unknown ordering among μ₁,…,μ_k. are dealt with separately. Sufficient conditions for an estimator to be inadmissible are provided and as a consequence, many natural estimators are shown to be inadmissible, Better estimators are provided. 相似文献