This paper numerically examines the size robustness of various conditional moment tests in misspecified tobit and probit models. The misspecifications considered include the incorrect exclusion of regressors, ignored heteroskedasticity and false distributional assumptions. An important feature of the experimental design is that it is based on an existing empirical study and is more realistic than many simulation studies. The tests are seen to have mixed performance depending on both the original null hypothesis being tested and type of misspecification encountered. 相似文献
We derive a non-parametric test for testing the presence of V(Xi,εi) in the non-parametric first-order autoregressive model Xi+1=T(Xi)+V(Xi,εi)+U(Xi)εi+1, where the function T(x) is assumed known. The test is constructed as a functional of a basic process for which we establish a weak invariance principle, under the null hypothesis and under stationarity and mixing assumptions. Bounds for the local and non-local powers are provided under a condition which ensures that the power tends to one as the sample size tends to infinity.The testing procedure can be applied, e.g. to bilinear models, ARCH models, EXPAR models and to some other uncommon models. Our results confirm the robustness of the test constructed in Ngatchou Wandji (1995) and in Diebolt & Ngatchou Wandji (1995). 相似文献
Lehmann & Stein (1948) proved the existence of non-similar tests which can be more powerful than best similar tests. They used Student's problem of testing for a non-zero mean given a random sample from the normal distribution with unknown variance as an example. This raises the question: should we use a non-similar test instead of Student's t test? Questions like this can be answered by comparing the power of the test with the power envelope. This paper discusses the difficulties involved in computing power envelopes. It reports an empirical comparison of the power of the t test and the power envelope and finds that the two are almost identical especially for sample sizes greater than 20. These findings suggest that, as well as being uniformly most powerful (UMP) within the class of similar tests, Student's t test is approximately UMP within the class of all tests. For practical purposes it might also be regarded as UMP when moderate or large sample sizes are involved. 相似文献
This paper compares the application of different versions of the simulated counterparts of the Wald test, the score test,
and the likelihood ratio test in one- and multiperiod multinomial probit models. Monte Carlo experiments show that the use
of the simple form of the simulated likelihood ratio test delivers relatively robust results regarding the testing of several
multinomial probit model specifications. In contrast, the inclusion of the Hessian matrix of the simulated loglikelihood function
into the simulated score test and (in the multiperiod multinomial probit model) particularly the inclusion of the quasi-maximum
likelihood theory into the simulated likelihood ratio test leads to substantial computational problems. The combined application
of the quasi-maximum likelihood theory with the simulated Wald test or the simulated score test is not systematically superior
to the application of the other versions of these two simulated classical tests either. Neither an increase in the number
of observations nor in the number of random draws in the incorporated Geweke-Hajivassiliou-Keane simulator systematically
lead to more precise conformities between the frequencies of type I errors and the basic significance levels. An increase
in the number of observations only decreases the frequencies of type II errors, particularly regarding the simulated classical
testing of multiperiod multinomial probit model specifications. 相似文献
We are concerned with three different types of multivariate chi-square distributions. Their members play important roles as limiting distributions of vectors of test statistics in several applications of multiple hypotheses testing. We explain these applications and consider the computation of multiplicity-adjusted p-values under the respective global hypothesis. By means of numerical examples, we demonstrate how much gain in level exhaustion or, equivalently, power can be achieved with corresponding multivariate multiple tests compared with approaches which are only based on univariate marginal distributions and do not take the dependence structure among the test statistics into account. As a further contribution of independent value, we provide an overview of essentially all analytic formulas for computing multivariate chi-square probabilities of the considered types which are available up to present. These formulas were scattered in the previous literature and are presented here in a unified manner. 相似文献
In this article three unit root tests that allow for a break in both the seasonal mean and linear trend of the data are proposed. The tests, which can be seen as small-sample corrected versions of already known asymptotic tests, are shown to perform very well in simulations, and much better than their asymptotic counterparts. 相似文献
Structural breaks in the level as well as in the volatility have often been exhibited in economic time series. In this paper, we propose new unit root tests when a time series has multiple shifts in its level and the corresponding volatility. The proposed tests are Lagrangian multiplier type tests based on the residual's marginal likelihood which is free from the nuisance mean parameters. The limiting null distributions of the proposed tests are the χ2distributions, and are affected not by the size and the location of breaks but only by the number of breaks.
We set the structural breaks under both the null and the alternative hypotheses to relieve a possible vagueness in interpreting test results in empirical work. The null hypothesis implies a unit root process with level shifts and the alternative connotes a stationary process with level shifts. The Monte Carlo simulation shows that our tests are locally more powerful than the OLSE-based tests, and that the powers of our tests, in a fixed time span, remain stable regardless the number of breaks. In our application, we employ the data which are analyzed by Perron (1990), and some results differ from those of Perron's (1990). 相似文献