首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 265 毫秒
1.
Nonparametric regression models are often used to check or suggest a parametric model. Several methods have been proposed to test the hypothesis of a parametric regression function against an alternative smoothing spline model. Some tests such as the locally most powerful (LMP) test by Cox et al. (Cox, D., Koh, E., Wahba, G. and Yandell, B. (1988). Testing the (parametric) null model hypothesis in (semiparametric) partial and generalized spline models. Ann. Stat., 16, 113–119.), the generalized maximum likelihood (GML) ratio test and the generalized cross validation (GCV) test by Wahba (Wahba, G. (1990). Spline models for observational data. CBMS-NSF Regional Conference Series in Applied Mathematics, SIAM.) were developed from the corresponding Bayesian models. Their frequentist properties have not been studied. We conduct simulations to evaluate and compare finite sample performances. Simulation results show that the performances of these tests depend on the shape of the true function. The LMP and GML tests are more powerful for low frequency functions while the GCV test is more powerful for high frequency functions. For all test statistics, distributions under the null hypothesis are complicated. Computationally intensive Monte Carlo methods can be used to calculate null distributions. We also propose approximations to these null distributions and evaluate their performances by simulations.  相似文献   

2.
Consider the problem of testing the composite null hypothesis that a random sample X1,…,Xn is from a parent which is a member of a particular continuous parametric family of distributions against an alternative that it is from a separate family of distributions. It is shown here that in many cases a uniformly most powerful similar (UMPS) test exists for this problem, and, moreover, that this test is equivalent to a uniformly most powerful invariant (UMPI) test. It is also seen in the method of proof used that the UMPS test statistic Is a function of the statistics U1,…,Un?k obtained by the conditional probability integral transformations (CPIT), and thus that no Information Is lost by these transformations, It is also shown that these optimal tests have power that is a nonotone function of the null hypothesis class of distributions, so that, for example, if one additional parameter for the distribution is assumed known, then the power of the test can not lecrease. It Is shown that the statistics U1, …, Un?k are independent of the complete sufficient statistic, and that these statistics have important invariance properties. Two examples at given. The UMPS tests for testing the two-parameter uniform family against the two-parameter exponential family, and for testing one truncation parameter distribution against another one are derived.  相似文献   

3.
Abstract

In time series, it is essential to check the independence of data by means of a proper method or an appropriate statistical test before any further analysis. Therefore, among different independence tests, a powerful and productive test has been introduced by Matilla-García and Marín via m-dimensional vectorial process, in which the value of the process at time t includes m-histories of the primary process. However, this method causes a dependency for the vectors even when the independence assumption of random variables is considered. Considering this dependency, a modified test is obtained in this article through presenting a new asymptotic distribution based on weighted chi-square random variables. Also, some other alterations to the test have been made via bootstrap method and by controlling the overlap. Compared with the primary test, it is obtained that not only the modified test is more accurate but also, it possesses higher power.  相似文献   

4.
ABSTRACT

The score test and the GOF test for the inverse Gaussian distribution, in particular the latter, are known to have large size distortion and hence unreliable power when referring to the asymptotic critical values. We show in this paper that with the appropriately bootstrapped critical values, these tests become second-order accurate, with size distortion being essentially eliminated and power more reliable. Two major generalizations of the score test are made: one is to allow the data to be right-censored, and the other is to allow the existence of covariate effects. A data mapping method is introduced for the bootstrap to be able to produce censored data that are conformable with the null model. Monte Carlo results clearly favour the proposed bootstrap tests. Real data illustrations are given.  相似文献   

5.
This paper provides some new results on the asymptotics of goodness-of-fit (GOF) tests based on minimum p-value statistics. In connection with detectability of sparse signals in high-dimensional data, various tests were proposed and investigated during the last decade, especially with respect to asymptotic properties. Minimum p-value GOF statistics were already investigated as minimum level attained statistics by Berk and Jones with respect to Bahadur efficiency. The distribution of minimum p-value GOF statistics is closely related to the distribution of higher criticism statistics, the distribution of the supremum of a normalized Brownian bridge, and the supremum of an Ornstein–Uhlenbeck process.  相似文献   

6.
Lehmann & Stein (1948) proved the existence of non-similar tests which can be more powerful than best similar tests. They used Student's problem of testing for a non-zero mean given a random sample from the normal distribution with unknown variance as an example. This raises the question: should we use a non-similar test instead of Student's t test? Questions like this can be answered by comparing the power of the test with the power envelope. This paper discusses the difficulties involved in computing power envelopes. It reports an empirical comparison of the power of the t test and the power envelope and finds that the two are almost identical especially for sample sizes greater than 20. These findings suggest that, as well as being uniformly most powerful (UMP) within the class of similar tests, Student's t test is approximately UMP within the class of all tests. For practical purposes it might also be regarded as UMP when moderate or large sample sizes are involved.  相似文献   

7.
In statistical modeling, we strive to specify models that resemble data collected in studies or observed from processes. Consequently, distributional specification and parameter estimation are central to parametric models. Graphical procedures, such as the quantile–quantile (QQ) plot, are arguably the most widely used method of distributional assessment, though critics find their interpretation to be overly subjective. Formal goodness of fit tests are available and are quite powerful, but only indicate whether there is a lack of fit, not why there is lack of fit. In this article, we explore the use of the lineup protocol to inject rigor into graphical distributional assessment and compare its power to that of formal distributional tests. We find that lineup tests are considerably more powerful than traditional tests of normality. A further investigation into the design of QQ plots shows that de-trended QQ plots are more powerful than the standard approach as long as the plot preserves distances in x and y to be the same. While we focus on diagnosing nonnormality, our approach is general and can be directly extended to the assessment of other distributions.  相似文献   

8.
Uniformly most powerful Bayesian tests (UMPBTs) are a new class of Bayesian tests in which null hypotheses are rejected if their Bayes factor exceeds a specified threshold. The alternative hypotheses in UMPBTs are defined to maximize the probability that the null hypothesis is rejected. Here, we generalize the notion of UMPBTs by restricting the class of alternative hypotheses over which this maximization is performed, resulting in restricted most powerful Bayesian tests (RMPBTs). We then derive RMPBTs for linear models by restricting alternative hypotheses to g priors. For linear models, the rejection regions of RMPBTs coincide with those of usual frequentist F‐tests, provided that the evidence thresholds for the RMPBTs are appropriately matched to the size of the classical tests. This correspondence supplies default Bayes factors for many common tests of linear hypotheses. We illustrate the use of RMPBTs for ANOVA tests and t‐tests and compare their performance in numerical studies.  相似文献   

9.
Testing the equality of variances of two linear models with common β-parameter is considered. A test based on least squares residuals (ASR test) is proposed, and it is shown that this test is invariant under the group of scale and translation changes. For some special cases, it is also proved that this test has a monotone power function. Finding the exact critical values of this test is not easy; an approximation is given to facilitate the computation of these. The powers of the BLUS test, the F-test and the new test are computed for various alternatives and compared in a particular case. The proposed test seems to be locally more powerful than the alternative tests.  相似文献   

10.
The intercomponent rank test suggested by Thompson (1991a) for the bivariate two sample problem is compared with the intracomponent rank test discussed by Puri and Sen (1971) and Hettmansperger (1984) and with the Hotelling T 2 test. Asymptotic relative efficiencies are discussed and the results of a simulation study are presented. Power studies show that for small sample sizes and small Type 1 error rates, say n = 5 and α = .01, the intercomponent rank test of Thompson (1991a) is somewhat liberal and the intracomponent test is quite conservative. For larger sample sizes and larger Type 1 error rates, both rank tests have improved properties under the null hypothesis. In almost all simulated cases, the intercomponent test is more powerful. In light of these studies it is suggested that the intercomponent rank test of Thompson (1991a), which has the added advantage of being easily computed with standard statistical software, is a strong competitor to the intracomponent rank test.  相似文献   

11.
We study the invariance properties of various test criteria which have been proposed for hypothesis testing in the context of incompletely specified models, such as models which are formulated in terms of estimating functions (Godambe, 1960) or moment conditions and are estimated by generalized method of moments (GMM) procedures (Hansen, 1982), and models estimated by pseudo-likelihood (Gouriéroux, Monfort, and Trognon, 1984b,c) and M-estimation methods. The invariance properties considered include invariance to (possibly nonlinear) hypothesis reformulations and reparameterizations. The test statistics examined include Wald-type, LR-type, LM-type, score-type, and C(α)?type criteria. Extending the approach used in Dagenais and Dufour (1991), we show first that all these test statistics except the Wald-type ones are invariant to equivalent hypothesis reformulations (under usual regularity conditions), but all five of them are not generally invariant to model reparameterizations, including measurement unit changes in nonlinear models. In other words, testing two equivalent hypotheses in the context of equivalent models may lead to completely different inferences. For example, this may occur after an apparently innocuous rescaling of some model variables. Then, in view of avoiding such undesirable properties, we study restrictions that can be imposed on the objective functions used for pseudo-likelihood (or M-estimation) as well as the structure of the test criteria used with estimating functions and generalized method of moments (GMM) procedures to obtain invariant tests. In particular, we show that using linear exponential pseudo-likelihood functions allows one to obtain invariant score-type and C(α)?type test criteria, while in the context of estimating function (or GMM) procedures it is possible to modify a LR-type statistic proposed by Newey and West (1987) to obtain a test statistic that is invariant to general reparameterizations. The invariance associated with linear exponential pseudo-likelihood functions is interpreted as a strong argument for using such pseudo-likelihood functions in empirical work.  相似文献   

12.
This paper examines the goodness-of-fit (GOF) test for a generalized asymmetric Student-t distribution (ASTD) and asymmetric exponential power distribution (AEPD). These distributions are known to include a broad class of distribution families and are quite suitable to modelling the innovations of financial time series. Despite their popularity, to our knowledge, no studies in the literature have so far investigated their affinity and differences in implementation. To fill this gap, we examine the empirical power behaviour of entropy-based GOF tests for hypotheses wherein the ASTD and AEPD play the role of null and alternative distributions. Our findings through a simulation study and real data analysis indicate that the two distributions are generally hard to distinguish and that the ASTD family accommodates AEPDs to a greater degree than the other way around for larger samples.  相似文献   

13.
ABSTRACT

The purpose of this paper is to use Bahadur's asymptotic relative efficiency measure to compare the performance of various tests of autoregressive (AR) versus moving average (MA) error processes in regression models. Tests to be examined include non-nested procedures of the models against each other, and classical procedures based upon testing both the AR and MA error processes against the more general autoregressive-moving average model.  相似文献   

14.
An adaptive test is proposed for the one-way layout. This test procedure uses the order statistics of the combined data to obtain estimates of percentiles, which are used to select an appropriate set of rank scores for the one-way test statistic. This test is designed to have reasonably high power over a range of distributions. The adaptive procedure proposed for a one-way layout is a generalization of an existing two-sample adaptive test procedure. In this Monte Carlo study, the power and significance level of the F-test, the Kruskal-Wallis test, the normal scores test, and the adaptive test were evaluated for the one-way layout. All tests maintained their significance level for data sets having at least 24 observations. The simulation results show that the adaptive test is more powerful than the other tests for skewed distributions if the total number of observations equals or exceeds 24. For data sets having at least 60 observations the adaptive test is also more powerful than the F-test for some symmetric distributions.  相似文献   

15.
The importance of the normal distribution for fitting continuous data is well known. However, in many practical situations data distribution departs from normality. For example, the sample skewness and the sample kurtosis are far away from 0 and 3, respectively, which are nice properties of normal distributions. So, it is important to have formal tests of normality against any alternative. D'Agostino et al. [A suggestion for using powerful and informative tests of normality, Am. Statist. 44 (1990), pp. 316–321] review four procedures Z 2(g 1), Z 2(g 2), D and K 2 for testing departure from normality. The first two of these procedures are tests of normality against departure due to skewness and kurtosis, respectively. The other two tests are omnibus tests. An alternative to the normal distribution is a class of skew-normal distributions (see [A. Azzalini, A class of distributions which includes the normal ones, Scand. J. Statist. 12 (1985), pp. 171–178]). In this paper, we obtain a score test (W) and a likelihood ratio test (LR) of goodness of fit of the normal regression model against the skew-normal family of regression models. It turns out that the score test is based on the sample skewness and is of very simple form. The performance of these six procedures, in terms of size and power, are compared using simulations. The level properties of the three statistics LR, W and Z 2(g 1) are similar and close to the nominal level for moderate to large sample sizes. Also, their power properties are similar for small departure from normality due to skewness (γ1≤0.4). Of these, the score test statistic has a very simple form and computationally much simpler than the other two statistics. The LR statistic, in general, has highest power, although it is computationally much complex as it requires estimates of the parameters under the normal model as well as those under the skew-normal model. So, the score test may be used to test for normality against small departure from normality due to skewness. Otherwise, the likelihood ratio statistic LR should be used as it detects general departure from normality (due to both skewness and kurtosis) with, in general, largest power.  相似文献   

16.
ABSTRACT

A simple test based on Gini's mean difference is proposed to test the hypothesis of equality of population variances. Using 2000 replicated samples and empirical distributions, we show that the test compares favourably with Bartlett's and Levene's test for the normal population. Also, it is more powerful than Bartlett's and Levene's tests for some alternative hypotheses for some non-normal distributions and more robust than the other two tests for large sample sizes under some alternative hypotheses. We also give an approximate distribution to the test statistic to enable one to calculate the nominal levels and P-values.  相似文献   

17.
In this article, we propose several goodness-of-fit methods for location–scale families of distributions under progressively Type-II censored data. The new tests are based on order statistics and sample spacings. We assess the performance of the proposed tests for the normal and Gumbel models against several alternatives by means of Monte Carlo simulations. It has been observed that the proposed tests are quite powerful in comparison with an existing goodness-of-fit test proposed for progressively Type-II censored data by Balakrishnan et al. [Goodness-of-fit tests based on spacings for progressively Type-II censored data from a general location–scale distribution, IEEE Trans. Reliab. 53 (2004), pp. 349–356]. Finally, we illustrate the proposed goodness-of-fit tests using two real data from reliability literature.  相似文献   

18.
For location–scale families, we consider a random distance between the sample order statistics and the quasi sample order statistics derived from the null distribution as a measure of discrepancy. The conditional qth quantile and expectation of the random discrepancy on the given sample are chosen as test statistics. Simulation results of powers against various alternatives are illustrated under the normal and exponential hypotheses for moderate sample size. The proposed tests, especially the qth quantile tests with a small or large q, are shown to be more powerful than other prominent goodness-of-fit tests in most cases.  相似文献   

19.

In this article we examine the effect that logarithmic and power transformations have on the order of integration in raw time series. For this purpose, we use a version of the tests of Robinson (1994) that permits us to test I ( d ) statistical models. The results, obtained via Monte Carlo, show that there is no effect in the degree of dependence of the series when this type of transformations are employed, resulting thus in useful mechanisms to be applied when a more plausible economic interpretation of the data is required.  相似文献   

20.
Abstract

This paper discusses Johansen’s likelihood ratio tests to determine the cointegration rank under local alternative hypotheses in the vector autoregressive models (VARs) in which drift or linear trend related to the hypotheses is not dependent upon the sample size, and evaluates related asymptotic properties. We show that the test statistics diverge at the rate of the sample size even under one of local alternative hypotheses, owing to the existence of such a deterministic term. This implies that under our situations, the tests are far more powerful than those under the conventional local alternative hypotheses.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号