首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Abstract

We consider the classification of high-dimensional data under the strongly spiked eigenvalue (SSE) model. We create a new classification procedure on the basis of the high-dimensional eigenstructure in high-dimension, low-sample-size context. We propose a distance-based classification procedure by using a data transformation. We also prove that our proposed classification procedure has consistency property for misclassification rates. We discuss performances of our classification procedure in simulations and real data analyses using microarray data sets.  相似文献   

2.
We consider a class of long-range-dependent Gaussian processes defined in a semiparametric framework. We propose a new estimator of the long-range dependence parameter, based on the integration of the periodogram in two windows. We show that it is asymptotically Gaussian and calculate the rate of convergence. We optimise parameters defining the window function for the minimum mean-square-error criterion. In a Monte-Carlo study, we compare the proposed estimator with previously studied estimators.  相似文献   

3.
We discuss a general application of categorical data analysis to mutations along the HIV genome. We consider a multidimensional table for several positions at the same time. Due to the complexity of the multidimensional table, we may collapse it by pooling some categories. However, the association between the remaining variables may not be the same as before collapsing. We discuss the collapsibility of tables and the change in the meaning of parameters after collapsing categories. We also address this problem with a log-linear model. We present a parameterization with the consensus output as the reference cell as is appropriate to explain genomic mutations in HIV. We also consider five null hypotheses and some classical methods to address them. We illustrate methods for six positions along the HIV genome, through consideration of all triples of positions.  相似文献   

4.
We consider a problem of estimating the minimum effective and peak doses in the presence of covariates. We propose a sequential strategy for subject assignment that includes an adaptive randomization component to balance the allocation to placebo and active doses with respect to covariates. We conclude that either adjusting for covariates in the model or balancing allocation with respect to covariates is required to avoid bias in the target dose estimation. We also compute optimal allocation to estimate the minimum effective and peak doses in discrete dose space using isotonic regression.  相似文献   

5.
We consider an example in which the use of finitely additive probabilities leads to the seemingly paradoxical conclusion that we may reason to a foregone conclusion. We give an informal treatment of this paradox. We then develop a geometric approach which formalises this treatment, and offers a foundational resolution of the problem. We discuss this approach both in the context of assessing the value of such paradoxical data and also when used for finite approximation.  相似文献   

6.
"We show how conditional logistic regression can be used to estimate the probability of being enumerated in a census and apply the model to the 1990 Post-Enumeration Survey (PES) in the United States.... We discuss some special problems caused by the fact that the PES sample area is open to migration between the captures. We also consider the effect of data errors in estimation. We characterize hard-to-enumerate populations and give some tentative estimates of correlation bias."  相似文献   

7.
We introduce a new class of distributions called the Burr XII system of densities with two extra positive parameters. We provide a comprehensive treatment of some of its mathematical properties. We estimate the model parameters by maximum likelihood. We assess the performance of the maximum likelihood estimators in terms of biases and mean squared errors by means of a simulation study. We also introduce a new family of regression models based on this system of densities. The usefulness of the proposed models is illustrated by means of three real data sets.  相似文献   

8.
We study the properties of a model combining method, ARM (adaptive regression by mixing), in the ANOVA framework. We propose model instability measures as a guide to the appropriateness of model combining in applications. We further systematically investigate the relationship between ARM performance and the underlying model structure. We propose an approach to evaluating the importance of factors based on the combined estimates. A theoretical risk bound on the combined estimator is also obtained.  相似文献   

9.
We examine the sizes and powers of three tests of convergence of Markov Chain Monte Carlo draws: the Kolmogorov–Smirnov test, fluctuation test, and Geweke's test. We show that the sizes and powers are sensitive to the existence of autocorrelation in the draws. We propose a filtered test that is corrected for autocorrelation. We present a numerical illustration using the Federal funds rate.  相似文献   

10.
11.
We investigate certain objective priors for the parameters in a normal linear regression models with one of the explanatory variables subject to measurement error. We first show that the use of the standard non informative prior for normal linear regression without measurement error leads to an improper posterior in the measurement error model. We then derive the Jeffreys prior and reference priors, and show that they lead to proper posteriors. We use simulation study to compare the frequentist performance of the estimates derived using these priors, and the MLE.  相似文献   

12.
We propose and compare several methods of constructing wavelet-based confidence intervals for the self-similarity parameter in heavy-tailed observations. We use empirical coverage probabilities to assess the procedures by applying them to Linear Fractional Stable Motion with many choices of parameters. We find that the asymptotic confidence intervals provide empirical coverage often much lower than nominal. We recommend the use of resampling confidence intervals. We also propose a procedure for monitoring the constancy of the self-similarity parameter and apply it to Ethernet data sets.  相似文献   

13.
Decisions to undertake bio-medical studies might depend on the results of previous similar studies. So too might the timing of meta-analyses. We show how temporal dependence among the studies analyzed in the meta-analysis, as well as the timing of the meta-analysis itself, can bias the results of the meta-analysis. We show analytically and numerically that a “toy” meta-analysis is biased. We then study bias in a more realistic stochastic process model of meta-analysis. We conclude that in meta-analysis it is difficult of avoid bias that is caused by statistical dependence among studies.  相似文献   

14.
Some partially sequential nonparametric tests for detecting linear trend   总被引:1,自引:0,他引:1  
In the present study, we develop two nonparametric partially sequential tests for detecting possible presence of linear trend among the incoming series of observations. We assume that a sample of fixed size is available a priori from some unknown univariate continuous population and there is no sign of trend among these historical observations. Our proposed tests can be viewed as the sequential type tests for monitoring structural changes. We use partial sequential sampling schemes based on usual ranks as well as on sequential ranks. We provide detailed discussion on asymptotic studies related to the proposed tests. We compare the two tests under various situations. We also present some numerical results based on simulation studies. Proposed tests are extremely important in profit making in volatile market through Margin Trading. We illustrate the mechanism with a detailed analysis of a stock price data.  相似文献   

15.
We study estimation and hypothesis testing in single‐index panel data models with individual effects. Through regressing the individual effects on the covariates linearly, we convert the estimation problem in single‐index panel data models to that in partially linear single‐index models. The conversion is valid regardless of the individual effects being random or fixed. We propose an estimating equation approach, which has a desirable double robustness property. We show that our method is applicable in single‐index panel data models with heterogeneous link functions. We further design a chi‐squared test to evaluate whether the individual effects are random or fixed. We conduct simulations to demonstrate the finite sample performance of the method and conduct a data analysis to illustrate its usefulness.  相似文献   

16.
We consider a one-sample U-statistic with kernel of dimension 2. We obtain its asymptotic bias and skewness and its Edgeworth and Cornish-Fisher type expansions. We also consider in less detail the one sample U-statistic with kernel of arbitrary dimension.  相似文献   

17.
We use semi-parametric efficiency theory to derive a class of estimators for the state occupation probabilities of the continuous-time irreversible illness-death model. We consider both the setting with and without additional baseline information available, where we impose no specific functional form on the intensity functions of the model. We show that any estimator in the class is asymptotically linear under suitable assumptions about the estimators of the intensity functions. In particular, the assumptions are weak enough to allow the use of data-adaptive methods, which is important for making the identifying assumption of coarsening at random plausible in realistic settings. We suggest a flexible method for estimating the transition intensity functions of the illness-death model based on penalized Poisson regression. We apply this method to estimate the nuisance parameters of an illness-death model in a simulation study and a real-world application.  相似文献   

18.
We consider the 'final' (deciding) set in a tennis match. We examine whether it is true that the chances for both players to win the match are equal at the beginning of the final set, even though they were not equal at the beginning of the match. We also test whether it is easier for an unseeded woman to beat a seeded player than it is for an unseeded man, and whether male players are more closely equal in quality than are females. We examine whether the service dominance decreases in long matches, and whether winning the 'pre-final' set provides an advantage in the final set. We use almost 90 000 points at Wimbledon to test all five hypotheses.  相似文献   

19.
We consider the estimation of life length of people who were born in the seventeenth or eighteenth century in England. The data consist of a sequence of times of life events that is either ended by a time of death or is right-censored by an unobserved time of migration. We propose a semi parametric model for the data and use a maximum likelihood method to estimate the unknown parameters in this model. We prove the consistency of the maximum likelihood estimators and describe an algorithm to obtain the estimates numerically. We have applied the algorithm to data and the estimates found are presented.  相似文献   

20.
We propose an algorithm of multiple comparisons in two-factor testing of psychophysical thresholds. We use logistic regression combined with guessing rate and adopt the step-down procedure with Ryan–Einot–Gabriel–Welsch (REGW) significance levels for multiple test. We test hypotheses of main effects and interaction in two-factor problem by using the delta test statistics.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号