期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Xiaoying Tian Jonathan Taylor 《Scandinavian Journal of Statistics》2017,44(2):480-499

In this paper, we seek to establish asymptotic results for selective inference procedures removing the assumption of Gaussianity. The class of selection procedures we consider are determined by affine inequalities, which we refer to as affine selection procedures. Examples of affine selection procedures include selective inference along the solution path of the least absolute shrinkage and selection operator (LASSO), as well as selective inference after fitting the least absolute shrinkage and selection operator at a fixed value of the regularization parameter. We also consider some tests in penalized generalized linear models. Our result proves asymptotic convergence in the high‐dimensional setting where n<p, and n can be of a logarithmic factor of the dimension p for some procedures. 相似文献

2.

A new construction of quantitative screening designs

K. Chatterjee C. Koukouvinos S. Stylianou 《Statistics》2019,53(1):227-244

相似文献

3.

Two new data-dependent choices of m when applying the m-out-of-n bootstrap to hypothesis testing

《Journal of Statistical Computation and Simulation》2012,82(12):2107-2120

The traditional non-parametric bootstrap (referred to as the n-out-of-n bootstrap) is a widely applicable and powerful tool for statistical inference, but in important situations it can fail. It is well known that by using a bootstrap sample of size m, different from n, the resulting m-out-of-n bootstrap provides a method for rectifying the traditional bootstrap inconsistency. Moreover, recent studies have shown that interesting cases exist where it is better to use the m-out-of-n bootstrap in spite of the fact that the n-out-of-n bootstrap works. In this paper, we discuss another case by considering its application to hypothesis testing. Two new data-based choices of m are proposed in this set-up. The results of simulation studies are presented to provide empirical comparisons between the performance of the traditional bootstrap and the m-out-of-n bootstrap, based on the two data-dependent choices of m, as well as on an existing method in the literature for choosing m. These results show that the m-out-of-n bootstrap, based on our choice of m, generally outperforms the traditional bootstrap procedure as well as the procedure based on the choice of m proposed in the literature. 相似文献

4.

Synthetic data method to incorporate external information into a current study

Tian Gu Jeremy M. G. Taylor Wenting Cheng Bhramar Mukherjee 《Revue canadienne de statistique》2019,47(4):580-603

We consider the situation where there is a known regression model that can be used to predict an outcome, Y, from a set of predictor variables X . A new variable B is expected to enhance the prediction of Y. A dataset of size n containing Y, X and B is available, and the challenge is to build an improved model for Y| X ,B that uses both the available individual level data and some summary information obtained from the known model for Y| X . We propose a synthetic data approach, which consists of creating m additional synthetic data observations, and then analyzing the combined dataset of size n + m to estimate the parameters of the Y| X ,B model. This combined dataset of size n + m now has missing values of B for m of the observations, and is analyzed using methods that can handle missing data (e.g., multiple imputation). We present simulation studies and illustrate the method using data from the Prostate Cancer Prevention Trial. Though the synthetic data method is applicable to a general regression context, to provide some justification, we show in two special cases that the asymptotic variances of the parameter estimates in the Y| X ,B model are identical to those from an alternative constrained maximum likelihood estimation approach. This correspondence in special cases and the method's broad applicability makes it appealing for use across diverse scenarios. The Canadian Journal of Statistics 47: 580–603; 2019 © 2019 Statistical Society of Canada 相似文献

5.

New Generalizations of Cauchy Distribution

Dragan Ðorić 《统计学通讯:理论与方法》2013,42(21):3764-3776

The generalized skew-normal distribution introduced by Balakrishnan (2002 Balakrishnan , N. ( 2002 ). Discussion on ‘Skew multivariate models related to hidden truncation and/or selective reporting’ by B. C. Arnold and R. J. Beaver . Test 11 : 37 – 39 .[Web of Science ®] , [Google Scholar]) is used to obtain new generalizations of univariate Cauchy distribution with two parameters, denoted by GC _{m, n}(a, b) with m and n non-negative integer numbers and a, b ∈ R. For cases (m, n) = (1, 2), (m, n) = (2, 1), (m, n) = (0, 3) and (m, n) = (3, 0) explicit forms of the density functions are derived and compared to previous generalizations of Cauchy and skew-Cauchy distributions. 相似文献

6.

2 m 4 n designs with resolution III or IV containing clear two-factor interaction components

S. Zhao R. Zhang 《Statistical Papers》2008,49(3):441-454

The orthogonal arrays with mixed levels have become widely used in fractional factorial designs. It is highly desirable to know when such designs with resolution III or IV have clear two-factor interaction components (2fic’s). In this paper, we give a complete classification of the existence of clear 2fic’s in regular 2^m4ⁿ designs with resolution III or IV. The necessary and sufficient conditions for a 2^m4ⁿ design to have clear 2fic’s are given. Also, 2^m4ⁿ designs of 32 runs with the most clear 2fic’s are given for n = 1,2. 相似文献

7.

A nonparametric procedure for the analysis of balanced crossover designs

Serge Tardif Franois Bellavance Constance Van Eeden 《Revue canadienne de statistique》2005,33(4):471-488

The authors propose nonparametric tests for the hypothesis of no direct treatment effects, as well as for the hypothesis of no carryover effects, for balanced crossover designs in which the number of treatments equals the number of periods p, where p ≥ 3. They suppose that the design consists of n replications of balanced crossover designs, each formed by m Latin squares of order p. Their tests are permutation tests which are based on the n vectors of least squares estimators of the parameters of interest obtained from the n replications of the experiment. They obtain both the exact and limiting distribution of the test statistics, and they show that the tests have, asymptotically, the same power as the F‐ratio test. 相似文献

8.

Construction of some mixed two- and four-level regular designs with GMC criterion

Tian-Fang Zhang Jian-Feng Yang Run-Chu Zhang 《统计学通讯:理论与方法》2017,46(17):8497-8509

General minimum lower-order confounding (GMC) criterion is to choose optimal designs, which are based on the aliased effect-number pattern (AENP). The AENP and GMC criterion have been developed to form GMC theory. Zhang et al. (2015 Zhang, T.F., Yang, J.F., Li, Z.M., Zhang, R.C. (2015). Construction of regular 2ⁿ4¹ designs with general minimum lower-order confounding. Commun. Stat. - Theory Methods 46:2724–2735.[Taylor &; Francis Online], [Web of Science ®] , [Google Scholar]) introduced GMC 2ⁿ4^m criterion for choosing optimal designs and constructed all GMC 2ⁿ4¹ designs with N/4 + 1 ? n + 2 ? 5N/16. In this article, we analyze the properties of 2ⁿ4¹ designs and construct GMC 2ⁿ4¹ designs with 5N/16 + 1 ? n + 2 < N ? 1, where n and N are, respectively, the numbers of two-level factors and runs. Further, GMC 2ⁿ4¹ designs with 16-run, 32-run are tabulated. 相似文献

9.

Cramér-type moderate deviations for intermediate trimmed means

Nadezhda Gribkova 《统计学通讯:理论与方法》2017,46(23):11918-11932

相似文献

10.

Polynomial Histograms for Multivariate Density and Mode Estimation

JUNMEI JING INGE KOCH KANTA NAITO 《Scandinavian Journal of Statistics》2012,39(1):75-96

Abstract. We consider the problem of efficiently estimating multivariate densities and their modes for moderate dimensions and an abundance of data. We propose polynomial histograms to solve this estimation problem. We present first‐ and second‐order polynomial histogram estimators for a general d‐dimensional setting. Our theoretical results include pointwise bias and variance of these estimators, their asymptotic mean integrated square error (AMISE), and optimal binwidth. The asymptotic performance of the first‐order estimator matches that of the kernel density estimator, while the second order has the faster rate of O(n^?6/(d+6)). For a bivariate normal setting, we present explicit expressions for the AMISE constants which show the much larger binwidths of the second order estimator and hence also more efficient computations of multivariate densities. We apply polynomial histogram estimators to real data from biotechnology and find the number and location of modes in such data. 相似文献

11.

Sampling from a Discrete Distribution While Preserving Monotonicity

George S. Fishman Louis R. Moore III 《The American statistician》2013,67(3):219-223

This article describes a cutpoint sampling method for efficiently sampling from an n-point discrete distribution that preserves the monotone relationship between a uniform deviate and the random variate it generates. This property is useful for developing a sampling plan to reduce variance in a Monte Carlo or simulation study. The expected number of comparisons with this method is derived and shown to be bounded above by (m + n ?1)/n, where m denotes the number of cut-points. The alias sampling method, which is regarded as the most efficient table sampling technique, generally lacks the monotone property and requires 2n storage locations, whereas the proposed cutpoint sampling method requires m + n storage locations. The article describes two modifications for cases in which n is large and possibly infinite. It is shown that circumstances arise in which the cutpoint method requires fewer comparisons on average than the alias method does for exactly the same space requirement. The article also describes an algorithm to implement the proposed method. 相似文献

12.

Improved Ordering Results for Fail-Safe Systems with Exponential Components

N. Balakrishnan Abedin Haidari Ghobad Barmalzan 《统计学通讯:理论与方法》2013,42(10):2010-2023

Let X_{2: n} and Y_{2: m} be the second order statistics from n independent exponential variables with hazards λ₁, …, λ_n, and an independent exponential sample of size m with hazard change to λ, respectively. When m ? n, we obtain necessary and sufficient conditions for comparing X_{2: n} and Y_{2: m} in mean residual life, dispersive, hazard rate, and likelihood ratio orderings based on some inequalities between λ_i’s and λ. The established results show how one can compare an (n ? 1)-out-of-n system consisting of heterogeneous components with exponential lifetimes with any (m ? 1)-out-of-m system consisting of homogeneous components with exponential lifetimes. 相似文献

13.

A series of single array 2m factorial search designs for even m

Hooshang Talebi Elham Jalali 《Australian & New Zealand Journal of Statistics》2014,56(4):395-405

By means of a search design one is able to search for and estimate a small set of non‐zero elements from the set of higher order factorial interactions in addition to estimating the lower order factorial effects. One may be interested in estimating the general mean and main effects, in addition to searching for and estimating a non‐negligible effect in the set of 2‐ and 3‐factor interactions, assuming 4‐ and higher‐order interactions are all zero. Such a search design is called a ‘main effect plus one plan’ and is denoted by MEP.1. Construction of such a plan, for 2^m factorial experiments, has been considered and developed by several authors and leads to MEP.1 plans for an odd number m of factors. These designs are generally determined by two arrays, one specifying a main effect plan and the other specifying a follow‐up. In this paper we develop the construction of search designs for an even number of factors m, m≠6. The new series of MEP.1 plans is a set of single array designs with a well structured form. Such a structure allows for flexibility in arriving at an appropriate design with optimum properties for search and estimation. 相似文献

14.

Optimal component test plans for a parallel system based on Type-II censoring

P. Vellaisamy M. Kumar 《Statistical Methodology》2008,5(5):454-461

Consider a parallel system with n independent components. Assume that the lifetime of the jth component follows an exponential distribution with a constant but unknown parameter λ_j, 1≤j≤n. We test r_j components of type-j for failure and compute the total time T_j of r_j failures for the jth component. Based on T=(T₁,T₂,…,T_n) and r=(r₁,r₂,…,r_n), we derive optimal reliability test plans which ensure the usual probability requirements on system reliability. Further, we solve the associated nonlinear integer programming problem by a simple enumeration of integers over the feasible range. An algorithm is developed to obtain integer solutions with minimum cost. Finally, some examples have been discussed for various levels of producer’s and consumer’s risk to illustrate the approach. Our optimal plans lead to considerable savings in costs over the available plans in the literature. 相似文献

15.

Improving the detection of unusual observations in high‐dimensional settings

下载免费PDF全文

Insha Ullah Matthew D.M. Pawley Adam N.H. Smith Beatrix Jones 《Australian & New Zealand Journal of Statistics》2017,59(4):449-462

Multivariate control charts are used to monitor stochastic processes for changes and unusual observations. Hotelling's T² statistic is calculated for each new observation and an out‐of‐control signal is issued if it goes beyond the control limits. However, this classical approach becomes unreliable as the number of variables p approaches the number of observations n, and impossible when p exceeds n. In this paper, we devise an improvement to the monitoring procedure in high‐dimensional settings. We regularise the covariance matrix to estimate the baseline parameter and incorporate a leave‐one‐out re‐sampling approach to estimate the empirical distribution of future observations. An extensive simulation study demonstrates that the new method outperforms the classical Hotelling T² approach in power, and maintains appropriate false positive rates. We demonstrate the utility of the method using a set of quality control samples collected to monitor a gas chromatography–mass spectrometry apparatus over a period of 67 days. 相似文献

16.

Using orthogonal array for constructing three-level search designs

Nabaz Esmailzadeh Zahra Zandi 《统计学通讯:模拟与计算》2017,46(3):1906-1917

We consider the problem of constructing search designs for 3^m factorial designs. By using projection properties of some three-level orthogonal arrays, some search designs are obtained for 3 ? m ? 11. The new obtained orthogonal search designs are capable of searching and identifying up to four two-factor interactions and estimating them along with the general mean and main effects. The resulted designs have very high searching probabilities; it means that besides the well-known orthogonal structure, they have high ability in searching the true effects. 相似文献

17.

Central limit theorems for functionals of large sample covariance matrix and mean vector in matrix‐variate location mixture of normal distributions

Taras Bodnar Stepan Mazur Nestor Parolya 《Scandinavian Journal of Statistics》2019,46(2):636-660

In this paper, we consider the asymptotic distributions of functionals of the sample covariance matrix and the sample mean vector obtained under the assumption that the matrix of observations has a matrix‐variate location mixture of normal distributions. The central limit theorem is derived for the product of the sample covariance matrix and the sample mean vector. Moreover, we consider the product of the inverse sample covariance matrix and the mean vector for which the central limit theorem is established as well. All results are obtained under the large‐dimensional asymptotic regime, where the dimension p and the sample size n approach infinity such that p/n→c ∈ [0, + ∞) when the sample covariance matrix does not need to be invertible and p/n→c ∈ [0,1) otherwise. 相似文献

18.

A review of statistical methods in imaging genetics

Farouk S. Nathoo Linglong Kong Hongtu Zhu 《Revue canadienne de statistique》2019,47(1):108-131

With the rapid growth of modern technology, many biomedical studies are being conducted to collect massive datasets with volumes of multi‐modality imaging, genetic, neurocognitive and clinical information from increasingly large cohorts. Simultaneously extracting and integrating rich and diverse heterogeneous information in neuroimaging and/or genomics from these big datasets could transform our understanding of how genetic variants impact brain structure and function, cognitive function and brain‐related disease risk across the lifespan. Such understanding is critical for diagnosis, prevention and treatment of numerous complex brain‐related disorders (e.g., schizophrenia and Alzheimer's disease). However, the development of analytical methods for the joint analysis of both high‐dimensional imaging phenotypes and high‐dimensional genetic data, a big data squared (BD²) problem, presents major computational and theoretical challenges for existing analytical methods. Besides the high‐dimensional nature of BD², various neuroimaging measures often exhibit strong spatial smoothness and dependence and genetic markers may have a natural dependence structure arising from linkage disequilibrium. We review some recent developments of various statistical techniques for imaging genetics, including massive univariate and voxel‐wise approaches, reduced rank regression, mixture models and group sparse multi‐task regression. By doing so, we hope that this review may encourage others in the statistical community to enter into this new and exciting field of research. The Canadian Journal of Statistics 47: 108–131; 2019 © 2019 Statistical Society of Canada 相似文献

19.

Randomly weighted sums and their maxima with heavy-tailed increments and dependence structure

Shijie Wang Yiyu Hu Jijiao He Xuejun Wang 《统计学通讯:理论与方法》2017,46(21):10851-10863

Consider the randomly weighted sums S_m(θ) = ∑^m_{i = 1}θ_iX_i, 1 ? m ? n, and their maxima M_n(θ) = max?_{1 ? m ? n}S_m(θ), where X_i, 1 ? i ? n, are real-valued and dependent according to a wide type of dependence structure, and θ_i, 1 ? i ? n, are non negative and arbitrarily dependent, but independent of X_i, 1 ? i ? n. Under some mild conditions on the right tails of the weights θ_i, 1 ? i ? n, we establish some asymptotic equivalence formulas for the tail probabilities of S_n(θ) and M_n(θ) in the case where X_i, 1 ? i ? n, are dominatedly varying, long-tailed and subexponential distributions, respectively. 相似文献

20.

Estimators of shift based on statistics of the Kolmogorov-Smirnov type

Alain Boulanger 《Revue canadienne de statistique》1983,11(4):271-284

This paper is concerned with the estimation of a shift parameter δ_o, based on some nonnegative functional Hg₁ of the pair (D^δ_N(x), f?^δ_N(x)), where D^δ_N(x) = K_N/b {F_2,n(x)—F_1,m (x + δ)}, +^δ_N(x) = {mF_1,m (x + δ) + nF_2,n(x)}/N, where F_1,m and F_2,n are the empirical distribution functions of two independent random samples (N = m + n), and where K²_N = mn/N. First an estimator δ_N, is defined as a value of δ minimizing a functional H of the type of H₁. A second estimator δ¹_N is also defined which is a linearized version of the first. Finite and asymptotic properties of these estimators are considered. It is also shown that most well-known test statistics of the Kolmogorov-Smirnov type are particular cases of such functionals H₁. The asymptotic distribution and the asymptotic efficiency of some estimators are given. 相似文献