期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A discrepancy bound for deterministic acceptance-rejection samplers beyond $$N^{-1/2}$$ in dimension 1

Houying Zhu Josef Dick 《Statistics and Computing》2017,27(4):901-911

In this paper we consider an acceptance-rejection (AR) sampler based on deterministic driver sequences. We prove that the discrepancy of an N element sample set generated in this way is bounded by $\mathcal {O} (N^{-2/3}\log N)$, provided that the target density is twice continuously differentiable with non-vanishing curvature and the AR sampler uses the driver sequence $\mathcal {K}_M= \{( j \alpha , j \beta ) ~~ mod~~1 \mid j = 1,\ldots ,M\},$ where $\alpha ,\beta $ are real algebraic numbers such that $1,\alpha ,\beta $ is a basis of a number field over $\mathbb {Q}$ of degree 3. For the driver sequence $\mathcal {F}_k= \{ ({j}/{F_k}, \{{jF_{k-1}}/{F_k}\} ) \mid j=1,\ldots , F_k\},$ where $F_k$ is the k-th Fibonacci number and $\{x\}=x-\lfloor x \rfloor $ is the fractional part of a non-negative real number x, we can remove the $\log $ factor to improve the convergence rate to $\mathcal {O}(N^{-2/3})$, where again N is the number of samples we accepted. We also introduce a criterion for measuring the goodness of driver sequences. The proposed approach is numerically tested by calculating the star-discrepancy of samples generated for some target densities using $\mathcal {K}_M$ and $\mathcal {F}_k$ as driver sequences. These results confirm that achieving a convergence rate beyond $N^{-1/2}$ is possible in practice using $\mathcal {K}_M$ and $\mathcal {F}_k$ as driver sequences in the acceptance-rejection sampler. 相似文献

2.

Automated selection of <Emphasis Type="Italic">r</Emphasis> for the <Emphasis Type="Italic">r</Emphasis> largest order statistics approach with adjustment for sequential testing

Brian Bader Jun Yan Xuebin Zhang 《Statistics and Computing》2017,27(6):1435-1451

The r largest order statistics approach is widely used in extreme value analysis because it may use more information from the data than just the block maxima. In practice, the choice of r is critical. If r is too large, bias can occur; if too small, the variance of the estimator can be high. The limiting distribution of the r largest order statistics, denoted by GEV$_r$, extends that of the block maxima. Two specification tests are proposed to select r sequentially. The first is a score test for the GEV$_r$ distribution. Due to the special characteristics of the GEV$_r$ distribution, the classical chi-square asymptotics cannot be used. The simplest approach is to use the parametric bootstrap, which is straightforward to implement but computationally expensive. An alternative fast weighted bootstrap or multiplier procedure is developed for computational efficiency. The second test uses the difference in estimated entropy between the GEV$_r$ and GEV$_{r-1}$ models, applied to the r largest order statistics and the $r-1$ largest order statistics, respectively. The asymptotic distribution of the difference statistic is derived. In a large scale simulation study, both tests held their size and had substantial power to detect various misspecification schemes. A new approach to address the issue of multiple, sequential hypotheses testing is adapted to this setting to control the false discovery rate or familywise error rate. The utility of the procedures is demonstrated with extreme sea level and precipitation data. 相似文献

3.

Rate of uniform consistency for a class of mode regression on functional stationary ergodic data

Mohamed Chaouch Naâmane Laïb Djamal Louani 《Statistical Methods and Applications》2017,26(1):19-47

The aim of this paper is to study the asymptotic properties of a class of kernel conditional mode estimates whenever functional stationary ergodic data are considered. To be more precise on the matter, in the ergodic data setting, we consider a random elements (X, Z) taking values in some semi-metric abstract space $E\times F$. For a real function $\varphi $ defined on the space F and $x\in E$, we consider the conditional mode of the real random variable $\varphi (Z)$ given the event “$X=x$”. While estimating the conditional mode function, say $\theta _\varphi (x)$, using the well-known kernel estimator, we establish the strong consistency with rate of this estimate uniformly over Vapnik–Chervonenkis classes of functions $\varphi $. Notice that the ergodic setting offers a more general framework than the usual mixing structure. Two applications to energy data are provided to illustrate some examples of the proposed approach in time series forecasting framework. The first one consists in forecasting the daily peak of electricity demand in France (measured in Giga-Watt). Whereas the second one deals with the short-term forecasting of the electrical energy (measured in Giga-Watt per Hour) that may be consumed over some time intervals that cover the peak demand. 相似文献

4.

Random projections for Bayesian regression

Leo N. Geppert Katja Ickstadt Alexander Munteanu Jens Quedenfeld Christian Sohler 《Statistics and Computing》2017,27(1):79-101

This article deals with random projections applied as a data reduction technique for Bayesian regression analysis. We show sufficient conditions under which the entire d-dimensional distribution is approximately preserved under random projections by reducing the number of data points from n to $k\in O({\text {poly}}(d/\varepsilon ))$ in the case $n\gg d$. Under mild assumptions, we prove that evaluating a Gaussian likelihood function based on the projected data instead of the original data yields a $(1+O(\varepsilon ))$-approximation in terms of the $\ell _2$ Wasserstein distance. Our main result shows that the posterior distribution of Bayesian linear regression is approximated up to a small error depending on only an $\varepsilon $-fraction of its defining parameters. This holds when using arbitrary Gaussian priors or the degenerate case of uniform distributions over $\mathbb {R}^d$ for $\beta $. Our empirical evaluations involve different simulated settings of Bayesian linear regression. Our experiments underline that the proposed method is able to recover the regression model up to small error while considerably reducing the total running time. 相似文献

5.

Point process-based Monte Carlo estimation

Clément Walter 《Statistics and Computing》2017,27(1):219-236

This paper addresses the issue of estimating the expectation of a real-valued random variable of the form $X = g(\mathbf {U})$ where g is a deterministic function and $\mathbf {U}$ can be a random finite- or infinite-dimensional vector. Using recent results on rare event simulation, we propose a unified framework for dealing with both probability and mean estimation for such random variables, i.e. linking algorithms such as Tootsie Pop Algorithm or Last Particle Algorithm with nested sampling. Especially, it extends nested sampling as follows: first the random variable X does not need to be bounded any more: it gives the principle of an ideal estimator with an infinite number of terms that is unbiased and always better than a classical Monte Carlo estimator—in particular it has a finite variance as soon as there exists $k \in \mathbb {R}> 1$ such that ${\text {E}}\left[ X^k \right] < \infty $. Moreover we address the issue of nested sampling termination and show that a random truncation of the sum can preserve unbiasedness while increasing the variance only by a factor up to 2 compared to the ideal case. We also build an unbiased estimator with fixed computational budget which supports a Central Limit Theorem and discuss parallel implementation of nested sampling, which can dramatically reduce its running time. Finally we extensively study the case where X is heavy-tailed. 相似文献

6.

On the $$L_p$$ norms of kernel regression estimators for incomplete data with applications to classification

Timothy Reese Majid Mojirsheibani 《Statistical Methods and Applications》2017,26(1):81-112

We consider kernel methods to construct nonparametric estimators of a regression function based on incomplete data. To tackle the presence of incomplete covariates, we employ Horvitz–Thompson-type inverse weighting techniques, where the weights are the selection probabilities. The unknown selection probabilities are themselves estimated using (1) kernel regression, when the functional form of these probabilities are completely unknown, and (2) the least-squares method, when the selection probabilities belong to a known class of candidate functions. To assess the overall performance of the proposed estimators, we establish exponential upper bounds on the $L_p$ norms, $1\le p<\infty $, of our estimators; these bounds immediately yield various strong convergence results. We also apply our results to deal with the important problem of statistical classification with partially observed covariates. 相似文献

7.

Hypotheses testing about the drift parameter in linear stochastic differential equation driven by stable processes

David Stibůrek 《Statistical Methods and Applications》2016,25(3):433-452

In this paper, we consider the problem of hypotheses testing about the drift parameter $\theta $ in the process $\text {d}Y^{\delta }_{t} = \theta \dot{f}(t)Y^{\delta }_{t}\text {d}t + b(t)\text {d}L^{\delta }_{t}$ driven by symmetric $\delta $-stable Lévy process $L^{\delta }_{t}$ with $\dot{f}(t)$ being the derivative of a known increasing function f(t) and b(t) being known as well. We consider the hypotheses testing $H_{0}: \theta \le 0$ and $K_{0}: \theta =0$ against the alternatives $H_{1}: \theta >0$ and $K_{1}: \theta \ne 0$, respectively. For these hypotheses, we propose inverse methods, which are motivated by sequential approach, based on the first hitting time of the observed process (or its absolute value) to a pre-specified boundary or two boundaries until some given time. The applicability of these methods is illustrated. For the case $Y^{\delta }_{0}=0$, we are able to calculate the values of boundaries and finite observed times more directly. We are able to show the consistencies of proposed tests for $Y^{\delta }_{0}\ge 0$ with $\delta \in (1,2]$ and for $Y^{\delta }_{0}=0$ with $\delta \in (0,2]$ under quite mild conditions. 相似文献

8.

$$D_s$$-optimality in copula models

Elisa Perrone Andreas Rappold Werner G. Müller 《Statistical Methods and Applications》2017,26(3):403-418

Optimum experimental design theory has recently been extended for parameter estimation in copula models. The use of these models allows one to gain in flexibility by considering the model parameter set split into marginal and dependence parameters. However, this separation also leads to the natural issue of estimating only a subset of all model parameters. In this work, we treat this problem with the application of the $D_s$-optimality to copula models. First, we provide an extension of the corresponding equivalence theory. Then, we analyze a wide range of flexible copula models to highlight the usefulness of $D_s$-optimality in many possible scenarios. Finally, we discuss how the usage of the introduced design criterion also relates to the more general issue of copula selection and optimal design for model discrimination. 相似文献

9.

Investigation of the widely applicable Bayesian information criterion

N. Friel J. P. McKeone C. J. Oates A. N. Pettitt 《Statistics and Computing》2017,27(3):833-844

The widely applicable Bayesian information criterion (WBIC) is a simple and fast approximation to the model evidence that has received little practical consideration. WBIC uses the fact that the log evidence can be written as an expectation, with respect to a powered posterior proportional to the likelihood raised to a power $t^*\in {(0,1)}$, of the log deviance. Finding this temperature value $t^*$ is generally an intractable problem. We find that for a particular tractable statistical model that the mean squared error of an optimally-tuned version of WBIC with correct temperature $t^*$ is lower than an optimally-tuned version of thermodynamic integration (power posteriors). However in practice WBIC uses the a canonical choice of $t=1/\log (n)$. Here we investigate the performance of WBIC in practice, for a range of statistical models, both regular models and singular models such as latent variable models or those with a hierarchical structure for which BIC cannot provide an adequate solution. Our findings are that, generally WBIC performs adequately when one uses informative priors, but it can systematically overestimate the evidence, particularly for small sample sizes. 相似文献

10.

Optimal regular graph designs

Sera Aylin Cakiroglu 《Statistics and Computing》2018,28(1):103-112

A typical problem in optimal design theory is finding an experimental design that is optimal with respect to some criteria in a class of designs. The most popular criteria include the A- and D-criteria. Regular graph designs occur in many optimality results, and if the number of blocks is large enough, an A-optimal (or D-optimal) design is among them (if any exist). To explore the landscape of designs with a large number of blocks, we introduce extensions of regular graph designs. These are constructed by adding the blocks of a balanced incomplete block design repeatedly to the original design. We present the results of an exact computer search for the best regular graph designs and the best extended regular graph designs with up to 20 treatments v, block size $k \le 10$ and replication r $\le 10$ and $r(k-1)-(v-1)\lfloor r(k-1)/(v-1)\rfloor \le 9$. 相似文献

11.

Estimates for cell counts and common odds ratio in three-way contingency tables by homogeneous log-linear models with missing data

Haresh D. Rochani Robert L. Vogel Hani M. Samawi Daniel F. Linder 《AStA Advances in Statistical Analysis》2017,101(1):51-65

Missing observations often occur in cross-classified data collected during observational, clinical, and public health studies. Inappropriate treatment of missing data can reduce statistical power and give biased results. This work extends the Baker, Rosenberger and Dersimonian modeling approach to compute maximum likelihood estimates for cell counts in three-way tables with missing data, and studies the association between two dichotomous variables while controlling for a third variable in $ 2\times 2 \times K $ tables. This approach is applied to the Behavioral Risk Factor Surveillance System data. Simulation studies are used to investigate the efficiency of estimation of the common odds ratio. 相似文献

12.

Semi-parametric bivariate polychotomous ordinal regression

Francesco Donat Giampiero Marra 《Statistics and Computing》2017,27(1):283-299

A pair of polychotomous random variables $(Y_1,Y_2)^\top =:{\varvec{Y}}$, where each $Y_j$ has a totally ordered support, is studied within a penalized generalized linear model framework. We deal with a triangular generating process for ${\varvec{Y}}$, a structure that has been employed in the literature to control for the presence of residual confounding. Differently from previous works, however, the proposed model allows for a semi-parametric estimation of the covariate-response relationships. In this way, the risk of model mis-specification stemming from the imposition of fixed-order polynomial functional forms is also reduced. The proposed estimation methods and related inferential results are finally applied to study the effect of education on alcohol consumption among young adults in the UK. 相似文献

13.

Objective Bayesian transformation and variable selection using default Bayes factors

E. Charitidou D. Fouskakis I. Ntzoufras 《Statistics and Computing》2018,28(3):579-594

In this work, the problem of transformation and simultaneous variable selection is thoroughly treated via objective Bayesian approaches by the use of default Bayes factor variants. Four uniparametric families of transformations (Box–Cox, Modulus, Yeo-Johnson and Dual), denoted by T, are evaluated and compared. The subjective prior elicitation for the transformation parameter $\lambda _T$, for each T, is not a straightforward task. Additionally, little prior information for $\lambda _T$ is expected to be available, and therefore, an objective method is required. The intrinsic Bayes factors and the fractional Bayes factors allow us to incorporate default improper priors for $\lambda _T$. We study the behaviour of each approach using a simulated reference example as well as two real-life examples. 相似文献

14.

Conditional density estimation using the local Gaussian correlation

Håkon Otneim Dag Tjøstheim 《Statistics and Computing》2018,28(2):303-321

Let $\mathbf {X} = (X_1,\ldots ,X_p)$ be a stochastic vector having joint density function $f_{\mathbf {X}}(\mathbf {x})$ with partitions $\mathbf {X}_1 = (X_1,\ldots ,X_k)$ and $\mathbf {X}_2 = (X_{k+1},\ldots ,X_p)$. A new method for estimating the conditional density function of $\mathbf {X}_1$ given $\mathbf {X}_2$ is presented. It is based on locally Gaussian approximations, but simplified in order to tackle the curse of dimensionality in multivariate applications, where both response and explanatory variables can be vectors. We compare our method to some available competitors, and the error of approximation is shown to be small in a series of examples using real and simulated data, and the estimator is shown to be particularly robust against noise caused by independent variables. We also present examples of practical applications of our conditional density estimator in the analysis of time series. Typical values for k in our examples are 1 and 2, and we include simulation experiments with values of p up to 6. Large sample theory is established under a strong mixing condition. 相似文献

15.

Wavelet regression estimations with strong mixing data

Junke Kou Youming Liu 《Statistical Methods and Applications》2018,27(4):667-688

Using a wavelet basis, we establish in this paper upper bounds of wavelet estimation on $ L^{p}({\mathbb {R}}^{d}) $ risk of regression functions with strong mixing data for $ 1\le p<\infty $. In contrast to the independent case, these upper bounds have different analytic formulae for $p\in [1, 2]$ and $p\in (2, +\infty )$. For $p=2$, it turns out that our result reduces to a theorem of Chaubey et al. (J Nonparametr Stat 25:53–71, 2013); and for $d=1$ and $p=2$, it becomes the corresponding theorem of Chaubey and Shirazi (Commun Stat Theory Methods 44:885–899, 2015). 相似文献

16.

Stochastically optimal bootstrap sample size for shrinkage-type statistics

Bei Wei Stephen M. S. Lee Xiyuan Wu 《Statistics and Computing》2016,26(1-2):249-262

In nonregular problems where the conventional $n$ out of $n$ bootstrap is inconsistent, the $m$ out of $n$ bootstrap provides a useful remedy to restore consistency. Conventionally, optimal choice of the bootstrap sample size $m$ is taken to be the minimiser of a frequentist error measure, estimation of which has posed a major difficulty hindering practical application of the $m$ out of $n$ bootstrap method. Relatively little attention has been paid to a stronger, stochastic, version of the optimal bootstrap sample size, defined as the minimiser of an error measure calculated directly from the observed sample. Motivated by this stronger notion of optimality, we develop procedures for calculating the stochastically optimal value of $m$. Our procedures are shown to work under special forms of Edgeworth-type expansions which are typically satisfied by statistics of the shrinkage type. Theoretical and empirical properties of our methods are illustrated with three examples, namely the James–Stein estimator, the ridge regression estimator and the post-model-selection regression estimator. 相似文献

17.

Model-free feature screening for ultrahigh dimensional censored regression

Tingyou Zhou Liping Zhu 《Statistics and Computing》2017,27(4):947-961

In this paper we design a sure independent ranking and screening procedure for censored regression (cSIRS, for short) with ultrahigh dimensional covariates. The inverse probability weighted cSIRS procedure is model-free in the sense that it does not specify a parametric or semiparametric regression function between the response variable and the covariates. Thus, it is robust to model mis-specification. This model-free property is very appealing in ultrahigh dimensional data analysis, particularly when there is lack of information for the underlying regression structure. The cSIRS procedure is also robust in the presence of outliers or extreme values as it merely uses the rank of the censored response variable. We establish both the sure screening and the ranking consistency properties for the cSIRS procedure when the number of covariates p satisfies $p=o\{\exp (an)\}$, where a is a positive constant and n is the available sample size. The advantages of cSIRS over existing competitors are demonstrated through comprehensive simulations and an application to the diffuse large-B-cell lymphoma data set. 相似文献

18.

A comparison of efficient approximations for a weighted sum of chi-squared random variables

Dean A. Bodenham Niall M. Adams 《Statistics and Computing》2016,26(4):917-928

In many applications, the cumulative distribution function (cdf) $F_{Q_N}$ of a positively weighted sum of N i.i.d. chi-squared random variables $Q_N$ is required. Although there is no known closed-form solution for $F_{Q_N}$, there are many good approximations. When computational efficiency is not an issue, Imhof’s method provides a good solution. However, when both the accuracy of the approximation and the speed of its computation are a concern, there is no clear preferred choice. Previous comparisons between approximate methods could be considered insufficient. Furthermore, in streaming data applications where the computation needs to be both sequential and efficient, only a few of the available methods may be suitable. Streaming data problems are becoming ubiquitous and provide the motivation for this paper. We develop a framework to enable a much more extensive comparison between approximate methods for computing the cdf of weighted sums of an arbitrary random variable. Utilising this framework, a new and comprehensive analysis of four efficient approximate methods for computing $F_{Q_N}$ is performed. This analysis procedure is much more thorough and statistically valid than previous approaches described in the literature. A surprising result of this analysis is that the accuracy of these approximate methods increases with N. 相似文献

19.

Functional principal component analysis of spatially correlated data

Chong Liu Surajit Ray Giles Hooker 《Statistics and Computing》2017,27(6):1639-1654

This paper focuses on the analysis of spatially correlated functional data. We propose a parametric model for spatial correlation and the between-curve correlation is modeled by correlating functional principal component scores of the functional data. Additionally, in the sparse observation framework, we propose a novel approach of spatial principal analysis by conditional expectation to explicitly estimate spatial correlations and reconstruct individual curves. Assuming spatial stationarity, empirical spatial correlations are calculated as the ratio of eigenvalues of the smoothed covariance surface Cov$(X_i(s),X_i(t))$ and cross-covariance surface Cov$(X_i(s), X_j(t))$ at locations indexed by i and j. Then a anisotropy Matérn spatial correlation model is fitted to empirical correlations. Finally, principal component scores are estimated to reconstruct the sparsely observed curves. This framework can naturally accommodate arbitrary covariance structures, but there is an enormous reduction in computation if one can assume the separability of temporal and spatial components. We demonstrate the consistency of our estimates and propose hypothesis tests to examine the separability as well as the isotropy effect of spatial correlation. Using simulation studies, we show that these methods have some clear advantages over existing methods of curve reconstruction and estimation of model parameters. 相似文献

20.

A coordinate descent algorithm for computing penalized smooth quantile regression

Abdallah Mkhadri Mohamed Ouhourane Karim Oualkacha 《Statistics and Computing》2017,27(4):865-883

The computation of penalized quantile regression estimates is often computationally intensive in high dimensions. In this paper we propose a coordinate descent algorithm for computing the penalized smooth quantile regression (cdaSQR) with convex and nonconvex penalties. The cdaSQR approach is based on the approximation of the objective check function, which is not differentiable at zero, by a modified check function which is differentiable at zero. Then, using the maximization-minimization trick of the gcdnet algorithm (Yang and Zou in, J Comput Graph Stat 22(2):396–415, 2013), we update each coefficient simply and efficiently. In our implementation, we consider the convex penalties $\ell _1+\ell _2$ and the nonconvex penalties SCAD (or MCP) $+ \ell _2$. We establishe the convergence property of the csdSQR with $\ell _1+\ell _2$ penalty. The numerical results show that our implementation is an order of magnitude faster than its competitors. Using simulations we compare the speed of our algorithm to its competitors. Finally, the performance of our algorithm is illustrated on three real data sets from diabetes, leukemia and Bardet–Bidel syndrome gene expression studies. 相似文献