期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Wavelet‐based estimators for mixture regression

Michel H. Montoril Aluísio Pinheiro Brani Vidakovic 《Scandinavian Journal of Statistics》2019,46(1):215-234

We consider a process that is observed as a mixture of two random distributions, where the mixing probability is an unknown function of time. The setup is built upon a wavelet‐based mixture regression. Two linear wavelet estimators are proposed. Furthermore, we consider three regularizing procedures for each of the two wavelet methods. We also discuss regularity conditions under which the consistency of the wavelet methods is attained and derive rates of convergence for the proposed estimators. A Monte Carlo simulation study is conducted to illustrate the performance of the estimators. Various scenarios for the mixing probability function are used in the simulations, in addition to a range of sample sizes and resolution levels. We apply the proposed methods to a data set consisting of array Comparative Genomic Hybridization from glioblastoma cancer studies. 相似文献

2.

PERMUTATION TEST FOR EQUALITY OF CORRELATION COEFFICIENTS IN TWO POPULATIONS

《统计学通讯:模拟与计算》2013,42(4):641-651

ABSTRACT

The purpose of this paper is to investigate the permutation tests for equality of correlation coefficients among two independent populations. We discuss how to apply permutation test to this problem and its asymptotic suitability. We also show some simulation studies and an example of the Iris data. 相似文献

3.

A Potts‐mixture spatiotemporal joint model for combined magnetoencephalography and electroencephalography data

Yin Song Farouk Nathoo Arif Babul 《Revue canadienne de statistique》2019,47(4):688-711

We develop a new methodology for determining the location and dynamics of brain activity from combined magnetoencephalography (MEG) and electroencephalography (EEG) data. The resulting inverse problem is ill‐posed and is one of the most difficult problems in neuroimaging data analysis. In our development we propose a solution that combines the data from three different modalities, magnetic resonance imaging (MRI), MEG and EEG, together. We propose a new Bayesian spatial finite mixture model that builds on the mesostate‐space model developed by Daunizeau & Friston [Daunizeau and Friston, NeuroImage 2007; 38, 67–81]. Our new model incorporates two major extensions: (i) We combine EEG and MEG data together and formulate a joint model for dealing with the two modalities simultaneously; (ii) we incorporate the Potts model to represent the spatial dependence in an allocation process that partitions the cortical surface into a small number of latent states termed mesostates. The cortical surface is obtained from MRI. We formulate the new spatiotemporal model and derive an efficient procedure for simultaneous point estimation and model selection based on the iterated conditional modes algorithm combined with local polynomial smoothing. The proposed method results in a novel estimator for the number of mixture components and is able to select active brain regions, which correspond to active variables in a high‐dimensional dynamic linear model. The methodology is investigated using synthetic data and simulation studies and then demonstrated on an application examining the neural response to the perception of scrambled faces. R software implementing the methodology along with several sample datasets are available at the following GitHub repository https://github.com/v2south/PottsMix . The Canadian Journal of Statistics 47: 688–711; 2019 © 2019 Statistical Society of Canada 相似文献

4.

Varying-Coefficient Functional Linear Regression Models

Hervé Cardot Pascal Sarda 《统计学通讯:理论与方法》2013,42(20):3186-3203

This article considers a generalization of the functional linear regression in which an additional real variable influences smoothly the functional coefficient. We thus define a varying-coefficient regression model for functional data. We propose two estimators based, respectively, on conditional functional principal regression and on local penalized regression splines and prove their pointwise consistency. We check, with the prediction one day ahead of ozone concentration in the city of Toulouse, the ability of such nonlinear functional approaches to produce competitive estimations. 相似文献

5.

Scalable spatio‐temporal Bayesian analysis of high‐dimensional electroencephalography data

Shariq Mohammed Dipak K. Dey 《Revue canadienne de statistique》2021,49(1):107-128

We present a scalable Bayesian modelling approach for identifying brain regions that respond to a certain stimulus and use them to classify subjects. More specifically, we deal with multi‐subject electroencephalography (EEG) data with a binary response distinguishing between alcoholic and control groups. The covariates are matrix‐variate with measurements taken from each subject at different locations across multiple time points. EEG data have a complex structure with both spatial and temporal attributes. We use a divide‐and‐conquer strategy and build separate local models, that is, one model at each time point. We employ Bayesian variable selection approaches using a structured continuous spike‐and‐slab prior to identify the locations that respond to a certain stimulus. We incorporate the spatio‐temporal structure through a Kronecker product of the spatial and temporal correlation matrices. We develop a highly scalable estimation algorithm, using likelihood approximation, to deal with large number of parameters in the model. Variable selection is done via clustering of the locations based on their duration of activation. We use scoring rules to evaluate the prediction performance. Simulation studies demonstrate the efficiency of our scalable algorithm in terms of estimation and fast computation. We present results using our scalable approach on a case study of multi‐subject EEG data. 相似文献

6.

Age-period-cohort analysis: an illustration of the problems in assessing interaction in one observation per cell data

Lawrence L. Kupper Joseph M. Janis Ibrahim A. Salama Carl N. Yoshizawa Bernard G. Greenberg H. H. Winsborough 《统计学通讯:理论与方法》2013,42(23):201-217

This paper discusses the specific problems of age-period-cohort (A-P-C) analysis within the general framework of interaction assessment for two-way cross-classified data with one observation per cell. The A-P-C multiple classification model containing the effects of age groups (rows), periods of observation (columns), and birth cohorts (diagonals of the two-way table) is characterized as one of a special class of models involving interaction terms assumed to have very specific forms. The so-called A-P-C identification problem, which results from the use of a particular interaction structure for detecting cohort effects, is shown to manifest itself in the form of an exact linear dependency among the columns of the design matrix. The precise relationship holding among these columns is derived, as is an explicit formula for the bias in the parameter estimates resulting from an incorrect specification of an assumed restriction on the parameters required to solve the normal equations. Current methods for modeling A-P-C data are critically reviewed, an illustrative numerical example is presented, and one potentially promising analysis strategy is discussed. However, gien the large number of possible sources for error in A-P-C analyses, it is strongly recommended that the results of such analyses be interpreted with a great deal of caution. 相似文献

7.

Suppressing marginal totals from a two-dimensional table to protect sensitive information

F. M. Malvestuto M. Moscarini 《Statistics and Computing》1997,7(2):101-114

The technique of data suppression for protecting sensitive information in a two-dimensional table from exact disclosure raises the computational problems of testing a given table of censored data for security, and searching for a secure suppression pattern of minimum size for a given table. We provide a polynomial security test to solve the former problem, and prove that the latter problem is intractable in the general case, but can be solved in linear time in the special case in which only sensitive cells are to be protected. 相似文献

8.

On construction of asymptotically correct confidence intervals

Shifeng Xiong Weiyan Mu 《Journal of statistical planning and inference》2009

In this paper we discuss constructing confidence intervals based on asymptotic generalized pivotal quantities (AGPQs). An AGPQ associates a distribution with the corresponding parameter, and then an asymptotically correct confidence interval can be derived directly from this distribution like Bayesian or fiducial interval estimates. We provide two general procedures for constructing AGPQs. We also present several examples to show that AGPQs can yield new confidence intervals with better finite-sample behaviors than traditional methods. 相似文献

9.

Testing the Equality of Covariance Operators in Functional Samples 总被引：1，自引：0，他引：1

STEFAN FREMDT JOSEF G. STEINEBACH LAJOS HORVÁTH PIOTR KOKOSZKA 《Scandinavian Journal of Statistics》2013,40(1):138-152

Abstract. We propose a non‐parametric test for the equality of the covariance structures in two functional samples. The test statistic has a chi‐square asymptotic distribution with a known number of degrees of freedom, which depends on the level of dimension reduction needed to represent the data. Detailed analysis of the asymptotic properties is developed. Finite sample perfo‐rmance is examined by a simulation study and an application to egg‐laying curves of fruit flies. 相似文献

10.

Deconvolution methods for non-parametric inference in two-level mixed models

Peter Hall Tapabrata Maiti 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2009,71(3):703-718

Summary. We develop a general non-parametric approach to the analysis of clustered data via random effects. Assuming only that the link function is known, the regression functions and the distributions of both cluster means and observation errors are treated non-parametrically. Our argument proceeds by viewing the observation error at the cluster mean level as though it were a measurement error in an errors-in-variables problem, and using a deconvolution argument to access the distribution of the cluster mean. A Fourier deconvolution approach could be used if the distribution of the error-in-variables were known. In practice it is unknown, of course, but it can be estimated from repeated measurements, and in this way deconvolution can be achieved in an approximate sense. This argument might be interpreted as implying that large numbers of replicates are necessary for each cluster mean distribution, but that is not so; we avoid this requirement by incorporating statistical smoothing over values of nearby explanatory variables. Empirical rules are developed for the choice of smoothing parameter. Numerical simulations, and an application to real data, demonstrate small sample performance for this package of methodology. We also develop theory establishing statistical consistency. 相似文献

11.

Modified Profile Likelihood for Fixed-Effects Panel Data Models 总被引：1，自引：0，他引：1

F. Bartolucci R. Bellio A. Salvan N. Sartori 《Econometric Reviews》2016,35(7):1271-1289

We show how modified profile likelihood methods, developed in the statistical literature, may be effectively applied to estimate the structural parameters of econometric models for panel data, with a remarkable reduction of bias with respect to ordinary likelihood methods. Initially, the implementation of these methods is illustrated for general models for panel data including individual-specific fixed effects and then, in more detail, for the truncated linear regression model and dynamic regression models for binary data formulated along with different specifications. Simulation studies show the good behavior of the inference based on the modified profile likelihood, even when compared to an ideal, although infeasible, procedure (in which the fixed effects are known) and also to alternative estimators existing in the econometric literature. The proposed estimation methods are implemented in an R package that we make available to the reader. 相似文献

12.

Duality and local sensitivity analysis in least squares,minimax, and least absolute values regressions

《Journal of Statistical Computation and Simulation》2012,82(10):887-909

Abstract

This paper deals with the problem of local sensitivity analysis in regression, i.e., how sensitive the results of a regression model (objective function, parameters, and dual variables) are to changes in the data. We use a general formula for local sensitivities in optimization problems to calculate the sensitivities in three standard regression problems (least squares, minimax, and least absolute values). Closed formulas for all sensitivities are derived. Sensitivity contours are presented to help in assessing the sensitivity of each observation in the sample. The dual problems of the minimax and least absolute values are obtained and interpreted. The proposed sensitivity measures are shown to deal more effectively with the masking problem than the existing methods. The methods are illustrated by their application to some examples and graphical illustrations are given. 相似文献

13.

BAYESIAN SMOOTHING WITH OCCASIONAL JUMPS

《统计学通讯:理论与方法》2013,42(12):2707-2719

Consider a sequence of independent random variables X ₁, X ₂,…,X _n observed at n equally spaced time points where X _i has a probability distribution which is known apart from the values of a parameter θ_i ∈ R which may change from observation to observation. We consider the problem of estimating θ = (θ₁, θ₂,…,θ_n) given the observed values of X ₁, X ₂,…,X _n. The paper proposes a prior distribution for the parameters θ for which sets of parameter values exhibiting no change, or no change apart from a few sudden large changes, or lots of small changes, all have positive prior probability. Markov chain sampling may be used to calculate Bayes estimates of the parameters. We report the results of a Monte Carlo study based on Poisson distributed data which compares the Bayes estimator with estimators obtained using cubic splines and with estimators derived from the Schwarz criterion. We conclude that the Bayes method is preferable in a minimax sense since it never produces the disastrously large errors of the other methods and pays only a modest price for this degree of safety. All three methods are used to smooth mortality rates for oesophageal cancer in Irish males aged 65–69 over the period 1955 through 1994. 相似文献

14.

Rhoda Nandai Muse Satheesh Aradhyula 《Journal of applied statistics》2021,48(6):1128

Farmers in Sub-Saharan Africa have lower agricultural technology adoption rates compared to the rest of the world. It is believed that the past season yield affects a farmer''s capacity to take on the riskier improved seed variety; but this effect has not been studied. We quantify the effect of past season yield on improved corn seed use in future seasons while addressing the impact of the seed variety on yield. We develop a maximum likelihood method that addresses the fact that farmers self-select into a technology resulting in its effect on yield being endogenous. The method is unique since it models both lagged and endogenous effects in correlated discrete and continuous outcomes simultaneously. Due to the prescence of the lagged effect in a three year dataset, we also propose a solution to the initial conditions problem and demonstrate with simulations its effectiveness. We used survey longitudinal data collected from Kenyan corn farmers for three years. Our results show that higher past season yield increased the likelihood of adoption in future seasons. The simulation and empirical studies indicate that ignoring the self selection of improved seed use biases the results; we obtain a different sign in the covariance. 相似文献

15.

A Class of Pseudolikelihood Ratio Tests for Homogeneity in Exponential Tilt Mixture Models

下载免费PDF全文

Yang Ning Yong Chen 《Scandinavian Journal of Statistics》2015,42(2):504-517

Mixture models are commonly used in biomedical research to account for possible heterogeneity in population. In this paper, we consider tests for homogeneity between two groups in the exponential tilt mixture models. A novel pairwise pseudolikelihood approach is proposed to eliminate the unknown nuisance function. We show that the corresponding pseudolikelihood ratio test has an asymptotic distribution as a supremum of two squared Gaussian processes under the null hypothesis. To maintain the appeal of simplicity for conventional likelihood ratio tests, we propose two alternative tests, both shown to have a simple asymptotic distribution of under the null. Simulation studies show that the proposed class of pseudolikelihood ratio tests performs well in controlling type I errors and having competitive powers compared with the current tests. The proposed tests are illustrated by an example of partial differential expression detection using microarray data from prostate cancer patients. 相似文献

16.

A family of admissible minimax estimators of the mean of a multivariate,normal distribution

Tze Fen Li Dinesh S. Bhoj 《Revue canadienne de statistique》1986,14(3):245-250

Let X has a p-dimensional normal distribution with mean vector θ and identity covariance matrix I. In a compound decision problem consisting of squared-error estimation of θ, Strawderman (1971) placed a Beta (α, 1) prior distribution on a normal class of priors to produce a family of Bayes minimax estimators. We propose an incomplete Gamma(α, β) prior distribution on the same normal class of priors to produce a larger family of Bayes minimax estimators. We present the results of a Monte Carlo study to demonstrate the reduced risk of our estimators in comparison with the Strawderman estimators when θ is away from the zero vector. 相似文献

17.

Bayesian Recovery of the Initial Condition for the Heat Equation

B. T. Knapik A. W. van der Vaart J. H. van Zanten 《统计学通讯:理论与方法》2013,42(7):1294-1313

We study a Bayesian approach to recovering the initial condition for the heat equation from noisy observations of the solution at a later time. We consider a class of prior distributions indexed by a parameter quantifying “smoothness” and show that the corresponding posterior distributions contract around the true parameter at a rate that depends on the smoothness of the true initial condition and the smoothness and scale of the prior. Correct combinations of these characteristics lead to the optimal minimax rate. One type of priors leads to a rate-adaptive Bayesian procedure. The frequentist coverage of credible sets is shown to depend on the combination of the prior and true parameter as well, with smoother priors leading to zero coverage and rougher priors to (extremely) conservative results. In the latter case, credible sets are much larger than frequentist confidence sets, in that the ratio of diameters diverges to infinity. The results are numerically illustrated by a simulated data example. 相似文献

18.

Šidák-type tests for the two-sample problem based on precedence and exceedance statistics

Eugenia Stoimenova N. Balakrishnan 《Statistics》2017,51(2):247-264

This paper deals with a class of nonparametric two-sample tests for ordered alternatives. The test statistics proposed are based on the number of observations from one sample that precede or exceed a threshold specified by the other sample, and they are extensions of ?idák's test. We derive their exact null distributions and also discuss a large-sample approximation. We then study their power properties exactly against the Lehmann alternative and make some comparative comments. Finally, we present an example to illustrate the proposed tests. 相似文献

19.

Large-scale multiple testing under dependence

Wenguang Sun T. Tony Cai 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2009,71(2):393-424

Summary. The paper considers the problem of multiple testing under dependence in a compound decision theoretic framework. The observed data are assumed to be generated from an underlying two-state hidden Markov model. We propose oracle and asymptotically optimal data-driven procedures that aim to minimize the false non-discovery rate FNR subject to a constraint on the false discovery rate FDR. It is shown that the performance of a multiple-testing procedure can be substantially improved by adaptively exploiting the dependence structure among hypotheses, and hence conventional FDR procedures that ignore this structural information are inefficient. Both theoretical properties and numerical performances of the procedures proposed are investigated. It is shown that the procedures proposed control FDR at the desired level, enjoy certain optimality properties and are especially powerful in identifying clustered non-null cases. The new procedure is applied to an influenza-like illness surveillance study for detecting the timing of epidemic periods. 相似文献

20.

A general method of finding a minimax estimator of a distribution function when no equalizer rule is available

Qiqing Yu 《Revue canadienne de statistique》1992,20(3):281-290

We propose a feasible general method for finding a minimax estimator of an unknown distribution function F in the nonparametric problem. As an application, some minimax estimators are proposed. Furthermore, some minimax binomial parametric problems are studied. 相似文献