期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Composite likelihood estimation in multivariate data analysis

Yinshan Zhao Harry Joe 《Revue canadienne de statistique》2005,33(3):335-356

The authors propose two composite likelihood estimation procedures for multivariate models with regression/univariate and dependence parameters. One is a two‐stage method based on both univariate and bivariate margins. The other estimates all the parameters simultaneously based on bivariate margins. For some special cases, the authors compare their asymptotic efficiencies with the maximum likelihood method. The performance of the two methods is reasonable, except that the first procedure is inefficient for the regression parameters under strong dependence. The second approach is generally better for the regression parameters, but less efficient for the dependence parameters under weak dependence. 相似文献

2.

Univaiuate anu multivariate categorical data analysis for block designs

R.P. Bhargava 《统计学通讯:理论与方法》2013,42(11):1209-1231

Analysis for univariate and multivariate categorical data in block designs is given and illustrated through examples. The univariate analysis compares the treatments on the basis of their pooled frequency distributions (pooled over blocks). The test statistic used is called Q after Cochran (1950). The large sample null distribution of Q is a chi-square. Analysis of p-variate categorical data (kth variable having ck classes, K=1,...,p) can be done by treating it as a univariate categorical problem with [d] classes. Very often [d] is large in relation to the size of the experiment. This makes the expected frequencies for some of the cells very small, making the univariate method inapplicable. In these circumstances it may be reasonable to compare the treatments on the basis of marginal distributions up to the mth dimension, 1[d] , which is given in this paper. This method is also illustrated for missing observations 相似文献

3.

Multivariate analysis of variance for functional data

T. Górecki 《Journal of applied statistics》2017,44(12):2172-2189

Functional data are being observed frequently in many scientific fields, and therefore most of the standard statistical methods are being adapted for functional data. The multivariate analysis of variance problem for functional data is considered. It seems to be of practical interest similarly as the one-way analysis of variance for such data. For the MANOVA problem for multivariate functional data, we propose permutation tests based on a basis function representation and tests based on random projections. Their performance is examined in comprehensive simulation studies, which provide an idea of the size control and power of the tests and identify differences between them. The simulation experiments are based on artificial data and real labeled multivariate time series data found in the literature. The results suggest that the studied testing procedures can detect small differences between vectors of curves even with small sample sizes. Illustrative real data examples of the use of the proposed testing procedures in practice are also presented. 相似文献

4.

Analysis of distance for structured multivariate data and extensions to multivariate analysis of variance 总被引：3，自引：0，他引：3

J. C. Gower & W. J. Krzanowski 《Journal of the Royal Statistical Society. Series C, Applied statistics》1999,48(4):505-519

相似文献

5.

Nonparametric density estimation for multivariate bounded data

Taoufik Bouezmarni Jeroen V.K. Rombouts 《Journal of statistical planning and inference》2010,140(1):139-152

We propose a new nonparametric estimator for the density function of multivariate bounded data. As frequently observed in practice, the variables may be partially bounded (e.g. nonnegative) or completely bounded (e.g. in the unit interval). In addition, the variables may have a point mass. We reduce the conditions on the underlying density to a minimum by proposing a nonparametric approach. By using a gamma, a beta, or a local linear kernel (also called boundary kernels), in a product kernel, the suggested estimator becomes simple in implementation and robust to the well known boundary bias problem. We investigate the mean integrated squared error properties, including the rate of convergence, uniform strong consistency and asymptotic normality. We establish consistency of the least squares cross-validation method to select optimal bandwidth parameters. A detailed simulation study investigates the performance of the estimators. Applications using lottery and corporate finance data are provided. 相似文献

6.

Smoothed functional canonical correlation analysis of humidity and temperature data

Istem Koymen Keser Ipek Deveci Kocakoç 《Journal of applied statistics》2015,42(10):2126-2140

This paper focuses on smoothed functional canonical correlation analysis (SFCCA) to investigate the relationships and changes in large, seasonal and long-term data sets. The aim of this study is to introduce a guideline for SFCCA for functional data and to give some insights on the fine tuning of the methodology for long-term periodical data. The guidelines are applied on temperature and humidity data for 11 years between 2000 and 2010 and the results are interpreted. Seasonal changes or periodical shifts are visually studied by yearly comparisons. The effects of the ‘number of basis functions’ and the ‘selection of smoothing parameter’ on the general variability structure and on correlations between the curves are examined. It is concluded that the number of time points (knots), number of basis functions and the time span of evaluation (monthly, daily, etc.) should all be chosen harmoniously. It is found that changing the smoothing parameter does not have a significant effect on the structure of curves and correlations. The number of basis functions is found to be the main effector on both individual and correlation weight functions. 相似文献

7.

Optimal design for classification of functional data

Cai Li Luo Xiao 《Revue canadienne de statistique》2020,48(2):285-307

We study the design problem for the optimal classification of functional data. The goal is to select sampling time points so that functional data observed at these time points can be classified accurately. We propose optimal designs that are applicable to either dense or sparse functional data. Using linear discriminant analysis, we formulate our design objectives as explicit functions of the sampling points. We study the theoretical properties of the proposed design objectives and provide a practical implementation. The performance of the proposed design is evaluated through simulations and real data applications. The Canadian Journal of Statistics 48: 285–307; 2020 © 2019 Statistical Society of Canada 相似文献

8.

Characterizing persistent disturbing behavior using longitudinal and multivariate techniques

Jan Serroyen Liesbeth Bruckers Geert Rogiers 《Journal of applied statistics》2010,37(2):341-355

Persistent disturbing behavior (PDB) refers to a chronic condition in therapy-resistant psychiatric patients. Since these patients are highly unstable and difficult to maintain in their natural living environment and even in hospital wards, it is important to properly characterize this group. Previous studies in the Belgian province of Limburg indicated that the size of this group was larger than anticipated. Here, using a score calculated from longitudinal psychiatric registration data in 611 patients, we characterize the difference between PDB patients and a set of control patients. These differences are studied both at a given point in time, using discriminant analysis, as well as in terms of the evolution of the score over time, using longitudinal data analysis methods. Further, using clustering techniques, the group of PDB patients is split into two subgroups, characterized in terms of a number of ordinal scores. Such findings are useful from a scientific as well as from an organizational point of view. 相似文献

9.

Likelihood ratio test for independence with partial multivariate normal data

Serge B. Provost 《统计学通讯:理论与方法》2013,42(6):1763-1775

This article derives the likelihood ratio statistic to test the independence between (X ₁,…,X _r) and (X _r+1,…,X _k) under the assumption that (X ₁,…,X _k) has a multivariate normal distribution and that a sample of size n is available, where for N observation vectors all components are available, while for M = (n + N) observation vectors, the data on the last q components, (X_k-q+1,…,X _k) are missing (k+q≥r). 相似文献

10.

Markov chain models for multivariate repeated binary data analysis

Wei Tian Stewart J. Anderson 《统计学通讯:模拟与计算》2013,42(4):1001-1019

Repeated categorical outcomes frequently occur in clinical trials. Muenz and Rubinstein (1985) presented Markov chain models to analyze binary repeated data in a breast cancer study. We extend their method to the setting when more than one repeated outcome variable is of interest. In a randomized clinical trial of breast cancer, we investigate the dependency of toxicities on predictor variables and the relationship among multiple toxic effects. 相似文献

11.

Robust likelihood inferences for multivariate correlated data

Chien-Hung Chen 《Journal of applied statistics》2011,38(12):2901-2910

Multivariate normal, due to its well-established theories, is commonly utilized to analyze correlated data of various types. However, the validity of the resultant inference is, more often than not, erroneous if the model assumption fails. We present a modification for making the multivariate normal likelihood acclimatize itself to general correlated data. The modified likelihood is asymptotically legitimate for any true underlying joint distributions so long as they have finite second moments. One can, hence, acquire full likelihood inference without knowing the true random mechanisms underlying the data. Simulations and real data analysis are provided to demonstrate the merit of our proposed parametric robust method. 相似文献

12.

Empirical likelihood confidence intervals for nonparametric functional data analysis

Heng Lian 《Journal of statistical planning and inference》2012

We consider the problem of constructing confidence intervals for nonparametric functional data analysis using empirical likelihood. In this doubly infinite-dimensional context, we demonstrate the Wilk's phenomenon and propose a bias-corrected construction that requires neither undersmoothing nor direct bias estimation. We also extend our results to partially linear regression models involving functional data. Our numerical results demonstrate improved performance of the empirical likelihood methods over normal approximation-based methods. 相似文献

13.

Kernel estimations for multivariate density functional with bootstrap

Dewang Li 《统计学通讯:理论与方法》2017,46(9):4631-4641

In this article the bootstrap method is discussed for the kernel estimation of the multivariate density function. We have considered sample mean functional and constructed its consistency and asymptotic normality by bootstrap estimator. It has been shown that the bootstrap works for kernel estimates of multivariate density functional. The convergence rate with bootstrap for density has been proved. Finally, two simulations of application are given. 相似文献

14.

Discriminant analysis of survey data

Ching-Ho Leu Kam-Wah Tsui 《Journal of statistical planning and inference》1997,60(2):1115-290

We consider the problem of the effect of sample designs on discriminant analysis. The selection of the learning sample is assumed to depend on the population values of auxiliary variables. Under a superpopulation model with a multivariate normal distribution, unbiasedness and consistency are examined for the conventional estimators (derived under the assumptions of simple random sampling), maximum likelihood estimators, probability-weighted estimators and conditionally unbiased estimators of parameters. Four corresponding sampled linear discriminant functions are examined. The rates of misclassification of these four discriminant functions and the effect of sample design on these four rates of misclassification are discussed. The performances of these four discriminant functions are assessed in a simulation study. 相似文献

15.

Evaluation of process capability in multivariate nonlinear profiles

《Journal of Statistical Computation and Simulation》2012,82(12):2411-2428

ABSTRACT

Process capability indices measure the ability of a process to provide products that meet certain specifications. Few references deal with the capability of a process characterized by a functional relationship between a response variable and one or more explanatory variables, which is called profile. Specifically, there is not any reference analysing the capability of processes characterized by multivariate nonlinear profiles. In this paper, we propose a method to measure the capability of these processes, based on principal components for multivariate functional data and the concept of functional depth. A simulation study is conducted to assess the performance of the proposed method. An example from the sugar production illustrates the applicability of this approach. 相似文献

16.

Conjugate analysis of multivariate normal data with incomplete observations

Francesca Dominici Giovanni Parmigiani Merlise Clyde 《Revue canadienne de statistique》2000,28(3):533-550

The authors discuss prior distributions that are conjugate to the multivariate normal likelihood when some of the observations are incomplete. They present a general class of priors for incorporating information about unidentified parameters in the covariance matrix. They analyze the special case of monotone patterns of missing data, providing an explicit recursive form for the posterior distribution resulting from a conjugate prior distribution. They develop an importance sampling and a Gibbs sampling approach to sample from a general posterior distribution and compare the two methods. 相似文献

17.

A likelihood-based approach for multivariate one-sided tests with missing data

Guohai Zhou Lang Wu Rollin Brant J. Mark Ansermino 《Journal of applied statistics》2017,44(11):2000-2016

Inequality-restricted hypotheses testing methods containing multivariate one-sided testing methods are useful in practice, especially in multiple comparison problems. In practice, multivariate and longitudinal data often contain missing values since it may be difficult to observe all values for each variable. However, although missing values are common for multivariate data, statistical methods for multivariate one-sided tests with missing values are quite limited. In this article, motivated by a dataset in a recent collaborative project, we develop two likelihood-based methods for multivariate one-sided tests with missing values, where the missing data patterns can be arbitrary and the missing data mechanisms may be non-ignorable. Although non-ignorable missing data are not testable based on observed data, statistical methods addressing this issue can be used for sensitivity analysis and might lead to more reliable results, since ignoring informative missingness may lead to biased analysis. We analyse the real dataset in details under various possible missing data mechanisms and report interesting findings which are previously unavailable. We also derive some asymptotic results and evaluate our new tests using simulations. 相似文献

18.

Comparison of algorithms for replacing missing data in discriminant analysis

J.Twedt Daniel D.S. Gill 《统计学通讯:理论与方法》2013,42(6):1567-1578

We examined the impact of different methods for replacing missing data in discriminant analyses conducted on randomly generated samples from multivariate normal and non-normal distributions. The probabilities of correct classification were obtained for these discriminant analyses before and after randomly deleting data as well as after deleted data were replaced using: (1) variable means, (2) principal component projections, and (3) the EM algorithm. Populations compared were: (1) multivariate normal with covariance matrices ∑₁=∑₂, (2) multivariate normal with ∑₁≠∑₂ and (3) multivariate non-normal with ∑₁=∑₂. Differences in the probabilities of correct classification were most evident for populations with small Mahalanobis distances or high proportions of missing data. The three replacement methods performed similarly but all were better than non - replacement. 相似文献

19.

Bayesian analysis of multivariate threshold autoregressive models with missing data

Sergio A. Calderón V. Fabio H. Nieto 《统计学通讯:理论与方法》2017,46(1):296-318

In some fields, we are forced to work with missing data in multivariate time series. Unfortunately, the data analysis in this context cannot be carried out in the same way as in the case of complete data. To deal with this problem, a Bayesian analysis of multivariate threshold autoregressive models with exogenous inputs and missing data is carried out. In this paper, Markov chain Monte Carlo methods are used to obtain samples from the involved posterior distributions, including threshold values and missing data. In order to identify autoregressive orders, we adapt the Bayesian variable selection method in this class of multivariate process. The number of regimes is estimated using marginal likelihood or product parameter-space strategies. 相似文献

20.

Quantification of symmetry for functional data with application to equine lameness classification

Helle Sørensen Anders Tolver Maj Halling Thomsen Pia Haubro Andersen 《Journal of applied statistics》2012,39(2):337-360

This paper presents a study on symmetry of repeated bi-phased data signals, in particular, on quantification of the deviation between the two parts of the signal. Three symmetry scores are defined using functional data techniques such as smoothing and registration. One score is related to the L ₂-distance between the two parts of the signal, whereas the other two are constructed to specifically measure differences in amplitude and phase. Moreover, symmetry scores based on functional principal component analysis (PCA) are examined. The scores are applied to acceleration signals from a study on equine gait. The scores turn out to be highly associated with lameness, and their applicability for lameness quantification and detection is investigated. Four classification approaches turn out to give similar results. The scores describing amplitude and phase variation turn out to outperform the PCA scores when it comes to the classification of lameness. 相似文献