期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Cláudia Neves 《Journal of applied statistics》2014,41(4):915-916

相似文献

2.

David Hand 《Significance》2008,5(1):11-14

Two ordinary computer discs containing the personal details—including the bank account numbers—of almost half the citizens of this country have gone missing. David Hand looks at data in the modern world and asks how secure it is and how secure it should be. 相似文献

3.

Robust response surfaces,regression, and positive data analyses

Ramalingam Shanmugam 《Journal of Statistical Computation and Simulation》2019,89(3)

相似文献

4.

Bayesian methods for data analysis,third edition

Z. Q. John Lu 《Journal of applied statistics》2010,37(4):705-706

相似文献

5.

Convex rearrangements,generalized Lorenz curves,and correlated Gaussian data

Youri Davydov Davar Khoshnevisan Zhan Shi Ričardas Zitikis 《Journal of statistical planning and inference》2007

We propose a statistical index for measuring the fluctuations of a stochastic process ξ

ξ

. This index is based on the generalized Lorenz curves and (modified) Gini indices of econometric theory. 相似文献

6.

Using R for data management,statistical analysis,and graphics

Georgi N. Boshnakov 《Journal of applied statistics》2012,39(6):1382-1383

相似文献

7.

Estimation of conditional mode with truncated,censored, and dependent data

Jong-Il Baek 《统计学通讯:理论与方法》2017,46(12):6000-6016

In this paper, we studied the uniform convergence with rates for the kernel estimator of the conditional mode function for a left truncated and right censored model. It is assumed that the lifetime observations with multivariate covariates form a stationary α-mixing sequence. Also, the asymptotic normality of the estimator is established. 相似文献

8.

Clustering transformed compositional data using K-means,with applications in gene expression and bicycle sharing system data

Antoine Godichon-Baggioni Cathy Maugis-Rabusseau Andrea Rau 《Journal of applied statistics》2019,46(1):47-65

Although there is no shortage of clustering algorithms proposed in the literature, the question of the most relevant strategy for clustering compositional data (i.e. data whose rows belong to the simplex) remains largely unexplored in cases where the observed value is equal or close to zero for one or more samples. This work is motivated by the analysis of two applications, both focused on the categorization of compositional profiles: (1) identifying groups of co-expressed genes from high-throughput RNA sequencing data, in which a given gene may be completely silent in one or more experimental conditions; and (2) finding patterns in the usage of stations over the course of one week in the Velib' bicycle sharing system in Paris, France. For both of these applications, we make use of appropriately chosen data transformations, including the Centered Log Ratio and a novel extension called the Log Centered Log Ratio, in conjunction with the K-means algorithm. We use a non-asymptotic penalized criterion, whose penalty is calibrated with the slope heuristics, to select the number of clusters. Finally, we illustrate the performance of this clustering strategy, which is implemented in the Bioconductor package coseq, on both the gene expression and bicycle sharing system data. 相似文献

9.

Sweeping,alignment and the analysis of unbalanced data

Richard Fawcett Penny Stewart 《统计学通讯:理论与方法》2013,42(12):3453-3471

The terms sweeping and alignment refer to the same process. Sweeping/alignment is used by data analysts as a technique for describing the effects of a model factor (e.g., treatments in a randomized block design) after the effects of nuisance parameters (e.g., blocks) have been removed from the data. In this paper sweeping/alignment is used as the basis for developing tests of factors in unbalanced experimental design models. Formulas are presented for treatment effects in randomized block designs with missing observations, and for interaction and main effects in unbalanced two-way factorial designs with empty cells. 相似文献

10.

Case study in data analysis,no. 2

Jane F. Gentleman G. A. Whitmore 《Revue canadienne de statistique》1984,12(1):7-10

In November 1979, the derailment of a train passing through Mississauga, Ontario, caused the explosion of tank cars containing liquid propane and the leakage of chlorine through a hole in another tank car, Officials evacuated more than 200,000 people from the area, but firemen stayed, exposing themselves to noxious fumes from the explosions and fires. When the crisis was over, health officials administered health tests and questionnaires to the affected men and to a control group of unaffected firefighters. Health information was gathered again one and two years later. In this study, two independent sets of analysts examine the health data to determine whether exposure to hazardous chemicals at the derailment site had any lasting effects on the lung function of the Mississauga firefighters. 相似文献

11.

Real data, real learning and the London Olympics

Neville Davies 《Significance》2006,3(2):94-96

The Olympic and Paralympic Games are coming to London in 2012, and there will be huge interest, especially among the young. Could the Games be used to involve students of all ages in a large scale project that involves and interests them to break down their fear of statistics and to motivate learning? Neville Davies of the Royal Statistical Society Centre for Statistical Education appeals for help in a major teaching and learning initiative based on the London Olympics. Significance is proud to announce the launch of a scheme that could bring thousands of students, nationally and internationally, to appreciate and value the usefulness of statistics. 相似文献

12.

Analysis of longitudinal data with irregular, outcome-dependent follow-up

Haiqun Lin Daniel O. Scharfstein Robert A. Rosenheck 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2004,66(3):791-813

Summary. A frequent problem in longitudinal studies is that subjects may miss scheduled visits or be assessed at self-selected points in time. As a result, observed outcome data may be highly unbalanced and the availability of the data may be directly related to the outcome measure and/or some auxiliary factors that are associated with the outcome. If the follow-up visit and outcome processes are correlated, then marginal regression analyses will produce biased estimates. Building on the work of Robins, Rotnitzky and Zhao, we propose a class of inverse intensity-of-visit process-weighted estimators in marginal regression models for longitudinal responses that may be observed in continuous time. This allows us to handle arbitrary patterns of missing data as embedded in a subject's visit process. We derive the large sample distribution for our inverse visit-intensity-weighted estimators and investigate their finite sample behaviour by simulation. Our approach is illustrated with a data set from a health services research study in which homeless people with mental illness were randomized to three different treatments and measures of homelessness (as percentage days homeless in the past 3 months) and other auxiliary factors were recorded at follow-up times that are not fixed by design. 相似文献

13.

Model-based clustering,classification, and discriminant analysis of data with mixed type

Ryan P. Browne Paul D. McNicholas 《Journal of statistical planning and inference》2012

We propose a mixture of latent variables model for the model-based clustering, classification, and discriminant analysis of data comprising variables with mixed type. This approach is a generalization of latent variable analysis, and model fitting is carried out within the expectation-maximization framework. Our approach is outlined and a simulation study conducted to illustrate the effect of sample size and noise on the standard errors and the recovery probabilities for the number of groups. Our modelling methodology is then applied to two real data sets and their clustering and classification performance is discussed. We conclude with discussion and suggestions for future work. 相似文献

14.

Combining census,dual-system,and evaluation study data to estimate population shares

Zaslavsky AM 《Journal of the American Statistical Association》1993,88(423):1,092-1,105

"The 1990 [U.S.] census and Post-Enumeration Survey produced census and dual system estimates (DSE) of population by domain, together with an estimated sampling covariance matrix of the DSE. Estimates of the bias of the DSE were derived from various PES evaluation programs. Of the three sources, the unadjusted census is the least variable but is believed to be the most biased, the DSE is less biased but more variable, and the bias estimates may be regarded as unbiased but are the most variable. This article addresses methods for combining the census, the DSE, and bias estimates obtained from the evaluation programs to produce accurate estimates of population shares, as measured by weighted squared- or absolute-error loss functions applied to estimated population shares of domains." 相似文献

15.

Probability‐scale residuals for continuous,discrete, and censored data

下载免费PDF全文

Bryan E. Shepherd Chun Li Qi Liu 《Revue canadienne de statistique》2016,44(4):463-479

相似文献

16.

Multivariate meta-analysis for data consortia,individual patient meta-analysis,and pooling projects

John Ritz Eugene Demidenko Donna Spiegelman 《Journal of statistical planning and inference》2008

We discuss maximum likelihood and estimating equations methods for combining results from multiple studies in pooling projects and data consortia using a meta-analysis model, when the multivariate estimates with their covariance matrices are available. The estimates to be combined are typically regression slopes, often from relative risk models in biomedical and epidemiologic applications. We generalize the existing univariate meta-analysis model and investigate the efficiency advantages of the multivariate methods, relative to the univariate ones. We generalize a popular univariate test for between-studies homogeneity to a multivariate test. The methods are applied to a pooled analysis of type of carotenoids in relation to lung cancer incidence from seven prospective studies. In these data, the expected gain in efficiency was evident, sometimes to a large extent. Finally, we study the finite sample properties of the estimators and compare the multivariate ones to their univariate counterparts. 相似文献

17.

The relationship between the mean,median, and mode with grouped data

Shimin Zheng Eunice Mogusu Sreenivas P. Veeranki Megan Quinn Yan Cao 《统计学通讯:理论与方法》2017,46(9):4285-4295

It is widely believed that the median is “usually” between the mean and the mode for skewed unimodal distributions. However, this inequality is not always true, especially with grouped data. Unavailability of complete raw data further necessitates the importance of evaluating this characteristic in grouped data. There is a gap in the current statistical literature on assessing mean–median–mode inequality for grouped data. The study aims to evaluate the relationship between the mean, median, and mode with unimodal grouped data; derive conditions for their inequalities; and present their application. 相似文献

18.

The construction of data to reflect the research objective,and how randomisation tests make such data usable

T. P. Hutchinson D. Cairns E. Chekaluk 《Statistical Papers》2002,43(3):349-359

When comparing the central values of two independent groups, should a t-test be performed, or should the observations be transformed into their ranks and a Wilcoxon-Mann-Whitney test performed? This paper argues that neither should automatically be chosen. Instead, provided that software for conducting randomisation tests is available, the chief concern should be with obtaining data values that are a good reflection of scientific reality and appropriate to the objective of the research; if necessary, the data values should be transformed so that this is so. The subsequent use of a randomisation (permutation) test will mean that failure of the transformed data values to satisfy assumptions such as normality and equality of variances will not be of concern. 相似文献

19.

Pooling multivariate data under W, LR and LM tests

B. M. Golam Kibria A. K. Ms. E. Saleh 《Statistical Papers》2006,47(1):49-68

Two independent random samples are drawn from two multivariate normal populations with mean vectors μ1 and μ2 and a common variance-covariance matrix Σ. Ahmed and Saleh (1990) considered preliminary test maximum likelihood estimator (PMLTE) for estimating μ1 based on the Hotelling's T _N ², when it is suspected that μ1=μ2. In this paper, the PTMLE based on the Wald (W), Likelihood Ratio (LR) and Lagrangian Multiplier (LM) tests are considered. Using the quadratic risk function, the conditions of superiority of the proposed estimator for departure parameter are derived. A max-min rule for the size of the preliminary test of significance is presented. It is demonstrated that the PTMLE based on W test produces the highest minimum guaranteed efficiencies compared to UMLE among the three test procedures. 相似文献

20.

Using SAS for data management,statistical analysis and graphics

Jingyun Yang 《Pharmaceutical statistics》2012,11(4):346-346

相似文献