期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Consistency of semiparametric maximum likelihood estimators for two‐phase sampling

Aad Van Der Vaart Jon A. Wellner 《Revue canadienne de statistique》2001,29(2):269-288

Semiparametric maximum likelihood estimators have recently been proposed for a class of two‐phase, outcome‐dependent sampling models. All of them were “restricted” maximum likelihood estimators, in the sense that the maximization is carried out only over distributions concentrated on the observed values of the covariate vectors. In this paper, the authors give conditions for consistency of these restricted maximum likelihood estimators. They also consider the corresponding unrestricted maximization problems, in which the “absolute” maximum likelihood estimators may then have support on additional points in the covariate space. Their main consistency result also covers these unrestricted maximum likelihood estimators, when they exist for all sample sizes. 相似文献

2.

Regression analysis of longitudinal data with outcome‐dependent sampling and informative censoring

Weining Shen Suyu Liu Yong Chen Jing Ning 《Scandinavian Journal of Statistics》2019,46(3):831-847

We consider a regression analysis of longitudinal data in the presence of outcome‐dependent observation times and informative censoring. Existing approaches commonly require a correct specification of the joint distribution of longitudinal measurements, the observation time process, and informative censoring time under the joint modeling framework and can be computationally cumbersome due to the complex form of the likelihood function. In view of these issues, we propose a semiparametric joint regression model and construct a composite likelihood function based on a conditional order statistics argument. As a major feature of our proposed methods, the aforementioned joint distribution is not required to be specified, and the random effect in the proposed joint model is treated as a nuisance parameter. Consequently, the derived composite likelihood bypasses the need to integrate over the random effect and offers the advantage of easy computation. We show that the resulting estimators are consistent and asymptotically normal. We use simulation studies to evaluate the finite‐sample performance of the proposed method and apply it to a study of weight loss data that motivated our investigation. 相似文献

3.

Regression Analysis of Longitudinal Data with Time‐Dependent Covariates and Informative Observation Times

XINYUAN SONG XIAOYUN MU LIUQUAN SUN 《Scandinavian Journal of Statistics》2012,39(2):248-258

Abstract. Longitudinal data frequently occur in many studies, and longitudinal responses may be correlated with observation times. In this paper, we propose a new joint modelling for the analysis of longitudinal data with time‐dependent covariates and possibly informative observation times via two latent variables. For inference about regression parameters, estimating equation approaches are developed and asymptotic properties of the proposed estimators are established. In addition, a lack‐of‐fit test is presented for assessing the adequacy of the model. The proposed method performs well in finite‐sample simulation studies, and an application to a bladder tumour study is provided. 相似文献

4.

Estimators Based on Data‐Driven Generalized Weighted Cramér‐von Mises Distances under Censoring – with Applications to Mixture Models

ERIC BEUTNER LAURENT BORDES 《Scandinavian Journal of Statistics》2011,38(1):108-129

Abstract. Estimators based on data‐driven generalized weighted Cramér‐von Mises distances are defined for data that are subject to a possible right censorship. The function used to measure the distance between the data, summarized by the Kaplan–Meier estimator, and the target model is allowed to depend on the sample size and, for example, on the number of censored items. It is shown that the estimators are consistent and asymptotically multivariate normal for every p dimensional parametric family fulfiling some mild regularity conditions. The results are applied to finite mixtures. Simulation results for finite mixtures indicate that the estimators are useful for moderate sample sizes. Furthermore, the simulation results reveal the usefulness of sample size dependent and censoring sensitive distance functions for moderate sample sizes. Moreover, the estimators for the mixing proportion seem to be fairly robust against a ‘symmetric’ contamination model even when censoring is present. 相似文献

5.

Robust linear discriminant analysis using S‐estimators

Christophe Croux Catherine Dehon 《Revue canadienne de statistique》2001,29(3):473-493

The authors consider a robust linear discriminant function based on high breakdown location and covariance matrix estimators. They derive influence functions for the estimators of the parameters of the discriminant function and for the associated classification error. The most B‐robust estimator is determined within the class of multivariate S‐estimators. This estimator, which minimizes the maximal influence that an outlier can have on the classification error, is also the most B‐robust location S‐estimator. A comparison of the most B‐robust estimator with the more familiar biweight S‐estimator is made. 相似文献

6.

State‐space model for proxy‐based millennial reconstruction

Terry C. K. Lee Min Tsao Francis W. Zwiers 《Revue canadienne de statistique》2010,38(3):488-505

It is important to study historical temperature time series prior to the industrial revolution so that one can view the current global warming trend from a long‐term historical perspective. Because there are no instrumental records of such historical temperature data, climatologists have been interested in reconstructing historical temperatures using various proxy time series. In this paper, the authors examine a state‐space model approach for historical temperature reconstruction which not only makes use of the proxy data but also information on external forcings. A challenge in the implementation of this approach is the estimation of the parameters in the state‐space model. The authors developed two maximum likelihood methods for parameter estimation and studied the efficiency and asymptotic properties of the associated estimators through a combination of theoretical and numerical investigations. The Canadian Journal of Statistics 38: 488–505; 2010 © 2010 Crown in the right of Canada 相似文献

7.

Block‐threshold‐adapted Estimators via a Maxiset Approach

Florent Autin Jean‐Marc Freyermuth Rainer Von Sachs 《Scandinavian Journal of Statistics》2014,41(1):240-258

We study the maxiset performance of a large collection of block thresholding wavelet estimators, namely the horizontal block thresholding family. We provide sufficient conditions on the choices of rates and threshold values to ensure that the involved adaptive estimators obtain large maxisets. Moreover, we prove that any estimator of such a family reconstructs the Besov balls with a near‐minimax optimal rate that can be faster than the one of any separable thresholding estimator. Then, we identify, in particular cases, the best estimator of such a family, that is, the one associated with the largest maxiset. As a particularity of this paper, we propose a refined approach that models method‐dependent threshold values. By a series of simulation studies, we confirm the good performance of the best estimator by comparing it with the other members of its family. 相似文献

8.

A Non‐Parametric Estimator of the Spectral Density of a Continuous‐Time Gaussian Process Observed at Random Times

JEAN‐MARC BARDET PIERRE R. BERTRAND 《Scandinavian Journal of Statistics》2010,37(3):458-476

Abstract. In numerous applications data are observed at random times and an estimated graph of the spectral density may be relevant for characterizing and explaining phenomena. By using a wavelet analysis, one derives a non‐parametric estimator of the spectral density of a Gaussian process with stationary increments (or a stationary Gaussian process) from the observation of one path at random discrete times. For every positive frequency, this estimator is proved to satisfy a central limit theorem with a convergence rate depending on the roughness of the process and the moment of random durations between successive observations. In the case of stationary Gaussian processes, one can compare this estimator with estimators based on the empirical periodogram. Both estimators reach the same optimal rate of convergence, but the estimator based on wavelet analysis converges for a different class of random times. Simulation examples and an application to biological data are also provided. 相似文献

9.

Automated selection of post‐strata using a model‐assisted regression tree estimator

Kelly S. McConville Daniell Toth 《Scandinavian Journal of Statistics》2019,46(2):389-413

Despite having desirable properties, model‐assisted estimators are rarely used in anything but their simplest form to produce official statistics. This is due to the fact that the more complicated models are often ill suited to the available auxiliary data. Under a model‐assisted framework, we propose a regression tree estimator for a finite‐population total. Regression tree models are adept at handling the type of auxiliary data usually available in the sampling frame and provide a model that is easy to explain and justify. The estimator can be viewed as a post‐stratification estimator where the post‐strata are automatically selected by the recursive partitioning algorithm of the regression tree. We establish consistency of the regression tree estimator and a variance estimator, along with asymptotic normality of the regression tree estimator. We compare the performance of our estimator to other survey estimators using the United States Bureau of Labor Statistics Occupational Employment Statistics Survey data. 相似文献

10.

An additive–multiplicative mean model for panel count data with dependent observation and dropout processes

Guanglei Yu Yang Li Liang Zhu Hui Zhao Jianguo Sun Leslie L. Robison 《Scandinavian Journal of Statistics》2019,46(2):414-431

This paper discusses regression analysis of panel count data with dependent observation and dropout processes. For the problem, a general mean model is presented that can allow both additive and multiplicative effects of covariates on the underlying point process. In addition, the proportional rates model and the accelerated failure time model are employed to describe possible covariate effects on the observation process and the dropout or follow‐up process, respectively. For estimation of regression parameters, some estimating equation‐based procedures are developed and the asymptotic properties of the proposed estimators are established. In addition, a resampling approach is proposed for estimating a covariance matrix of the proposed estimator and a model checking procedure is also provided. Results from an extensive simulation study indicate that the proposed methodology works well for practical situations, and it is applied to a motivating set of real data. 相似文献

11.

Analysis of generalized semiparametric mixed varying‐coefficients models for longitudinal data

Yanqing Sun Li Qi Fei Heng Peter B. Gilbert 《Revue canadienne de statistique》2019,47(3):352-373

The generalized semiparametric mixed varying‐coefficient effects model for longitudinal data can accommodate a variety of link functions and flexibly model different types of covariate effects, including time‐constant, time‐varying and covariate‐varying effects. The time‐varying effects are unspecified functions of time and the covariate‐varying effects are nonparametric functions of a possibly time‐dependent exposure variable. A semiparametric estimation procedure is developed that uses local linear smoothing and profile weighted least squares, which requires smoothing in the two different and yet connected domains of time and the time‐dependent exposure variable. The asymptotic properties of the estimators of both nonparametric and parametric effects are investigated. In addition, hypothesis testing procedures are developed to examine the covariate effects. The finite‐sample properties of the proposed estimators and testing procedures are examined through simulations, indicating satisfactory performances. The proposed methods are applied to analyze the AIDS Clinical Trial Group 244 clinical trial to investigate the effects of antiretroviral treatment switching in HIV‐infected patients before and after developing the T215Y antiretroviral drug resistance mutation. The Canadian Journal of Statistics 47: 352–373; 2019 © 2019 Statistical Society of Canada 相似文献

12.

Best monotone M‐estimators

Adam W. Kolkiewicz 《Revue canadienne de statistique》2003,31(3):329-347

The author shows how to find M‐estimators of location whose generating function is monotone and which are optimal or close to optimal. It is easy to identify a consistent sequence of estimators in this class. In addition, it contains simple and efficient approximations in cases where the likelihood function is difficult to obtain. In some neighbourhoods of the normal distribution, the loss of efficiency due to the approximation is quite small. Optimal monotone M‐estimators can also be determined in cases when the underlying distribution is known only up to a certain neighbourhood. The author considers the e‐contamination model and an extension thereof that allows the distributions to be arbitrary outside compact intervals. His results also have implications for distributions with monotone score functions. The author illustrates his methodology using Student and stable distributions. 相似文献

13.

Linear Increments with Non‐monotone Missing Data and Measurement Error

下载免费PDF全文

Shaun R. Seaman Daniel Farewell Ian R. White 《Scandinavian Journal of Statistics》2016,43(4):996-1018

Linear increments (LI) are used to analyse repeated outcome data with missing values. Previously, two LI methods have been proposed, one allowing non‐monotone missingness but not independent measurement error and one allowing independent measurement error but only monotone missingness. In both, it was suggested that the expected increment could depend on current outcome. We show that LI can allow non‐monotone missingness and either independent measurement error of unknown variance or dependence of expected increment on current outcome but not both. A popular alternative to LI is a multivariate normal model ignoring the missingness pattern. This gives consistent estimation when data are normally distributed and missing at random (MAR). We clarify the relation between MAR and the assumptions of LI and show that for continuous outcomes multivariate normal estimators are also consistent under (non‐MAR and non‐normal) assumptions not much stronger than those of LI. Moreover, when missingness is non‐monotone, they are typically more efficient. 相似文献

14.

The impact of period effects on dose level contrasts in alternating cross‐over designs for first‐time‐in‐human studies

Stephan Koehne‐Voss Heinz Schmidli David M. Smith Iris Pigeot 《Pharmaceutical statistics》2011,10(1):45-49

For first‐time‐in‐human studies with small molecules alternating cross‐over designs are often employed and at study end are analyzed using linear models. We discuss the impact of including a period effect in the model on the precision with which dose level contrasts can be estimated and quantify the bias of least squares estimators if a period effect is inherent in the data that is not accounted for in the model. We also propose two alternative designs that allow a more precise estimation of dose level contrasts compared with the standard design when period effects are included in the model. Copyright © 2010 John Wiley & Sons, Ltd. 相似文献

15.

Inference for Multi‐dimensional High‐frequency Data with an Application to Conditional Independence Testing

Markus Bibinger Per A. Mykland 《Scandinavian Journal of Statistics》2016,43(4):1078-1102

We find the asymptotic distribution of the multi‐dimensional multi‐scale and kernel estimators for high‐frequency financial data with microstructure. Sampling times are allowed to be asynchronous and endogenous. In the process, we show that the classes of multi‐scale and kernel estimators for smoothing noise perturbation are asymptotically equivalent in the sense of having the same asymptotic distribution for corresponding kernel and weight functions. The theory leads to multi‐dimensional stable central limit theorems and feasible versions. Hence, they allow to draw statistical inference for a broad class of multivariate models, which paves the way to tests and confidence intervals in risk measurement for arbitrary portfolios composed of high‐frequently observed assets. As an application, we enhance the approach to construct a test for investigating hypotheses that correlated assets are independent conditional on a common factor. 相似文献

16.

A Semiparametric Regression Model for Longitudinal Data with Non‐stationary Errors

下载免费PDF全文

Rui Li Chenlei Leng Jinhong You 《Scandinavian Journal of Statistics》2017,44(4):932-950

Motivated by the need to analyze the National Longitudinal Surveys data, we propose a new semiparametric longitudinal mean‐covariance model in which the effects on dependent variable of some explanatory variables are linear and others are non‐linear, while the within‐subject correlations are modelled by a non‐stationary autoregressive error structure. We develop an estimation machinery based on least squares technique by approximating non‐parametric functions via B‐spline expansions and establish the asymptotic normality of parametric estimators as well as the rate of convergence for the non‐parametric estimators. We further advocate a new model selection strategy in the varying‐coefficient model framework, for distinguishing whether a component is significant and subsequently whether it is linear or non‐linear. Besides, the proposed method can also be employed for identifying the true order of lagged terms consistently. Monte Carlo studies are conducted to examine the finite sample performance of our approach, and an application of real data is also illustrated. 相似文献

17.

Flexible Latent‐State Modelling of Old Faithful's Eruption Inter‐Arrival Times in 2009

Roland Langrock 《Australian & New Zealand Journal of Statistics》2012,54(3):261-279

This paper is concerned with the analysis of a time series comprising the eruption inter‐arrival times of the Old Faithful geyser in 2009. The series is much longer than other well‐documented ones and thus gives a more comprehensive insight into the dynamics of the geyser. Basic hidden Markov models with gamma state‐dependent distributions and several extensions are implemented. In order to better capture the stochastic dynamics exhibited by Old Faithful, the different non‐standard models under consideration seek to increase the flexibility of the basic models in various ways: (i) by allowing non‐geometric distributions for the times spent in the different states; (ii) by increasing the memory of the underlying Markov chain, with or without assuming additional structure implied by mixture transition distribution models; and (iii) by incorporating feedback from the observation process on the latent process. In each case it is shown how the likelihood can be formulated as a matrix product which can be conveniently maximized numerically. 相似文献

18.

Generalised quasi‐likelihood inference in a semi‐parametric binary dynamic mixed logit model

下载免费PDF全文

Nan Zheng Brajendra C. Sutradhar 《Australian & New Zealand Journal of Statistics》2018,60(3):343-373

There exists a recent study where dynamic mixed‐effects regression models for count data have been extended to a semi‐parametric context. However, when one deals with other discrete data such as binary responses, the results based on count data models are not directly applicable. In this paper, we therefore begin with existing binary dynamic mixed models and generalise them to the semi‐parametric context. For inference, we use a new semi‐parametric conditional quasi‐likelihood (SCQL) approach for the estimation of the non‐parametric function involved in the semi‐parametric model, and a semi‐parametric generalised quasi‐likelihood (SGQL) approach for the estimation of the main regression, dynamic dependence and random effects variance parameters. A semi‐parametric maximum likelihood (SML) approach is also used as a comparison to the SGQL approach. The properties of the estimators are examined both asymptotically and empirically. More specifically, the consistency of the estimators is established and finite sample performances of the estimators are examined through an intensive simulation study. 相似文献

19.

The role of reversals in order‐restricted inference

Michael D. Perlman Sanjay Chaudhuri 《Revue canadienne de statistique》2004,32(2):193-198

A statistical model is said to be an order‐restricted statistical model when its parameter takes its values in a closed convex cone C of the Euclidean space. In recent years, order‐restricted likelihood ratio tests and maximum likelihood estimators have been criticized on the grounds that they may violate a cone order monotonicity (COM) property, and hence reverse the cone order induced by C. The authors argue here that these reversals occur only in the case that C is an obtuse cone, and that in this case COM is an inappropriate requirement for likelihood‐based estimates and tests. They conclude that these procedures thus remain perfectly reasonable procedures for order‐restricted inference. 相似文献

20.

Inference in Semi‐Parametric Dynamic Models for Repeated Count Data

下载免费PDF全文

Brajendra C. Sutradhar K.V. Vineetha Warriyar Nan Zheng 《Australian & New Zealand Journal of Statistics》2016,58(3):397-434

This paper deals with a longitudinal semi‐parametric regression model in a generalised linear model setup for repeated count data collected from a large number of independent individuals. To accommodate the longitudinal correlations, we consider a dynamic model for repeated counts which has decaying auto‐correlations as the time lag increases between the repeated responses. The semi‐parametric regression function involved in the model contains a specified regression function in some suitable time‐dependent covariates and a non‐parametric function in some other time‐dependent covariates. As far as the inference is concerned, because the non‐parametric function is of secondary interest, we estimate this function consistently using the independence assumption‐based well‐known quasi‐likelihood approach. Next, the proposed longitudinal correlation structure and the estimate of the non‐parametric function are used to develop a semi‐parametric generalised quasi‐likelihood approach for consistent and efficient estimation of the regression effects in the parametric regression function. The finite sample performance of the proposed estimation approach is examined through an intensive simulation study based on both large and small samples. Both balanced and unbalanced cluster sizes are incorporated in the simulation study. The asymptotic performances of the estimators are given. The estimation methodology is illustrated by reanalysing the well‐known health care utilisation data consisting of counts of yearly visits to a physician by 180 individuals for four years and several important primary and secondary covariates. 相似文献