期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Local logisitic regression an application to army penetration data

《Journal of Statistical Computation and Simulation》2012,82(1):35-50

There have been a number of procedures used to analyze non-monotonic binary data to predict the probability of response. Some classical procedures are the Up and Down strategy, the Robbins–Monro procedure, and other sequential optimization designs. Recently, nonparametric procedures such as kernel regression and local linear regression (llogr) have been applied to this type of data. It is a well known fact that kernel regression has problems fitting the data near the boundaries and a drawback with local linear regression is that it may be “too linear” when fitting data from a curvilinear function. The procedure introduced in this paper is called local logistic regression, which fits a logistic regression function at each of the data points. An example is given using United States Army projectile data that supports the use of local logistic regression when analyzing non-monotonic binary data for certain response curves. Properties of local logistic regression will be presented along with simulation results that indicate some of the strengths of the procedure. 相似文献

2.

A Baseline-free Procedure for Transformation Models Under Interval Censorship

Gu MG Sun L Zuo G 《Lifetime data analysis》2005,11(4):473-488

An important property of Cox regression model is that the estimation of regression parameters using the partial likelihood procedure does not depend on its baseline survival function. We call such a procedure baseline-free. Using marginal likelihood, we show that an baseline-free procedure can be derived for a class of general transformation models under interval censoring framework. The baseline-free procedure results a simplified and stable computation algorithm for some complicated and important semiparametric models, such as frailty models and heteroscedastic hazard/rank regression models, where the estimation procedures so far available involve estimation of the infinite dimensional baseline function. A detailed computational algorithm using Markov Chain Monte Carlo stochastic approximation is presented. The proposed procedure is demonstrated through extensive simulation studies, showing the validity of asymptotic consistency and normality. We also illustrate the procedure with a real data set from a study of breast cancer. A heuristic argument showing that the score function is a mean zero martingale is provided. 相似文献

3.

Imputation for semiparametric transformation models with biased-sampling data 总被引：1，自引：1，他引：0

H Liu J Qin Y Shen 《Lifetime data analysis》2012,18(4):470-503

Widely recognized in many fields including economics, engineering, epidemiology, health sciences, technology and wildlife management, length-biased sampling generates biased and right-censored data but often provide the best information available for statistical inference. Different from traditional right-censored data, length-biased data have unique aspects resulting from their sampling procedures. We exploit these unique aspects and propose a general imputation-based estimation method for analyzing length-biased data under a class of flexible semiparametric transformation models. We present new computational algorithms that can jointly estimate the regression coefficients and the baseline function semiparametrically. The imputation-based method under the transformation model provides an unbiased estimator regardless whether the censoring is independent or not on the covariates. We establish large-sample properties using the empirical processes method. Simulation studies show that under small to moderate sample sizes, the proposed procedure has smaller mean square errors than two existing estimation procedures. Finally, we demonstrate the estimation procedure by a real data example. 相似文献

4.

Empirical likelihood for nonlinear regression models with nonignorable missing responses

Zhihuang Yang Niansheng Tang 《Revue canadienne de statistique》2020,48(3):386-416

This article develops three empirical likelihood (EL) approaches to estimate parameters in nonlinear regression models in the presence of nonignorable missing responses. These are based on the inverse probability weighted (IPW) method, the augmented IPW (AIPW) method and the imputation technique. A logistic regression model is adopted to specify the propensity score. Maximum likelihood estimation is used to estimate parameters in the propensity score by combining the idea of importance sampling and imputing estimating equations. Under some regularity conditions, we obtain the asymptotic properties of the maximum EL estimators of these unknown parameters. Simulation studies are conducted to investigate the finite sample performance of our proposed estimation procedures. Empirical results provide evidence that the AIPW procedure exhibits better performance than the other two procedures. Data from a survey conducted in 2002 are used to illustrate the proposed estimation procedure. The Canadian Journal of Statistics 48: 386–416; 2020 © 2020 Statistical Society of Canada 相似文献

5.

An approximate maximum likelihood procedure for parameter estimation in multivariate discrete data regression models

Andrew W. Roddam 《Journal of applied statistics》2001,28(2):273-279

This paper considers an alternative to iterative procedures used to calculate maximum likelihood estimates of regression coefficients in a general class of discrete data regression models. These models can include both marginal and conditional models and also local regression models. The classical estimation procedure is generally via a Fisher-scoring algorithm and can be computationally intensive for high-dimensional problems. The alternative method proposed here is non-iterative and is likely to be more efficient in high-dimensional problems. The method is demonstrated on two different classes of regression models. 相似文献

6.

Multiple Hypothesis Testing for Variable Selection

下载免费PDF全文

Florian Rohart 《Australian & New Zealand Journal of Statistics》2016,58(2):245-267

We propose two new procedures based on multiple hypothesis testing for correct support estimation in high‐dimensional sparse linear models. We conclusively prove that both procedures are powerful and do not require the sample size to be large. The first procedure tackles the atypical setting of ordered variable selection through an extension of a testing procedure previously developed in the context of a linear hypothesis. The second procedure is the main contribution of this paper. It enables data analysts to perform support estimation in the general high‐dimensional framework of non‐ordered variable selection. A thorough simulation study and applications to real datasets using the R package mht shows that our non‐ordered variable procedure produces excellent results in terms of correct support estimation as well as in terms of mean square errors and false discovery rate, when compared to common methods such as the Lasso, the SCAD penalty, forward regression or the false discovery rate procedure (FDR). 相似文献

7.

More efficient logistic analysis using moving extreme ranked set sampling

Hani M. Samawi Haresh Rochani Daniel Linder Arpita Chatterjee 《Journal of applied statistics》2017,44(4):753-766

Logistic regression is the most popular technique available for modeling dichotomous-dependent variables. It has intensive application in the field of social, medical, behavioral and public health sciences. In this paper we propose a more efficient logistic regression analysis based on moving extreme ranked set sampling (MERSS_min) scheme with ranking based on an easy-to-available auxiliary variable known to be associated with the variable of interest (response variable). The paper demonstrates that this approach will provide more powerful testing procedure as well as more efficient odds ratio and parameter estimation than using simple random sample (SRS). Theoretical derivation and simulation studies will be provided. Real data from 2011 Youth Risk Behavior Surveillance System (YRBSS) data are used to illustrate the procedures developed in this paper. 相似文献

8.

Robust Model-Free Multiclass Probability Estimation

Wu Y Zhang HH Liu Y 《Journal of the American Statistical Association》2010,105(489):424-436

Classical statistical approaches for multiclass probability estimation are typically based on regression techniques such as multiple logistic regression, or density estimation approaches such as linear discriminant analysis (LDA) and quadratic discriminant analysis (QDA). These methods often make certain assumptions on the form of probability functions or on the underlying distributions of subclasses. In this article, we develop a model-free procedure to estimate multiclass probabilities based on large-margin classifiers. In particular, the new estimation scheme is employed by solving a series of weighted large-margin classifiers and then systematically extracting the probability information from these multiple classification rules. A main advantage of the proposed probability estimation technique is that it does not impose any strong parametric assumption on the underlying distribution and can be applied for a wide range of large-margin classification methods. A general computational algorithm is developed for class probability estimation. Furthermore, we establish asymptotic consistency of the probability estimates. Both simulated and real data examples are presented to illustrate competitive performance of the new approach and compare it with several other existing methods. 相似文献

9.

Diagnostic Displays for Assessing Leverage and Influence in Generalized Linear Models

Andy H. Lee 《Australian & New Zealand Journal of Statistics》1987,29(3):233-243

A general technique for assessing leverage and influential observations in Generalized Linear Models is described. The procedure takes the form of Half-Normal plots with envelopes derived from simulation to enhance overall assessment of the model. This procedure of assessment is more informative and provides additional insight compared with procedures based on the largest sample leverage and influence statistics. Application of the method is illustrated with an example in logistic regression. 相似文献

10.

Comparing liang-zeger estimates with maximum likelihood in bivariate logistic regression

《Journal of Statistical Computation and Simulation》2012,82(3-4):133-148

The estimation methods of Liang and Zeger (1986) are compared to the method of maximum likelihood for a particular parametric model for bivariate logistic regression. The effect of correlation between the responses and intracluster correlation of the explanatory variables on the estimation procedures is studied. Specific comparisons via simulation are made in the context of an opthamologic data set. 相似文献

11.

Estimation of semiparametric regression model with longitudinal data

Yanqing Sun 《Lifetime data analysis》2010,16(2):271-298

In a longitudinal study, an individual is followed up over a period of time. Repeated measurements on the response and some time-dependent covariates are taken at a series of sampling times. The sampling times are often irregular and depend on covariates. In this paper, we propose a sampling adjusted procedure for the estimation of the proportional mean model without having to specify a sampling model. Unlike existing procedures, the proposed method is robust to model misspecification of the sampling times. Large sample properties are investigated for the estimators of both regression coefficients and the baseline function. We show that the proposed estimation procedure is more efficient than the existing procedures. Large sample confidence intervals for the baseline function are also constructed by perturbing the estimation equations. A simulation study is conducted to examine the finite sample properties of the proposed estimators and to compare with some of the existing procedures. The method is illustrated with a data set from a recurrent bladder cancer study. 相似文献

12.

A general structural model for decomposing time series and its analysis as a generalized regression model

Ralf Pauly 《Statistical Papers》1989,30(1):245-261

相似文献

13.

An additive–multiplicative mean model for panel count data with dependent observation and dropout processes

Guanglei Yu Yang Li Liang Zhu Hui Zhao Jianguo Sun Leslie L. Robison 《Scandinavian Journal of Statistics》2019,46(2):414-431

This paper discusses regression analysis of panel count data with dependent observation and dropout processes. For the problem, a general mean model is presented that can allow both additive and multiplicative effects of covariates on the underlying point process. In addition, the proportional rates model and the accelerated failure time model are employed to describe possible covariate effects on the observation process and the dropout or follow‐up process, respectively. For estimation of regression parameters, some estimating equation‐based procedures are developed and the asymptotic properties of the proposed estimators are established. In addition, a resampling approach is proposed for estimating a covariance matrix of the proposed estimator and a model checking procedure is also provided. Results from an extensive simulation study indicate that the proposed methodology works well for practical situations, and it is applied to a motivating set of real data. 相似文献

14.

Indirect estimation of (latent) linear models with ordinal regressors A Monte Carlo study and some empirical illustrations

Martin Kukuk 《Statistical Papers》2002,43(3):379-399

Summary This paper investigates the effects of ordinal regressors in linear regression models and in limited dependent variable models. Each ordered categorical variable is interpreted as a rough measurement of an underlying continuous variable as it is often done in microeconometrics for the dependent variable. It is shown that using ordinal indicators only leads to correct answers in a few special cases. In most situations, the usual estimators are biased. In order to estimate the parameters of the model consistently, the indirect estimation procedure suggested by Gourieroux et al. (1993) is applied. To demonstrate this method, first a simulation study is performed and then in a second step, two real data sets are used. In the latter case, continuous regressors are transformed into categorical variables to study the behavior of the estimation procedure. The method is extended to the case of limited dependent variable models. In general, the indirect estimators lead to adequate results. Received: March 27, 2000; revised version: March 6, 2001 相似文献

15.

A simulation study on SPSS ridge regression and ordinary least squares regression procedures for multicollinearity data

John Zhang Mahmud Ibrahim 《Journal of applied statistics》2005,32(6):571-588

This study compares the SPSS ordinary least squares (OLS) regression and ridge regression procedures in dealing with multicollinearity data. The LS regression method is one of the most frequently applied statistical procedures in application. It is well documented that the LS method is extremely unreliable in parameter estimation while the independent variables are dependent (multicollinearity problem). The Ridge Regression procedure deals with the multicollinearity problem by introducing a small bias in the parameter estimation. The application of Ridge Regression involves the selection of a bias parameter and it is not clear if it works better in applications. This study uses a Monte Carlo method to compare the results of OLS procedure with the Ridge Regression procedure in SPSS. 相似文献

16.

The Ubiquity of Statistics

William Kruskal 《The American statistician》2013,67(1):3-6

The use of biased estimation in data analysis and model building is discussed. A review of the theory of ridge regression and its relation to generalized inverse regression is presented along with the results of a simulation experiment and three examples of the use of ridge regression in practice. Comments on variable selection procedures, model validation, and ridge and generalized inverse regression computation procedures are included. The examples studied here show that when the predictor variables are highly correlated, ridge regression produces coefficients which predict and extrapolate better than least squares and is a safe procedure for selecting variables. 相似文献

17.

Logistic regression in meta-analysis using aggregate data

Bei-Hung Chang Stuart Lipsitz Christine Waternaux 《Journal of applied statistics》2000,27(4):411-424

We derived two methods to estimate the logistic regression coefficients in a meta-analysis when only the 'aggregate' data (mean values) from each study are available. The estimators we proposed are the discriminant function estimator and the reverse Taylor series approximation. These two methods of estimation gave similar estimators using an example of individual data. However, when aggregate data were used, the discriminant function estimators were quite different from the other two estimators. A simulation study was then performed to evaluate the performance of these two estimators as well as the estimator obtained from the model that simply uses the aggregate data in a logistic regression model. The simulation study showed that all three estimators are biased. The bias increases as the variance of the covariate increases. The distribution type of the covariates also affects the bias. In general, the estimator from the logistic regression using the aggregate data has less bias and better coverage probabilities than the other two estimators. We concluded that analysts should be cautious in using aggregate data to estimate the parameters of the logistic regression model for the underlying individual data. 相似文献

18.

On discrimination procedure with mixtures of continuous and categorical variables

Gafar Matanmi Oyeyemi George Chinanu Mbaeyi Saheed Ishola Salawu Bernard Olagboyega Muse 《Journal of applied statistics》2016,43(10):1864-1873

A discrimination procedure, based on the location model is described and suggested for use in situation where the discriminating variables are mixtures of continuous and binary variables. Some procedures that have been previously employed, in a similar situation, like Fisher's linear discriminant function and the logistic regression were compared with this method using error rate (ER). Optimal ERs for these procedures are reported using real and simulated data for the case of varying sample size and number of continuous and binary variables and were used as a measure for assessing the performance of the various procedures. The suggested procedure performed considerably better in the cases considered and never did produce a result that is poor when compared with other procedures. Hence, the suggested procedure might be considered for such situations. 相似文献

19.

A fuzzy robust regression approach applied to bedload transport data

Jalal Chachi 《统计学通讯:模拟与计算》2017,46(3):1703-1714

Fuzzy least-square regression can be very sensitive to unusual data (e.g., outliers). In this article, we describe how to fit an alternative robust-regression estimator in fuzzy environment, which attempts to identify and ignore unusual data. The proposed approach concerns classical robust regression and estimation methods that are insensitive to outliers. In this regard, based on the least trimmed square estimation method, an estimation procedure is proposed for determining the coefficients of the fuzzy regression model for crisp input-fuzzy output data. The investigated fuzzy regression model is applied to bedload transport data forecasting suspended load by discharge based on a real world data. The accuracy of the proposed method is compared with the well-known fuzzy least-square regression model. The comparison results reveal that the fuzzy robust regression model performs better than the other models in suspended load estimation for the particular dataset. This comparison is done based on a similarity measure between fuzzy sets. The proposed model is general and can be used for modeling natural phenomena whose available observations are reported as imprecise rather than crisp. 相似文献

20.

The application of stochastic approximation methods to the bio-assay problem

Dan Anbar 《Journal of statistical planning and inference》1977,1(2):191-206

A new general model for the bio-assay problem is introduced. It is shown that when the slope of the dose-response curve and the median effective dose is known, the Robbins-Monro method yields an asymptotically optimal estimation procedure. Adaptive procedures are discussed for the case of unknown slope. Results of Monte Carlo studies are given. 相似文献