期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Markov structure based logistic regression for repeated measures: An application to diabetes mellitus data

M. Shafiqur Rahman M. Ataharul Islam 《Statistical Methodology》2007,4(4):448-460

In longitudinal studies, as repeated observations are made on the same individual the response variables will usually be correlated. In analyzing such data, this dependence must be taken into account to avoid misleading inferences. The focus of this paper is to apply a logistic marginal model with Markovian dependence proposed by Azzalini [A. Azzalini, Logistic regression for autocorrelated data with application to repeated measures, Biometrika 81 (1994) 767–775] to the study of the influence of time-dependent covariates on the marginal distribution of the binary response in serially correlated binary data. We have shown how to construct the model so that the covariates relate only to the mean value of the process, independent of the association parameters. After formulating the proposed model for repeated measures data, the same approach is applied to missing data. An application is provided to the diabetes mellitus data of registered patients at the Bangladesh Institute of Research and Rehabilitation in Diabetes, Endocrine and Metabolic Disorders (BIRDEM) in 1984, using both time stationary and time varying covariates. 相似文献

2.

A Bayesian model for multinomial sampling with misclassified data

M. Ruiz F. J. Girón J. Martín C. Rojano 《Journal of applied statistics》2008,35(4):369-382

In this paper the issue of making inferences with misclassified data from a noisy multinomial process is addressed. A Bayesian model for making inferences about the proportions and the noise parameters is developed. The problem is reformulated in a more tractable form by introducing auxiliary or latent random vectors. This allows for an easy-to-implement Gibbs sampling-based algorithm to generate samples from the distributions of interest. An illustrative example related to elections is also presented. 相似文献

3.

Computation aspects of the parameter estimates of linear mixed effects model in multivariate repeated measures set-up

Anuradha Roy 《Journal of applied statistics》2008,35(3):307-320

The number of parameters mushrooms in a linear mixed effects (LME) model in the case of multivariate repeated measures data. Computation of these parameters is a real problem with the increase in the number of response variables or with the increase in the number of time points. The problem becomes more intricate and involved with the addition of additional random effects. A multivariate analysis is not possible in a small sample setting. We propose a method to estimate these many parameters in bits and pieces from baby models, by taking a subset of response variables at a time, and finally using these bits and pieces at the end to get the parameter estimates for the mother model, with all variables taken together. Applying this method one can calculate the fixed effects, the best linear unbiased predictions (BLUPs) for the random effects in the model, and also the BLUPs at each time of observation for each response variable, to monitor the effectiveness of the treatment for each subject. The proposed method is illustrated with an example of multiple response variables measured over multiple time points arising from a clinical trial in osteoporosis. 相似文献

4.

Analysis of longitudinal data with non-ignorable non-monotone missing values 总被引：2，自引：0，他引：2

A. B. Troxel D. P. Harrington & S. R. Lipsitz 《Journal of the Royal Statistical Society. Series C, Applied statistics》1998,47(3):425-438

A full likelihood method is proposed to analyse continuous longitudinal data with non-ignorable (informative) missing values and non-monotone patterns. The problem arose in a breast cancer clinical trial where repeated assessments of quality of life were collected: patients rated their coping ability during and after treatment. We allow the missingness probabilities to depend on unobserved responses, and we use a multivariate normal model for the outcomes. A first-order Markov dependence structure for the responses is a natural choice and facilitates the construction of the likelihood; estimates are obtained via the Nelder–Mead simplex algorithm. Computations are difficult and become intractable with more than three or four assessments. Applying the method to the quality-of-life data results in easily interpretable estimates, confirms the suspicion that the data are non-ignorably missing and highlights the likely bias of standard methods. Although treatment comparisons are not affected here, the methods are useful for obtaining unbiased means and estimating trends over time. 相似文献

5.

COMBINING INDIVIDUAL PARTICIPANT DATA AND SUMMARY STATISTICS FROM BOTH CONTINUOUSLY VALUED AND BINARY VARIABLES TO ESTIMATE REGRESSION PARAMETERS

Lyle C. Gurrin Lidija Turkovic 《Australian & New Zealand Journal of Statistics》2012,54(1):1-21

Recent research has extended standard methods for meta‐analysis to more general forms of evidence synthesis, where the aim is to combine different data types or data summaries that contain information about functions of multiple parameters to make inferences about the parameters of interest. We consider one such scenario in which the goal is to make inferences about the association between a primary binary exposure and continuously valued outcome in the context of several confounding exposures, and where the data are available in various different forms: individual participant data (IPD) with repeated measures, sample means that have been aggregated over strata, and binary data generated by thresholding the underlying continuously valued outcome measure. We show that an estimator of the population mean of a continuously valued outcome can be constructed using binary threshold data provided that a separate estimate of the outcome standard deviation is available. The results of a simulation study show that this estimator has negligible bias but is less efficient than the sample mean – the minimum variance ratio is based on a Taylor series expansion. Combining this estimator with sample means and IPD from different sources (such as a series of published studies) using both linear and probit regression does, however, improve the precision of estimation considerably by incorporating data that would otherwise have been excluded for being in the wrong format. We apply these methods to investigate the association between the G277S mutation in the transferrin gene and serum ferritin (iron) levels separately in pre‐ and post‐menopausal women based on data from three published studies. 相似文献

6.

Bayesian compartmental model for an infectious disease with dynamic states of infection

Marie V. Ozanne Grant D. Brown Jacob J. Oleson Iraci D. Lima Jose W. Queiroz Selma M. B. Jeronimo 《Journal of applied statistics》2019,46(6):1043-1065

Population-level proportions of individuals that fall at different points in the spectrum [of disease severity], from asymptomatic infection to severe disease, are often difficult to observe, but estimating these quantities can provide information about the nature and severity of the disease in a particular population. Logistic and multinomial regression techniques are often applied to infectious disease modeling of large populations and are suited to identifying variables associated with a particular disease or disease state. However, they are less appropriate for estimating infection state prevalence over time because they do not naturally accommodate known disease dynamics like duration of time an individual is infectious, heterogeneity in the risk of acquiring infection, and patterns of seasonality. We propose a Bayesian compartmental model to estimate latent infection state prevalence over time that easily incorporates known disease dynamics. We demonstrate how and why a stochastic compartmental model is a better approach for determining infection state proportions than multinomial regression is by using a novel method for estimating Bayes factors for models with high-dimensional parameter spaces. We provide an example using visceral leishmaniasis in Brazil and present an empirically-adjusted reproductive number for the infection. 相似文献

7.

Α Markov model for longitudinal studies with incomplete dichotomous outcomes

下载免费PDF全文

Orestis Efthimiou Nicky Welton Myrto Samara Stefan Leucht Georgia Salanti 《Pharmaceutical statistics》2017,16(2):122-132

Missing outcome data constitute a serious threat to the validity and precision of inferences from randomized controlled trials. In this paper, we propose the use of a multistate Markov model for the analysis of incomplete individual patient data for a dichotomous outcome reported over a period of time. The model accounts for patients dropping out of the study and also for patients relapsing. The time of each observation is accounted for, and the model allows the estimation of time‐dependent relative treatment effects. We apply our methods to data from a study comparing the effectiveness of 2 pharmacological treatments for schizophrenia. The model jointly estimates the relative efficacy and the dropout rate and also allows for a wide range of clinically interesting inferences to be made. Assumptions about the missingness mechanism and the unobserved outcomes of patients dropping out can be incorporated into the analysis. The presented method constitutes a viable candidate for analyzing longitudinal, incomplete binary data. 相似文献

8.

A non-parametric regression approach to repeated measures analysis in cancer experiments

M. Carme Ruiz De Villa M. Salome E. Cabral Eduardo Escrich Escriche Montse Solanas 《Journal of applied statistics》1999,26(5):601-611

The validity conditions for univariate or multivariate analyses of repeated measures are highly sensitive to the usual assumptions. In cancer experiments, the data are frequently heteroscedastic and strongly correlated with time, and standard analyses do not perform well. Alternative non-parametric approaches can contribute to an analysis of these longitudinal data. This paper describes a method for such situations, using the results from a comparative experiment in which tumour volume is evaluated over time. First, we apply the non-parametric approach proposed by Raz in constructing a randomization Ftest for comparing treatments. A local polynomial fit is conducted to estimate the growth curves and confidence intervals for each treatment. Finally, this technique is used to estimate the velocity of tumour growth. 相似文献

9.

Embedded experiments in repeated and overlapping surveys

James Chipperfield Philip Bell 《Journal of the Royal Statistical Society. Series A, (Statistics in Society)》2010,173(1):51-66

Summary. Statistical agencies make changes to the data collection methodology of their surveys to improve the quality of the data collected or to improve the efficiency with which they are collected. For reasons of cost it may not be possible to estimate the effect of such a change on survey estimates or response rates reliably, without conducting an experiment that is embedded in the survey which involves enumerating some respondents by using the new method and some under the existing method. Embedded experiments are often designed for repeated and overlapping surveys; however, previous methods use sample data from only one occasion. The paper focuses on estimating the effect of a methodological change on estimates in the case of repeated surveys with overlapping samples from several occasions. Efficient design of an embedded experiment that covers more than one time point is also mentioned. All inference is unbiased over an assumed measurement model, the experimental design and the complex sample design. Other benefits of the approach proposed include the following: it exploits the correlation between the samples on each occasion to improve estimates of treatment effects; treatment effects are allowed to vary over time; it is robust against incorrectly rejecting the null hypothesis of no treatment effect; it allows a wide set of alternative experimental designs. This paper applies the methodology proposed to the Australian Labour Force Survey to measure the effect of replacing pen-and-paper interviewing with computer-assisted interviewing. This application considered alternative experimental designs in terms of their statistical efficiency and their risks to maintaining a consistent series. The approach proposed is significantly more efficient than using only 1 month of sample data in estimation. 相似文献

10.

A Test of Non Null Hypothesis for Linear Trends in Proportions

Guolong Zhao 《统计学通讯:理论与方法》2013,42(8):1621-1639

Usual tests for trends stand under null hypothesis. This article presents a test of non null hypothesis for linear trends in proportions. A weighted least squares method is used to estimate the regression coefficient of proportions. A non null hypothesis is defined as its expectation equal to a prescribed regression coefficient margin. Its variance is used to construct an equation of basic relationship for linear trends in proportions along the asymptotic normal method. Then follow derivations for the sample size formula, the power function, and the test statistic. The expected power is obtained from the power function and the observed power is exhibited by Monte Carlo method. It reduces to the classical test for linear trends in proportions on setting the margin equal to zero. The agreement between the expected and the observed power is excellent. It is the non null hypothesis test matched with the classical test and can be applied to assess the clinical significance of trends among several proportions. By contrast, the classical test is restricted in testing the statistical significance. A set of data from a website is used to illustrate the methodology. 相似文献

11.

A case study of green tree frog population size estimation by repeated capture–mark–recapture method with individual tagging: a parametric bootstrap method vs Jolly–Seber method

《Journal of Statistical Computation and Simulation》2012,82(12):1879-1895

This paper deals with estimation of a green tree frog population in an urban setting using repeated capture–mark–recapture (CMR) method over several weeks with an individual tagging system which gives rise to a complicated generalization of the hypergeometric distribution. Based on the maximum likelihood estimation, a parametric bootstrap approach is adopted to obtain interval estimates of the weekly population size which is the main objective of our work. The method is computation-based; and programming intensive to implement the algorithm for re-sampling. This method can be applied to estimate the population size of any species based on repeated CMR method at multiple time points. Further, it has been pointed out that the well-known Jolly–Seber method, which is based on some strong assumptions, produces either unrealistic estimates, or may have situations where its assumptions are not valid for our observed data set. 相似文献

12.

Examining the robustness of fully synthetic data techniques for data with binary variables

《Journal of Statistical Computation and Simulation》2012,82(6):609-624

There is a growing demand for public use data while at the same time there are increasing concerns about the privacy of personal information. One proposed method for accomplishing both goals is to release data sets that do not contain real values but yield the same inferences as the actual data. The idea is to view confidential data as missing and use multiple imputation techniques to create synthetic data sets. In this article, we compare techniques for creating synthetic data sets in simple scenarios with a binary variable. 相似文献

13.

Uniformly most powerful unbiased test in univariate linear calibration

Guimei Zhao 《Statistics》2017,51(3):609-614

In this paper, we deal with the hypothesis testing problems for the univariate linear calibration, where a normally distributed response variable and an explanatory variable are involved, and the observations of the response variable corresponding to known values of the explanatory variable are used for making inferences concerning a single unknown value of the explanatory variable. The uniformly most powerful unbiased tests for both one-sided and two-sided hypotheses are constructed and verified. The power behaviour of the proposed tests is numerically compared with that of the existing method, and simulations show that the proposed tests make the powers improved. 相似文献

14.

Inferences for the inflation parameter in the zip distributions: The method of moments

Mahmood Kharrati-Kopaei Havva Faghih 《Statistical Methodology》2011,8(4):377-388

The zero-inflated Poisson (ZIP) distribution is widely used for modeling a count data set when the frequency of zeros is higher than the one expected under the Poisson distribution. There are many methods for making inferences for the inflation parameter in the ZIP models, e.g. the methods for testing Poisson (the inflation parameter is zero) versus ZIP distribution (the inflation parameter is positive). Most of these methods are based on the maximum likelihood estimators which do not have an explicit expression. However, the estimators which are obtained by the method of moments are powerful enough, easy to obtain and implement. In this paper, we propose an approach based on the method of moments for making inferences about the inflation parameter in the ZIP distribution. Our method is also compared to some recent methods via a simulation study and it is illustrated by an example. 相似文献

15.

Testing for constancy in varying coefficient models

Mohamed Ahkim 《统计学通讯:理论与方法》2018,47(4):890-911

We consider varying coefficient models, which are an extension of the classical linear regression models in the sense that the regression coefficients are replaced by functions in certain variables (for example, time), the covariates are also allowed to depend on other variables. Varying coefficient models are popular in longitudinal data and panel data studies, and have been applied in fields such as finance and health sciences. We consider longitudinal data and estimate the coefficient functions by the flexible B-spline technique. An important question in a varying coefficient model is whether an estimated coefficient function is statistically different from a constant (or zero). We develop testing procedures based on the estimated B-spline coefficients by making use of nice properties of a B-spline basis. Our method allows longitudinal data where repeated measurements for an individual can be correlated. We obtain the asymptotic null distribution of the test statistic. The power of the proposed testing procedures are illustrated on simulated data where we highlight the importance of including the correlation structure of the response variable and on real data. 相似文献

16.

Trend estimation of multivariate time series with controlled smoothness

Victor M. Guerrero Alejandro Islas-Camargo L. Leticia Ramirez-Ramirez 《统计学通讯:理论与方法》2017,46(13):6704-6726

This paper extends the univariate time series smoothing approach provided by penalized least squares to a multivariate setting, thus allowing for joint estimation of several time series trends. The theoretical results are valid for the general multivariate case, but particular emphasis is placed on the bivariate situation from an applied point of view. The proposal is based on a vector signal-plus-noise representation of the observed data that requires the first two sample moments and specifying only one smoothing constant. A measure of the amount of smoothness of an estimated trend is introduced so that an analyst can set in advance a desired percentage of smoothness to be achieved by the trend estimate. The required smoothing constant is determined by the chosen percentage of smoothness. Closed form expressions for the smoothed estimated vector and its variance-covariance matrix are derived from a straightforward application of generalized least squares, thus providing best linear unbiased estimates for the trends. A detailed algorithm applicable for estimating bivariate time series trends is also presented and justified. The theoretical results are supported by a simulation study and two real applications. One corresponds to Mexican and US macroeconomic data within the context of business cycle analysis, and the other one to environmental data pertaining to a monitored site in Scotland. 相似文献

17.

Variational Bayes with synthetic likelihood

Victor M. H. Ong David J. Nott Minh-Ngoc Tran Scott A. Sisson Christopher C. Drovandi 《Statistics and Computing》2018,28(4):971-988

Synthetic likelihood is an attractive approach to likelihood-free inference when an approximately Gaussian summary statistic for the data, informative for inference about the parameters, is available. The synthetic likelihood method derives an approximate likelihood function from a plug-in normal density estimate for the summary statistic, with plug-in mean and covariance matrix obtained by Monte Carlo simulation from the model. In this article, we develop alternatives to Markov chain Monte Carlo implementations of Bayesian synthetic likelihoods with reduced computational overheads. Our approach uses stochastic gradient variational inference methods for posterior approximation in the synthetic likelihood context, employing unbiased estimates of the log likelihood. We compare the new method with a related likelihood-free variational inference technique in the literature, while at the same time improving the implementation of that approach in a number of ways. These new algorithms are feasible to implement in situations which are challenging for conventional approximate Bayesian computation methods, in terms of the dimensionality of the parameter and summary statistic. 相似文献

18.

Testing for homogeneity in cross-lagged panel studies

Lawrence S Mayer Steven S Carroll 《统计学通讯:理论与方法》2013,42(9):2487-2516

Panel studies are statistical studies in which two or more variables are observed for two or more subjects at two or more points In time. Cross- lagged panel studies are those studies in which the variables are continuous and divide naturally into two effects or impacts of each set of variables on the other. If a regression approach is taken5 a regression structure Is formulated for the cross-lagged models This structure may assume that the regression parameters are homogeneous across waves and across subpopulations. Under such assumptions the methods of multivariate regression analysis can be adapted to make inferences about the parameters. These inferences are limited to the degree that homogeneity of the parameters Is 'supported b}T the data. We consider the problem of testing the hypotheses of homogeneity and consider the problem of making statistical inferences about the cross-effects should there be evidence against one of the homogeneity assumptions. We demonstrate the methods developed by applying then to two panel data sets. 相似文献

19.

Influence Measures Based on Cressie-Read Divergence Measures in Multivariate Linear Model

J. García-Heras J. Muñoz-García L. Pardo 《统计学通讯:理论与方法》2013,42(11):2055-2073

We define a new family of influence measures based on the divergence measures, in the multivariate general linear model. Influence measures are obtained by quantifying the divergence between the sample distribution of an estimate obtained with all the observations and the sample distribution of the same estimate obtained without any observation. This approach is applied to best linear unbiased estimates of estimable functions. Therefore, these diagnostics can be applied to every statistical multivariate technique that can be formulated like this kind of model. Some examples are considered to clarify the applicability of the introduced diagnostics. 相似文献

20.

Assessing the Quality of Household Panel Data: The Case of the Panel Study of Income Dynamics

Greg J. Duncan Daniel H. Hill 《商业与经济统计学杂志》2013,31(4):441-452

Evidence from a number of methodological studies are used to assess the overall quality of data from the Panel Study of Income Dynamics (PSID). Despite substantial cumulative non-response over the nearly two decades spanned by the study, the sample is found to maintain its representation of the nonimmigrant population of the United States. The most important reasons for this result are that the study's following rules insure that the sample replaces itself in the same manner as the population (through the formation of new families by the offspring of old) and that nonresponse is largely unsystematic. Nonresponse also appears to be largely random with respect to parameters in a number of behavioral models. The accuracy of measures is assessed by comparing survey measures with national aggregates and with highly accurate individual validating data. PSID reports of transfer income appear to compare more favorably with program aggregates than do reports from other large-scale surveys such as the Current Population Survey. Finally, although PSID survey measures generally are unbiased when compared to validating data, they contain amounts of measurement-error variance that range from trivially small to very large. 相似文献