首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Associations between multiple discrete measures are often due to collapsing over other variables. When the variables collapsed over are unobserved and continuous, log-multiplicative association models, including log-linear models with linear-by-linear interactions for ordinal categorical data and extensions of Goodman's (1979, 1985) RC(M) association model for multiple nominal and/or ordinal categorical variables, can be used to study the relationship between the observed discrete variables and the unobserved continuous ones, and to study the unobserved variables. The derivation and use of log-multiplicative association models as latent variable models for discrete variables are presented in this paper. The models are based on graphical models for discrete and continuous variables where the variables follow a conditional Gaussian distribution. The models have many desirable properties, including having schematic or graphical representations of the system of observed and unobserved variables, the log-multiplicative models can be read from the graphs, and estimates of the means, variances, and covariances of the latent variables given values on the observed variables are a function of the log-multiplicative model parameters. To illustrate some of the advantageous aspects of these models, two examples are presented. In one example, responses to items from the General Social Survey (Davis and Smith 1996) are modeled, and in the other example, panel data from two groups (Coleman 1964) are analyzed.  相似文献   

2.
3.
4.
《Social Networks》1987,9(1):1-36
In 1983, Holland, Laskey, and Leinhardt, using the ideas of Holland and Leinhardt, and Fienberg and Wasserman, introduced the notion of a stochastic blockmodel. The mathematics for stochastic a priori blockmodels, in which exogenous actor attribute data are used to partition actors independently of any statistical analysis of the available relational data, have been refined by several researchers and the resulting models used by many. Attempts to simultaneously partition actors and to perform relational data analyses using statistical methods that yield stochastic a posteriori blockmodels are still quite rare. In this paper, we discuss some old suggestions for producing such posterior blockmodels, and comment on other new suggestions based on multiple comparisons of model parameters, log-linear models for ordinal categorical data, and correspondence analysis. We also review measures for goodness-of-fit of a blockmodel, and we describe a natural approach to this problem using likelihood-ratio statistics generated from a popular model for relational data.  相似文献   

5.
This paper describes and contrasts two useful ways to employ a latent class variable as a mixture variable in regression analyses of panel data with a categorical dependent variable. One way is to model unobserved heterogeneity in the trajectory, or change in the distribution, of the dependent variable. Two models that accomplish this are the latent trajectory model and latent growth curve model for a categorical dependent variable having ordered categories. Each latent class here represents a distinct trajectory of the dependent variable. The latent trajectory model introduces covariate effects on the composition of latent classes, while the latent growth curve model introduces covariate effects on both the "intercept" and the "slope" of growth in logit, which may vary among latent classes.
The other useful way is to model unobserved heterogeneity in the state dependence of the dependent variable. Two models that accomplish this are introduced for a simultaneous analysis of response probability and response stability, and the latent class variable is employed to distinguish two latent populations that differ in the stability of responses over time. One of them is the switching multinomial logit model with a time-lagged dependent variable as its separation indicator, and the other is the mover-stayer regression model.
By applying these four models to empirical data, this paper demonstrates the usefulness of these models for panel-data analyses. Example programs for specifying these models based on the LEM program are also provided.  相似文献   

6.
HOW TO IMPUTE INTERACTIONS, SQUARES, AND OTHER TRANSFORMED VARIABLES   总被引:1,自引:0,他引:1  
Researchers often carry out regression analysis using data that have missing values. Missing values can be filled in using multiple imputation, but imputation is tricky if the regression includes interactions, squares, or other transformations of the regressors. In this paper, we examine different approaches to imputing transformed variables; and we find one simple method that works well across a variety of circumstances. Our recommendation is to transform, then impute —i.e., calculate the interactions or squares in the incomplete data and then impute these transformations like any other variable. The transform-then-impute method yields good regression estimates, even though the imputed values are often inconsistent with one another. It is tempting to try and "fix" the inconsistencies in the imputed values, but methods that do so lead to biased regression estimates. Such biased methods include the passive imputation strategy implemented by the popular ice command for Stata.  相似文献   

7.
We propose an alternative method of conducting exploratory latent class analysis that utilizes latent class factor models, and compare it to the more traditional approach based on latent class cluster models. We show that when formulated in terms of R mutually independent, dichotomous latent factors, the LC factor model has the same number of distinct parameters as an LC cluster model with R+1 clusters. Analyses over several data sets suggest that LC factor models typically fit data better and provide results that are easier to interpret than the corresponding LC cluster models. We also introduce a new graphical "bi-plot" display for LC factor models and compare it to similar plots used in correspondence analysis and to a barycentric coordinate display for LC cluster models. New results on identification of LC models are also presented. We conclude by describing various model extensions and an approach for eliminating boundary solutions in identified and unidentified LC models, which we have implemented in a new computer program.  相似文献   

8.
Although the methodology for handling ordinal and dichotomous observed variables in structural equation models (SEMs) is developing rapidly, several important issues are unresolved. One of these is the optimal test statistic to apply as a test of overall model fit. We propose a new "vanishing tetrad" test statistic for such models. We build on Bollen's (1990) simultaneous test statistic for testing multiple vanishing tetrads and on Bollen and Ting's (1993) confirmatory tetrad analysis (CTA) for hypothesis testing of model structures. These and other works on vanishing tetrads assume continuous observed variables and do not consider observed categorical variables. In this paper we present a method to test models when some or all of the observed variables are collapsed or categorical versions of underlying continuous variables. The test statistic that we provide is an alternative "overall fit" statistic for SEMs with censored, ordinal, or dichotomous observed variables. Furthermore, the vanishing tetrad test sometimes permits us to compare the fit of some models that are not nested in the traditional likelihood ratio test. We illustrate the new test statistic with examples and a small simulation experiment comparing it with two other tests of model fit for SEMs with ordinal or dichotomous endogenous variables.  相似文献   

9.
Effects of categorical variables in statistical models typically are reported in terms of comparison either with a reference category or with a suitably defined "mean effect," for reasons of parameter identification. A conventional presentation of estimates and standard errors, but without the full variance–covariance matrix, does not allow subsequent readers either to make inference on a comparison of interest that is not presented or to compare or combine results from different studies where the same variables but different reference levels are used. It is shown how an alternative presentation, in terms of "quasi standard errors," overcomes this problem in an economical and intuitive way. A primary application is the reporting of effects of categorical predictors, often called factors, in linear and generalized linear models, hazard models, multinomial–response models, generalized additive models, etc. Other applications include the comparison of coefficients between related regression equations—for example, log–odds ratios in a multinomial logit model—and the presentation of multipliers or "scores" in models with multiplicative interaction structure.  相似文献   

10.
Multilevel Latent Class Models   总被引:4,自引:0,他引:4  
The latent class (LC) models that have been developed so far assume that observations are independent. Parametric and nonparametric random–coefficient LC models are proposed here, which will make it possible to modify this assumption. For example, the models can be used for the analysis of data collected with complex sampling designs, data with a multilevel structure, and multiple–group data for more than a few groups. An adapted EM algorithm is presented that makes maximum–likelihood estimation feasible. The new model is illustrated with examples from organizational, educational, and cross–national comparative research.  相似文献   

11.
A General Class of Nonparametric Models for Ordinal Categorical Data   总被引:1,自引:0,他引:1  
This paper presents a general class of models for ordinal categorical data that can be specified by means of linear and/or log-linear equality and/or inequality restrictions on the (conditional) probabilities of a multiway contingency table. Some special cases are models with ordered local odds ratios, models with ordered cumulative response probabilities, order-restricted row association and column association models, and models for stochastically ordered marginal distributions. A simple unidimensional Newton algorithm is proposed for obtaining the restricted maximum-likelihood estimates. In situations in which there is some kind of missing data, this algorithm can be implemented in the M step of an EM algorithm. Computation of p-values of testing statistics is performed by means of parametric bootstrapping.  相似文献   

12.
In many applications observations have some type of clustering, with observations within clusters tending to be correlated. A common instance of this occurs when each subject in the sample undergoes repeated measurement, in which case a cluster consists of the set of observations for the subject. One approach to modeling clustered data introduces cluster-level random effects into the model. The use of random effects in linear models for normal responses is well established. By contrast, random effects have only recently seen much use in models for categorical data. This chapter surveys a variety of potential social science applications of random effects modeling of categorical data. Applications discussed include repeated measurement for binary or ordinal responses, shrinkage to improve multiparameter estimation of a set of proportions or rates, multivariate latent variable modeling, hierarchically structured modeling, and cluster sampling. The models discussed belong to the class of generalized linear mixed models (GLMMs), an extension of ordinary linear models that permits nonnormal response variables and both fixed and random effects in the predictor term. The models are GLMMs for either binomial or Poisson response variables, although we also present extensions to multicategory (nominal or ordinal) responses. We also summarize some of the technical issues of model-fitting that complicate the fitting of GLMMs even with existing software.  相似文献   

13.
An "effect display" is a graphical or tabular summary of a statistical model based on high-order terms in the model. Effect displays have previously been defined by Fox (1987, 2003) for generalized linear models (including linear models). Such displays are especially compelling for complicated models—for example, those including interactions or polynomial terms. This paper extends effect displays to models commonly used for polytomous categorical response variables: the multinomial logit model and the proportional-odds logit model. Determining point estimates of effects for these models is a straightforward extension of results for the generalized linear model. Estimating sampling variation for effects on the probability scale in the multinomial and proportional-odds logit models is more challenging, however, and we use the delta method to derive approximate standard errors. Finally, we provide software for effect displays in the R statistical computing environment.  相似文献   

14.
Ordinal response scales with a middle category are widely used in public opinion studies, psychology, medicine, computed tomography and other fields. The usual models in the statistical literature for ordinal response variables treat the case where the scale has a natural middle category no differently from the case where the scale does not have a middle category. This paper proposes new models for the analysis of ordinal response scales with middle categories, applying these to data collected in 1993-1994 on American opinion toward the balance between environmental quality and economic prosperity. Some of the models should also be useful when the scale does not have a natural middle category. The models are easily used to address issues of concern in empirical work—for example, stochastic ordering among covariate classes and asymmetry about the middle category. Log-linear models are considered in Section 2. The relationship between the normal distribution and a quadratic log-linear model with known scores, discussed in this section, is the basis for Section 3, which considers a log-nonlinear model with unknown scores estimated from the data. Section 4 shows how generalized log-linear and generalized log-nonlinear models can be used to simultaneously study whether the response is below, at, or above the midpoint, and the conditional distribution of responses above (below) the midpoint. These models are also useful when the response scale is viewed as nested and/or the response process is sequential.  相似文献   

15.
Procedures for ascertaining relative model adequacy in latent variable structural relations models are discussed. Under diverse methods of estimation, this determination may be assessed using the chi square goodness of fit statistic, incremental fit indices for covariance structure models, and latent variable coefficients of determination. An example from evaluation research is taken (cf. Magidson, 1977; Bentler & Woodward, 1978). Numerical sensitivity of parameter estimates under alternative model specifications is demonstrated. Interpretive implications based on these procedures are discussed in terms of parameter sensitivity to alternative model specifications.  相似文献   

16.
The standard latent class model is a finite mixture of indirectly observed multinomial distributions, each of which is assumed to exhibit statistical independence. Latent class analysis has been applied in a wide variety of research contexts, including studies of mobility, educational attainment, agreement, and diagnostic accuracy, and as measurement error models in social research. One of the attractive features of the latent class model in these settings is that the parameters defining the individual multinomials are readily interpretable marginal probabilities, conditional on the unobserved latent variable(s), that are often of substantive interest. There are, however, settings where the local-independence axiom is not supported, and hence it is useful to consider some form of local dependence. In this paper we consider a family of models defined in terms of finite mixtures of multinomial models where the multinomials are parameterized in terms of a set of models for the univariate marginal distributions and for marginal associations. Local dependence is introduced through the models for marginal associations, and the standard latent class model obtains as a special case. Three examples are analyzed with the models to illustrate their utility in analyzing complex cross-classifications.  相似文献   

17.
Recent developments have made model-based imputation of network data feasible in principle, but the extant literature provides few practical examples of its use. In this paper, we consider 14 schools from the widely used In-School Survey of Add Health (Harris et al., 2009), applying an ERGM-based estimation and simulation approach to impute the network missing data for each school. Add Health's complex study design leads to multiple types of missingness, and we introduce practical techniques for handing each. We also develop a cross-validation based method – Held-Out Predictive Evaluation (HOPE) – for assessing this approach. Our results suggest that ERGM-based imputation of edge variables is a viable approach to the analysis of complex studies such as Add Health, provided that care is used in understanding and accounting for the study design.  相似文献   

18.
A model is considered for the regression analysis of multivariate binary data such as repeated-measures data (for example, panel data) or multiple-indicators with measures of some underlying characteristic such as attitude or ability (for example, surveys or tests). The model is related to the usual Rasch model, the usual latent-class model, and other familiar models such as logistic regression. In addition to a regression specification, the model includes parameters that describe heterogeneity not accounted for by the predictors. In contrast to most other approaches, a nonparametric specification of the latent mixing distribution is used, leading to a formulation based on scaled latent classes. We examine the relationship between this model and several other models, give a tractable formulation of the likelihood function and likelihood equations, present an algorithm for maximum-likelihood estimation, and analyze marginal and conditional latent structures. The approach is illustrated with longitudinal data from the German Socioeconomic Panel.  相似文献   

19.
The interlinguistic and intercultural comparability of school satisfaction as a component of subjective well-being (SWB) is analysed using a sample of 11 to 14-year-olds extracted from the Children's Worlds international database, which includes 15 countries (N = 17,246). To this end, two multi-item scales and one single-item scale on overall life satisfaction were adopted as indicators of SWB. Six items on satisfaction with different facets of school life were included as subjective indicators of school satisfaction.Previous analyses conducted by different authors in several countries were replicated by means of multiple regression analysis in order to explore which items could be better used to test models which alternatively use one or two latent variables. Different options were considered and analysed using Structural Equations Modelling (SEM) to estimate the most appropriate model for a cross-country comparison of school satisfaction as a component of SWB. Multi-group SEM was used on six items regarding school-related satisfaction, related alternatively to one and two latent variables. We propose a multi-group model with two latent variables for school-related satisfaction related to the SLSS and OLS for the purposes of cross-cultural comparison; the model displays excellent fit indexes.  相似文献   

20.
Multiple imputation (MI), a two-stage process whereby missing data are imputed multiple times and the resulting estimates of the parameter(s) of interest are combined across the completed datasets, is becoming increasingly popular for handling missing data. However, MI can result in biased inference if not carried out appropriately or if the underlying assumptions are not justifiable. Despite this, there remains a scarcity of guidelines for carrying out MI. In this paper we provide a tutorial on the main issues involved in employing MI, as well as highlighting some common pitfalls and misconceptions, and areas requiring further development. When contemplating using MI we must first consider whether it is likely to offer gains (reduced bias or increased precision) over alternative methods of analysis. Once it has been decided to use MI, there are a number of decisions that must be made during the imputation process; we discuss the extent to which these decisions can be guided by the current literature. Finally we highlight the importance of checking the fit of the imputation model. This process is illustrated using a case study in which we impute missing outcome data in a five-wave longitudinal study that compared extremely preterm individuals with term-born controls.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号