首页 | 本学科首页   官方微博 | 高级检索  
     检索      

零膨胀计数数据的联合建模及变量选择
引用本文:胡亚南,田茂再.零膨胀计数数据的联合建模及变量选择[J].统计研究,2019,36(1):104-114.
作者姓名:胡亚南  田茂再
作者单位:郑州大学商学院经济统计系;教育部人文社会科学重点研究基地中国人民大学应用统计科学研究中心;中国人民大学
基金项目:中国人民大学科学研究基金(中央高校基本科研业务费专项资金资助)项目"大数据分析的稳健统计理论与应用研究"(18XNL012)的资助
摘    要:零膨胀计数数据破坏了泊松分布的方差-均值关系,可由取值服从泊松分布的数据和取值为零(退化分布)的数据各占一定比例所构成的混合分布所解释。本文基于自适应弹性网技术,研究了零膨胀计数数据的联合建模及变量选择问题。对于零膨胀泊松分布,引入潜变量,构造出零膨胀泊松模型的完全似然,由零膨胀部分和泊松部分两项组成。考虑到协变量可能存在共线性和稀疏性,通过对似然函数加自适应弹性网惩罚得到目标函数,然后利用EM算法得到回归系数的稀疏估计量,并用贝叶斯信息准则BIC来确定最优调节参数。本文也给出了估计量的大样本性质的理论证明和模拟研究,最后把所提出的方法应用到实际问题中。

关 键 词:零膨胀泊松模型  变量选择  联合建模

Joint Modeling and Variable Selection from Zero-Inflated Count Data
Hu Yanan & Tian Maozai.Joint Modeling and Variable Selection from Zero-Inflated Count Data[J].Statistical Research,2019,36(1):104-114.
Authors:Hu Yanan & Tian Maozai
Abstract:Zero-inflated count data damage the mean-variance relation in Poisson distribution, which can be explained by the mixture distribution composed pro rata of data subject to Poisson distribution and zero-valued observations (degradation distribution). This paper studies the joint modeling and variable selection from zero-inflated count data based on the adaptive elastic-net technique. As to the zero-inflated Poisson distribution, some latent variables are induced into constructing a complete likelihood of the regression model, consisted of two components (zero-inflated and Poisson). Taking the possible collinearity and sparsity of covariates into account, the objective function is obtained by adding the adaptive elastic-net penalty to the likelihood function. Then the sparse estimator of the regression coefficient is achieved by using the EM algorithm to optimize the objective function. The Bayesian information criterion (BIC) is employed to determine the optimal tuning parameter. This paper also presents the performance of the proposed estimator with large sample properties through a theoretical demonstration and simulation study, and then applied to the practical issues with the real data.
Keywords:Zero-inflated Poisson Model  Variable Selection  Joint Modeling  
本文献已被 维普 等数据库收录!
点击此处可从《统计研究》浏览原始摘要信息
点击此处可从《统计研究》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号