首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Regularized boxplot via convex clustering
Authors:Hosik Choi  J C Poythress  Jong-June Jeon  Changyi Park
Institution:1. Department of Applied Statistics, Kyonggi University, Suwon, Korea;2. Department of Statistics, University of Georgia, Athens, GA, Georgia;3. Department of Statistics, University of Seoul, Dongdaemun-gu, Seoul Korea;4. Natural Science Research Institute, University of Seoul, Dongdaemun-gu, Seoul, Korea
Abstract:A boxplot is a simple and effective exploratory data analysis tool for graphically summarizing a distribution of data. However, in cases where the quartiles in a boxplot are inaccurately estimated, these estimates can affect subsequent analyses. In this paper, we consider the problem of constructing boxplots in a bivariate setting with a categorical covariate with multiple subgroups, and assume that some of these boxplots can be clustered. We propose to use this grouping property to improve the estimation of the quartiles. We demonstrate that the proposed method more accurately estimates the quartiles compared to the usual boxplot. It is also shown that the proposed method identifies outliers effectively as a consequence of accurate quartiles, and possesses a clustering effect due to the group property. We then apply the proposed method to annual maximum precipitation data in South Korea and present its clustering results.
Keywords:Box-whisker plot  convex clustering  group comparison  shrinkage estimator
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号