首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Non-parametric estimation of data dimensionality prior to data compression: the case of the human development index
Authors:David Canning  Declan French
Institution:1. Department of Global Health and Population, Harvard School of Public Health , Harvard University , Boston , MA , USA;2. UKCRC Centre of Excellence for Public Health, Management School , Queens University , Belfast BT7 1NN, UK
Abstract:In many applications in applied statistics, researchers reduce the complexity of a data set by combining a group of variables into a single measure using a factor analysis or an index number. We argue that such compression loses information if the data actually have high dimensionality. We advocate the use of a non-parametric estimator, commonly used in physics (the Takens estimator), to estimate the correlation dimension of the data prior to compression. The advantage of this approach over traditional linear data compression approaches is that the data do not have to be linearised. Applying our ideas to the United Nations Human Development Index, we find that the four variables that are used in its construction have dimension 3 and the index loses information.
Keywords:development  well-being  dimension  measure  indicator
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号