首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 541 毫秒
1.
人工神经网络变量选取与隐藏单元数的确定   总被引:2,自引:0,他引:2  
根据多隐藏层所有训练样本误差平方和最小设计优化问题,求解并绘出计算流程图。Trevor等人认为隐藏单元过多比过少好,交叉验证估计(隐藏单元)正则化参数没有必要。还有一种通常做法是常常利用分类树挑选变量作为输入变量进行人工神经网络建模。而从人工神经网络与多元统计、传统回归和其他数据挖掘工具的区别和联系出发,认为这些观点和做法值得商酌;用ZIP编码实例说明隐藏单元过多不一定比过少好,实际数据分析中所需隐藏单元数的确定可以用交叉验证结合经验判断来实现,利用分类树选择的变量对于人工神经网络没有太大的效果;通过分类树删节变量以降低计算量的效果不如通过压缩隐藏单元个数降低计算量来得好;非完全问题“从简单到一般”思想与完全问题中选择所有变量的思想不矛盾。在总结了Le Cun等人的局部联结以有效降低权数思想的基础上,提出通过随机选择人工变量建立人工神经网络分布式模型系统的设想。  相似文献   

2.
The usual maximum likelihood estimators of the parameters of the von Mises distribution are shown to perform badly in small samples. In view of this and the fact that these estimators require a large amount of computation, alternative, simpler estimators are proposed. It is shown that these estimators are at least comparable to the traditional estimators and are, in many cases, superior to them. We also apply the procedure of jackknifing to the maximum likelihood estimator of the concentration parameter of the von Mises distribution and compare the properties of the jackknifed estimator with the other estimators considered in this paper.  相似文献   

3.
The USSR is moving from extensive to intensive type of population reproduction, not because of any governmental measures but as a reaction to objective circumstances. At present there is a reduction in the population cohort of working age. This has an adverse effect on production. And the number of the elderly is increasing; we must involve them more in the production process, in monitoring the service sphere, for instance. The infant mortality rate has declined sharply in the USSR, especially since World War II. The birthrate has also dropped, but it is still higher than in the US, France, or West Germany. Some authorities think that this (the ZPG) is a good idea, but this author thinks it is a bad idea, meaning a drop in society's productive forces. We need an active demographic policy, meaning one which ensures optimal reproduction of the population with an improvement in its health, culture, and living standard. The policy must be differentiated, as conditions vary from one part of the country to another. An example is the 1981 decree on assistance to families with children which provides for payments of 35-50 rubles/month to mothers who must stay home to look after children. The number of children per family to ensure reproduction must be 2.6 on the average. In the Central Asian republics family sizes of 4-5 children are anticipated, but in the RSFSR, Ukraine, Belorussia, and the Baltic republics the anticipated size of families is rather low. Measures must be taken to alter this by providing assistance to families with children.  相似文献   

4.
The joint efforts in the USSR by the Central Planning Commission and the State Statistical Commission to develop projections of family characteristics of the Soviet population are described. The projections, based on official data for the rural and urban populations in 1987, are for the years 1991, 1996, 2001, and 2006.  相似文献   

5.
The Cornish-Fisher expansion of the Pearson type VI distribution is known to be reasonably accurate when both degrees of freedom are relatively large (say greater than or equal to 5). However, when either or both degrees of freedom are less than 5, the accuracy of the computed percentage point begins to suffer; in some cases severely. To correct for this, the error surface in the degrees of freedom plane is modeled by least squares curve fitting for selected levels of tail probability (.025, .05, and .10) which can be used to adjust the percentage point obtained from the usual Cornish-Fisher expansion. This adjustment procedure produces a computing algorithm that computes percentage points of the Pearson type VI distribution at the above probability levels, accurate to at least + 1 in 3 digits in approximately 11 milliseconds per subroutine call on an IBM 370/145. This adjusted routine is valid for both degrees of freedom greater than or equal to 1.  相似文献   

6.
The X2 approximation to the distribution of the sample coefficient of variation of a normally distributed random variable, due to McKay 1932 is not nearly as accurate as suggested by the confirmatory studies of Fieller 1932 and Pearson 1932.The approximation will, however, be adequate in many practical situations and has the convenience of requiring nothing more than the readily available tables of x2.  相似文献   

7.
This work presents an optimal value to be used in the power transformation to transform the exponential to normality for statistical process control (SPC) applications. The optimal value is found by minimizing the sum of absolute differences between two distinct cumulative probability functions. Based on this criterion, a numerical search yields a proposed value of 3.5142, so the transformed distribution is well approximated by the normal distribution. Two examples are presented to demonstrate the effectiveness of using the transformation method and its applications in SPC. The transformed data are almost normally distributed and the performance of the individual charts is satisfactory. Compared to charts that use the original exponential data and probability control limits, the individual charts constructed using the transformed distribution are superior in appearance, ease of interpretation and implementation by practitioners.  相似文献   

8.
Estimation of the parameter in the problem of the Nile is treated as a decision problem with squared error loss, It is shown that the minimum risk scale equivariant estimator dominates the incomplete sufficient unbiased estimators considered by Iwase and Seto, Sharper bounds for the equivariant estimator are derived which may be used to obtain the values of the same from the sample with sufficient accuracy.  相似文献   

9.
洪兴建 《统计研究》2010,27(2):83-86
 由于很多收入抽样数据只是公布了相对简约的分组数据,如何依据信息不完整的分组数据估计样本基尼系数的范围是非常重要的。本文针对分组数据中各组收入的取值范围以及各组人均收入是否已知,从多个方面探讨了样本基尼系数的取值范围,并给出了相应的估算公式。最后,结合我国城乡居民收入的分组数据,实证分析了城乡收入基尼系数的范围。  相似文献   

10.
The population growth rate of the European dipper has been shown to decrease with winter temperature and population size. We examine here the demographic mechanism for this effect by analysing how these factors affect the survival rate. Using more than 20 years of capture-mark-recapture data (1974-1997) based on more than 4000 marked individuals, we perform analyses using open capture-mark-recapture models. This allowed us to estimate the annual apparent survival rates (probability of surviving and staying on the study site from one year to the next one) and the recapture probabilities. We partitioned the variance of the apparent survival rates into sampling variance and process variance using random effects models, and investigated which variables best accounted for temporal process variation. Adult males and females had similar apparent survival rates, with an average of 0.52 and a coefficient of variation of 40%. Chick apparent survival was lower, averaging 0.06 with a coefficient of variation of 42%. Eighty percent of the variance in apparent survival rates was explained by winter temperature and population size for adults and 48% by winter temperature for chicks. The process variance outweighed the sampling variance both for chick and adult survival rates, which explained that shrunk estimates obtained under random effects models were close to MLE estimates. A large proportion of the annual variation in the apparent survival rate of chicks appears to be explained by inter-year differences in dispersal rates.  相似文献   

11.
This is a history of the 1937 census of the USSR, the results of which were suppressed because of an alleged massive undercount. The author attempts to determine whether there was in fact an undercount in this census, and if so how significant it was. He concludes that those responsible were not guilty of producing a significant undercount, and in fact employed the methods required to keep the undercount to a bare mimimum.  相似文献   

12.
Surveillance to detect changes of spatial patterns is of interest in many areas such as environmental control and regional analysis. Here the interaction parameter of the Ising model, is considered. A minimal sufficient statistic and its asymptotic distribution are used. It is demonstrated that the convergence to normal, distribution is rapid. The main result is that when the lattice is large, all approximations are better in several respects. It is shown that, for large lattice sizes, earlier results on surveillance of a normally distributed random variable can be used in cases of most interest. The expected delay of alarm at a fixed level of false alarm probability is examined for some examples.  相似文献   

13.
This paper dwells on the choice between the ordinary least squares and the estimated generalized least squares estimators when the presence of heteroskedasticity is suspected. Since the estimated generalized least squares estimator does not dominate the ordinary least squares estimator completely over the whole parameter space, it is of interest to the researcher to know in advance whether the degree of severity of heteroskedasticity is such that OLS estimator outperforms the estimated generalized least squares (or 2SAE). Casting the problem in the non-spherical error mold and exploiting the principle underlying the Bayesian pretest estimator, an intuitive non-mathematical procedure is proposed to serve as an aid to the researcher in deciding when to use either the ordinary least squares (OLS) or the estimated generalized least squares (2SAE) estimators.  相似文献   

14.
The population growth rate of the European dipper has been shown to decrease with winter temperature and population size. We examine here the demographic mechanism for this effect by analysing how these factors affect the survival rate. Using more than 20 years of capture-mark-recapture data (1974-1997) based on more than 4000 marked individuals, we perform analyses using open capture-mark-recapture models. This allowed us to estimate the annual apparent survival rates (probability of surviving and staying on the study site from one year to the next one) and the recapture probabilities. We partitioned the variance of the apparent survival rates into sampling variance and process variance using random effects models, and investigated which variables best accounted for temporal process variation. Adult males and females had similar apparent survival rates, with an average of 0.52 and a coefficient of variation of 40%. Chick apparent survival was lower, averaging 0.06 with a coefficient of variation of 42%. Eighty percent of the variance in apparent survival rates was explained by winter temperature and population size for adults and 48% by winter temperature for chicks. The process variance outweighed the sampling variance both for chick and adult survival rates, which explained that shrunk estimates obtained under random effects models were close to MLE estimates. A large proportion of the annual variation in the apparent survival rate of chicks appears to be explained by inter-year differences in dispersal rates.  相似文献   

15.
We derive analytic expressions for the biases, to O(n? 1), of the maximum likelihood estimators of the parameters of the generalized Rayleigh distribution family. Using these expressions to bias-correct the estimators is found to be extremely effective in terms of bias reduction, and generally results in a small reduction in relative mean squared error. In general, the analytic bias-corrected estimators are also found to be superior to the alternative of bias-correction via the bootstrap.  相似文献   

16.
Mortality trends in the USSR from the middle of the nineteenth century to the present day are analyzed, with a focus on changes in life expectancy. The authors note that life expectancy increased up to 1964-1965, declined subsequently, and stabilized during the 1980s. Life expectancy has again started to rise since 1985. More detailed analyses of mortality differentials by sex and age and for the rural and urban populations are included. (SUMMARY IN ENG)  相似文献   

17.
Asymptotic cumulants of the maximum likelihood estimator of the canonical parameter in the exponential family are obtained up to the fourth order with the added higher-order asymptotic variance. In the case of a scalar parameter, the corresponding results with and without studentization are given. These results are also obtained for the estimators by the weighted score, especially for those using the Jeffreys prior. The asymptotic cumulants are used for reducing bias and mean square error to improve a point estimator and for interval estimation to have higher-order accuracy. It is shown that the kurtosis to squared skewness ratio of the sufficient statistic plays a fundamental role.  相似文献   

18.
The EM algorithm is a popular method for parameter estimation in situations where the data can be viewed as being incomplete. As each E-step visits each data point on a given iteration, the EM algorithm requires considerable computation time in its application to large data sets. Two versions, the incremental EM (IEM) algorithm and a sparse version of the EM algorithm, were proposed recently by Neal R.M. and Hinton G.E. in Jordan M.I. (Ed.), Learning in Graphical Models, Kluwer, Dordrecht, 1998, pp. 355–368 to reduce the computational cost of applying the EM algorithm. With the IEM algorithm, the available n observations are divided into B (B n) blocks and the E-step is implemented for only a block of observations at a time before the next M-step is performed. With the sparse version of the EM algorithm for the fitting of mixture models, only those posterior probabilities of component membership of the mixture that are above a specified threshold are updated; the remaining component-posterior probabilities are held fixed. In this paper, simulations are performed to assess the relative performances of the IEM algorithm with various number of blocks and the standard EM algorithm. In particular, we propose a simple rule for choosing the number of blocks with the IEM algorithm. For the IEM algorithm in the extreme case of one observation per block, we provide efficient updating formulas, which avoid the direct calculation of the inverses and determinants of the component-covariance matrices. Moreover, a sparse version of the IEM algorithm (SPIEM) is formulated by combining the sparse E-step of the EM algorithm and the partial E-step of the IEM algorithm. This SPIEM algorithm can further reduce the computation time of the IEM algorithm.  相似文献   

19.
A data set in the form of a 2 × 2 × 2 contingency table is presented and analyzed in detail. For instructional purposes, the analysis of the data can be used to illustrate some basic concepts in the loglinear model approach to the analysis of multidimensional contingency tables.  相似文献   

20.
Summary.  The World Health Organization revises the international classification of diseases about every 10 years to stay abreast of advances in medical science and to compare international health statistics. However, the new revision (i.e. the 10th revision) introduces discontinuities in mortality trends, making it impossible to compare the mortality statistics before and after the revision directly. The US National Center for Health Statistics published comparability ratios to correct the discontinuities between the two sets of mortality data: one coded by the ninth revision and the other by the 10th revision. We propose a parametric two-stage model to produce new comparability ratios and use these ratios to correct the discontinuities. The asymptotic behaviour of the comparability ratios is investigated. Our model not only measures the extent of discontinuities in trends in mortality but also can be used to forecast future mortality. Comparing with the National Center for Health Statistics's ratios, our comparability ratios smooth out the discontinuities better for most causes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号