期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A Semiparametric Model for Line Transect Sampling

Omar Eidous 《统计学通讯:理论与方法》2013,42(7):1211-1221

This paper introduces an appealing semiparametric model for estimating wildlife abundance based on line transect data. The proposed method requires the existence of a parametric model and then improves the estimator using a kernel method. Properties of the resultant estimator are derived and an expression for the asymptotic mean square error (AMSE) of the estimator is given. Minimization of the AMSE leads to an explicit formula for an optimal choice of the smoothing parameter. Small-sample properties of the proposed estimator using the parametric half-normal model are investigated and compared with the classical kernel estimator using both simulations and real data. Numerical results show that improvements over the classical kernel estimator often can be realized even when the true density is far from the half-normal model. 相似文献

2.

Asymptotic Unbiased Kernel Estimator for Line Transect Sampling

Omar Eidous M. K. Shakhatreh 《统计学通讯:理论与方法》2013,42(24):4353-4363

In this article, we propose a new estimator for the density of objects using line transect data. The proposed estimator combines the nonparametric kernel estimator with parametric detection function: the exponential or the half normal detection function to estimate the density of objects. The selection of the detection function depends on the testing of the shoulder condition assumption. If the shoulder condition is true then the half-normal detection function is introduced together with the kernel estimator. Otherwise, the negative exponential is combined with the kernel estimator. Under these assumptions, the proposed estimator is asymptotically unbiased and it is strongly consistent estimator for the density of objects using line transect data. The simulation results indicate that the proposed estimator is very successful in taking the advantage of the parametric detection function available. 相似文献

3.

Histogram Model with Plausible Parametric Detection Function for Line Transect Data

Omar Eidous 《统计学通讯:理论与方法》2014,43(1):1-12

This article develops a new model that combines between the histogram and plausible parametric detection function to estimate the population density (abundance) by using line transects technique. A parametric detection function is introduced to improve the properties of the classical histogram estimator. Asymptotic properties of the resulting estimator are derived and an expression for the asymptotic mean square error (AMSE) is given. A general formula for the optimal choice of the histogram bin width based on AMSE is derived. Moreover, other possible alternative procedures to select the bin width are suggested and studied via simulation technique. The results show the superiority of the proposed estimators over both the classical histogram and the usual kernel estimators in most reasonable cases. In addition, the simulation results indicate that the choice of a plausible detection function is less sensitive than the choice of a bin width on the performance of the proposed estimator. 相似文献

4.

Bootstrap MISE Estimators to Obtain Bandwidth for Kernel Density Estimation

Jeffrey C. Miecznikowski Dongliang Wang Alan Hutson 《统计学通讯:模拟与计算》2013,42(7):1455-1469

Research in the area of bandwidth selection was an active topic in the 1980s and 1990s, however, recently there has been little research in the area. We re-opened this investigation and have found a new method for estimating mean integrated squared error for kernel density estimators. We provide an overview of other methods to obtain optimal bandwidths and offer a comparison of these methods via a simulation study. In certain situations, our method of estimating an optimal bandwidth yields a smaller MISE than competing methods to compute bandwidths. This procedure is illustrated by an application to two data sets. 相似文献

5.

基于核密度估计的非线性时间序列聚类

张贝贝《统计教育》2010,(4):15-20

本文研究的是时间序列的聚类问题。由于现实世界中时间序列多数是非线性的,而现有的时间序列聚类问题大都是基于线性时间序列模型进行聚类的,本文提出了可以用于非线性时间序列的聚类方法。以时间序列的二维核密度估计之间的相似性作为非线性时间序列的距离度量,该距离度量方式是一种非参数的距离度量方法,考虑到了时间序列自相关结构的差异,能够粗糙地识别时间序列形状和动态相关结构的相似性。与理论研究结果相一致,我们的模拟实验结果也验证了这种距离度量的有效性。相似文献

6.

Kernel Smoothing Density Estimation when Group Membership is Subject to Missing

Tang W He H Gunzler D 《Journal of statistical planning and inference》2012,142(3):685-694

Density function is a fundamental concept in data analysis. Non-parametric methods including kernel smoothing estimate are available if the data is completely observed. However, in studies such as diagnostic studies following a two-stage design the membership of some of the subjects may be missing. Simply ignoring those subjects with unknown membership is valid only in the MCAR situation. In this paper, we consider kernel smoothing estimate of the density functions, using the inverse probability approaches to address the missing values. We illustrate the approaches with simulation studies and real study data in mental health. 相似文献

7.

Kernel Survival Function Estimation Based on Doubly Censored Data

Animikh Biswas 《统计学通讯:理论与方法》2013,42(7):1293-1307

This work concerns the estimation of a smooth survival function based on doubly censored data. We establish strong consistency and asymptotic normality for a kernel estimator. Moreover, we also obtain an asymptotic expression for the mean integrated squared error, which yields an optimum bandwidth in terms of readily estimable quantities. 相似文献

8.

Optimal False Discovery Rate Control with Kernel Density Estimation in a Microarray Experiment

Moonsu Kang 《统计学通讯:模拟与计算》2016,45(3):771-780

Most of current false discovery rate (FDR) procedures in a microarray experiment assume restrictive dependence structures, resulting in being less reliable. FDR controlling procedure under suitable dependence structures based on Poisson distributional approximation is shown. Unlike other procedures, the distribution of false null hypotheses is estimated by using kernel density estimation allowing for dependent structures among the genes. Furthermore, we develop an FDR framework that minimizes the false nondiscovery rate (FNR) with a constraint on the controlled level of the FDR. The performance of the proposed FDR procedure is compared with that of other existing FDR controlling procedures, with an application to the microarray study of simulated data. 相似文献

9.

Inversion Theorem Based Kernel Density Estimation for the Ordinary Least Squares Estimator of a Regression Coefficient

Dongliang Wang Alan D. Hutson 《统计学通讯:理论与方法》2013,42(8):1571-1579

The traditional confidence interval associated with the ordinary least squares estimator of linear regression coefficient is sensitive to non-normality of the underlying distribution. In this article, we develop a novel kernel density estimator for the ordinary least squares estimator via utilizing well-defined inversion based kernel smoothing techniques in order to estimate the conditional probability density distribution of the dependent random variable. Simulation results show that given a small sample size, our method significantly increases the power as compared with Wald-type CIs. The proposed approach is illustrated via an application to a classic small data set originally from Graybill (1961 Graybill, F.A. (1961). Introduction to Linear Statistical Models. Vol. 1. New York: McGraw-Hill Book Company. [Google Scholar]). 相似文献

10.

Kernel Density Estimation with Generalized Binning

M. Pawlak & U. Stadtmuller 《Scandinavian Journal of Statistics》1999,26(4):539-561

We propose kernel density estimators based on prebinned data. We use generalized binning schemes based on the quantiles points of a certain auxiliary distribution function. Therein the uniform distribution corresponds to usual binning. The statistical accuracy of the resulting kernel estimators is studied, i.e. we derive mean squared error results for the closeness of these estimators to both the true function and the kernel estimator based on the original data set. Our results show the influence of the choice of the auxiliary density on the binned kernel estimators and they reveal that non-uniform binning can be worthwhile. 相似文献

11.

Robust Likelihood Cross-Validation for Kernel Density Estimation

Ximing Wu 《商业与经济统计学杂志》2013,31(4):761-770

Likelihood cross-validation for kernel density estimation is known to be sensitive to extreme observations and heavy-tailed distributions. We propose a robust likelihood-based cross-validation method to select bandwidths in multivariate density estimations. We derive this bandwidth selector within the framework of robust maximum likelihood estimation. This method establishes a smooth transition from likelihood cross-validation for nonextreme observations to least squares cross-validation for extreme observations, thereby combining the efficiency of likelihood cross-validation and the robustness of least-squares cross-validation. We also suggest a simple rule to select the transition threshold. We demonstrate the finite sample performance and practical usefulness of the proposed method via Monte Carlo simulations and a real data application on Chinese air pollution. 相似文献

12.

Cross-validation Bandwidth Matrices for Multivariate Kernel Density Estimation 总被引：3，自引：0，他引：3

TARN DUONG MARTIN L. HAZELTON 《Scandinavian Journal of Statistics》2005,32(3):485-506

Abstract. The performance of multivariate kernel density estimates depends crucially on the choice of bandwidth matrix, but progress towards developing good bandwidth matrix selectors has been relatively slow. In particular, previous studies of cross-validation (CV) methods have been restricted to biased and unbiased CV selection of diagonal bandwidth matrices. However, for certain types of target density the use of full (i.e. unconstrained) bandwidth matrices offers the potential for significantly improved density estimation. In this paper, we generalize earlier work from diagonal to full bandwidth matrices, and develop a smooth cross-validation (SCV) methodology for multivariate data. We consider optimization of the SCV technique with respect to a pilot bandwidth matrix. All the CV methods are studied using asymptotic analysis, simulation experiments and real data analysis. The results suggest that SCV for full bandwidth matrices is the most reliable of the CV methods. We also observe that experience from the univariate setting can sometimes be a misleading guide for understanding bandwidth selection in the multivariate case. 相似文献

13.

对支持向量机混合核函数方法的再评估

魏瑾瑞《统计研究》2015,32(2):90-96

混合核函数方法并没有解决核函数的选择问题,只是将问题等价转换为权重参数的选择。同时该方法还需要分别为两个核函数确定参数,大大增加了算法的复杂程度,限制了支持向量机的泛化能力。事实上,调节核函数的参数对分类结果的影响要远大于选择什么类型的核函数,因此混合核函数方法实属“避轻就重”。实证分析表明,不同核函数对应的共同支持向量比例很高,存在很大程度的一致性,线性组合的意义并不大,这也是混合核函数方法无法有效提升分类性能的一个重要原因。相似文献

14.

Local Smoothing for Kernel Distribution Function Estimation

Santanu Dutta 《统计学通讯:模拟与计算》2015,44(4):878-891

The problem of bandwidth selection for kernel-based estimation of the distribution function (cdf) at a given point is considered. With appropriate bandwidth, a kernel-based estimator (kdf) is known to outperform the empirical distribution function. However, such a bandwidth is unknown in practice. In pointwise estimation, the appropriate bandwidth depends on the point where the function is estimated. The existing smoothing methods use one common bandwidth to estimate the cdf. The accuracy of the resulting estimates varies substantially depending on the cdf and the point where it is estimated. We propose to select bandwidth by minimizing a bootstrap estimator of the MSE of the kdf. The resulting estimator performs reliably, irrespective of where the cdf is estimated. It is shown to be consistent under i.i.d. as well as strongly mixing dependence assumption. Two applications of the proposed estimator are shown in finance and seismology. We report a dataset on the S & P Nifty index values. 相似文献

15.

Comparison of Kernel Density Estimators with Assumption on Number of Modes

Raphaël Coudret Gilles Durrieu Jérôme Saracco 《统计学通讯:模拟与计算》2015,44(1):196-216

A data-driven bandwidth choice for a kernel density estimator called critical bandwidth is investigated. This procedure allows the estimation to have as many modes as assumed for the density to estimate. Both Gaussian and uniform kernels are considered. For the Gaussian kernel, asymptotic results are given. For the uniform kernel, an argument against these properties is mentioned. These theoretical results are illustrated with a simulation study that compares the kernel estimators that rely on critical bandwidth with another one that uses a plug-in method to select its bandwidth. An estimator that consists in estimates of density contour clusters and takes assumptions on number of modes into account is also considered. Finally, the methodology is illustrated using environment monitoring data. 相似文献

16.

MDL Mean Function Selection in Semiparametric Kernel Regression Models

Jan G. De Gooijer Ao Yuan 《统计学通讯:理论与方法》2013,42(14):2237-2248

相似文献

17.

Post-Model-Selection Method for Density Estimation

Małgorzata Wojtyś 《统计学通讯:理论与方法》2013,42(17):3082-3098

相似文献

18.

Kernel Estimation of the Spectral Density of Stationary Random Closed Sets

Stephan Böhm Lothar Heinrich Volker Schmidt 《Australian & New Zealand Journal of Statistics》2004,46(1):41-51

A non‐parametric kernel estimator of the spectral density of stationary random closed sets is studied. Conditions are derived under which this estimator is asymptotically unbiased and mean‐square consistent. For the planar Boolean model with isotropic compact and convex grains, an averaged version of the kernel estimator is compared with the theoretical spectral density. 相似文献

19.

Estimating Percentiles of Time-to-Failure Distribution Obtained from a Linear Degradation Model Using Kernel Density Method

Mohammed Al-Haj Ebrahem Omar Eidous Gharam Kmail 《统计学通讯:模拟与计算》2013,42(9):1811-1822

In this article, we propose a nonparametric estimator for percentiles of the time-to-failure distribution obtained from a linear degradation model using the kernel density method. The properties of the proposed kernel estimator are investigated and compared with well-known maximum likelihood and ordinary least squares estimators via a simulation technique. The mean squared error and the length of the bootstrap confidence interval are used as the basis criteria of the comparisons. The simulation study shows that the performance of the kernel estimator is acceptable as a general estimator. When the distribution of the data is assumed to be known, the maximum likelihood and ordinary least squares estimators perform better than the kernel estimator, while the kernel estimator is superior when the assumption of our knowledge of the data distribution is violated. A comparison among different estimators is achieved using a real data set. 相似文献

20.

Kernel Estimation of Rate Function for Recurrent Event Data

CHIN-TSANG CHIANG MEI-CHENG WANG CHIUNG-YU HUANG 《Scandinavian Journal of Statistics》2005,32(1):77-91

Abstract. Recurrent event data are largely characterized by the rate function but smoothing techniques for estimating the rate function have never been rigorously developed or studied in statistical literature. This paper considers the moment and least squares methods for estimating the rate function from recurrent event data. With an independent censoring assumption on the recurrent event process, we study statistical properties of the proposed estimators and propose bootstrap procedures for the bandwidth selection and for the approximation of confidence intervals in the estimation of the occurrence rate function. It is identified that the moment method without resmoothing via a smaller bandwidth will produce a curve with nicks occurring at the censoring times, whereas there is no such problem with the least squares method. Furthermore, the asymptotic variance of the least squares estimator is shown to be smaller under regularity conditions. However, in the implementation of the bootstrap procedures, the moment method is computationally more efficient than the least squares method because the former approach uses condensed bootstrap data. The performance of the proposed procedures is studied through Monte Carlo simulations and an epidemiological example on intravenous drug users. 相似文献