Selection of regress or va riables when E(Y) is an unknown honlinear function |
| |
Authors: | J.R. Green M.F. Al-bayatti |
| |
Affiliation: | Department of Statistics andComputational Mathematics , The University , Liverpool, BX, 69 3, England |
| |
Abstract: | We consider the problem of deciding which of a set of p independent variables x1 X2J xs we are to regard as being functionally involved in the mean of a dependent normal random variable Y and estimating E( Y) in terms of the chosen x's. This mean is an unknown function (assumed to be doubly differentiable) of some or all of the x's, so that the problem is of wide relevance. We approximate to the hypersurface in two different ways, and select within each approximation: (a)For the situation where the mean of Y is assumed to be a linear function of the x's, we use ono of the optimum methods of selection. (b)More generally, in the space of the X's the function will be approximately linear in a relatively small region. Accordingly this p-dimensional space is subdivided into smaller regions by a clustering procedure, and a hyperplane if fitted with in each region to aproximate to the unknown responce surface.An adaption of an optimum-regressor-selection procedure is then used to assist in the selection of the regressors Approximate F tests are given to choose between models, including deciding how many x's to retain. Alternatively: the application of Akaike's Extended Maximum Likelihood Principle provides another way of choosing between the models and of selecting regressor variables. The methods are applied to data on glass manufacture. |
| |
Keywords: | Approximate nonlinear regression cluster analysis Extended Maximum Likeli¬hood Principle multiple correlation coefficient optimum regressor selection regression segmented linear models |
|
|