Bootstrap Confidence Intervals for Large-scale Multivariate Monotonic Regression Problems |
| |
Authors: | Oleg Sysoev Anders Grimvall |
| |
Affiliation: | Department of Computer and Information Science, Link?ping University, Link?ping, Sweden |
| |
Abstract: | Recently, the methods used to estimate monotonic regression (MR) models have been substantially improved, and some algorithms can now produce high-accuracy monotonic fits to multivariate datasets containing over a million observations. Nevertheless, the computational burden can be prohibitively large for resampling techniques in which numerous datasets are processed independently of each other. Here, we present efficient algorithms for estimation of confidence limits in large-scale settings that take into account the similarity of the bootstrap or jackknifed datasets to which MR models are fitted. In addition, we introduce modifications that substantially improve the accuracy of MR solutions for binary response variables. The performance of our algorithms is illustrated using data on death in coronary heart disease for a large population. This example also illustrates that MR can be a valuable complement to logistic regression. |
| |
Keywords: | Big data Bootstrap Confidence intervals Monotonic regression Pool-adjacent-violators algorithm. |
|
|