Sampling from spatial databases |
| |
Authors: | Frank Olken Doron Rotem |
| |
Affiliation: | (1) Information and Computing Sciences Division, Lawrence Berkeley Laboratory, 94720 Berkeley, CA, USA;(2) Management Information Systems Department, School of Business, San Jose State University, San Jose, CA, USA |
| |
Abstract: | This paper deals with techniques for obtaining random point samples from spatial databases. We seek random points from a continuous domain (usually 2) which satisfy a spatial predicate that is represented in the database as a collection of polygons. Several applications of spatial sampling (e.g. environmental monitoring, agronomy, forestry, etc) are described. Sampling problems are characterized in terms of two key parameters: coverage (selectivity), and expected stabbing number (overlap). We discuss two fundamental approaches to sampling with spatial predicates, depending on whether we sample first or evaluate the predicate first. The approaches are described in the context of both quadtrees and R-trees, detailing the sample first, acceptance/rejection tree, and partial area tree algorithms. A sequential algorithm, the one-pass spatial reservoir algorithm is also described. The relative performance of the various sampling algorithms is compared and choice of preferred algorithms is suggested. We conclude with a short discussion of possible extensions. |
| |
Keywords: | geographic information systems GIS quadtrees query processing query optimization R-trees random sampling relational databases reservoir sampling sampling algorithms simple random sampling sequential sampling spatial data structures spatial databases |
本文献已被 SpringerLink 等数据库收录! |
|