Validating protein structure using kernel density estimates |
| |
Authors: | Charles C. Taylor Kanti V. Mardia Marco Di Marzio Agnese Panzera |
| |
Affiliation: | 1. Department of Statistics , University of Leeds , Leeds , LS2 9JT , UK;2. Dipartimento di Metodi Quantitativi e Teoria Economica , Università di Chieti-Pescara , Viale Pindaro 42, 65127 , Pescara , Italy |
| |
Abstract: | Measuring the quality of determined protein structures is a very important problem in bioinformatics. Kernel density estimation is a well-known nonparametric method which is often used for exploratory data analysis. Recent advances, which have extended previous linear methods to multi-dimensional circular data, give a sound basis for the analysis of conformational angles of protein backbones, which lie on the torus. By using an energy test, which is based on interpoint distances, we initially investigate the dependence of the angles on the amino acid type. Then, by computing tail probabilities which are based on amino-acid conditional density estimates, a method is proposed which permits inference on a test set of data. This can be used, for example, to validate protein structures, choose between possible protein predictions and highlight unusual residue angles. |
| |
Keywords: | circular kernel conformational angle probability contour variable bandwidth von Mises density |
|
|