A dissimilarity matrix between protein atom classes based on Gaussian mixtures
2002 (English)In: Bioinformatics, ISSN 1367-4803, E-ISSN 1460-2059, Vol. 18, no 9, 1257-1263 p.Article in journal (Refereed) Published
Motivation: Previously, Rantanen et al. (2001; J. Mol. Biol., 313, 197-214) constructed a protein atom-ligand fragment interaction library embodying experimentally solved, high-resolution three-dimensional (3D) structural data from the Protein Data Bank (PDB). The spatial locations of protein atoms that surround ligand fragments were modeled with Gaussian mixture models, the parameters of which were estimated with the expectation-maximization (EM) algorithm. In the validation analysis of this library, there was strong indication that the protein atom classification, 24 classes, was too large and that a reduction in the classes would lead to improved predictions. Results: Here, a dissimilarity (distance) matrix that is suitable for comparison and fusion of 24 pre-defined protein atom classes has been derived. Jeffreys' distances between Gaussian mixture models are used as a basis to estimate dissimilarities between protein atom classes. The dissimilarity data are analyzed both with a hierarchical clustering method and independently by using multidimensional scaling analysis. The results provide additional insight into the relationships between different protein atom classes, giving us guidance on, for example, how to readjust protein atom classification and, thus, they will help us to improve protein-ligand interaction predictions.
Place, publisher, year, edition, pages
2002. Vol. 18, no 9, 1257-1263 p.
favorable binding-sites, meaningful hierarchical-classification, hydrogen-bonding regions, directed drug design, ligand probe groups, ludi, recognition, positions, superstar, molecules
IdentifiersURN: urn:nbn:se:kth:diva-21891ISI: 000178001400013OAI: oai:DiVA.org:kth-21891DiVA: diva2:340589
QC 201005252010-08-102010-08-10Bibliographically approved