Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Improving Contact Prediction along Three Dimensions
KTH, School of Computer Science and Communication (CSC), Computational Biology, CB. Aalto University, Finland.
2014 (English)In: PloS Computational Biology, ISSN 1553-734X, E-ISSN 1553-7358, Vol. 10, no 10, e1003847- p.Article in journal (Refereed) Published
Abstract [en]

Correlation patterns in multiple sequence alignments of homologous proteins can be exploited to infer information on the three-dimensional structure of their members. The typical pipeline to address this task, which we in this paper refer to as the three dimensions of contact prediction, is to (i) filter and align the raw sequence data representing the evolutionarily related proteins; (ii) choose a predictive model to describe a sequence alignment; (iii) infer the model parameters and interpret them in terms of structural properties, such as an accurate contact map. We show here that all three dimensions are important for overall prediction success. In particular, we show that it is possible to improve significantly along the second dimension by going beyond the pair-wise Potts models from statistical physics, which have hitherto been the focus of the field. These (simple) extensions are motivated by multiple sequence alignments often containing long stretches of gaps which, as a data feature, would be rather untypical for independent samples drawn from a Potts model. Using a large test set of proteins we show that the combined improvements along the three dimensions are as large as any reported to date.

Place, publisher, year, edition, pages
2014. Vol. 10, no 10, e1003847- p.
Keyword [en]
Multiple Sequence Alignments, Direct-Coupling Analysis, Correlated Mutations, Protein Families, Coevolution, Classification, Covariance, Database
National Category
Biochemistry and Molecular Biology
Identifiers
URN: urn:nbn:se:kth:diva-157218DOI: 10.1371/journal.pcbi.1003847ISI: 000344547900009PubMedID: 25299132Scopus ID: 2-s2.0-84908313189OAI: oai:DiVA.org:kth-157218DiVA: diva2:769772
Note

QC 20141209

Available from: 2014-12-09 Created: 2014-12-08 Last updated: 2017-12-05Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textPubMedScopus

Search in DiVA

By author/editor
Aurell, Erik
By organisation
Computational Biology, CB
In the same journal
PloS Computational Biology
Biochemistry and Molecular Biology

Search outside of DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 37 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf