Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
How to talk about protein-level false discovery rates in shotgun proteomics
KTH, School of Biotechnology (BIO), Gene Technology.ORCID iD: 0000-0002-5401-5553
KTH, School of Biotechnology (BIO), Gene Technology.
KTH, School of Biotechnology (BIO), Gene Technology.ORCID iD: 0000-0001-5689-9797
2016 (English)In: Proteomics, ISSN 1615-9853, E-ISSN 1615-9861, Vol. 16, no 18, 2461-2469 p.Article in journal (Refereed) Published
Abstract [en]

A frequently sought output from a shotgun proteomics experiment is a list of proteins that we believe to have been present in the analyzed sample before proteolytic digestion. The standard technique to control for errors in such lists is to enforce a preset threshold for the false discovery rate (FDR). Many consider protein-level FDRs a difficult and vague concept, as the measurement entities, spectra, are manifestations of peptides and not proteins. Here, we argue that this confusion is unnecessary and provide a framework on how to think about protein-level FDRs, starting from its basic principle: the null hypothesis. Specifically, we point out that two competing null hypotheses are used concurrently in today's protein inference methods, which has gone unnoticed by many. Using simulations of a shotgun proteomics experiment, we show how confusing one null hypothesis for the other can lead to serious discrepancies in the FDR. Furthermore, we demonstrate how the same simulations can be used to verify FDR estimates of protein inference methods. In particular, we show that, for a simple protein inference method, decoy models can be used to accurately estimate protein-level FDRs for both competing null hypotheses.

Place, publisher, year, edition, pages
Wiley-Blackwell, 2016. Vol. 16, no 18, 2461-2469 p.
Keyword [en]
Bioinformatics, Data processing and analysis, Mass spectrometry-LC-MS/MS, Protein inference, Simulation, Statistical analysis
National Category
Biophysics Bioinformatics and Systems Biology
Identifiers
URN: urn:nbn:se:kth:diva-196441DOI: 10.1002/pmic.201500431ISI: 000385813600005PubMedID: 27503675ScopusID: 2-s2.0-84988369698OAI: oai:DiVA.org:kth-196441DiVA: diva2:1050537
Note

QC 20161129

Available from: 2016-11-29 Created: 2016-11-14 Last updated: 2016-11-29Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textPubMedScopus

Search in DiVA

By author/editor
The, MatthewTasnim, AyeshaKäll, Lukas
By organisation
Gene Technology
In the same journal
Proteomics
BiophysicsBioinformatics and Systems Biology

Search outside of DiVA

GoogleGoogle Scholar

Altmetric score

Total: 12 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf