kth.sePublikationer KTH
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
HAPP: High-accuracy pipeline for processing deep metabarcoding data
Stockholm Univ, Dept Biochem & Biophys, Sci Life Lab, Natl Bioinformat Infrastruct Sweden, Solna, Sweden.
Swedish Museum Nat Hist, Dept Bioinformat & Genet, Stockholm, Sweden.
Swedish Museum Nat Hist, Dept Bioinformat & Genet, Stockholm, Sweden.
Lund Univ, Dept Lab Med, Sci Life Lab, Natl Bioinformat Infrastruct Sweden, Lund, Sweden.
Visa övriga samt affilieringar
2025 (Engelska)Ingår i: PloS Computational Biology, ISSN 1553-734X, E-ISSN 1553-7358, Vol. 21, nr 11, artikel-id e1013558Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Deep metabarcoding offers an efficient and reproducible approach to biodiversity monitoring, but noisy data and incomplete reference databases challenge accurate diversity estimation and taxonomic annotation. Here, we introduce a novel algorithm, NEEAT, for removing spurious operational taxonomic units (OTUs) originating from nuclear-embedded mitochondrial DNA sequences (NUMTs) or sequencing errors. It integrates 'echo' signals across samples with the identification of unusual evolutionary patterns among similar DNA sequences. We also extensively benchmark current tools for chimera removal, taxonomic annotation and OTU clustering of deep metabarcoding data. The best performing tools/parameter settings are integrated into HAPP, a high-accuracy pipeline for processing deep metabarcoding data. Tests using CO1 data from BOLD and large-scale metabarcoding data on insects demonstrate that HAPP significantly outperforms existing methods, while enabling efficient analysis of extensive datasets by parallelizing computations across taxonomic groups.

Ort, förlag, år, upplaga, sidor
Public Library of Science (PLoS) , 2025. Vol. 21, nr 11, artikel-id e1013558
Nationell ämneskategori
Bioinformatik och beräkningsbiologi
Identifikatorer
URN: urn:nbn:se:kth:diva-375535DOI: 10.1371/journal.pcbi.1013558ISI: 001609505600001PubMedID: 41202092Scopus ID: 2-s2.0-105022268948OAI: oai:DiVA.org:kth-375535DiVA, id: diva2:2031277
Anmärkning

QC 20260122

Tillgänglig från: 2026-01-22 Skapad: 2026-01-22 Senast uppdaterad: 2026-01-22Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextPubMedScopus

Person

Andersson, Anders F.

Sök vidare i DiVA

Av författaren/redaktören
Andersson, Anders F.
Av organisationen
Science for Life Laboratory, SciLifeLabGenteknologi
I samma tidskrift
PloS Computational Biology
Bioinformatik och beräkningsbiologi

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetricpoäng

doi
pubmed
urn-nbn
Totalt: 6 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf