Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Ensemble models for spoofing detection in automatic speaker verification
Show others and affiliations
2019 (English)In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2019, International Speech Communication Association, 2019, p. 1018-1022Conference paper, Published paper (Refereed)
Abstract [en]

Detecting spoofing attempts of automatic speaker verification (ASV) systems is challenging, especially when using only one modelling approach. For robustness, we use both deep neural networks and traditional machine learning models and combine them as ensemble models through logistic regression. They are trained to detect logical access (LA) and physical access (PA) attacks on the dataset released as part of the ASV Spoofing and Countermeasures Challenge 2019. We propose dataset partitions that ensure different attack types are present during training and validation to improve system robustness. Our ensemble model outperforms all our single models and the baselines from the challenge for both attack types. We investigate why some models on the PA dataset strongly outperform others and find that spoofed recordings in the dataset tend to have longer silences at the end than genuine ones. By removing them, the PA task becomes much more challenging, with the tandem detection cost function (t-DCF) of our best single model rising from 0.1672 to 0.5018 and equal error rate (EER) increasing from 5.98% to 19.8% on the development set.

Place, publisher, year, edition, pages
International Speech Communication Association, 2019. p. 1018-1022
Series
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, ISSN 2308-457X ; 2019
Keywords [en]
Anti-spoofing, ASVspoof 2019, Countermeasures, Logical access attack, Model ensemble, Physical access attack
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:kth:diva-265139DOI: 10.21437/Interspeech.2019-2505Scopus ID: 2-s2.0-85074719233OAI: oai:DiVA.org:kth-265139DiVA, id: diva2:1377356
Conference
20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019; Graz; Austria; 15 September 2019 through 19 September 2019
Note

QC 20191211

Available from: 2019-12-11 Created: 2019-12-11 Last updated: 2019-12-11Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records BETA

Sturm, Bob

Search in DiVA

By author/editor
Sturm, Bob
By organisation
Speech, Music and Hearing, TMH
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 104 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf