Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Idealized computational models for auditory receptive fields
KTH, Skolan för datavetenskap och kommunikation (CSC), Beräkningsbiologi, CB.ORCID-id: 0000-0002-9081-2170
KTH, Skolan för datavetenskap och kommunikation (CSC), Tal, musik och hörsel, TMH.ORCID-id: 0000-0003-2926-6518
2015 (Engelska)Ingår i: PLoS ONE, ISSN 1932-6203, E-ISSN 1932-6203, Vol. 10, nr 3, artikel-id e0119032Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

We present a theory by which idealized models of auditory receptive fields can be derived in a principled axiomatic manner, from a set of structural properties to (i) enable invariance of receptive field responses under natural sound transformations and (ii) ensure internal consistency between spectro-temporal receptive fields at different temporal and spectral scales.

For defining a time-frequency transformation of a purely temporal sound signal, it is shown that the framework allows for a new way of deriving the Gabor and Gammatone filters as well as a novel family of generalized Gammatone filters, with additional degrees of freedom to obtain different trade-offs between the spectral selectivity and the temporal delay of time-causal temporal window functions.

When applied to the definition of a second-layer of receptive fields from a spectrogram, it is shown that the framework leads to two canonical families of spectro-temporal receptive fields, in terms of spectro-temporal derivatives of either spectro-temporal Gaussian kernels for non-causal time or a cascade of time-causal first-order integrators over the temporal domain and a Gaussian filter over the logspectral domain. For each filter family, the spectro-temporal receptive fields can be either separable over the time-frequency domain or be adapted to local glissando transformations that represent variations in logarithmic frequencies over time. Within each domain of either non-causal or time-causal time, these receptive field families are derived by uniqueness from the assumptions.

It is demonstrated how the presented framework allows for computation of basic auditory features for audio processing and that it leads to predictions about auditory receptive fields with good qualitative similarity to biological receptive fields measured in the inferior colliculus (ICC) and primary auditory cortex (A1) of mammals.

Ort, förlag, år, upplaga, sidor
Plos , 2015. Vol. 10, nr 3, artikel-id e0119032
Nyckelord [en]
Automatic Speech Recognition, Cat Striate Cortex, Inferior Colliculus, Feature-Extraction, Scale Selection, Natural Sounds, Gabor Analysis, Visual-Cortex, Time-Domain, Filter
Nationell ämneskategori
Data- och informationsvetenskap
Forskningsämne
Tal- och musikkommunikation
Identifikatorer
URN: urn:nbn:se:kth:diva-160565DOI: 10.1371/journal.pone.0119032ISI: 000352134700031PubMedID: 25822973Scopus ID: 2-s2.0-84926628005OAI: oai:DiVA.org:kth-160565DiVA, id: diva2:790400
Forskningsfinansiär
Vetenskapsrådet, 2010-4766,2012-4685,2014-4083EU, FP7, Sjunde ramprogrammet, FET-Open 618067
Anmärkning

QC 20150407

Tillgänglig från: 2015-02-24 Skapad: 2015-02-24 Senast uppdaterad: 2018-09-13Bibliografiskt granskad

Open Access i DiVA

fulltext(3894 kB)91 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 3894 kBChecksumma SHA-512
413b8d4b5d312a391c9626c318e8ea1b2596d8121ff5bdb3fb98131f01a57497925cc688f4a924e28de5e7acb3d6ffa280df833fa82e55fe978a0152ef518a60
Typ fulltextMimetyp application/pdf

Övriga länkar

Förlagets fulltextPubMedScopusPreprint at arXiv:1404.2037

Personposter BETA

Lindeberg, TonyFriberg, Anders

Sök vidare i DiVA

Av författaren/redaktören
Lindeberg, TonyFriberg, Anders
Av organisationen
Beräkningsbiologi, CBTal, musik och hörsel, TMH
I samma tidskrift
PLoS ONE
Data- och informationsvetenskap

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 91 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

doi
pubmed
urn-nbn

Altmetricpoäng

doi
pubmed
urn-nbn
Totalt: 1738 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf