Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Lip Synchronization: from Phone Lattice to PCA Eigen-projections using Neural Networks
KTH, Skolan för datavetenskap och kommunikation (CSC), Centra, Centrum för Talteknologi, CTT. KTH, Skolan för datavetenskap och kommunikation (CSC), Tal, musik och hörsel, TMH, Tal-kommunikation.
2008 (engelsk)Inngår i: INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, BAIXAS: ISCA-INST SPEECH COMMUNICATION ASSOC , 2008, s. 2016-2019Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Lip synchronization is the process of generating natural lip movements from a speech signal. In this work we address the lip-sync problem using an automatic phone recognizer that generates a phone lattice carrying posterior probabilities. The acoustic feature vector contains the posterior probabilities of all the phones over a time window centered at the current time point. Hence this representation characterizes the phone recognition output including the confusion patterns caused by its limited accuracy. A 3D face model with varying texture is computed by analyzing a video recording of the speaker using a 3D morphable model. Training a neural network using 30 000 data vectors from an audiovisual recording in Dutch resulted in a very good simulation of the face on independent data sets of the same or of a different speaker.

sted, utgiver, år, opplag, sider
BAIXAS: ISCA-INST SPEECH COMMUNICATION ASSOC , 2008. s. 2016-2019
Emneord [en]
lip synchronization, speech recognition, phone lattice, 3D morphable models, principal component analysis, audio visual speech
HSV kategori
Identifikatorer
URN: urn:nbn:se:kth:diva-29854ISI: 000277026101077Scopus ID: 2-s2.0-84867204708ISBN: 978-1-61567-378-0 (tryckt)OAI: oai:DiVA.org:kth-29854DiVA, id: diva2:399745
Konferanse
9th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2008)
Merknad
QC 20110222Tilgjengelig fra: 2011-02-23 Laget: 2011-02-17 Sist oppdatert: 2018-01-12bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

ScopusISCA

Søk i DiVA

Av forfatter/redaktør
Al Moubayed, Samer
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric

isbn
urn-nbn
Totalt: 699 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf