Audio-visual speaker verification using continuous fused HMMs

Posted on October 24, 2006 - Filed Under fhmm, publications, research, speech | Leave a Comment

Dean, David and Sridharan, Sridha and Wark, Tim (2006) Audio-visual speaker verification using continuous fused HMMs. In Proceedings HCSNet Workshop on the Use of Vision in HCI, Canberra, Australia.
This paper examines audio-visual speaker verification using a novel adaptation of fused hidden Markov models, in comparison to output fusion of individual classifiers in the audio [...]

Read More..>>

An examination of audio-visual fused HMMs for speaker recognition

Posted on October 24, 2006 - Filed Under fhmm, publications, research, speech | Leave a Comment

Dean, David and Wark, Tim and Sridharan, Sridha (2006) An examination of audio-visual fused HMMs for speaker recognition. In Proceedings Second Workshop on Multimodal User Authentication, Toulouse, France.
Fused hidden Markov models (FHMMs) have been shown to work well for the task of audio-visual speaker recognition, but only in an output decision-fusion configuration of both the [...]

Read More..>>

Comparing Audio and Visual Information for Speech Processing

Posted on October 24, 2006 - Filed Under publications, research, speech | Leave a Comment

Dean, David and Lucey, Patrick and Sridharan, Sridha and Wark, Tim (2005) Comparing Audio and Visual Information for Speech Processing. In Proceedings The Eighth International Symposium on Signal Processing and Its Applications, pages pp. 58-61, Sydney, Australia.
This paper examines the utility of audio-visual speech for the two related tasks of speech and speaker recognition. A [...]

Read More..>>

Audio-visual speaker identification using the CUAVE database

Posted on October 24, 2006 - Filed Under publications, research, speech | Leave a Comment

Dean, David and Lucey, Patrick and Sridharan, Sridha (2005) Audio-visual speaker identification using the CUAVE database. In Vatikiotis-Bateson, Eric and Burnham, Denis and Fels, Sidney, Eds. Proceedings Auditory-Visual Speech Processing 2005, British Columbia, Canada.
The freely available nature of the CUAVE database allows it to provide a valuable platform to form benchmarks and compare research. This [...]

Read More..>>

  • Pages

  • Recent Posts

  • Categories

  • Interesting from Elsewhere

  • Meta