Interspeech and AVSP 2007

Posted on October 12, 2007 - Filed Under biometrics, conference, research, speech | Leave a Comment

I recently attended two speech related conference over in Europe. It seems I like my international conferences in twos. The first conference was the Interspeech 2007 conference in Antwerp, Belgium, and the second was the International Conference on Auditory-Visual Speech Processing (AVSP) 2007 near Hilvarenbeek in the Netherlands. Both were good experiences and will be [...]

Read More..>>

Journal Impact and Eigenfactor

Posted on July 19, 2007 - Filed Under impact, journals, research | Leave a Comment

I’ve been looking for good journals to submit a paper to for a while now, and working out the quality of journals is not the easiest thing to do. Traditionally, the impact factor as calculated and listed in Journal Citation Reports (JCR) have been used, but the access to these impact factors is not free, [...]

Read More..>>

Google Portrait and Sebastien Marcel interview

Posted on June 18, 2007 - Filed Under IDIAP, biometrics, face recognition, research | Leave a Comment

Have you seen Google Portrait? It is actually by Sebastien Marcel at IDIAP, not Google, but it is a nice little application of face detection. Basically you can type anything you like into the search box, and the site will search Google Images for your search term, and return any faces it finds in the [...]

Read More..>>

An introduction to audio-visual speech recognition

Posted on April 30, 2007 - Filed Under audio-visual, research, speech | Leave a Comment

This is from an introduction to my latest paper, and I thought it might be useful to put up here. Feel free to leave any comments on this below.
Audio-visual Speech Recognition
Automatic speech recognition is a very mature area of research, and one that is increasingly becoming involved in our day-to-day lives. While many systems that [...]

Read More..>>

Audio-visual speech and the McGurk effect

Posted on April 23, 2007 - Filed Under audio-visual, research, speech | Leave a Comment

It may not be immediately obvious to most, but speech is fundamentally a multimodal interaction. (Multimodal is the fancy-pants way of saying that the interaction occurs through more than one mode or channel of communication – audio, visual, gestural, etc.).
While we can communicate very well with audio alone, such as during a telephone call, our [...]

Read More..>>

Audio-visual speaker verification using continuous fused HMMs

Posted on October 24, 2006 - Filed Under fhmm, publications, research, speech | Leave a Comment

Dean, David and Sridharan, Sridha and Wark, Tim (2006) Audio-visual speaker verification using continuous fused HMMs. In Proceedings HCSNet Workshop on the Use of Vision in HCI, Canberra, Australia.
This paper examines audio-visual speaker verification using a novel adaptation of fused hidden Markov models, in comparison to output fusion of individual classifiers in the audio [...]

Read More..>>

An examination of audio-visual fused HMMs for speaker recognition

Posted on October 24, 2006 - Filed Under fhmm, publications, research, speech | Leave a Comment

Dean, David and Wark, Tim and Sridharan, Sridha (2006) An examination of audio-visual fused HMMs for speaker recognition. In Proceedings Second Workshop on Multimodal User Authentication, Toulouse, France.
Fused hidden Markov models (FHMMs) have been shown to work well for the task of audio-visual speaker recognition, but only in an output decision-fusion configuration of both the [...]

Read More..>>

Comparing Audio and Visual Information for Speech Processing

Posted on October 24, 2006 - Filed Under publications, research, speech | Leave a Comment

Dean, David and Lucey, Patrick and Sridharan, Sridha and Wark, Tim (2005) Comparing Audio and Visual Information for Speech Processing. In Proceedings The Eighth International Symposium on Signal Processing and Its Applications, pages pp. 58-61, Sydney, Australia.
This paper examines the utility of audio-visual speech for the two related tasks of speech and speaker recognition. A [...]

Read More..>>

Audio-visual speaker identification using the CUAVE database

Posted on October 24, 2006 - Filed Under publications, research, speech | Leave a Comment

Dean, David and Lucey, Patrick and Sridharan, Sridha (2005) Audio-visual speaker identification using the CUAVE database. In Vatikiotis-Bateson, Eric and Burnham, Denis and Fels, Sidney, Eds. Proceedings Auditory-Visual Speech Processing 2005, British Columbia, Canada.
The freely available nature of the CUAVE database allows it to provide a valuable platform to form benchmarks and compare research. This [...]

Read More..>>

QUT ePrints suggestions

Posted on October 24, 2006 - Filed Under QUT, research | Leave a Comment

I recently sent this email to QUT’s ePrints service, and I thought I’d post it up here too, in case anyone else is interested.
Hi,
I have been updating my QUT eprints lately, and I would like to give a few suggestions as to how the system could be improved.
1) In lists of publications, make author names [...]

Read More..>>

keep looking »

  • Pages

  • Recent Posts

  • Categories

  • Interesting from Elsewhere

  • Meta