Audio-Visual Speech Recognition

2001. 07. 13.


Click here to start


Table of Contents

Audio-Visual Speech Recognition

Outline

Purpose of the lecture

Contents

Contents

Concepts and terminology

Concepts and terminology

Overview of speech production - physical aspects

Overview of speech production - physical aspects

Bimodality of human speech

Bimodality of human speech

Bimodality of human speech

Speechreading by humans

Computer speechreading systems

Strategies for combining audio and visual modalities of speech

Strategies for combining audio and visual modalities of speech

The visual speech recognition (lipreading) subsystem

Types of facial speech features

Types of facial speech features

Types of facial speech features

Types of facial speech features

Mouth region localization

Mouth/ lip contour extraction and tracking

Mouth/ lip contour extraction and tracking

Mouth/ lip contour extraction and tracking

Classification of visual speech features

Classification of visual speech features

Experimental frameworks

Experimental frameworks

Experimental frameworks

Experimental frameworks

Experimental frameworks

Examples of audio-visual speech recognition systems

Examples of audio-visual speech recognition systems

Examples of audio-visual speech recognition systems

Examples of audio-visual speech recognition systems

Examples of audio-visual speech recognition systems

Challenges in audio-visual speech recognition area

Conclusions

References

References

Demo - McGurk effect

Experimental frameworks

Author: Mihaela Gordan

Email: ssip@inf.u-szeged.hu

Home Page: http://www.inf.u-szeged.hu/~ssip