6.345 Automatic Speech Recognition

Spring 2003

Major components in a speech recognition system.
Major components in a Speech Recognition System, excerpted from Lecture 1. (Image by James Glass and Victor Zue.)

Course Highlights

6.345 is a course in the department's "Bioelectrical Engineering" concentration. This course offers a full set of lecture slides with accompanying speech samples, as well as homework assignments and other materials used in the course.

Course Description

6.345 introduces students to the rapidly developing field of automatic speech recognition. Its content is divided into three parts. Part I deals with background material in the acoustic theory of speech production, acoustic-phonetics, and signal representation. Part II describes algorithmic aspects of speech recognition systems including pattern classification, search algorithms, stochastic modelling, and language modelling techniques. Part III compares and contrasts the various approaches to speech recognition, and describes advanced techniques used for acoustic-phonetic modelling, robust speech recognition, speaker adaptation, processing paralinguistic information, speech understanding, and multimodal processing.

Special Features

Technical Requirements

Media player software, such as Quicktime® PlayerRealOne™ Player, or Windows Media® Player, is required to run the .wav files found on this course site.

RealOne™ is a trademark or a registered trademark of RealNetworks, Inc.
QuickTime® is a trademark of Apple Computer, Inc., registered in the U.S. and other countries.
Windows Media® is a registered trademark or trademark of Microsoft Corporation in the U.S. and/or other countries.

*Some translations represent previous versions of courses.

Donate Now


Dr. James Glass
Prof. Victor Zue

Course Meeting Times

Two sessions / week
1.5 hours / session