Registry

Module Specifications

Current Academic Year 2012 - 2013
Please note that this information is subject to change.

Module Title
Module Code
School
Online Module Resources

NFQ level 9 Credit Rating
Pre-requisite None
Co-requisite None
Compatibles None
Incompatibles None
Description
Effective engineering of human interfaces over telephone links requires the use of speech as both an input and an output medium. The module starts from a standard Digital Signal Processing (DSP) background. The student simultaneously acquires a theoretical understanding of, and practical experience with speech signals. The subject is uniquely amenable to experimentation using a sound-equipped PC. The lab exercises start with basic recording and playback, and progress to exploration of systems for synthesis, coding/compression and recognition of speech waveforms. The lectures introduce the theoretical background to methods to be explored with the PC-based labs. The lab exercises can all be performed off-campus using a typical domestic PC.

Learning Outcomes
1. Record speech signals and explain and interpret spectograms of the recordings.
2. Write code to perform basic analysis of speech waveforms.
3. Design a system to modify a speech waveform in both time and frequency domains.
4. Describe the all-pole model of speech production, and its use in coding.
5. Explain the operation of a speech coder and of a speech recognition system.
6. Implement elements of a concatenative speech synthesis system.
7. Describe the parts of the human hearing system.
8. Explain how features of human hearing are exploited to achieve compression of audio signals.



Workload Full-time hours per semester
Type Hours Description
Lecture36lectures
Lab52No Description
Independent learning102No Description
Total Workload: 190

All module information is indicative and subject to change. For further information,students are advised to refer to the University's Marks and Standards and Programme Specific Regulations at: http://www.dcu.ie/registry/examinations/index.shtml

Indicative Content and Learning Activities
Acoustics.
Acoustic Modelling of Speech: Physics + Anatomy.

Basic Methods.
Digital Modelling of Speech: Numerical Methods.

LPC.
Linear Predictive Coding: underlying math. modelling.

Hearing & Perception.
Hearing & Perception.

Coding.
Speech Coding: Waveform coding, Vocoding, hybrids.

Synthesis.
Speech Synthesis: Various models.

Recognition.
Speech Recognition: Basic introduction to fundamental methods.

Assessment Breakdown
Continuous Assessment% Examination Weight%
Course Work Breakdown
TypeDescription% of totalAssessment Date
LaboratoryPC-based exercises analysing and synthesising speech waveforms.30%n/a
Reassessment Requirement
Resit arrangements are explained by the following categories;
1 = A resit is available for all components of the module
2 = No resit is available for 100% continuous assessment module
3 = No resit is available for the continuous assessment component
This module is category
Indicative Reading List
  • McLoughlin, I.V.: 2009, Applied Speech and Audio Processing, CUP,
  • Howard D., Angus J.: 2006, Acoustics and Psychoacoustics, 3, Focal Press,
  • O'Shaughnessy, D.: 2000, Speech Communication, 2, IEEE Press,
  • Moore, B.: 2003, Intro to Psychology of Hearing, 5, Academic Press,
  • Zwicker, E.: 1999, the Ear as a Communication Receiver, ASA Press,
  • Deller, J., Proakis, J., Hansen, J.: 1999, Discrete Time Processing of Speech Signals, WileyBlackwell,
Other Resources
None
Array
Programme or List of Programmes
Timetable this semester: Timetable for
Archives: