ituu_logo
Verlauf

Optimising Client-Server Architectures for Spoken Language Dialogue Systems

Author: Dmitry Zaykovskiy

Status: in progress

Description:

The subject of this thesis is centred around optimising spoken language dialogue systems (SLDSs) architectures. The study builds upon previous work in the areas of speech signal coding, information transmission, automatic speech recognition and spoken natural language understanding. It aims at improving both the performance of the speech processing components and the upstream data transmission channel. The topic is motivated by an increasing use of the SLDSs technology in all (especially mobile) situations of life and for increasingly resource-consuming speaker-independent and large-vocabulary applications. This implies that the main processing steps of SLDSs are shifted from the client-side (e.g. Personal Digital Assistants - PDAs and mobile phones) to a powerful server computer. However, such a paradigm shift requires investigating data pre-processing, transmission and speech processing techniques that are optimally and mutually tuned.

In order to achieve the goal of an optimum overall SLDSs architecture, an experimental set-up containing a simulated speech data transmission channel front-end and a speech processing back-end including components for stochastically-based speech recognition and spoken natural language understanding are established. The major task of the thesis work consists of adapting and optimising the individual component approaches of the set-up so as to obtain an optimum overall system performance in terms of high accuracy and low transmission costs.

The utility of the developed approaches is illustrated and evaluated with real user data on applications for information retrieval and transaction services. The thesis is carried out in cooperation with the Information Transmission Group of the ITUU Department.

Further Information

PhD_Description.pdf