A speech and character combined recognition engine for mobile devices

Zhou, Xioabo; Xiao, Bin; Sha, Edwin H.‐M.; Suk, Soo‐Young; Chung, Hyun‐Yeol

doi:10.1108/17427370810890409

Article navigation

Volume 4, Issue 2

27 June 2008

Editors

Seng Loke

Search for other works by this author on:

This Site

PubMed

Google Scholar

Research Article| June 27 2008

A speech and character combined recognition engine for mobile devices

Soo‐Young Suk;

Soo‐Young Suk

Information Technology Research Institute, AIST, Tsukuba, Japan, and

Search for other works by this author on:

This Site

PubMed

Google Scholar

Hyun‐Yeol Chung

School of EECS, Yeungnam University, Gyung‐San, Republic of Korea

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author & Article Information

Publisher: Emerald Publishing

Online ISSN: 1742-738X

Print ISSN: 1742-7371

2008

International Journal of Pervasive Computing and Communications (2008) 4 (2): 232–249.

https://doi.org/10.1108/17427370810890409

Purpose

The purpose of this paper is to describe a speech and character combined recognition engine (SCCRE) developed for working on personal digital assistants (PDAs) or on mobile devices. Also, the architecture of a distributed recognition system for providing a more convenient user interface is discussed.

Design/methodology/approach

In SCCRE, feature extraction for speech and for character is carried out separately, but the recognition is performed in an engine. The client recognition engine essentially employs a continuous hidden Markov model (CHMM) structure and this CHMM structure consists of variable parameter topology in order to minimize the number of model parameters and to reduce recognition time. This model also adopts the proposed successive state and mixture splitting (SSMS) method for generating context independent model. SSMS optimizes the number of mixtures through splitting in mixture domain and the number of states through splitting in time domain.

Findings

The recognition results show that the developed engine can reduce the total number of Gaussian up to 40 per cent compared with the fixed parameter models at the same recognition performance when applied to speech recognition for mobile devices. It shows that SSMS can reduce the size of memory for models to 65 per cent and that for processing to 82 per cent. Moreover, the recognition time decreases 17 per cent with the SMS model while maintaining the recognition rate.

Originality/value

The proposed system will be very useful for many on‐line multimodal interfaces such as PDAs and mobile applications.

2008

You do not currently have access to this content.

Don't already have an account? Register

A speech and character combined recognition engine for mobile devices

New and popular articles

Email Alerts

Cited By

A speech and character combined recognition engine for mobile devices

Sign in

Client Account

ICE Member Sign In

New and popular articles

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

Sharing Unavailable