HomeProjectsPeoplePublicatons
Search:
   
 

 

Pen and Speech Recognition in the User Interface for Mobile Multimedia Terminals

Shankar Narayanaswamy,  Ph.D. 1996 (advisor: Bob Brodersen).

 

Portable computers have soared in popularity over the last few years. Vendors are introducing new models with smaller form factors, longer battery life, communications capabilities and unique user interfaces using pen or audio input. The design and implementation of a networked user interface architecture using hand-writing, recognition and speech recognition is explored. Although the user interface, was designed for mobile multimedia terminals such as the InfoPad system, it is more, generally applicable in any application domain where pen and/or spoken input are preferable to keyboard input. We examine the kinds of handwriting and speech recognizers needed to provide an effective user interface. There are several aspects to this problem. Firstly, there is the user interaction model which determines where and how the user uses each input modality. Secondly, there is the applications programming model which determines the level of abstraction and the extent of encapsulation of the recognizer’s functionality. Thirdly, there is the service provision model which determines whether the recognizer is part of the application or whether it runs as a separate thread or process, or somewhere in between. The latter allows off-loading the recognition computation onto a remote, possibly specialized server and minimizes the impact of compute-intensive recognizers on other applications. The entire infrastructure for a pen and speech based user interface is described, including a software hidden Markov model based writer-independent hand-print recognizer, a VLSI hidden Markov model based large-vocabulary speaker independent continuous speech recognizer, type servers to handle the new data types and applications that exercise the entire user interface architecture.