![]() |
![]() |
Chin-Hui Lee
National University of Singapore
Monday, December 10, 2001, 11:00 am - 12:00 pm
Silleck Lounge, Brooklyn Campus, Polytechnic University
With the increasing availability of multimedia content over the internet and the growing popularity of cell phones and personal data assistants, mobile information access and remote transactions are now common practices for both home and business usages in our daily life. However, due to the simplification of keypad entries and the shrinkage in size of displays on these new information appliances, the traditional point-and-click graphical user interface (GUI) is becoming limited in usability. This calls for a more natural user interface that is multi-modal and incorporates human sensory mechanisms. Voice communication will play a key role here. Studies on human behavior, human language technologies and human information processing are critical to humanize new machine interfaces.
In this talk, we examine voice user interface (VUI) technologies, including speech recognition and understanding, speech synthesis, speaker authentication and dialogue processing. We discuss how to link Shannon^pi^s channel modeling and decoding paradigm and the hidden Markov modeling framework to solve many of the problems in human language technologies. Finally we investigate how new wireless and internet services and applications can be designed based on this new set of imperfect technologies. It is also important to know how to extend these voice technologies to multimedia signal processing, intelligent information processing, multi-lingual language processing and multi-modal databases and interfaces so that we can extract information from content, index content for retrieval, organize the retrieval mechanisms and prepare dialogue access interfaces to enhance human-machine communication.
Speaker's Bio
Chin-Hui Lee received the B.S. degree in Electrical Engineering from
National Taiwan University, Taipei, in 1973, the M.S. degree in
Engineering and Applied Science from Yale University, New Haven, in
1977, and the Ph.D. degree in Electrical Engineering with a minor in
Statistics from University of Washington, Seattle, in 1981.
After graduation, Dr. Lee joined Verbex Corporation, Bedford, MA, and was involved in research on connected word recognition. In 1984, he became affiliated with Digital Sound Corporation, Santa Barbara, where he engaged in research and product development in speech coding, speech synthesis, speech recognition and signal processing for the development of the DSC-2000 Voice Server. Since 1986, he has been with Bell Laboratories, Murray Hill, New Jersey, where he is now a Distinguished Member of Technical Staff and Director of the Dialogue Systems Research Department. His research interests include multimedia signal processing, speech and speaker recognition, speech and language modeling, adaptive and discriminative learning, spoken dialogue processing, biometric authentication and information retrieval. His research scope is reflected in a best seller, entitled "Automatic Speech and Speaker Recognition: Advanced Topics", published by the Kluwer Academic Publishers in 1996. Since July 2001 he is a visiting professor at School of Computing, National University of Singapore.
Dr. Lee has participated actively in professional societies. He is a member of the IEEE Signal Processing Society, Communication Society, and the European Speech Communication Association. He is also a lifetime member of the Computational Linguistic Society in Taiwan. In 1991-1995, he was an associate editor for the IEEE Transactions on Signal Processing and Transactions on Speech and Audio Processing. During the same period, he served as a member of the ARPA Spoken Language Coordination Committee. In 1995-1998 he was a member of the Speech Processing Technical Committee of the IEEE Signal Processing Society (SPS) and chaired the Speech TC from 1997 to 1998. In 1996 he helped promote the SPS Multimedia Signal Processing (MMSP) Technical Committee in which he is a founding member.
Dr. Lee is a Fellow of the IEEE. He has published more than 220 papers and 20 patents on the subject of automatic speech and speaker recognition. He received the SPS Senior Award in 1994 and the SPS Best Paper Award in 1997 and 1999, respectively. In 1997, he was also awarded the prestigious Bell Labs President's Gold Award for his contributions to the Lucent Speech Processing Solutions product. More recently he was named one of the six Distinguished Lecturers for the year 2000 by the IEEE Signal Processing Society.
For further information, please contact Nasir Memon.