Insights into the Challenges and Opportunities of Large Multi-Modal Models for Blind and Low Vision Users: CLIP
PARIKSHA: A Scalable, Democratic, Transparent Evaluation Platform for Assessing Indic Large Language Models
Publication Is Word Error Rate a Good Indicator for Spoken Language Understanding Accuracy Ye-Yi Wang, Alex Acero, Ciprian Chelba IEEE Workshop on Automatic Speech Recognition and Understanding | January 2003 Project Project
Publication A Speech-Centric Perspective for Human-Computer Interface Li Deng, Alex Acero, Ye-Yi Wang, Kuansan Wang, Hsiao-Wuen Hon, Jasha Droppo, Milind Mahajan, Xuedong Huang Proc. of the IEEE Fifth Workshop on Multimedia Signal Processing | December 2002 Proc. of the IEEE Fifth Workshop on Multimedia Signal Processing
Publication Integrating Multiple Knowledge Sources for Utterance-Level Confidence Annotation in the CMU Communicator Spoken Dialog System Dan Bohus, Alexander I. Rudnicky CMU-CS-02-190 | November 2002 University of Washington Computer Science & Engineering Technical Report
Publication A System for Spoken Query Information Retrieval on Mobile Devices Eric Chang, Frank Seide, Helen M. Meng, Zhuoran Chen, Yu Shi, Yuk-Chi Li IEEE Transactions on Speech and Audio Processing | October 2002, Vol 10(8): pp. 531-541
Publication Automatic Speech Recognition for Wireless Mobile Devices. Richard C. Rose, Sarangarajan Parthasarathy September 2002
Publication Unsupervised speaker segmentation of telephone conversations. Aaron E. Rosenberg, Allen Gorin, Zhu Liu, Sarangarajan Parthasarathy ICSLP 2002 | September 2002
Publication A Multi-Class Approach for Modelling Out-of-Vocabulary Words Proc. Int. Conf. on Spoken Language Processing | September 2002 Proc. Int. Conf. on Spoken Language Processing
Publication Log-Domain Speech Feature Enhancement Using Sequential MAP Noise Estimation and a Phase-sensitive Model of the Acoustic Environment Li Deng, Jasha Droppo, Alex Acero Proc. International Conference on Spoken Language Processing | September 2002 Proc. International Conference on Spoken Language Processing