Insights into the Challenges and Opportunities of Large Multi-Modal Models for Blind and Low Vision Users: CLIP
PARIKSHA: A Scalable, Democratic, Transparent Evaluation Platform for Assessing Indic Large Language Models
Publication Conditional ML Estimation Using Rational Function Growth Transform Ciprian Chelba, Alex Acero Snowbird Learning Workshop, Proc. of the Snowbird Learning Workshop | April 2004 Proc. of the Snowbird Learning Workshop
Publication Enhancement of Log Mel Power Spectra of Speech Using a Phase-Sensitive Model of the Acoustic Environment and Sequential Estimation of the Corrupting Noise Li Deng, Jasha Droppo, Alex Acero IEEE Transactions on Speech and Audio Processing | March 2004, Vol 12: pp. 133-143
Publication The Use of SVM for Chinese New Word Identification Hongqiao Li, Chang-Ning Huang, Jianfeng Gao, Xiaozhong Fan March 2004
Publication Comparison of Sentential-Stress Allocation within Base Phrase among Different Reading Styles Min Chu, Mingzhen Bao Proc. of International Conference on Speech Prosody | March 2004
Publication A Hybrid Approach to Rendering Handwritten Characters Sara L. Su, Chenyu Wu, Ying-Qing Xu, Heung-Yeung Shum Proceedings of WSCG | February 2004
Publication Initial Development of a Voice-Activated Astronaut Assistant for Procedural Tasks: From Need to Concept to Prototype Gregory Aist, Dan Bohus, Brad Boven, Ellen Campana, Susana Early, Steven Phan Interactive Instruction Development | January 2004, Vol 16(3): pp. 32-36
Publication Error Awareness and Recovery in Task-Oriented Spoken Dialogue Systems Dan Bohus January 2004 January 2004
Publication Advances in Large Vocabulary Speech Recognition Geoffrey Zweig MSR-TR-2004-154 | January 2004 Advances in Computers, Elsevier Science
Publication Arc Minimization in Finite State Decoding Graphs with Cross-Word Decoding Context Geoffrey Zweig MSR-TR-2004-153 | January 2004 Computer Speech and Language. Vol. 18, 2004