The llama 3 herd of models A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ... arXiv preprint arXiv:2407.21783, 2024 | 1206 | 2024 |
Multi-modal sensor based emotion recognition and emotional interface O Kalinli-Akbacak US Patent 9,031,293, 2015 | 243 | 2015 |
A saliency-based auditory attention model with applications to unsupervised prominent syllable detection in speech. O Kalinli, SS Narayanan Interspeech 2007, 1941-1944, 2007 | 134 | 2007 |
Adaptive displays using gaze tracking O Kalinli US Patent 8,493,390, 2013 | 133 | 2013 |
Noise adaptive training for robust automatic speech recognition O Kalinli, ML Seltzer, J Droppo, A Acero IEEE Transactions on Audio, Speech, and Language Processing 18 (8), 1889-1901, 2010 | 103 | 2010 |
Prompting large language models with speech recognition abilities Y Fathullah, C Wu, E Lakomkin, J Jia, Y Shangguan, K Li, J Guo, W Xiong, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 93 | 2024 |
Interface using eye tracking contact lenses R Chen, O Kalinli US Patent 8,632,182, 2014 | 90 | 2014 |
Contextualized streaming end-to-end speech recognition with trie-based deep biasing and shallow fusion D Le, M Jain, G Keren, S Kim, Y Shi, J Mahadeokar, J Chan, ... arXiv preprint arXiv:2104.02194, 2021 | 84 | 2021 |
Prominence detection using auditory attention cues and task-dependent high level information O Kalinli, S Narayanan IEEE Transactions on audio, Speech, and language processing 17 (5), 1009-1024, 2009 | 83 | 2009 |
Apparatus and method for determining relevance of input speech O Kalinli US Patent App. 13/083,356, 2012 | 73 | 2012 |
Noise adaptive training using a vector Taylor series approach for noise robust automatic speech recognition O Kalinli, ML Seltzer, A Acero 2009 IEEE International Conference on Acoustics, Speech and Signal …, 2009 | 72 | 2009 |
Emotion recognition using auditory attention cues extracted from users voice O Kalinli-Akbacak US Patent 9,020,822, 2015 | 46 | 2015 |
Combining auditory attention cues with phoneme posterior scores for phone/vowel/syllable boundary detection O Kalinli-Akbacak US Patent 9,672,811, 2017 | 43 | 2017 |
Saliency-driven unstructured acoustic scene classification using latent perceptual indexing O Kalinli, S Sundaram, S Narayanan 2009 IEEE International Workshop on Multimedia Signal Processing, 1-6, 2009 | 43 | 2009 |
Speech syllable/vowel/phone boundary detection using auditory attention cues O Kalinli, R Chen US Patent 8,756,061, 2014 | 36 | 2014 |
Method for tone/intonation recognition using auditory attention cues O Kalinli US Patent 8,676,574, 2014 | 33 | 2014 |
Semantic distance: A new metric for asr performance analysis towards spoken language understanding S Kim, A Arora, D Le, CF Yeh, C Fuegen, O Kalinli, ML Seltzer arXiv preprint arXiv:2104.02138, 2021 | 31 | 2021 |
A top-down auditory attention model for learning task dependent influences on prominence detection in speech O Kalinli, S Narayanan 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008 | 27 | 2008 |
Dissecting user-perceived latency of on-device E2E speech recognition Y Shangguan, R Prabhavalkar, H Su, J Mahadeokar, Y Shi, J Zhou, C Wu, ... arXiv preprint arXiv:2104.02207, 2021 | 26 | 2021 |
Scaling asr improves zero and few shot learning A Xiao, W Zheng, G Keren, D Le, F Zhang, C Fuegen, O Kalinli, Y Saraf, ... arXiv preprint arXiv:2111.05948, 2021 | 22 | 2021 |