Selected Publications
-
MULTIVOX: A Benchmark for Evaluating Voice Assistants for Multimodal Interactions
Ramaneswaran S*, Ashish Seth*, Nishit Anand, Utkarsh Tyagi,
Sonal Kumar, Sreyan Ghosh, Dinesh Manocha
EMNLP 2025
-
Do audio-visual large language models really see and hear?
Ramaneswaran S, Koushik Jayakumar, S Sakshi,
Sreyan Ghosh, Ruohan Gao, Dinesh Manocha
Under review, CVPR 2026
-
Peeking Into The Future For Contextual Biasing
Ramaneswaran S, Cindy Tseng, Eesung Kim,
Vijendra Raj Apsingekar, Yun Tang
Under review, ICASSP 2026
-
Do Audio-Language Models Understand Linguistic Variations?
Ramaneswaran S*, Sonal Kumar*, Hemant Giri*,
Ashish Seth, Sreyan Ghosh, Dinesh Manocha
NAACL 2025
-
EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning
Ashish Seth*, Ramaneswaran S*, S Sakshi,
Sonal Kumar, Sreyan Ghosh, Dinesh Manocha
EMNLP 2024
-
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
S Sakshi*, Utkarsh Tyagi*, Sonal Kumar*, Ashish Seth*,
Ramaneswaran S*, Oriol Nieto et al.
ICLR 2025