Subhashini Venugopalan

Subhashini
Venugopalan

I am a Research Scientist at Google Research where I focus on developing LLM agents with specialized capabilities for scientific discovery and automated coding. My work centers on enabling models to reason over scientific data and complex codebases to significantly accelerate research workflows. Parallel to this, I lead research on evaluating video understanding to advance how multimodal models can make digital content more accessible to blind and low-vision users.

I am motivated by AI's potential to accelerate scientific breakthroughs and create assistive technologies that empower users. My contributions to healthcare and accessibility were featured in the documentary Healed through A.I. and recognized with a 2025 Bronze Cannes Lions award for Transformative Design.

I earned my Ph.D. from UT Austin, advised by Ray Mooney and co-advised by Kate Saenko and Trevor Darrell. This collaborative environment, enriched by research stays at UC Berkeley and Google Brain internships, enabled my doctoral work to contribute foundational models for video captioning and sequence-to-sequence multimodal reasoning.

Research Focus

Agents & Science

Designing autonomous agents capable of scientific coding, parsing literature, and aiding in complex research reasoning.

Multimodal Learning

Developing models that integrate visual perception, speech, and language reasoning for video understanding.

AI for Accessibility

Creating assistive technologies to empower users with diverse vision, speech and motor capabilities through ML.

Selected Talks

Publications

An AI system to help scientists write expert-level empirical software

An AI system to help scientists write expert-level empirical software

Eser Aygün*, Anastasiya Belyaeva*, Gheorghe Comanici*, Marc Coram*, Subhashini Venugopalan*, (+others), John C Platt, Michael P Brenner

Preprint 2025

CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning

CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning

Hao Cui, Zahra Shamsi, Gowoon Cheon, (+27 others), Subhashini Venugopalan

ICLR 2025

CBGB The Cloud-Based Geospatial Benchmark: Challenges and LLM Evaluation

CBGB The Cloud-Based Geospatial Benchmark: Challenges and LLM Evaluation

Jeffrey A. Cardille, Renee Johnston, Simon Ilyushchenko, Johan Kartiwa, Zahra Shamsi, Matthew Abraham, Khashayar Azad, Kainath Ahmed, Emma Bergeron Quick, Nuala Caughie, Noah Jencz, Karen Dyson, Andrea Puzzi Nicolau, Maria Fernanda Lopez-Ornelas, David Saah, Michael Brenner, Subhashini Venugopalan, Sameera S Ponda

Terrabytes @ ICML 2025, PMLR 2025

Speech Recognition With LLMs Adapted to Disordered Speech Using Reinforcement Learning

Speech Recognition With LLMs Adapted to Disordered Speech Using Reinforcement Learning

Chirag Nagpal*, Subhashini Venugopalan*, Jimmy Tobin, Marilyn Ladewig, Katherine Heller, Katrin Tomanek

ICASSP 2025

Towards a Single ASR Model That Generalizes to Disordered Speech

Towards a Single ASR Model That Generalizes to Disordered Speech

Jimmy Tobin, Katrin Tomanek, Subhashini Venugopalan

ICASSP 2025

SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers

SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers

Shraman Pramanick*, Rama Chellappa, Subhashini Venugopalan *

NeurIPS 2024

Using large language models to accelerate communication for eye gaze typing users with ALS

Using large language models to accelerate communication for eye gaze typing users with ALS

Shanqing Cai, Subhashini Venugopalan, Katie Seaver, Xiang Xiao, Katrin Tomanek, Sri Jalasutram, Meredith Ringel Morris, Shaun Kane, Ajit Narayanan, Robert L. MacDonald, Emily Kornman, Daniel Vance, Blair Casey, Steve M. Gleason, Philip Q. Nelson, Michael P. Brenner

Nature Comms 2024

SkipWriter: LLM-Powered Abbreviated Writing on Tablets

SkipWriter: LLM-Powered Abbreviated Writing on Tablets

Zheer Xu, Shanqing Cai, Mukund Varma T, Subhashini Venugopalan , Shumin Zhai

UIST 2024

A Design Space for Intelligent and Interactive Writing Assistants

A Design Space for Intelligent and Interactive Writing Assistants

Mina Lee, Katy Ilonka Gero, John Joon Young Chung, Simon Buckingham Shum, Vipul Raheja, Hua Shen, Subhashini Venugopalan , and others.

CHI 2024

Large Language Models As A Proxy For Human Evaluation In Assessing The Comprehensibility Of Disordered Speech Transcription

Large Language Models As A Proxy For Human Evaluation In Assessing The Comprehensibility Of Disordered Speech Transcription

Katrin Tomanek, Jimmy Tobin, Subhashini Venugopalan , Richard Cave, Katie Seaver, Jordan R. Green

ICASSP 2024

Speech Intelligibility Classifiers From 550K Disordered Speech Samples

Speech Intelligibility Classifiers From 550K Disordered Speech Samples

Subhashini Venugopalan , Jimmy Tobin, Samuel J. Yang, Katie Seaver, Richard J.N. Cave, Pan-Pan Jiang, Neil Zeghidour, Rus Heywood, Jordan Green, Michael P. Brenner

ICASSP 2023

Is Attention All That NeRF Needs?

Is Attention All That NeRF Needs?

Mukund Varma T*, Peihao Wang*, Xuxi Chen, Tianlong Chen, Subhashini Venugopalan , Zhangyang Wang

ICLR 2023

SpeakFaster Observer: Long-Term Instrumentation of Eye-Gaze Typing for Measuring AAC Communication

SpeakFaster Observer: Long-Term Instrumentation of Eye-Gaze Typing for Measuring AAC Communication

Shanqing Cai, Subhashini Venugopalan , Katrin Tomanek, Shaun Kane, Meredith Ringel Morris, Richard Cave, Bob MacDonald, Jon Campbell, Blair Casey, Emily Kornman, Daniel Vance, Jay Beavers

CHI 2023

Sparse Winning Tickets are Data-Efficient Image Recognizers

Sparse Winning Tickets are Data-Efficient Image Recognizers

Mukund Varma, Xuxi Chen, Zhenyu Zhang, Tianlong Chen, Subhashini Venugopalan , Zhangyang Wang

NeurIPS 2022

Context-Aware Abbreviation Expansion Using Large Language Models

Context-Aware Abbreviation Expansion Using Large Language Models

Shanqing Cai*, Subhashini Venugopalan *, Katrin Tomanek, Ajit Narayanan, Meredith Ringel Morris, Michael P. Brenner

NAACL 2022

A machine-learning based objective measure for ALS disease severity

A machine-learning based objective measure for ALS disease severity

Fernando G. Vieira*, Subhashini Venugopalan *, Alan S. Premasiri, Maeve McNally, Aren Jansen, Kevin McCloskey, Michael P. Brenner, Steven Perrin

npj Digital Medicine 2022

Integrating deep learning and unbiased automated high-content screening to identify complex disease signatures in human fibroblasts.

Integrating deep learning and unbiased automated high-content screening to identify complex disease signatures in human fibroblasts.

Lauren Schiff, Bianca Migliori, Ye Chen, Deidre Carter, Caitlyn Bonilla, Jenna Hall, Minjie Fan, Edmund Tam, Sara Ahadi, Brodie Fischbacher, Anton Geraschenko, Christopher J. Hunter, Subhashini Venugopalan , and 30 others.

Nature Comms 2022

TRILLsson: Distilled Universal Paralinguistic Speech Representations

TRILLsson: Distilled Universal Paralinguistic Speech Representations

Joel Shor, Subhashini Venugopalan

INTERSPEECH 2022

Comparing Supervised Models And Learned Speech Representations For Classifying Intelligibility Of Disordered Speech On Selected Phrases

Comparing Supervised Models And Learned Speech Representations For Classifying Intelligibility Of Disordered Speech On Selected Phrases

Subhashini Venugopalan , Joel Shor, Manoj Plakal, Jimmy Tobin, Katrin Tomanek, Jordan Green, Michael Brenner

INTERSPEECH 2021

Guided Integrated Gradients: An Adaptive Path Method for Removing Noise

Guided Integrated Gradients: An Adaptive Path Method for Removing Noise

Andrei Kapishnikov, Subhashini Venugopalan , Besim Avci, Ben Wedin, Michael Terry, Tolga Bolukbasi

CVPR 2021

Scaling Symbolic Methods using Gradients for Neural Model Explanation

Scaling Symbolic Methods using Gradients for Neural Model Explanation

Subham Sekhar Sahoo, Subhashini Venugopalan , Li Li, Rishabh Singh, Patrick Riley

ICLR 2021

Predicting Risk of Developing Diabetic Retinopathy using Deep Learning

Predicting Risk of Developing Diabetic Retinopathy using Deep Learning

Ashish Bora, Siva Balasubramanian, Boris Babenko, Sunny Virmani, Subhashini Venugopalan , Akinori Mitani, Guilherme de Oliveira Marinho, Jorge Cuadros, Paisan Ruamviboonsuk, Greg S Corrado, Lily Peng, Dale R Webster, Avinash V Varadarajan, Naama Hammel, Yun Liu, Pinal Bavishi

THE LANCET, Digital Health 2021

Scientific Discovery by Generating Counterfactuals using Image Translation

Scientific Discovery by Generating Counterfactuals using Image Translation

Arunachalam Narayanaswamy*, Subhashini Venugopalan *, Dale R. Webster, Lily Peng, Greg Corrado, Paisan Ruamviboonsuk, Pinal Bavishi, Rory Sayres, Abigail Huang, Siva Balasubramanian, Michael Brenner, Philip Nelson, Avinash V. Varadarajan

MICCAI 2020

Attribution in Scale and Space

Attribution in Scale and Space

Shawn Xu, Subhashini Venugopalan , Mukund Sundararajan

CVPR 2020

Predicting optical coherence tomography-derived diabetic macular edema grades from fundus photographs using deep learning

Predicting optical coherence tomography-derived diabetic macular edema grades from fundus photographs using deep learning

Avinash V Varadarajan, Pinal Bavishi, Paisan Ruamviboonsuk, Peranut Chotcomwongse, Subhashini Venugopalan , Arunachalam Narayanaswamy, Jorge Cuadros, Kuniyoshi Kanai, George Bresnick, Mongkol Tadarati, Sukhum Silpa-Archa, Jirawut Limwattanayingyong, Variya Nganthavee, Joseph R Ledsam, Pearse A Keane, Greg S Corrado, Lily Peng, Dale R Webster

Nature Comms 2020

Batch Equalization with a Generative Adversarial Network

Batch Equalization with a Generative Adversarial Network

Wesley Wei Qian, Cassandra Xia, Subhashini Venugopalan , Arunachalam Narayanaswamy, Jian Peng, D Michael Ando

ECCB 2020

Detection of anaemia from retinal fundus images via deep learning

Detection of anaemia from retinal fundus images via deep learning

Akinori Mitani, Abigail Huang, Subhashini Venugopalan , Greg S Corrado, Lily Peng, Dale R Webster, Naama Hammel, Yun Liu, Avinash V Varadarajan

Nature BME 2019

It's easy to fool yourself: Case studies on identifying bias and confounding in bio-medical datasets

It's easy to fool yourself: Case studies on identifying bias and confounding in bio-medical datasets

Subhashini Venugopalan*, Arunachalam Narayanaswamy*, Samuel Yang*, Anton Gerashcenko, Scott Lipnick, Nina Makhortova, James Hawrot, Christine Marques, Joao Pereira, Michael Brenner, Lee Rubin, Brian Wainger, Marc Berndl

NeurIPS LMRL 2019

Applying Deep Neural Network Analysis to High-Content Image-Based Assays

Applying Deep Neural Network Analysis to High-Content Image-Based Assays

Samuel J Yang*, Scott L Lipnick*, Nina R Makhortova*, Subhashini Venugopalan*, Minjie Fan*, Zan Armstrong, Thorsten M Schlaeger, Liyong Deng, Wendy K Chung, Liadan O' Callaghan, Anton Geraschenko, Dosh Whye, Marc Berndl, Jon Hazard, Brian Williams, Arunachalam Narayanaswamy, D Michael Ando, Philip Nelson, Lee L Rubin

SLAS DISCOVERY: Advancing Life Sciences R&D 2019

Detecting cancer metastases on gigapixel pathology images

Detecting cancer metastases on gigapixel pathology images

Yun Liu, Krishna Gadepalli, Mohammad Norouzi, George E Dahl, Timo Kohlberger, Aleksey Boyko, Subhashini Venugopalan , Aleksei Timofeev, Philip Q Nelson, Greg S Corrado, Jason D Hipp, Lily Peng, Martin C Stumpe

ArXiv 2017

Captioning Images with Diverse Objects

Captioning Images with Diverse Objects

Subhashini Venugopalan , Lisa Hendricks, Marcus Rohrbach, Raymond Mooney, Kate Saenko, Trevor Darrell

CVPR 2017

Semantic Text Summarization of Long Videos

Semantic Text Summarization of Long Videos

Shagan Sah, Sourabh Kulhare, Allison Gray, Subhashini Venugopalan , Emily Prud'hommeaux, Raymond Ptucha

WACV 2017

Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs

Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs

Varun Gulshan, Lily Peng, Marc Coram, Martin C Stumpe, Derek Wu, Arunachalam Narayanaswamy, Subhashini Venugopalan , Kasumi Widner, Tom Madams, Jorge Cuadros, Ramasamy Kim, Rajiv Raman, Philip C Nelson, Jessica L Mega, Dale R Webster

JAMA 2016

Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text

Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text

Subhashini Venugopalan , Lisa Hendricks, Raymond Mooney, Kate Saenko

EMNLP 2016

Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data

Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data

Lisa Hendricks, Subhashini Venugopalan , Marcus Rohrbach, Raymond Mooney, Kate Saenko, Trevor Darrell

CVPR 2016

Sequence to Sequence - Video to Text

Sequence to Sequence - Video to Text

Subhashini Venugopalan , Marcus Rohrbach, Jeff Donahue, Raymond Mooney, Trevor Darrell, Kate Saenko

ICCV 2015

Translating Videos to Natural Language Using Deep Recurrent Neural Networks

Translating Videos to Natural Language Using Deep Recurrent Neural Networks

Subhashini Venugopalan , Huijun Xu, Jeff Donahue, Marcus Rohrbach, Raymond Mooney, Kate Saenko

NAACL-HLT 2015

Long-term Recurrent Convolutional Networks for Visual Recognition and Description

Long-term Recurrent Convolutional Networks for Visual Recognition and Description

Jeff Donahue, Lisa Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan , Kate Saenko, Trevor Darrell

CVPR 2015

Integrating Language and Vision to Generate Natural Language Descriptions of Videos in the Wild

Integrating Language and Vision to Generate Natural Language Descriptions of Videos in the Wild

Jesse Thomason*, Subhashini Venugopalan *, Sergio Guadarrama, Kate Saenko, Raymond Mooney

COLING 2014

YouTube2Text: Recognizing and Describing Arbitrary Activities Using Semantic Hierarchies and Zero-shot Recognition

YouTube2Text: Recognizing and Describing Arbitrary Activities Using Semantic Hierarchies and Zero-shot Recognition

Sergio Guadarrama,Niveda Krishnamoorthy, Girish Malkarnenkar, Subhashini Venugopalan , Raymond Mooney, Trevor Darrell, Kate Saenko

ICCV 2013