Subhashini Venugopalan
Research Scientist

I am an ML researcher at Google working on applications of vision, audio, and language technologies to problems motivated in healthcare and sciences. Prior to this, I was a PhD student at UT Austin working on natural language processing, and computer vision. I was a member of the Machine Learning Group, and was advised by Prof. Ray Mooney.

During my PhD I was also fortunate to work with Prof. Trevor Darrell's group at UC Berkeley and with Prof. Kate Saenko at Boston University. I spent several summers working on deep learning projects at Google Research (host: Varun Gulshan) and Google Brain (host: Andrea Frome). Prior to joining UT, I spent a year at IBM Research, India (Blue scholar).

In 2011, I obtained a Masters degree in Computer Science and Engineering from IIT, Madras. My thesis was advised by Prof. C. Pandu Rangan. In 2009, I graduated with a bachelor's degree in Information Technology from NITK, Surathkal.


Research

I work on machine learning applications motivated in healthcare and sciences. I am a key contributor to a number of works featuring in the Healed through A.I. documentary. Some of my work pertains to improving speech recognition systems for users with impaired speech, others to transfer learning for bio/medical data (e.g. detecting diabetic retinopathy, breast cancer), and I have also developed methods to interpret such vision/audio models (model explanation) for medical applications. During my graduate studies, I applied natural language processing and computer vision techniques to generate descriptions of events depicted in videos and images. Please refer to my Google Scholar page for an up-to-date list of my publications.


Talks


Broader talks covering different areas of my research, usually presented at multiple venues.


Publications


SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers
Shraman Pramanick*, Rama Chellappa, Subhashini Venugopalan*

NeurIPS. Vancouver, Canada. December 2024.
  • [+Abstract]
  • [PDF]
  • [Code]
  • [Dataset]

SkipWriter: LLM-Powered Abbreviated Writing on Tablets
Zheer Xu, Shanqing Cai, Mukund Varma T, Subhashini Venugopalan, Shumin Zhai

UIST. Pittsburg, USA. October 2024.
  • [+Abstract]
  • [Paper (coming..soon)]
  • [Poster]
  • [Slides]

A Design Space for Intelligent and Interactive Writing Assistants
Mina Lee, Katy Ilonka Gero, John Joon Young Chung, Simon Buckingham Shum, Vipul Raheja, Hua Shen, Subhashini Venugopalan, and others.

CHI. Hawaii, USA. May 2024.
  • [+Abstract]
  • [PDF]
  • [Video]

Large Language Models As A Proxy For Human Evaluation In Assessing The Comprehensibility Of Disordered Speech Transcription
Katrin Tomanek, Jimmy Tobin, Subhashini Venugopalan, Richard Cave, Katie Seaver, Jordan R. Green

ICASSP. Seoul, Korea. April 2024
  • [+Abstract]
  • [PDF]
  • [Blogpost]
  • [Poster]

Speech Intelligibility Classifiers From 550K Disordered Speech Samples
Subhashini Venugopalan, Jimmy Tobin, Samuel J. Yang, Katie Seaver, Richard J.N. Cave, Pan-Pan Jiang, Neil Zeghidour, Rus Heywood, Jordan Green, Michael P. Brenner

ICASSP. Rhodes Island, Greece. June 2023.
  • [+Abstract]
  • [PDF]
  • [Code]
  • [Slides]
  • [Poster]
  • [Video]

Is Attention All That NeRF Needs?
Mukund Varma T*, Peihao Wang*, Xuxi Chen, Tianlong Chen, Subhashini Venugopalan, Zhangyang Wang
*equal contribution

ICLR. Kigali, Rwanda. May 2023.
  • [+Abstract]
  • [PDF]
  • [Code]
  • [Project Page]

SpeakFaster Observer: Long-Term Instrumentation of Eye-Gaze Typing for Measuring AAC Communication
Shanqing Cai, Subhashini Venugopalan, Katrin Tomanek, Shaun Kane, Meredith Ringel Morris, Richard Cave, Bob MacDonald, Jon Campbell, Blair Casey, Emily Kornman, Daniel Vance, Jay Beavers

CHI. Hamburg, Germany. April 2023.
  • [+Abstract]
  • [PDF]

Sparse Winning Tickets are Data-Efficient Image Recognizers
Mukund Varma, Xuxi Chen, Zhenyu Zhang, Tianlong Chen, Subhashini Venugopalan, Zhangyang Wang

NeurIPS. New Orleans, USA. Dec. 2022.
  • [+Abstract]
  • [PDF]
  • [Slides]
  • [Poster]
  • [Code]

Healthcare applications, Interpretability


Context-Aware Abbreviation Expansion Using Large Language Models
Shanqing Cai*, Subhashini Venugopalan*, Katrin Tomanek, Ajit Narayanan, Meredith Ringel Morris, Michael P. Brenner
*equal contribution

NAACL. Seattle, USA (+Virtual). July 2022.
  • [+Abstract]
  • [PDF]
  • [Slides]

A machine-learning based objective measure for ALS disease severity
Fernando G. Vieira*, Subhashini Venugopalan*, Alan S. Premasiri, Maeve McNally, Aren Jansen, Kevin McCloskey, Michael P. Brenner, Steven Perrin
*equal contribution

npj Digital Medicine. April 2022
  • [+Abstract]
  • [PDF]

Integrating deep learning and unbiased automated high-content screening to identify complex disease signatures in human fibroblasts.
Lauren Schiff, Bianca Migliori, Ye Chen, Deidre Carter, Caitlyn Bonilla, Jenna Hall, Minjie Fan, Edmund Tam, Sara Ahadi, Brodie Fischbacher, Anton Geraschenko, Christopher J. Hunter, Subhashini Venugopalan, and 30 others.

Nature Communications. March 2022
  • [+Abstract]
  • [PDF]
  • [Tweet]

TRILLsson: Distilled Universal Paralinguistic Speech Representations
Joel Shor, Subhashini Venugopalan

INTERSPEECH 2022. Incheon, South Korea (+Virtual). September 2022.
  • [+Abstract]
  • [PDF]
  • [Blogpost]

Comparing Supervised Models And Learned Speech Representations For Classifying Intelligibility Of Disordered Speech On Selected Phrases
Subhashini Venugopalan, Joel Shor, Manoj Plakal, Jimmy Tobin, Katrin Tomanek, Jordan Green, Michael Brenner

INTERSPEECH 2021. Brno, Czechia (+Virtual). August 2021.
  • [+Abstract]
  • [PDF]
  • [Slides]
  • [Video]

Guided Integrated Gradients: An Adaptive Path Method for Removing Noise
Andrei Kapishnikov, Subhashini Venugopalan, Besim Avci, Ben Wedin, Michael Terry, Tolga Bolukbasi

IEEE Conference on Computer Vision and Pattern Recognition. Virtual. June 2021. (CVPR 2021)
  • [+Abstract]
  • [PDF]
  • [Project Page]
  • [Poster]
  • [Code]

Scaling Symbolic Methods using Gradients for Neural Model Explanation
Subham Sekhar Sahoo, Subhashini Venugopalan, Li Li, Rishabh Singh, Patrick Riley

International Conference on Learned Representations (ICLR). May 2021.
  • [+Abstract]
  • [PDF]
  • [Slides]
  • [Code]

Predicting Risk of Developing Diabetic Retinopathy using Deep Learning
Ashish Bora, Siva Balasubramanian, Boris Babenko, Sunny Virmani, Subhashini Venugopalan, Akinori Mitani, Guilherme de Oliveira Marinho, Jorge Cuadros, Paisan Ruamviboonsuk, Greg S Corrado, Lily Peng, Dale R Webster, Avinash V Varadarajan, Naama Hammel, Yun Liu, Pinal Bavishi

THE LANCET, Digital Health, January 2021.
  • [+Abstract]
  • [Paper]

Scientific Discovery by Generating Counterfactuals using Image Translation
Arunachalam Narayanaswamy*, Subhashini Venugopalan*, Dale R. Webster, Lily Peng, Greg Corrado, Paisan Ruamviboonsuk, Pinal Bavishi, Rory Sayres, Abigail Huang, Siva Balasubramanian, Michael Brenner, Philip Nelson, Avinash V. Varadarajan
*equal contribution

International Conference on Medical Image Computing and Computer Assisted Intervention (Lima, Peru) Virtual, October 2020. (MICCAI 2020)
  • [+Abstract]
  • [PDF]
  • [Code]
  • [Slides]
  • [Video]

Attribution in Scale and Space
Shawn Xu, Subhashini Venugopalan, Mukund Sundararajan

IEEE Conference on Computer Vision and Pattern Recognition, (Seattle, USA) Virtual June 2020. (CVPR 2020)
  • [+Abstract]
  • [PDF]
  • [Code]
  • [Slides]
  • [Video]

Predicting optical coherence tomography-derived diabetic macular edema grades from fundus photographs using deep learning
Avinash V Varadarajan, Pinal Bavishi, Paisan Ruamviboonsuk, Peranut Chotcomwongse, Subhashini Venugopalan, Arunachalam Narayanaswamy, Jorge Cuadros, Kuniyoshi Kanai, George Bresnick, Mongkol Tadarati, Sukhum Silpa-Archa, Jirawut Limwattanayingyong, Variya Nganthavee, Joseph R Ledsam, Pearse A Keane, Greg S Corrado, Lily Peng, Dale R Webster

Nature Communications, 2020.
  • [+Abstract]
  • [PDF]

Detection of anaemia from retinal fundus images via deep learning
Akinori Mitani, Abigail Huang, Subhashini Venugopalan, Greg S Corrado, Lily Peng, Dale R Webster, Naama Hammel, Yun Liu, Avinash V Varadarajan

Nature BioMedical Engineering, 2019.
  • [+Abstract]
  • [PDF]
  • [Blog Post]

Batch Equalization with a Generative Adversarial Network
Wesley Wei Qian, Cassandra Xia, Subhashini Venugopalan, Arunachalam Narayanaswamy, Jian Peng, D Michael Ando

European Conference on Computational Biology (ECCB) 2020.
(Also in Bioinformatics.)
  • [+Abstract]
  • [Paper]

It's easy to fool yourself: Case studies on identifying bias and confounding in bio-medical datasets
Subhashini Venugopalan*, Arunachalam Narayanaswamy*, Samuel Yang*, Anton Gerashcenko, Scott Lipnick, Nina Makhortova, James Hawrot, Christine Marques, Joao Pereira, Michael Brenner, Lee Rubin, Brian Wainger, Marc Berndl
*equal contribution

NeurIPS Learning Meaningful Representations of Life (LMRL) Workshop, 2019.
  • [+Abstract]
  • [PDF]
  • [Poster]

Applying Deep Neural Network Analysis to High-Content Image-Based Assays
Samuel J Yang*, Scott L Lipnick*, Nina R Makhortova*, Subhashini Venugopalan*, Minjie Fan*, Zan Armstrong, Thorsten M Schlaeger, Liyong Deng, Wendy K Chung, Liadan O’Callaghan, Anton Geraschenko, Dosh Whye, Marc Berndl, Jon Hazard, Brian Williams, Arunachalam Narayanaswamy, D Michael Ando, Philip Nelson, Lee L Rubin
*equal contribution

SLAS DISCOVERY: Advancing Life Sciences R&D, 2019
  • [+Abstract]
  • [PDF]
  • [Slides]

Detecting cancer metastases on gigapixel pathology images
Yun Liu, Krishna Gadepalli, Mohammad Norouzi, George E Dahl, Timo Kohlberger, Aleksey Boyko, Subhashini Venugopalan, Aleksei Timofeev, Philip Q Nelson, Greg S Corrado, Jason D Hipp, Lily Peng, Martin C Stumpe

ArXiv, 2017
  • [+Abstract]
  • [PDF]
  • [Blog Post]

Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs
Varun Gulshan, Lily Peng, Marc Coram, Martin C Stumpe, Derek Wu, Arunachalam Narayanaswamy, Subhashini Venugopalan, Kasumi Widner, Tom Madams, Jorge Cuadros, Ramasamy Kim, Rajiv Raman, Philip C Nelson, Jessica L Mega, Dale R Webster

The Journal of the American Medical Association (JAMA), 2016.
  • [+Abstract]
  • [PDF]
  • [Blog Post]
  • [Best of the Decade]

Vision and Language


Captioning Images with Diverse Objects
Subhashini Venugopalan, Lisa Hendricks, Marcus Rohrbach, Raymond Mooney, Kate Saenko, Trevor Darrell

IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, Hawaii, July 2017. (CVPR 2017)
  • [+Abstract]
  • [arXiv]
  • [Blog Post]
  • [Code]
  • [Project Page]

Semantic Text Summarization of Long Videos
Shagan Sah, Sourabh Kulhare, Allison Gray, Subhashini Venugopalan, Emily Prud'hommeaux, Raymond Ptucha

IEEE Winter Conference on Applications of Computer Vision Santa Rosa, California, March 2017. (WACV 2017)
  • [+Abstract]
  • [PDF]

Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text
Subhashini Venugopalan, Lisa Hendricks, Raymond Mooney, Kate Saenko

Conference on Empirical Methods in Natural Language Processing, Austin, Texas, November 2016. (EMNLP 2016)
  • [+Abstract]
  • [PDF]
  • [Code]
  • [Project Page]

Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data
Lisa Hendricks, Subhashini Venugopalan, Marcus Rohrbach, Raymond Mooney, Kate Saenko, Trevor Darrell

IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, Nevada, June 2016. (CVPR 2016)
  • [+Abstract]
  • [arXiv]
  • [Code]
  • [Project Page]

Sequence to Sequence - Video to Text
Subhashini Venugopalan, Marcus Rohrbach, Jeff Donahue, Raymond Mooney, Trevor Darrell, Kate Saenko

International Conference on Computer Vision, Santiago, Chile, December 2015. (ICCV 2015)
  • [+Abstract]
  • [PDF]
  • [Code]
  • [Project Page]

Translating Videos to Natural Language Using Deep Recurrent Neural Networks
Subhashini Venugopalan, Huijun Xu, Jeff Donahue, Marcus Rohrbach, Raymond Mooney, Kate Saenko

North American Chapter of the Association for Computational Linguistics, Denver, Colorado, June 2015. (NAACL-HLT 2015)
  • [+Abstract]
  • [PDF]
  • [Code]
  • [Project Page]

Long-term Recurrent Convolutional Networks for Visual Recognition and Description
Jeff Donahue, Lisa Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan, Kate Saenko, Trevor Darrell

IEEE Conference on Computer Vision and Pattern Recognition, Boston, Massachusetts, June 2015. (CVPR 2015)
  • [+Abstract]
  • [PDF]
  • [Project page]

Integrating Language and Vision to Generate Natural Language Descriptions of Videos in the Wild
Jesse Thomason*, Subhashini Venugopalan*, Sergio Guadarrama, Kate Saenko, Raymond Mooney
*equal contribution

International Conference on Computational Linguistics, Dublin, Ireland, August 2014.(COLING 2014)
  • [+Abstract]
  • [PDF]
  • [Code]
  • [Project Page]

YouTube2Text: Recognizing and Describing Arbitrary Activities Using Semantic Hierarchies and Zero-shot Recognition
Sergio Guadarrama,Niveda Krishnamoorthy, Girish Malkarnenkar, Subhashini Venugopalan, Raymond Mooney, Trevor Darrell, Kate Saenko

International Conference on Computer Vision, Sydney, Australia, December 2013.(ICCV 2013)
  • [+Abstract]
  • [PDF]
  • [Poster]


Other Research

In the past I have researched on topics in public policy, theoretical cryptography and social network analysis.


Topic based classification and pattern identification in patents
Subhashini Venugopalan, Varun Rai

Technological Forecasting and Social Change, 2014
  • [+Abstract]
  • [PDF]

People and Entity Retrieval in Implicit Social Networks
Suman K. Pathapati, Subhashini Venugopalan, Ashok P. Kumar, Anuradha Bhamidipaty

IEEE International Conference on Internet Multimedia Systems Architecture and Application, Bangalore, India December 2011. (IMSAA 2011)
  • [+Abstract]
  • [PDF]

A New Approach to Threshold Attribute Based Signatures
S. Sharmila Deva Selvi, Subhashini Venugopalan, C Pandu Rangan

  • [+Abstract]
  • [PDF]

Contact


github.com/vsubhashini