Shruthi Bannur

Microsoft

14 Papers

8 Citations

Shruthi Bannur is an academic researcher from Microsoft. The author has contributed to research in topics: Computer science & Semantics (computer science). The author has an hindex of 3, co-authored 3 publications.

Author Tools

Create citation map

Create Author Profile

Analyze Shruthi Bannur's Top Papers

Chat about Author

Papers

•Book Chapter•10.1007/978-3-031-20059-5_1

Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing

Benedikt Böcking, +10 more

- 21 Apr 2022

- Lecture Notes in Computer Science

TL;DR: This paper proposed a self-supervised joint vision-language approach with a focus on better text modelling, which achieved state-of-the-art results in radiology natural language inference through its improved vocabulary and novel language pretraining objective leveraging semantics and discourse characteristics in radiological reports.

...read moreread less

189

Journal Article•10.48550/arXiv.2301.04558

Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing

Shruthi Bannur, +14 more

- 11 Jan 2023

- arXiv.org

TL;DR: BioViL-T as discussed by the authors uses a CNN-Transformer hybrid multi-image encoder trained jointly with a text model, achieving state-of-the-art performance on progression classification, phrase grounding, and report generation.

...read moreread less

10.1145/3613904.3642013

Multimodal Healthcare AI: Identifying and Designing Clinically Relevant Vision-Language Applications for Radiology

Nur Yildirim, +20 more

- 22 Feb 2024

TL;DR: This work engaged in an iterative, multidisciplinary design process to envision clinically relevant VLM interactions, and co-designed four VLM use concepts: Draft Report Generation, Augmented Report Review, Visual Search and Querying, and Patient Imaging History Highlights.

...read moreread less

Journal Article•10.48550/arxiv.2401.10815

RAD-DINO: Exploring Scalable Medical Image Encoders Beyond Text Supervision

Fernando P'erez-Garc'ia, +14 more

- 19 Jan 2024

- arXiv.org

TL;DR: RAD-DINO is introduced, a biomedical image encoder pre-trained solely on unimodal biomedical imaging data that obtains similar or greater performance than state-of-the-art biomedical language supervised models on a diverse range of benchmarks.

...read moreread less

Patent

Systems and methods for monitoring driver state

Akshay Uttama Nambi Srirangam Narashiman, +4 more

- 31 May 2018

TL;DR: In this article, a system and techniques for monitoring driver state are described, which is adapted to receive a set of color images of a person, such as image of a driver of a vehicle with varying levels of illumination in the images.

...read moreread less