Vision-Language Models for Biomedical Applications

Thapa, Surendrabikram; Naseem, Usman; Zhou, Luping; Kim, Jinman

Vision-Language Models for Biomedical Applications

Files

Published version (791.92 KB)

Downloads: 54

Date

2024-10-28

Authors

Thapa, Surendrabikram

Naseem, Usman

Zhou, Luping

Kim, Jinman

Publisher

ACM

Abstract

Vision-language models (VLMs) are transforming the landscape of biomedical research and healthcare by enabling the seamless integration and interpretation of complex multimodal data, including medical images and clinical texts. Recognizing the growing impact of these models, the first international workshop on Vision- Language Models for Biomedicine (VLM4Bio) was held in conjunction with ACM Multimedia 2024. The workshop aimed to address the critical need for advanced techniques that can leverage VLMs in applications such as medical imaging, diagnostics, and personalized treatment. As healthcare data increasingly involves both visual and textual information, VLM4Bio provided a platform for interdisciplinary collaboration between experts in natural language processing, computer vision, biomedical engineering, and AI ethics. This paper provides an overview of the inaugural edition of the VLM4Bio workshop, summarizing the key discussions, contributions, and future directions for expanding the workshop’s scope and influence in subsequent editions.

Persistent link

https://hdl.handle.net/10919/121530

Collections

Journal Articles, Association for Computing Machinery (ACM)
Scholarly Works, Sanghani Center for Artificial Intelligence and Data Analytics

Full item page

Vision-Language Models for Biomedical Applications

Files

TR Number

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

Persistent link

Collections