ImageSI: Interactive Deep Learning for Image Semantic Interaction

Lin, Jiayue

ImageSI: Interactive Deep Learning for Image Semantic Interaction

Files

Lin_J_T_2024.pdf (3.33 MB)

Downloads: 77

Date

2024-06-04

Authors

Lin, Jiayue

Publisher

Virginia Tech

Abstract

Interactive deep learning frameworks are crucial for effectively exploring and analyzing complex image datasets in visual analytics. However, existing approaches often face challenges related to inference accuracy and adaptability. To address these issues, we propose ImageSI, a framework integrating deep learning models with semantic interaction techniques for interactive image data analysis. Unlike traditional methods, ImageSI directly incorporates user feedback into the image model, updating underlying embeddings through customized loss functions, thereby enhancing the performance of dimension reduction tasks. We introduce three variations of ImageSI, ImageSI $t e x t M D S −1$ , prioritizing explicit pairwise relationships from user interaction, and ImageSI $t e x t D R T r i p l e t$ and ImageSI $t e x t P H T r i p l e t$ , emphasizing clustering by defining groups of images based on user input. Through usage scenarios and quantitative analyses centered on algorithms, we demonstrate the superior performance of ImageSI $t e x t D R T r i p l e t$ and ImageSI $t e x t M D S −1$ in terms of inference accuracy and interaction efficiency. Moreover, ImageSI $t e x t P H T r i p l e t$ shows competitive results. The baseline model, WMDS $−1$ , generally exhibits lower performance metrics.

Keywords

Semantic Interaction, Deep Learning, Dimension Reduction, Images

Persistent link

https://hdl.handle.net/10919/119283

Collections

Masters Theses

Full item page

ImageSI: Interactive Deep Learning for Image Semantic Interaction

Files

TR Number

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

Persistent link

Collections