Siddiqui, A.

Latent diffusion for generative visual attribution in medical image diagnostics

Masters thesis

Siddiqui, A. 2023. Latent diffusion for generative visual attribution in medical image diagnostics. Masters thesis Middlesex University

Publication dates
Type	Masters thesis
Title	Latent diffusion for generative visual attribution in medical image diagnostics
Authors	Siddiqui, A.
Abstract	Visual attribution in medical imaging seeks to make evident the diagnostically-relevant components of a medical image, in contrast to the more common detection of diseased tissue deployed in conventional machine vision pipelines (due to the inherent learning nature of these latter models, they are typically not easily interpretable/explainable to clinicians). State-of-the-art techniques in visual attribution generally consist of different variants of deep neural networks, implemented as classifiers, or segmenters. However, they have not thus far included an explicit linguistic component. We here present a novel generative visual attribution technique, one that leverages latent diffusion models in combination with domain-specific large language models, in order to generate normal counterparts of abnormal images. The discrepancy between the two hence gives rise to a mapping indicating the diagnostically-relevant image components. To achieve this, we deploy image priors in conjunction with appropriate conditioning mechanisms in order to control the image generative process, including natural language text prompts acquired from medical science and applied radiology. We perform experiments and quantitatively evaluate our results on the COVID-19 Radiography Database containing labelled chest X-rays with differing pathologies via the Frechet Inception Distance (FID), Structural Similarity (SSIM) and Multi Scale Structural Similarity Metric (MS-SSIM) metrics obtained between real and generated images. The resulting system also exhibits a range of latent capabilities including super-resolution and zero-shot localized disease induction, which are evaluated with real examples from the cheXpert dataset.
Keywords	Visual Attribution; Explainable AI; Diffusion models; Medical imaging
Sustainable Development Goals	9 Industry, innovation and infrastructure
	3 Good health and well-being
Middlesex University Theme	Health & Wellbeing
Department name	Computer Science
	Science and Technology
Institution name	Middlesex University
Publisher	Middlesex University Research Repository
Online	28 Mar 2024
Publication process dates
Accepted	18 Sep 2023
Deposited	28 Mar 2024
Output status	Published
Accepted author manuscript	AASiddiqui thesis EMBARGO.pdf File Access Level Open
Related Output
Has version	https://doi.org/10.1038/s41598-024-81646-x
Language	English

Permalink -

https://repository.mdx.ac.uk/item/116z34

Log in to edit

Download files

Accepted author manuscript

	AASiddiqui thesis EMBARGO.pdf
File access level: Open

73
total views
17
total downloads
1
views this month
1
downloads this month

Latent diffusion for generative visual attribution in medical image diagnostics

Download files

Accepted author manuscript

73

17

1

1

Export as