Title

THE EFFECT OF LANGUAGE INPUT IN SEMANTIC SEGMENTATION AND GRADING OF COLORECTAL CANCER WITH LANGUAGE-VISION MODELS

Abstract

Abstract

Accurately diagnosing and grading colorectal cancer from histopathological images is crucial for effective treatment. Traditionally, this grading relies on visual examination by pathologists, a process that is inherently time-consuming, subjective, and prone to inter-observer variability. Deep learning offers a promising avenue for automating this process, but two major challenges remain: limited availability of large, annotated datasets and a lack of dedicated semantic segmentation models for this task. This study proposes a modified architecture of MaskCLIP, a powerful vision-language model, fine-tuned on our annotated colorectal cancer image dataset for semantic segmentation and grading of colorectal cancer images. This marks the first application of such a model to this specific task, offering a way to leverage textual information in histopathological analysis. We conduct a comparative analysis of vision-language and vision-only MaskCLIP variants to understand the impact of language guidance on model performance. By investigating whether integrating MaskCLIP as a vision-language module surpasses the capabilities of traditional vision-only models, we aim to uncover the potential benefits of incorporating textual data in histopathological analysis. This research provides valuable insights into the role of language in enhancing deep learning models for colorectal cancer analysis, particularly for semantic segmentation.

Supervisor(s)

Supervisor(s)

SINA SEHLAVER

Date and Location

Date and Location

2024-07-29 09:30:00

Category

Category

MSc_Thesis