Show simple item record

dc.contributor.authorFilippatou, Viktoria
dc.date.accessioned2024-11-28T09:24:09Z
dc.date.available2024-11-28T09:24:09Z
dc.date.issued2024-11-28
dc.identifier.urihttps://hdl.handle.net/2077/84376
dc.description.abstractFigurative language is an integral part of human communication and everyday life. As a Natural Language Processing task it has long been the focus of attention in research, and recently it has been translated into a vision and language task, where multi-modal models seem to outperform uni-modal ones. This thesis explores how a vision and language transformer-based model, specifically VisualBERT, understands figurative language -idioms, metaphors, and similes- and examines if its visual embeddings can be enhanced to align better with figurative meaning. Understanding these alignments is critical for assessing whether these models can truly grasp the abstract and symbolic layers of language, beyond surface-level pattern recognition. Through a series of experiments and attention analysis, this research highlights both the potential and limitations of a vision and language model, illuminating the broader challenges in grounding language to visual contexts.sv
dc.language.isoengsv
dc.subjectfigurative language, vision, language, VisualBertsv
dc.titleFINDING MEANING IN A HAYSTACK: On How Vision and Language Models Process Figurative Languagesv
dc.title.alternativeFINDING MEANING IN A HAYSTACK: On How Vision and Language Models Process Figurative Languagesv
dc.typeText
dc.setspec.uppsokHumanitiesTheology
dc.type.uppsokH2
dc.contributor.departmentUniversity of Gothenburg / Department of Philosophy,Lingustics and Theory of Scienceeng
dc.contributor.departmentGöteborgs universitet / Institutionen för filosofi, lingvistik och vetenskapsteoriswe
dc.type.degreeStudent essay


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record