| dc.contributor.author | Filippatou, Viktoria | |
| dc.date.accessioned | 2024-11-28T09:24:09Z | |
| dc.date.available | 2024-11-28T09:24:09Z | |
| dc.date.issued | 2024-11-28 | |
| dc.identifier.uri | https://hdl.handle.net/2077/84376 | |
| dc.description.abstract | Figurative language is an integral part of human communication and everyday life. As a Natural Language
Processing task it has long been the focus of attention in research, and recently it has been translated into
a vision and language task, where multi-modal models seem to outperform uni-modal ones. This thesis
explores how a vision and language transformer-based model, specifically VisualBERT, understands figurative
language -idioms, metaphors, and similes- and examines if its visual embeddings can be enhanced to
align better with figurative meaning. Understanding these alignments is critical for assessing whether these
models can truly grasp the abstract and symbolic layers of language, beyond surface-level pattern recognition.
Through a series of experiments and attention analysis, this research highlights both the potential and
limitations of a vision and language model, illuminating the broader challenges in grounding language to
visual contexts. | sv |
| dc.language.iso | eng | sv |
| dc.subject | figurative language, vision, language, VisualBert | sv |
| dc.title | FINDING MEANING IN A HAYSTACK: On How Vision and Language Models Process Figurative Language | sv |
| dc.title.alternative | FINDING MEANING IN A HAYSTACK: On How Vision and Language Models Process Figurative Language | sv |
| dc.type | Text | |
| dc.setspec.uppsok | HumanitiesTheology | |
| dc.type.uppsok | H2 | |
| dc.contributor.department | University of Gothenburg / Department of Philosophy,Lingustics and Theory of Science | eng |
| dc.contributor.department | Göteborgs universitet / Institutionen för filosofi, lingvistik och vetenskapsteori | swe |
| dc.type.degree | Student essay | |