Who is laughing now? Laughter-infused dialogue systems

Maraev, Vladislav

dc.contributor.author	Maraev, Vladislav
dc.date.accessioned	2022-08-16T08:07:19Z
dc.date.available	2022-08-16T08:07:19Z
dc.date.issued	2022-08-16
dc.identifier.isbn	978-91-8009-909-7 (print)
dc.identifier.isbn	978-91-8009-910-3 (PDF)
dc.identifier.uri	https://hdl.handle.net/2077/72030
dc.description	Denna avhandling banar väg för att inkludera skratt i dialogsystem på ett domänoberoende och språkvetenskapligt giltigt sätt, med hjälp av verktyg och metoder från datalingvistiken. Avhandlingen behandlar tre huvudsakliga områden. Det första området rör placeringen av skratt i relation till tal och andra beteenden. Vi visar att neurala nätverk kan förutsäga skratt från transkriberad dialog, medan människor är sämre på detta. Sådana modeller gör det möjligt för dialogsystem att förutsäga mänskligt skratt och, vid behov, placera skratt från systemet på lämpliga ställen. Vi undersöker också placeringen av skratt i förhållande till ögonrörelser och visar att skrattets funktion är relaterad till olika ögonrörelsemönster. Dessa resultat har viktiga följder för konversationella agenter och sociala robotar med avseende på generering och samordning av multimodalt beteende. Det andra området rör interaktionen mellan skratt, den kommunikativa avsikten hos användare och system, och den kontext i vilken avsikten realiseras. Vi lägger grunden till den centrala komponenten i ett dialogsystem genom att implementera en teoretiskt motiverad dialoghanterare baserad på linjär logik. Vår dialoghanterare utvidgas sedan till att hantera skratt som utgör eller ackompanjerar återkoppling, och skratt som utgör svar på ja/nej-frågor. Därtill undersöker vi hur skratt kan modifiera eller forma en dialoghandling, samt hur skratt kan förbättra neurala modeller för igenkänning av dialoghandlingar. Det tredje området är humor. Även om humor inte är en förutsättning för skratt så är de nära sammankopplade. Vi undersöker hur humor är relaterad till resonerande om sociala konventioner och andra inlärda och implicita antaganden. Vi visar hur dessa kan modelleras informellt och påvisar därigenom ett sätt på vilket situationsbunden kreativitet i samtal kan realiseras i konversationella agenter.	en
dc.description.abstract	This thesis paves the way for including laughter in spoken dialogue systems in a domain-general and linguistically valid way using computational linguistics tools and methods. The thesis is concerned with three main areas. The first area concerns the placement of laughter in relation to speech and other behaviours. We show that convolutional and recurrent neural networks can effectively predict laughs from dialogue transcripts, whereas human perforance in this task is significantly worse. Such models allow dialogue systems to predict user laughter and, if needed, put system laughter in an appropriate place. Further, we look at laughter placement in relation to gaze and show that laughter performing different pragmatic functions is related to different gaze patterns. These findings provide important implications for embodied conversational agents and social robots in regard to multimodal behaviour realisation and coordination. The second area is concerned with the interaction between laughter and the communicative intent of a user and system, as well as with the context in which the given intent occurs. We lay the groundwork for the central component of a spoken dialogue system by implementing a dialogue manager in a theoretically informed way using a proof-theoretic model based on linear logic. Our dialogue manager is then extended to support laughter functioning as feedback or as a signal accompanying system feedback, and an answer to polar questions. Additionally, we look at how laughter can modify or shape a dialogue act, and how the inclusion of laughter can improve Transformer-based deep learning models in the task of dialogue act recognition. The third area is humour. Even though humour is not necessary for laughter, they are closely related. We look at how humour is related to reasoning about social conventions and other learned and accommodated implicit assumptions. We show how these can be modelled informally, suggesting one way in which the goal of situational and conversational creativity can be realised in artificial agents.	en
dc.language.iso	eng	en
dc.subject	laughter, dialogue systems, humour, machine learning, reasoning, natural language processing, artificial agents	en
dc.title	Who is laughing now? Laughter-infused dialogue systems	en
dc.type	Text
dc.type.svep	Doctoral thesis	eng
dc.gup.mail	vladislav.maraev@gu.se	en
dc.type.degree	Doctor of Philosophy	en
dc.gup.origin	Göteborgs universitet. Humanistiska fakulteten	swe
dc.gup.origin	University of Gothenburg. Faculty of Humanities	eng
dc.gup.department	Department of Philosophy, Linguistics and Theory of Science ; Institutionen för filosofi, lingvistik och vetenskapsteori	en
dc.gup.defenceplace	29 augusti 2022, klockan 15:00 i sal J 439, Humanisten	en
dc.gup.defencedate	2022-08-29
dc.gup.dissdb-fakultet	HF

Files in this item

Name:: Thesis final_8aug.pdf
Size:: 2.353Mb
Format:: PDF
Description:: Thesis, full text

View/Open

Name:: spikblad final.pdf
Size:: 81.58Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Doctoral Theses / Doktorsavhandlingar Institutionen för filosofi, lingvistik och vetenskapsteori

Show simple item record