GUPEA >
Faculty of Humanities / Humanistiska fakulteten >
Department of Swedish / Institutionen för svenska språket >
Doctoral Theses / Doktorsavhandlingar Institutionen för svenska språket >

Exploring natural language processing for single-word and multi-word lexical complexity from a second language learner perspective


Please use this identifier to cite or link to this item: http://hdl.handle.net/2077/66861

Files in This Item:

File Description SizeFormat
gupea_2077_66861_2.pdfCover962KbAdobe PDF
View/Open
gupea_2077_66861_3.pdfAbstract85KbAdobe PDF
View/Open
gupea_2077_66861_4.pdfThesis1684KbAdobe PDF
View/Open
Title: Exploring natural language processing for single-word and multi-word lexical complexity from a second language learner perspective
Authors: Alfter, David
E-mail: alfter.david@gmx.net
Issue Date: 9-Feb-2021
University: Göteborgs universitet. Humanistiska fakulteten
University of Gothenburg. Faculty of Humanities
Institution: Department of Swedish ; Institutionen för svenska språket
Parts of work: Alfter, David and Yuri Bizzoni and Anders Agebjörn and Elena Volodina and Ildikó Pilán 2016. From distributions to labels: A lexical proficiency analysis using learner corpora. Proceedings of the joint workshop on NLP for Computer Assisted Language Learning and NLP for Language Acquisition at SLTC, Umeå, 16th November 2016 (No. 130, pp. 1-7). Linköping University Electronic Press.

Alfter, David and Elena Volodina 2018. Towards single word lexical complexity prediction. Proceedings of the thirteenth workshop on innovative use of NLP for building educational applications (pp. 79-88).

Alfter, David and Ildikó Pilán 2018. SB@GU at the Complex Word Identification 2018 Shared Task. Proceedings of the Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications (pp. 315-321).

Alfter, David and Therese Lindström Tiedemann and Elena Volodina 2020. Crowdsourcing Relative Rankings of Multi-Word Expressions: Experts versus Non-Experts. Northern European Journal of Language Technology.

Alfter, David and Therese Lindström Tiedemann and Elena Volodina 2019. LEGATO: A flexible lexicographic annotation tool. In NEAL Proceedings of the 22nd Nordic Conference on Computional Linguistics (NoDaLiDa), September 30-October 2, Turku, Finland (No. 167, pp. 382-388). Linköping University Electronic Press.

Graën, Johannes and David Alfter and Gerold Schneider 2020. Using Multilingual Resources to Evaluate CEFRLex for Learner Applications. Proceedings of The 12th Language Resources and Evaluation Conference (pp.346-355).

Alfter, David and Lars Borin and Ildikó Pilán and Therese Lindström Tiedemann and Elena Volodina 2019. Lärka: From Language Learning Platform to Infrastructure for Research on Language Learning. In Selected papers from the CLARIN Annual Conference 2018, Pisa, 8-10 October 2018 (No. 159, pp. 1-14). Linköping University Electronic Press.

Alfter, David and Johannes Graën 2019. Interconnecting lexical resources and word alignment: How do learners get on with particle verbs?. In Proceedings of the 22nd Nordic Conference on Computational Linguistics (pp. 321-326).
Date of Defence: 2021-03-02
Disputation: Tisdag den 2 mars 2021, kl 13, hörsal J309, Humaninsten, Renströmsgatan
Degree: Doctor of Philosophy
Publication type: Doctoral thesis
Series/Report no.: Data linguistica
31
Keywords: natural language processing
lexical complexity
CEFR
second language learning
machine learning
crowdsourcing
Abstract: In this thesis, we investigate how natural language processing (NLP) tools and techniques can be applied to vocabulary aimed at second language learners of Swedish in order to classify vocabulary items into different proficiency levels suitable for learners of different levels. In the first part, we use feature-engineering to represent words as vectors and feed these vectors into machine learning algorithms in order to (1) learn CEFR labels from the input data and (2) predict the CEFR level... more
ISBN: 978-91-87850-79-0
ISSN: 0347-948X
URI: http://hdl.handle.net/2077/66861
Appears in Collections:Doctoral Theses from University of Gothenburg / Doktorsavhandlingar från Göteborgs universitet
Doctoral Theses / Doktorsavhandlingar Institutionen för svenska språket

 

 

© Göteborgs universitet 2011