Show simple item record

dc.contributor.authorPilán, Ildikó
dc.date.accessioned2018-05-17T11:08:19Z
dc.date.available2018-05-17T11:08:19Z
dc.date.issued2018-05-17
dc.identifier.isbn978-91-87850-68-4
dc.identifier.issn0347-948X
dc.identifier.otherhttp://hdl.handle.net/2077/55895
dc.identifier.urihttp://hdl.handle.net/2077/55895
dc.description.abstractWith the ever-growing presence of electronic devices in our everyday lives, it is compelling to investigate how technology can contribute to make our language learning process more efficient and enjoyable. A fundamental piece in this puzzle is the ability to measure the complexity of the language that learners are able to deal with and produce at different stages of their progress. In this thesis work, we explore automatic approaches for modeling linguistic complexity at different levels of learning Swedish as a second and foreign language (L2). For these purposes, we employ natural language processing techniques to extract linguistic features and combine them with machine learning methods. We study linguistic complexity in two types of L2 texts: those written by experts for learners and those produced by learners themselves. Moreover, we investigate this type of data-driven analysis for the smaller unit of sentences. Automatic proficiency level prediction has a number of application potentials for the field of Intelligent Computer-Assisted Language Learning, out of which we investigate two directions. Firstly, this can facilitate locating learning materials suitable for L2 learners from corpora, which are valuable and easily accessible examples of authentic language use. We propose a framework for selecting sentences suitable as exercise items which, besides linguistic complexity, encompasses a number of additional criteria such as well-formedness and independence from a larger textual context. An empirical evaluation of the system implemented using these criteria indicated its usefulness in an L2 instructional setting. Secondly, linguistic complexity analysis enables the automatic evaluation of L2 texts which, besides being helpful for preparing learning materials, can also be employed for assessing learners' writing. We show that models trained partly or entirely on reading texts can effectively predict the proficiency level of learner essays, especially if some learner errors are automatically corrected in a pre-processing step. Both the sentence selection and the L2 text evaluation systems have been made freely available on an online learning platform.sv
dc.language.isoengsv
dc.relation.ispartofseriesData Linguisticasv
dc.relation.ispartofseries29sv
dc.relation.haspartPilán, Ildikó, Sowmya Vajjala and Elena Volodina 2016. A readable read: automatic assessment of language learning materials based on linguistic complexity. International Journal of Computational Linguistics and Applications (IJLCA) 7 (1): 143–159.sv
dc.relation.haspartPilán, Ildikó 2016. Detecting Context Dependence in Exercise Item Candidates Selected from Corpora. In Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications (BEA), 151–161.sv
dc.relation.haspartPilán, Ildikó, Elena Volodina and Lars Borin 2017. Candidate sentence selection for language learning exercises: from a comprehensive framework to an empirical evaluation. Traitement Automatique des Langues (TAL) Journal, Special issue on NLP for learning and teaching 57 (3): 67–91.sv
dc.relation.haspartPilán, Ildikó, Elena Volodina and Torsten Zesch 2016. Predicting proficiency levels in learner writings by transferring a linguistic complexity model from expert-written coursebooks. Proceedings of the 26th International Conference on Computational Linguistics (COLING), 2101–2111.sv
dc.relation.haspartPilán, Ildikó, David Alfter and Elena Volodina 2016. Coursebook texts as a helping hand for classifying linguistic complexity in language learners’ writings. Proceedings of the Workshop on Computational Linguistics for Linguistic Complexity (CL4LC), 120–126.sv
dc.relation.haspartPilán, Ildikó and Elena Volodina. Investigating the importance of linguistic complexity features across different datasets related to language learning. Submitted.sv
dc.subjectnatural language processingsv
dc.subjectlinguistic complexitysv
dc.subjectreadabilitysv
dc.subjectCEFRsv
dc.subjectsecond language learningsv
dc.subjectcorpus examplessv
dc.subjecttext classificationsv
dc.subjectmachine learningsv
dc.subjectdomain adaptationsv
dc.titleAutomatic proficiency level prediction for Intelligent Computer-Assisted Language Learningsv
dc.typeText
dc.type.svepDoctoral thesiseng
dc.gup.mailildiko.pilan@gmail.comsv
dc.type.degreeDoctor of Philosophysv
dc.gup.originGöteborgs universitet. Humanistiska fakultetenswe
dc.gup.originUniversity of Gothenburg. Faculty of Artseng
dc.gup.departmentDepartment of Swedish ; Institutionen för svenska språketsv
dc.gup.defenceplace13.15, Stora hörsalen (2150), Eklandagatan 86sv
dc.gup.defencedate2018-06-14
dc.gup.dissdb-fakultetHF


Files in this item

Thumbnail
Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record