Forensic comparison of voices, speech and speakers – Tools and Methods in Forensic Phonetics

Lindh, Jonas

dc.contributor.author	Lindh, Jonas
dc.date.accessioned	2017-05-16T08:47:25Z
dc.date.available	2017-05-16T08:47:25Z
dc.date.issued	2017-05-16
dc.identifier.isbn	978-91-629-0141-7
dc.identifier.isbn	978-91-629-0142-4
dc.identifier.uri	http://hdl.handle.net/2077/52188
dc.description.abstract	This thesis has three main objectives. The first objective (A) includes Study I, which investigates the parameter fundamental frequency (F0) and its robustness in different acoustic contexts by using different measures. The outcome concludes that using the alternative baseline as a measure will diminish the effect of low-quality recordings or varying speaking liveliness. However, both creaky voice and raised vocal effort induce intra-variation problems that are yet to be solved. The second objective (B) includes study II, III and IV. Study II investigates the differences between the results from an ear witness line-up experiment and the pairwise perceptual judgments of voice similarity performed by a large group of listeners. The study shows that humans seem to be much more focused on similarities of speech style than features connected to voice quality, even when recordings are played backwards. Study III investigates the differences between an automatic voice comparison system and humans’ perceptual judgments of voice similarity. The experiments’ results show that it is possible to see a correlation between how speakers were judged as more or less different using multidimensional scaling of similarity ranks compared to both the automatic system and the listeners. However, there are also differences due to the fact that human listeners include information about speech style and have difficulties weighting the parameters, i.e. ignoring them when they are contradictory. Study IV successfully investigates a new functional method for how to convert the perceptual similarity judgments made by humans and then compare those to the automatic system results within the likelihood ratio framework. It was discovered that the automatic system outperformed the naïve human listeners in this task (using a very small dataset). The third objective (C) includes study V. Study V investigates several statistical modelling techniques to calculate relevant likelihood ratios using simulations based on existing reference data in an authentic forensic case of a disputed utterance. The study presents several problems with modelling small datasets and develops methods to take into account the lack of data within the likelihood ratio framework. In summary, the thesis contains a larger historical background to forensic speaker comparison to guide the reader into the current research situation within forensic phonetics. The work further seeks to build a bridge between forensic phonetics and automatic voice recognition. Practical casework implications have been considered throughout the work on the basis of own experience as a forensic caseworker and through collaborative interaction with other parties working in the field, both in research and in forensic practice and law enforcement. Since 2005, the author has been involved in over 400 forensic cases and given testimony in several countries.	sv
dc.language.iso	eng	sv
dc.relation.haspart	Lindh, J. & Eriksson, A. (2007). Robustness of long time measures of fundamental frequency. In INTERSPEECH (pp. 2025-2028).	sv
dc.relation.haspart	Lindh, J. (2009). Perception of voice similarity and the results of a voice line-up. In Proceedings from FONETIK (pp. 186-189).	sv
dc.relation.haspart	Lindh, J. & Eriksson, A. (2010). Voice similarity-a comparison between judgements by human listeners and automatic voice comparison. In Proceedings from FONETIK (pp. 63-69).	sv
dc.relation.haspart	Lindh, J. & Morrison, G. S. (2011). Humans versus machine: forensic voice comparison on a small database of Swedish voice recordings. In Proceedings of ICPhS (Vol. 17, pp. 1254-1257).	sv
dc.relation.haspart	Morrison, G. S., Lindh, J. & Curran, J. M. (2014). Likelihood ratio calculation for a disputed-utterance analysis with limited available data. Speech Communication, 58, pp. 81-90. ::doi::10.1016/j.specom.2013.11.004	sv
dc.subject	forensic phonetics	sv
dc.subject	automatic voice recognition	sv
dc.subject	disputed utterance	sv
dc.subject	speech	sv
dc.subject	language technology	sv
dc.subject	phonetics	sv
dc.title	Forensic comparison of voices, speech and speakers – Tools and Methods in Forensic Phonetics	sv
dc.title.alternative	Tools and Methods in Forensic Phonetics	sv
dc.type	Text
dc.type.svep	Doctoral thesis	eng
dc.gup.mail	jonas.lindh@gu.se	sv
dc.type.degree	Doctor of Philosophy	sv
dc.gup.origin	Göteborgs universitet. Humanistiska fakulteten	swe
dc.gup.origin	University of Gothenburg. Faculty of Arts	eng
dc.gup.department	Department of Philosophy, Linguistics and Theory of Science ; Institutionen för filosofi, lingvistik och vetenskapsteori	sv
dc.gup.defenceplace	Onsdagen den 7 juni 2017, kl. 10.00, T307, Olof Wijksgatan 6	sv
dc.gup.defencedate	2017-06-07
dc.gup.dissdb-fakultet	HF

Files in this item

Name:: gupea_2077_52188_2.pdf
Size:: 70.90Kb
Format:: PDF
Description:: spikblad

View/Open

Name:: gupea_2077_52188_4.pdf
Size:: 2.685Mb
Format:: PDF
Description:: Thesis frame

View/Open

This item appears in the following Collection(s)

Doctoral Theses / Doktorsavhandlingar Institutionen för filosofi, lingvistik och vetenskapsteori
Doctoral Theses from University of Gothenburg / Doktorsavhandlingar från Göteborgs universitet

Show simple item record