Naïve listeners' prominence and boundary perception

Yoonsook Mo, Jennifer Cole, Eun Kyung Lee

Research output: Chapter in Book/Report/Conference proceedingConference contribution

37 Citations (Scopus)

Abstract

This paper examines how ordinary listeners, naïve with respect to the phonetics and phonology of prosody, perceive the location of prosodic boundaries that demarcate speech "chunks" and prominences that serve a "highlighting" function, in spontaneous speech (Buckeye corpus). Over 70 naïve listeners marked the locations of prominences and boundaries in a real-time transcription task. Fleiss' multitranscribers' reliability tests show that naïve transcribers are consistent in their perception of prosodic boundaries and prominences. Specifically, we observe higher multi-transcriber agreement scores for boundary marking than for prominence marking. Variation between transcriptions of the same speech excerpt produced by different listeners reveals individual differences in the perception of prominences and boundaries. Variation in Fleiss' multi-transcribers' agreement scores for excerpts from different speakers suggests that speakers vary in how they structure an utterance prosodically and/or in how effectively they cue prosodic structure. We also find that nuclear prominences are more consistently perceived by naïve listeners than prenuclear prominences. The finding that naïve listeners agree well above chance on the location of prosodic events indicates that naïve transcription is a valid method for prosody analysis which can augment analysis based solely on expert labeling.

Original languageEnglish
Title of host publicationProceedings of the 4th International Conference on Speech Prosody, SP 2008
PublisherInternational Speech Communications Association
Pages735-738
Number of pages4
ISBN (Print)9780616220030
Publication statusPublished - 2008 Jan 1
Event4th International Conference on Speech Prosody 2008, SP 2008 - Campinas, Brazil
Duration: 2008 May 62008 May 9

Publication series

NameProceedings of the 4th International Conference on Speech Prosody, SP 2008

Other

Other4th International Conference on Speech Prosody 2008, SP 2008
CountryBrazil
CityCampinas
Period08/5/608/5/9

Fingerprint

Transcription
Speech analysis
Labeling
Nave
Listeners

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Computer Vision and Pattern Recognition
  • Human-Computer Interaction
  • Software
  • Mechanical Engineering

Cite this

Mo, Y., Cole, J., & Lee, E. K. (2008). Naïve listeners' prominence and boundary perception. In Proceedings of the 4th International Conference on Speech Prosody, SP 2008 (pp. 735-738). (Proceedings of the 4th International Conference on Speech Prosody, SP 2008). International Speech Communications Association.
Mo, Yoonsook ; Cole, Jennifer ; Lee, Eun Kyung. / Naïve listeners' prominence and boundary perception. Proceedings of the 4th International Conference on Speech Prosody, SP 2008. International Speech Communications Association, 2008. pp. 735-738 (Proceedings of the 4th International Conference on Speech Prosody, SP 2008).
@inproceedings{f61294e124a6454b9d10403a13f9922e,
title = "Na{\"i}ve listeners' prominence and boundary perception",
abstract = "This paper examines how ordinary listeners, na{\"i}ve with respect to the phonetics and phonology of prosody, perceive the location of prosodic boundaries that demarcate speech {"}chunks{"} and prominences that serve a {"}highlighting{"} function, in spontaneous speech (Buckeye corpus). Over 70 na{\"i}ve listeners marked the locations of prominences and boundaries in a real-time transcription task. Fleiss' multitranscribers' reliability tests show that na{\"i}ve transcribers are consistent in their perception of prosodic boundaries and prominences. Specifically, we observe higher multi-transcriber agreement scores for boundary marking than for prominence marking. Variation between transcriptions of the same speech excerpt produced by different listeners reveals individual differences in the perception of prominences and boundaries. Variation in Fleiss' multi-transcribers' agreement scores for excerpts from different speakers suggests that speakers vary in how they structure an utterance prosodically and/or in how effectively they cue prosodic structure. We also find that nuclear prominences are more consistently perceived by na{\"i}ve listeners than prenuclear prominences. The finding that na{\"i}ve listeners agree well above chance on the location of prosodic events indicates that na{\"i}ve transcription is a valid method for prosody analysis which can augment analysis based solely on expert labeling.",
author = "Yoonsook Mo and Jennifer Cole and Lee, {Eun Kyung}",
year = "2008",
month = "1",
day = "1",
language = "English",
isbn = "9780616220030",
series = "Proceedings of the 4th International Conference on Speech Prosody, SP 2008",
publisher = "International Speech Communications Association",
pages = "735--738",
booktitle = "Proceedings of the 4th International Conference on Speech Prosody, SP 2008",

}

Mo, Y, Cole, J & Lee, EK 2008, Naïve listeners' prominence and boundary perception. in Proceedings of the 4th International Conference on Speech Prosody, SP 2008. Proceedings of the 4th International Conference on Speech Prosody, SP 2008, International Speech Communications Association, pp. 735-738, 4th International Conference on Speech Prosody 2008, SP 2008, Campinas, Brazil, 08/5/6.

Naïve listeners' prominence and boundary perception. / Mo, Yoonsook; Cole, Jennifer; Lee, Eun Kyung.

Proceedings of the 4th International Conference on Speech Prosody, SP 2008. International Speech Communications Association, 2008. p. 735-738 (Proceedings of the 4th International Conference on Speech Prosody, SP 2008).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Naïve listeners' prominence and boundary perception

AU - Mo, Yoonsook

AU - Cole, Jennifer

AU - Lee, Eun Kyung

PY - 2008/1/1

Y1 - 2008/1/1

N2 - This paper examines how ordinary listeners, naïve with respect to the phonetics and phonology of prosody, perceive the location of prosodic boundaries that demarcate speech "chunks" and prominences that serve a "highlighting" function, in spontaneous speech (Buckeye corpus). Over 70 naïve listeners marked the locations of prominences and boundaries in a real-time transcription task. Fleiss' multitranscribers' reliability tests show that naïve transcribers are consistent in their perception of prosodic boundaries and prominences. Specifically, we observe higher multi-transcriber agreement scores for boundary marking than for prominence marking. Variation between transcriptions of the same speech excerpt produced by different listeners reveals individual differences in the perception of prominences and boundaries. Variation in Fleiss' multi-transcribers' agreement scores for excerpts from different speakers suggests that speakers vary in how they structure an utterance prosodically and/or in how effectively they cue prosodic structure. We also find that nuclear prominences are more consistently perceived by naïve listeners than prenuclear prominences. The finding that naïve listeners agree well above chance on the location of prosodic events indicates that naïve transcription is a valid method for prosody analysis which can augment analysis based solely on expert labeling.

AB - This paper examines how ordinary listeners, naïve with respect to the phonetics and phonology of prosody, perceive the location of prosodic boundaries that demarcate speech "chunks" and prominences that serve a "highlighting" function, in spontaneous speech (Buckeye corpus). Over 70 naïve listeners marked the locations of prominences and boundaries in a real-time transcription task. Fleiss' multitranscribers' reliability tests show that naïve transcribers are consistent in their perception of prosodic boundaries and prominences. Specifically, we observe higher multi-transcriber agreement scores for boundary marking than for prominence marking. Variation between transcriptions of the same speech excerpt produced by different listeners reveals individual differences in the perception of prominences and boundaries. Variation in Fleiss' multi-transcribers' agreement scores for excerpts from different speakers suggests that speakers vary in how they structure an utterance prosodically and/or in how effectively they cue prosodic structure. We also find that nuclear prominences are more consistently perceived by naïve listeners than prenuclear prominences. The finding that naïve listeners agree well above chance on the location of prosodic events indicates that naïve transcription is a valid method for prosody analysis which can augment analysis based solely on expert labeling.

UR - http://www.scopus.com/inward/record.url?scp=84874894841&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84874894841&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84874894841

SN - 9780616220030

T3 - Proceedings of the 4th International Conference on Speech Prosody, SP 2008

SP - 735

EP - 738

BT - Proceedings of the 4th International Conference on Speech Prosody, SP 2008

PB - International Speech Communications Association

ER -

Mo Y, Cole J, Lee EK. Naïve listeners' prominence and boundary perception. In Proceedings of the 4th International Conference on Speech Prosody, SP 2008. International Speech Communications Association. 2008. p. 735-738. (Proceedings of the 4th International Conference on Speech Prosody, SP 2008).