Perceptual normalization for speaking rate occurs below the level of the syllable

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA(2023)

引用 0|浏览0
暂无评分
摘要
Because speaking rates are highly variable, listeners must use cues like phoneme or sentence duration to normalize speech across different contexts. Scaling speech perception in this way allows listeners to distinguish between temporal contrasts, like voiced and voiceless stops, even at different speech speeds. It has long been assumed that this speaking rate normalization can occur over small units such as phonemes. However, phonemes lack clear boundaries in running speech, so it is not clear that listeners can rely on them for normalization. To evaluate this, we isolate two potential processing levels for speaking rate normalization & mdash;syllabic and sub-syllabic & mdash;by manipulating phoneme duration in order to cue speaking rate, while also holding syllable duration constant. In doing so, we show that changing the duration of phonemes both with unique spectro-temporal signatures (/kA/) and more overlapping spectro-temporal signatures (/wI/) results in a speaking rate normalization effect. These results suggest that when acoustic boundaries within syllables are less clear, listeners can normalize for rate differences on the basis of sub syllabic units. VC 2023 Acoustical Society of America.
更多
查看译文
关键词
perceptual normalization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要