Optimal combination of neural temporal envelope and fine structure cues to explain speech identification in background noise

  • Il Joon Moon
  • , Jong Ho Won
  • , Min Hyun Park
  • , D. Timothy Ives
  • , Kaibao Nie
  • , Michael G. Heinz
  • , Christian Lorenzi
  • , Jay T. Rubinstein

Research output: Contribution to journalArticlepeer-review

22 Scopus citations

Abstract

The dichotomy between acoustic temporal envelope (ENV) and fine structure (TFS) cues has stimulated numerous studies over the past decade to understand the relative role of acousticENVand TFS in human speech perception. Such acoustic temporal speech cues produce distinct neural discharge patterns at the level of the auditory nerve, yet little isknownabout the central neural mechanisms underlying the dichotomy in speech perception between neural ENV and TFS cues. We explored the question of how the peripheral auditory system encodes neural ENV and TFS cues in steady or fluctuating background noise, and how the central auditory system combines these forms of neural information for speech identification. We sought to address this question by (1) measuring sentence identification in background noise for human subjects as a function of the degree of available acoustic TFS information and (2) examining the optimal combination of neuralENVand TFS cues to explainhumanspeech perception performance using computational models of the peripheral auditory system and central neural observers. Speech-identification performance by human subjects decreased as the acoustic TFS information was degraded in the speech signals. The model predictions best matched human performance when a greater emphasis was placed on neural ENV coding rather than neural TFS. However, neural TFS cues were necessary to account for the full effect of background-noise modulations on human speech-identification performance.

Original languageEnglish
Pages (from-to)12145-12154
Number of pages10
JournalJournal of Neuroscience
Volume34
Issue number36
DOIs
StatePublished - 3 Sep 2014

Keywords

  • Computational model
  • Neural mechanism
  • Speech perception
  • Temporal cues

Fingerprint

Dive into the research topics of 'Optimal combination of neural temporal envelope and fine structure cues to explain speech identification in background noise'. Together they form a unique fingerprint.

Cite this