Neural Tracking Measures of Speech Intelligibility: Manipulating Intelligibility while Keeping Acoustics Unchanged

I M Dushyanthi Karunathilake; Joshua P Kulasingham; Jonathan Z Simon

doi:10.1101/2023.05.18.541269

Neural Tracking Measures of Speech Intelligibility: Manipulating Intelligibility while Keeping Acoustics Unchanged

bioRxiv [Preprint]. 2023 Oct 9:2023.05.18.541269. doi: 10.1101/2023.05.18.541269.

Authors

I M Dushyanthi Karunathilake¹, Joshua P Kulasingham², Jonathan Z Simon^{1

3

4}

Affiliations

¹ Department of Electrical and Computer Engineering, University of Maryland, College Park, MD, 20742, USA.
² Department of Electrical Engineering, Linköping University, SE.
³ Department of Biology, University of Maryland, College Park, MD 20742, USA.
⁴ Institute for Systems Research, University of Maryland, College Park, MD 20742, USA.

Abstract

Neural speech tracking has advanced our understanding of how our brains rapidly map an acoustic speech signal onto linguistic representations and ultimately meaning. It remains unclear, however, how speech intelligibility is related to the corresponding neural responses. Many studies addressing this question vary the level of intelligibility by manipulating the acoustic waveform, but this makes it difficult to cleanly disentangle effects of intelligibility from underlying acoustical confounds. Here, using magnetoencephalography (MEG) recordings, we study neural measures of speech intelligibility by manipulating intelligibility while keeping the acoustics strictly unchanged. Acoustically identical degraded speech stimuli (three-band noise vocoded, ~20 s duration) are presented twice, but the second presentation is preceded by the original (non-degraded) version of the speech. This intermediate priming, which generates a 'pop-out' percept, substantially improves the intelligibility of the second degraded speech passage. We investigate how intelligibility and acoustical structure affects acoustic and linguistic neural representations using multivariate Temporal Response Functions (mTRFs). As expected, behavioral results confirm that perceived speech clarity is improved by priming. TRF analysis reveals that auditory (speech envelope and envelope onset) neural representations are not affected by priming, but only by the acoustics of the stimuli (bottom-up driven). Critically, our findings suggest that segmentation of sounds into words emerges with better speech intelligibility, and most strongly at the later (~400 ms latency) word processing stage, in prefrontal cortex (PFC), in line with engagement of top-down mechanisms associated with priming. Taken together, our results show that word representations may provide some objective measures of speech comprehension.

Keywords: MEG; Speech Intelligibility; TRF; neural tracking; vocoded speech.

Publication types

Preprint

Grants and funding

R01 DC019394/DC/NIDCD NIH HHS/United States