论文标题
对听力受损的听众感知的语音质量的阶段失真调查
Investigation of Phase Distortion on Perceived Speech Quality for Hearing-impaired Listeners
论文作者
论文摘要
阶段是影响质量和清晰度的语音的关键组成部分。当前的语音增强算法开始解决相扭曲,但该算法集中于正常听众(NH)听众。尚不清楚相位增强是否有益于听力受损(HI)听众。我们通过听力研究调查了阶段失真对语音质量的影响,在该研究中,NH和HI听众使用Mushra程序提供了语音质量评分。在一组条件下,语音与4个不同的信噪比(SNRS)的Babble噪声从-5到10 dB混合。在另一组条件下,SNR固定为10 dB,嘈杂的语音在模拟的回响室中呈现,T60S的范围为100至1000毫秒。语音水平保持在65 dB的SPL中,用于NH听众,并将放大器用于HI听众以确保可听性。理想比率掩盖(IRM)用于模拟语音增强。两个客观指标(即PESQ和HASQI)用于比较主观和客观评分。结果表明,相失真对两组的感知质量产生负面影响,而PESQ与人类评分更加紧密相关。
Phase serves as a critical component of speech that influences the quality and intelligibility. Current speech enhancement algorithms are beginning to address phase distortions, but the algorithms focus on normal-hearing (NH) listeners. It is not clear whether phase enhancement is beneficial for hearing-impaired (HI) listeners. We investigated the influence of phase distortion on speech quality through a listening study, in which NH and HI listeners provided speech-quality ratings using the MUSHRA procedure. In one set of conditions, the speech was mixed with babble noise at 4 different signal-to-noise ratios (SNRs) from -5 to 10 dB. In another set of conditions, the SNR was fixed at 10 dB and the noisy speech was presented in a simulated reverberant room with T60s ranging from 100 to 1000 ms. The speech level was kept at 65 dB SPL for NH listeners and amplification was applied for HI listeners to ensure audibility. Ideal ratio masking (IRM) was used to simulate speech enhancement. Two objective metrics (i.e., PESQ and HASQI) were utilized to compare subjective and objective ratings. Results indicate that phase distortion has a negative impact on perceived quality for both groups and PESQ is more closely correlated with human ratings.