基于情感基音模板的情感语音合成

来源期刊:中南大学学报(自然科学版)2010年第6期

论文作者:陈明义 党培霞

文章页码:2258 - 2263

关键词:情感语音合成;情感基音模板;基音同步叠加算法;韵律参数

Key words:emotional speech synthesis; emotional pitch template; pitch synchronous overlap algorithm (PSOLA); prosodic parameters

摘    要:为了合成能够模拟表达说话人的情感状态的语音,提出一种基于情感基音模板的情感语音合成方法。该方法分别建立高兴、愤怒、悲伤和中立4种不同情感下的韵母基音模板库,建立4种声调模型,统计分析语音库中情感语音的韵律特征参数,运用基音同步叠加算法(PSOLA)合成含情感色彩的语音。实验以音节为合成单位,根据情感特征参数的统计分析结果调节合成语音的韵律特征,合成各种情感的语音。仿真实验结果表明:用情感基音模板合成的目标情感语音具有目标情感的音质色彩,再通过韵律参数调节,可合成较理想的情感语音。该方法可用于增加语音合成系统的智能化,提高人机交互的能力。

Abstract: In order to synthesize the speech which can express the speaker’s emotional state, a method of emotional speech synthesis based on the emotional pitch template was presented. By the method, happy, angry, sad and neutral vowel pitch template libraries were established, and four kinds of tone model were also established, the prosody characteristic parameters of the emotional speech were analyzed, and pitch synchronous overlap algorithm (PSOLA) to synthesis speech with emotional colors was used. Using the syllable as the synthetic unit, the prosodic parameters of the synthetic speech were adjusted according to the statistical analysis of the prosodic parameters to synthesize various emotional speech. Simulation results show that with the same prosodic parameters, the emotional speech synthesized with the targeted emotional pitch template has the tone color of the targeted emotion. After the adjustment of prosodic parameters, the ideal emotional speech can be gotten. The method can be used to increase the intelligence of speech synthesis system and improve the capabilities of human-computer interaction.

有色金属在线官网  |   会议  |   在线投稿  |   购买纸书  |   科技图书馆

中南大学出版社 技术支持 版权声明   电话:0731-88830515 88830516   传真:0731-88710482   Email:administrator@cnnmol.com

互联网出版许可证:(署)网出证(京)字第342号   京ICP备17050991号-6      京公网安备11010802042557号