Linguists call this "prosody," the ability to add correct stress, intonation or sentiment to spoken language.
语言学家称之为“韵律”,也就是在口语中正确重读、添加语调或情绪的能力。
Restricted by prosody hierarchy and disturbed by tone and intonation, it is a hard task to detect the stress of Chinese speech automatically.
汉语的重音由于受到声调、语调以及韵律单元层级的干扰和制约,对于重音的自动感知一直是比较困难的问题。
The prosody prediction is to estimate the intonation, rhythm, stress placement and timing.
韵律预测可准确估计合成语音的语调、节奏、重音的位置和时长信息等。
应用推荐