但是由于有些说话人特征隐藏在较长的语音片段中,通过添加长时信息可能会进一步提高系统的性能。
However, since a. part of speaker characteristics is. hidden in longer speech segments, the performance may be further improved by adding this long -term information.
他们发现语音信号通常都是低沉的,而音乐片段的音调范围却很广泛;二者都会随着时间的推移有不同的变化。
They found that speech signals are normally low-pitched and musical clips have a wide range of pitches; both vary only gradually over time.
定义了一种语言,可用语音用 XML 文档中的片段。
[W3C Recommendation] defines a language that can be used to refer to fragments of an XML document.
应用推荐