tokenization and text normalization 语言符号化和文本一般化
Text normalization is a procedure to generate information, such as pronunciation, rhythm and so on, for special symbols correctly.
文本标准化是对输入文本进行分析 ,生成其中非汉字符号的拼音、节奏等信息的过程。
参考来源 - 中文语音合成系统中的文本标准化方法 in C·2,447,543篇论文数据,部分数据来源于NoteExpress
以上来源于: WordNet
Chinese text normalization is the process of transforming non-Chinese character strings into their corresponding Chinese character strings to determine their pronunciations.
中文文本正则化是把非汉字字符串转化为汉字串以确定其读音的过程。
The normalization checking attribute, if set to on, will normalize the input text if necessary.
如果打开了规范化检查属性,那么会在必要时对输入文本进行规范化。
应用推荐