And it presents the method of Chinese name recognition without text segmentation.
提出了在不作分词处理的原始文本中进行中文姓名识别的方法。
参考来源 - 基于语料库的中文姓名识别方法研究 in CThis thesis puts forward two kinds of arithmetic resolving the problem about compellation recognition in automatic division of Japanese. One is based on the rule and the other is based on the combination of the rule and the statistics.
本文提出了基于规则及基于规则与统计相结合两种解决日文自动分词中姓名识别问题的算法,并具体应用于实践,最终确定基于规则与统计相结合的日文姓名识别算法可实现较高的精确率和召回率,并具有较大的灵活性。
参考来源 - 日文文节切分中的姓名识别·2,447,543篇论文数据,部分数据来源于NoteExpress
该方法是完全数据驱动的,不需要姓名识别模板和规则。
为了给出更易于理解的解释,我们把这个机制置于中文姓名识别的应用。
In order to give a more comprehensible explanation, we put such a mechanism into the application of Chinese name recognition.
对于姓名、地址和产品描述等文本数据,查看可变的数据格式是识别包含多个域的字段的关键。
For text data such as name, address, and product descriptions, a review of the varying data formats is critical to identify fields containing multiple domains.
应用推荐