OCR识别系统通常采用扫描仪获取表格文档的原始图像,由计算机进行版面分析和字符识别。
The scanner is commonly used in OCR recognition system to acquire document original image, and the image transmit to computer for document layout analysis and OCR recognition.
提出了一种面向对象的表格图像版面分析方法,引入属性关系图的概念来描述表格版面结构,以表达表格组件间的复杂高维关系。
The concept of attributed relational graphs was introduced to describe the structure of form layout, expressing the complex high-dimension relationship between the components of the form.
最后数据分析的结果以表格、文本或图像形式返回用户平台。
After the analysis, results will return to user by Browser or email.
在分析了国内外表格文档信息自动录入系统的优缺点后,采用一种基于接触式图像传感器(CIS)摄取表格文档的原始图像信号,利用硬件获得了高质量的图像信号。
In this paper, Contact image sensor(CIS) is applied for acquire the original document image signal. The signal is been processed by hardware, and the high quality binary image is obtained.
本文提出了简单区域和框架线的概念,建立了完整的表格版面结构分析方法模型,对普通文档图像中各类表格进行自动版面结构分析。
This paper proposed a novel table layout analysis model based on simple region and frame line. A systemic table identification mechanism was also presented.
本文提出了简单区域和框架线的概念,建立了完整的表格版面结构分析方法模型,对普通文档图像中各类表格进行自动版面结构分析。
This paper proposed a novel table layout analysis model based on simple region and frame line. A systemic table identification mechanism was also presented.
应用推荐