For the document approach we can come at microformats from the structured text side.
对于文档方面,我们可以从结构化文本开始接近微格式。
The design and implementation of Structured Text Retrieval System is introduced in this paper.
该文介绍了一个应用于结构化文本的检索系统的设计和实现。
But it sets up the remaining chapters. This chapter covers working with XML, HTML, and structured text in general.
但是这一章开启了后面的章节,概述的是使用XML(可扩展标记语言),HTML(超文本标记语言)以及组织好的文本来工作。
Structured text formats come down firmly on the content side, while YAML and JSON come down firmly on the data side.
结构化文本格式紧靠在内容一端,而YAML和JSON 则在数据一端。
Companies may also have a need to analyze semi-structured text (such as XML content) or other data types (such as audio and video).
公司还可能需要分析半结构化文本(比如XML内容)或其他数据类型(比如音频和视频)。
Amberfish is general purpose text retrieval software. It supports nested queries of semi-structured text in XML format and traditional unstructured searching.
格式的半结构文本的嵌入式查询以及传统的机构型的搜索。
Further compounding the issue is that a lot of the information needing to be processed is either unstructured or semi-structured text, which is difficult to query.
更为复杂的问题是,大量需要处理的数据是非结构化或者半结构化的,这就更难查询了。
Set the default starting HTML header level for structured text documents. The default is 3, which implies that top-level headers will be created with an H3 HTML tag.
设置结构化文本文档HTML标题级别。默认为3,这就意味着最高级别的标题会被打上H3 HTML标签。
More readily, you can send text formats that are simpler than XML: TAB - or comma-delimited lists, Markdown or other lightly structured text, YAML, or JSON alternatives to XML.
更易于为人接受的方法是,发送比XML简单的文本格式:由制表符或逗号分隔的列表、Markdown或其他结构化程度较低的文本、yaml或json。
While we'll use JSON to approach from the data end of the document-to-data spectrum, you can use structured text formats to handle microformats with document-colored glasses on.
虽然JSON的应用靠近文档-数据谱系中的数据一端,也可用结构文本格式处理文档色彩较浓的微格式。
Furthermore, the structured text from the company's surveys specifies the plane's route and tail number, day of flight, customer seat, and the pilot and crew that worked that flight.
此外,现在捷蓝航空公司结构化的调查报告具体陈列了飞机路线和机尾编号、飞行日期、座位编号以及相应航班的飞行员和机组人员。
Chapter 3 mainly researches the method of converting semi-structured emails to structured text data in mail preprocessing, especially the method of recognizing the potential characters of emails, etc.
第三章重点研究了在邮件预处理方面将半结构化的电子邮件转化为结构化的文本数据方法,特别是电子邮件潜在特征词的识别方法等。
The processing, searching, and displaying of data is a simple process compared to the free text or structured document searches.
与自由文本或结构化文档搜索比较而言,处理、搜索和显示数据是一个简单的过程。
By strong, I mean they must be able to extract actionable information from both structured data, such as databases and Web pages, and unstructured data, such as text, audio, and video.
我所谓的强大是指,那些解决方案必须能够从结构化数据(例如数据库和网页)和非结构化数据(例如文本、音频和视频)中提取可操作的信息。
The next section gives you a step-by-step example for this kind of integration: text analysis is used to extract structured information from a database table containing unstructured information.
下一小节将针对此类集成给出一个逐步指导示例:文本分析被用于从包含非结构化信息的数据库表中提取结构化信息。
Some browsers, though, might show the returned text in a bit more structured manner.
不过,一些浏览器可能以更结构化的方式显示已返回的文本。
For illustration, I built a DB2 structured database from a subset of the IMDB content, and included the trivia as text fields in this database.
为了进行说明,我使用IMDB内容的子集构建了一个DB2结构化数据库,将这些传记信息作为文本字段保存在数据库中。
We could include other evidence for a connection by either including additional structured data, such as database tables that show which people worked together on movies, or by deeper text analysis.
还可以包含其他证据,这可以通过包含其他结构化数据(比如用数据库表记录哪些人为同一部电影工作过),或者通过进行更深入的文本分析。
In a scenario where you only need to extract a single value from a response document, it can be more convenient to "cheat," treating the XML as a string of text rather than a structured document.
在只需要从响应文档中提取单一值的场景中,“欺骗性”地把XML当作文本字符串,而不把它当作结构化的文档对待,会更方便。
Figure 3 depicts a scenario where concepts in free-form text are first annotated and later written to a database table together with existing structured information.
在图3所示的场景中,先给形式自由的文本中的概念加注解,然后把它们与现有的结构化信息一起写到一个数据库表中。
In a mind map, as opposed to traditional note taking or a linear text, information is structured in a way that resembles much more closely how your brain actually works.
与传统的记笔记或者直线型文字不同,思维导图里的信息就像大脑实际思考过程一样进行组织。
So we consider a text as a structured entity, or perhaps as an entity which is structured and yet at the same time that's the case with Roland Barthes.
所以,我们把原文视为一个结构上的实体,或者是作为一个,有结构上的实体同时,这就是罗兰,巴特的例子。
You have learned how to setup the UIMA development environment and how to create your own annotator and use it in InfoSphere Warehouse to extract structured information from text input.
您了解了如何设置UIMA开发环境,如何创建自己的注释器,以及在InfoSphere Warehouse中使用定制注释器从文本输入提取结构化信息。
Text operators can then extract structured information from text columns and add them to the output as new columns containing found concepts like names, skills, dates etc..
然后,Text操作器可以从文本列中提取结构化信息,把它们作为新列(其中包含找到的姓名、技能、日期等概念)添加到输出中。
Yacc is a grammar parser; it reads text and can be used to turn a sequence of words into a structured format for processing.
Yacc是一种语法分析器,它可以读取文本并用来将单词序列转换为便于处理的结构化的格式。
Developers can feed Placemaker any kind of structured and unstructured data, including feeds and web pages, and the app will analyze the text and extract location data from it.
开发者可以向Placemaker输入任何形式的结构化和非结构化数据,包括频道、网页,程序会分析文本并从中提取位置数据。
Cognos 8 Reporting is able to consume structured information from many data sources, and it can be used to propagate the text analysis results to a wide audience.
Cognos8Reporting同样能够使用来自各种数据源的结构化信息,并且可用于将文本分析结果传播给广泛的受众。
Internet search engines have focused largely on crawling text on Web pages, but Google is knee-deep in research about how to analyze and organize structured data, a company scientist said Friday.
互联网的搜索引擎们把主要精力都放在采集web页面的文本信息上,但是google却在研究如何分析和组织结构化数据方面小有所成,该公司的一位科学家上周五表示。
Internet search engines have focused largely on crawling text on Web pages, but Google is knee-deep in research about how to analyze and organize structured data, a company scientist said Friday.
互联网的搜索引擎们把主要精力都放在采集web页面的文本信息上,但是google却在研究如何分析和组织结构化数据方面小有所成,该公司的一位科学家上周五表示。
应用推荐