现有的通用无损压缩算法往往对文本数据压缩比较有效,而对典型数值模拟数据的压缩则不理想。
Universal lossless algorithm of data compression are always efficient to text and inefficient to typical numerical data from simulation.
提出了一个基于数据压缩的全文本数据库倒排索引结构,并在此结构上设计出了一些查找算法来获得更好的查找性。
Based on data compression, a inverted index structure in full textual databases is proposed, and some search algorithms on the index are designed to obtain better performance.
通过实例说明格式文本数据以通用文档编码格式存储的方式,以及存储数据压缩处理形成通用文档的方法。
This thesis illustrates the way of the storage that text data encoded in the unified document format, and the method of compressing a unified document for the storage of data.
应用推荐