改进了连续属性离散化的贪心算法。
Improve a greedy algorithm for discretization of continuous attribute.
基于多连续属性离散化的数据预处理方法。
A data preprocessing method based on multi continuous attribute discretization.
连续属性离散化是粗糙集应用研究的重点内容之一。
The discretization of Continuous attributes is one of the important contents in application study of rough sets.
应用聚类方法研究了数量关联规则提取过程中的连续属性离散化问题。
This paper presents a cluster method for discretization in the processing of mining quantitative association rules.
连续属性离散化方法在人工智能、机器学习等很多方面具有重要应用。
Discretization algorithm for real value attributes is of very important USES in many areas such as intelligence and machine learning.
针对这些问题,提出了一种基于属性重要度的整体连续属性离散化方法。
Regarding this, this paper puts forward the discrete method of the overall continuous attributes which is based on the importance of attributes.
本文对基于粗集的数据预处理中数据补齐和连续属性离散化问题进行讨论。
This thesis discusses the question of data reinforce and continuous feature discretization which is based upon data preprocessing of rough set.
提出了一种基于断点重要性的配电网连续属性离散化方法,证明了该方法的有效性。
A continuous attribute discretization of the electric power distribution system is put forward based on the breakpoint importance, which is proved effectively.
首先提出主泛化决策等概念,在数据过滤方法的基础上,利用等价类的合并对属性离散化。
Its basic idea is to merge the equivalence classes of q on basis of the methods of data filtering.
该方法打破了传统连续属性离散化遍历搜索的思路,在保证效率的基础上显著提高了离散效果。
This method break the idea of traditional continual attribute discretization's traversal heavy search, obviously enhanced the separate effect on the bases of efficiency.
肿瘤诊断数据库中的属性常为数量型属性,因此如何将数量型属性离散化是挖掘关联规则的难点。
Attributes in the database of tumor diagnoses are usually quantitative attributes, so quantitative attribute discretization is a problem of mining association rules.
本文基于可辨识矩阵提出一种连续属性离散化的方法,并利用平均互信息量对离散化结果进行修正。
The paper puts forward a method of discretization of continuous properties based on discernibility matrix and revises the discrete result by average mutual information.
介绍了在数据库知识发现(KDD)中将连续属性离散化的一些方法,并提出使用值差分度量离散化的算法。
Some methods for dividing continuous attributes in KDD (knowledge discovery in database) and a method based on VDM (value difference metric) are presented.
提出了一种基于微粒群优化(PSO)算法的连续属性离散化方法,很好的解决了建模过程中连续属性的离散化问题。
An algorithm for discretization based on Particle swarm optimization (PSO) is presented, which can settle the problem of continuous attributes discretization in systema modeling perfectly.
通过对C4.5算法的研究与分析,针对该算法处理连续性属性的不足,采用一种基于信息熵的区间合并的属性离散化方法。
Based on C4.5 analysis and research, this paper gives the method of continuous attributes dispersed, that merge interval based on information entropy.
连续属性通过极大熵方法离散化。
Continuous attributes are discretized through maximum entropy method.
本文提出了一个新的决策表离散化算法,该算法在离散化数据的同时具有良好的属性约简功能。
In this paper, a novel decision table discretization algorithm is presented, which has fine attribute reduction function in time of data discretization and increases quality of classification.
本文针对传统的离散化技术所造成的信息丢失问题,提出了利用直觉模糊粗糙集合理论来进行属性约简的方法。
The technology of attribute reduction based on the intuitionistic fuzzy rough set theory is described as to the problem of information loss in the process of discretization.
连续属性的离散化在数据挖掘中有着非常重要的作用。
The discretization of continuous properties is very important in data mining.
连续属性的离散化是粗糙集理论的主要问题之一。
The discretization of real value attributes is one of the most main problems in rough sets theory.
论文分析了基于熵的离散化方法的不足,从估计训练样本的概率分布的角度出发,提出基于样本分布与熵相结合的处理数值型属性的方法。
By the method of estimating the probability distribution of training examples, a new and simple method of dealing with numeric attribute based on example distribution and entropy is turned out.
连续型属性的离散化问题是机器学习中的关键问题,是一个NP难题。
Discretization of continuous type of attributes is a key issue in machine learning, it is a NP puzzle.
连续属性的离散化是数据预处理的重要工作。
Discretization of numeric attribute is an important role of data preprocessing.
前者从数据的离散化,降维,和属性选择方面有效的解决了处理大规模高维数据库时的效率与精度之间的矛盾。
The former algorithm makes improvements from three aspects: discretization, reducing dimension, attribute selection, which effectively solves the conflict between efficiency and prediction precision.
对数据离散化及属性选择度量方法作了改进,以降低计算复杂度,提高算法学习速度。
For decreasing calculation complexity and increasing speed of algorithm learning, it presented some improvement of data discretization and attributes selection measure.
基于灰色系统和粗糙集的有关理论,提出了一种新的基于属性重要性的离散化算法。
Based on theory of grey system and rough sets, a new discretization algorithm of continuous attributes in decision table is offered.
连续属性的离散化是粗糙集理论亟待解决的关键问题之一。
Discretization of continuous attributes is always one of the key problems that need urgent solutions in rough sets theory.
对收集到的入侵数据进行预处理、数据离散化,属性约简,并依据生成的检测规则来分析入侵数据。
The algorithm includes data preconditioning, data discretization, attribute reduction, production of detection rules, and finally analysis of intrusion data with these rules.
依据粗集理论研究离散化数据的特点,考虑类分布信息,采用信息熵理论进行连续条件属性的离散化。
Data discrimination is the character of RS, considering distributed information of class, and continual condition attributes are described according to information entropy theory.
依据粗集理论研究离散化数据的特点,考虑类分布信息,采用信息熵理论进行连续条件属性的离散化。
Data discrimination is the character of RS, considering distributed information of class, and continual condition attributes are described according to information entropy theory.
应用推荐