The data mining technology based on the rough set theory can be trained through the training data set.
基于粗糙集理论的数据挖掘技术,通过数据训练集所训练得到的算法模型能够有效用于疾病诊断。
In the stage I, called as data preprocessing, the support vector machines for regression (SVMR) approach is used to filter out the outliers in the training data set.
第一阶段称为所谓的资料预先处理,即使用支援向量回归来找出训练资料集中的离异点并删除之。
This model adopted direct prediction method, so error accumulation effect was avoided. The training data set was real-time updated so that the model was always the newest.
该模式采用直接预报的方法,避免了误差累积效应;对训练集实时更新,以保证模式的不断更新;模式的输入比较简单,便于应用。
This gives you a much larger training set for each trial, meaning that your algorithm will have enough data to learn from, but it also gives a fairly large number of tests (20 instead of 5 or 10).
对每次尝试来说,训练集都非常大,这意味着你的算法有足够的数据进行学习,而且这样一来也提供了足够多的测试次数(20次,而不是5次或10次)。
Nevertheless, if you're in a bind for data, this can yield passable results with lower variance than simply using one test set and one training set.
尽管如此,比起只是简单的使用一个测试集和一个训练集,这种方法可以产生比较低的差异,还算是一个可以说得过去的结果。
If, for instance, you only have 20 samples, there's not much data to use for a training set and still leave a significant test set.
比如,你现在仅仅只有20个样本,对于训练集和有效的测试集来说,没有太多的数据。
Similarly, with machine learning algorithms, a common problem is over-fitting the data and essentially memorizing the training set rather than learning a more general classification technique.
同样,对于机器学习算法,一个通常的问题是过适合(原文为over -fitting,译者注)数据,以及主要记忆训练集,而不是学习过多的一般分类技术。
Rimm also notes that, during C-Path's training, the team had to include 42 samples from the validation set to account for differences in the staining technique used in the two data sets.
Rimm还强调,在C颈训练中必须包括来自验证集的42个样本,用来解释在两个数据集中运用染色技术的不同。
As an aside, some machine learning techniques use a fourth set of data, called a validation set, which is used during training to avoid overfitting.
另一方面,一些机器学习技术使用第四种数据集,称为验证集[validationset],它被用来在整个训练期间避免过适合[overfitting]。
Ensure that use training set is selected so we use the data set we just loaded to create our model.
请确保选中usetraining set以便我们使用刚载入的这个数据集来创建我们的模型。
The training set of data will be memorized, making the network useless on new data sets.
训练集数据将被记忆,而使在处理新数据方面网络无用。
Machine learning and data mining techniques are applied to acquire knowledge and build a concept reasoning network based on semantic dictionary and large training set.
在已有的英语语义词典及大量训练集的基础上,应用机器学习、数据挖掘等技术进行知识获取并最终形成若干个概念推理网。
Classification and clustering are both commonly used data mining methods. The advantage of classification is that the accuracy is higher, but the labeled training set is needed.
分类和聚类都是常用的数据挖掘方法,分类的优点是准确率较高,但需要带有类别标注的训练集;
We either generalize towards a concept or infer a set of general rules that describe the qualities of the training data.
我们既可以从一个概念进行推广也可以从一组描述训练数据性质的一般性规则中进行推断。
Adopted the data processing method of the normalization, choose the training sample of the neural network, the mathematical model of the consumer price index based on BP nerve network predicts set up.
采用归一化数据处理方法,选择神经网络的训练样本,建立基于BP神经网络的居民消费价格指数预测的数学模型。
And based on the experimental results of multi-dimensional data clustering, anomaly detection matrix is determined through identifying the training sample set and the machine self-learning.
然后根据对多维数据聚类的实验分析结果,通过对样本集的训练进行标识和机器自学习过程来判别异常检测矩阵。
That using the reduction attributes of rough set reduced some redundant attributes, improved the real time of data processing by support vector machine, and shorten the time for training sample.
利用粗糙集的属性约简性来约简掉一些冗余属性,提高了支持向量机进行数据处理的实时性,缩短了训练样本的时间。
There are usually few training samples in the tasks of content-based remote sensing image retrieval, which will lead to over-learning problem while using this small data set for training.
提出一种基于多分类器协同训练的遥感图像检索方法,该方法在不同特征集上分别建立分类器,利用不同分类器的协同性自动标记未知样本,从而有效解决了小样本问题。
Raw data preprocessing, the size of training data, as well as the size of input vectors have been studied separately, to find out the best parameter set of BP neural network prediction.
本文分别针对数据的预处理、训练数据集大小以及输入向量的大小分别进行了研究,以确定使用BP神经网络预测的一个最佳参数组合。
In that same post, I cited a third party who observed that, for a data scientist, having less data in their training set means they're susceptible to several modeling risks.
在博文中,我曾援引某位第三方数据科学家的观察结果,发现培训信息集合当中包含的数量量越小、几类常见风险状况的发生可能性就越高。
The experimental data obtained were divided into training set and testing data.
把试验获取的数据整理分为训练样本集和测试样本集。
Absrtact: During the incremental learning, with the increase of the training set, it is very costly to process these data in terms of time and memory consumption.
摘要:在增量学习过程中,随着训练集规模的增大,支持向量机的学习过程需要占用大量内存,寻优速度非常缓慢。
The data sets which consist of training images and testing images are collected, and video data set is collected for real-time vehicle detection.
本文建立了针对车辆前后视图的训练样本库和测试图像库,并且建立了用于实时车辆识别的视频库。
The training set data and test data were input into model for training and simulation;
将训练集数据和测试集数据先后输入网络模型进行学习训练和仿真测试;
Experimental results on KDDCUP1999 data-set show that the method is more effective than cluster SVM in reducing training samples and improving the training and detecting speed of SVM.
在KDDCUP 1999数据集上进行实验,结果表明,与聚类支持向量机方法相比,该方法能简化训练样本,提高SVM的训练和检测速度。
The experimental results show that the proposed algorithm obtains significant accuracy improvement while requiring a much smaller set of RSS training data than previous methods.
实验结果表明,提出算法的定位精度明显高于传统定位算法,且大大降低了离线阶段数据采集的工作量。
The experimental results show that the proposed algorithm obtains significant accuracy improvement while requiring a much smaller set of RSS training data than previous methods.
实验结果表明,提出算法的定位精度明显高于传统定位算法,且大大降低了离线阶段数据采集的工作量。
应用推荐