It generally requires a large number of speech data for a speech recognition system to train HMM by the BW algorithm.
用传统的BW算法训练语音识别系统的H MM需要大量的语音数据。
The method takes advantage of semi-supervised thought to quantitate speech data and forms a code model with supervision information.
该方法利用半 监督的思想对方言语音数据进行矢量 量化,形成具有监督信息的码本模型。
At present, MPEG2 has become the most prevalent coding technique, which is mainly used for the transmission of speech data and viewdata.
MPEG2是目前应用最广泛的一种编码技术,主要用于语音,图像数据传输。
Parametric Stochastic Trajectory Model: in a speaker recognition system, it's often encountered that the speech data isn't enough for training.
参数化随机轨线模型:在说话人识别系统中,经常存在训练语料不足的问题。
A new algorithm of employing the fuzzy neural network is proposed to realize speech data fusion for speech recognition under high noisy condition.
针对高噪音环境中的语音识别问题,提出一种利用模糊神经网络进行语音数据融合的新算法。
Owing to the lack of telephone speech data, this paper proposes a software simulation implementation of converting clean speech sounds into telephone-quality ones.
针对电话语料比较缺乏的问题,提出了一种完全由软件模拟实现由纯净语音向电话质量语音转换的算法。
To improve the performance of speaker recognition in the condition of noise and little speech data, feature parameters were studied based on the Vector Quantization (VQ).
为了使说话人识别系统在语音较短和存在噪声的环境下也具有较高的识别率,基于矢量量化识别算法,对提取的特征参数进行研究。
This paper describes the effect of additive noise and convolution noise on speech data and gives a blind environmental compensation method to improve environmental robustness.
本文讨论了环境加性噪声和卷积噪声对语音数据的影响,以及提高系统的鲁棒性的盲的环境补偿方法。
ISD1420 is a fresh generation speech chip which can make recorded speech data reserve permanently, and is high fidelity, low power consumption, fit for interfacing with microcomputer.
ISD1420是一种录音数据永久保存、高保真、耗电小、适用于同单片机接口的新一代语音器件。
Before analyzed, Speech data are weighted by Hamming function and pre emphasized, silence segments are detected and discarded by computer automatically based on the frame energy threshold.
分析之前,语音样本经过频域预加权和时域汉明窗加权处理,并利用帧能量门限自动去除了样本中的寂静段。
They include patents on wireless data, speech coding, security, and encryption, according to Nokia.
按诺基亚的说法,它们包括无线数据、语音编码、安全和加密方面专利。
The 10 patents cover wireless data, speech coding, security and encryption and are "infringed by all Apple iPhone models shipped since the iPhone was introduced in 2007."
诺基亚的10项专利包括无线数据、语音编码、安全和加密,以及“所有出货的苹果iPhone机型中的技术。自2007年推出iPhone以来,所有的苹果iPhone机型都侵犯了诺基亚的专利”。
But entering data into a phone might ultimately be done not with fingers but with speech-or even directly by the brain.
但我们最终可能不会用手指把数据输入电话,而是用语言——甚至直接用我们的大脑。
When speaking specific data types it can help to tell the text-to-speech (TTS) system of the voice browser the type of information to convert into speech.
当用户说出特定的数据类型时,该信息会将要转换为语音的信息类型告诉语音浏览器的TTS系统。
Since much of the data we receive comes through speech, the Brain Fitness Program works with language and hearing to improve both speed and accuracy.
由于我们获得的信息大多是通过说话,因此这个健脑计划(the BrainFitness Program)是通过语言和听力来提高思维速度和准确性的。
Google has shoveled vast financial and engineering resources into a collection of data mining and artificial intelligence systems, from speech recognition to machine translation to computer vision.
谷歌把大量财力和技术资源投入到了一系列数据挖掘和人口智能系统之中,从语音识别到机器翻译再到计算机视觉,不一而足。
Those larger implants should yield much more brain signal data that could in turn improve translation accuracy to the point that thought-to-speech translation could become a viable clinical solution.
这些大型的植入电极应当可以获取到更多的脑波数据来提高翻译的准确性,或许思维-语言翻译的临床应用将变得切实可行。
A device that can generate and propagate signals representing data or speech.
一种能生成和传送代表数据或话音的信号的装置。
This paper addresses the problem of speech recognition under telephone channel conditions using data simulation method and HMM(Hidden Markov Model)adaptation.
该文研究了基于数据模拟方法和HMM(隐马尔科夫模型)自适应的电话信道条件下语音识别问题。
It plans to use our speech patterns - not just what we say but how we say it - in conjunction with other behavioural data, such as how we type, to build up a more reliable picture of our identity.
该项目计划利用我们的语言模式——这不光包括我们说话的内容,还包括我们说话的方式——以及打字习惯等的行为数据,建立一个更可靠的身份验证体系。
Experiments on chunking recognition and Part-of-Speech tagging are conducted to show that the new data structure greatly speeds up the feature matching process while keeping the same space complexity.
基本短语识别和词性标注的实验显示,这种新的数据结构的确能够极大地加快最大熵方法执行系统的速度,同时保持空间复杂度不变。
The speech recording is a very important section in the voyage data recorder (VDR).
在航海数据记录仪中,语音记录是十分重要的环节。
This paper raised a method of pattern recognition, it can be used to the continuous speech recognition with reference, text understanding and data file analysis.
提出一种模式识别方法,它能有效地用于有监督的连续语音识别、文稿理解和数据文件分析。
People are no longer satisfied with the simple speech signal and transmission of the characters data, but expect use many kinds of expression media.
人们不再满足于单纯的语音信号和文字数据的传输,而是期盼使用多种表示媒体。
This research indicates that the encryption functional module can be applied to point-to-point speech communication and other low speed data traffic.
研究表明,该加密功能模块可用于点对点的语音通信和其他低速率数据通信模型。
Depending on all above studies and experiments, we can conclude that using data mining distill prosodic rules in speech synthesis is viable.
通过以上研究实验表明,利用数据挖掘技术对语音合成中的韵律规则进行提取和学习是可行有效的。
It introduces speech recognition system based on interval data of passing zero, refers some crucial technology and the design flow of the system.
介绍了基于过零间隔点技术的声纹识别系统和其中一些关键技术和系统的设计流程。
For the difficulty of stressful speech collection, although tested and trained in the same conditions, speech recognizer performs imperfect with sparse data.
一般情况下变异语音数据采集困难,获得的训练数据量少,这样即使测试环境和训练环境都相同,识别性能也不理想。
For the difficulty of stressful speech collection, although tested and trained in the same conditions, speech recognizer performs imperfect with sparse data.
一般情况下变异语音数据采集困难,获得的训练数据量少,这样即使测试环境和训练环境都相同,识别性能也不理想。
应用推荐