欧美成人免费电影,国产欧美一区二区三区精品酒店,精品国产a毛片,色网在线免费观看

參數資料
型號: ISD-SR3000
文件頁數: 43/120頁
文件大小: 1293K
代理商: ISD-SR3000
2-2
2—SOFTWARE
ISD-SR3000
Voice Solutions in Silicon
2.2
ISD-SR3000 uses a segmented triphone recognition process. The sampled speech utterance
is split into distinct phonetic sounds, the smallest units of speech. Because these phonemes
vary in both sound and duration, the processor must be able to determine boundaries between
the sounds. The ISD-SR3000 uses Hidden Markov Models to hypothesize boundaries between
sounds and to form probabilistic models on each possible combination.
RECOGNITION ENGINE
The outputs are then classified by determining matches between the phonetic sounds and the
stored phoneme models. The acoustic models for the phonemes are gathered from a large
sample of speakers, allowing for a wide variation across accents, dialect, and gender. This al-
lows the recognizer to associate the sound segments with a number of possible phonemes, en-
abling recognition when words are pronounced differently.
The phonemes are then matched to vocabulary words or phrases using a search routine. The
set of phonemes is compared to the vocabulary models for the active topics, and the recognized
word is returned. If the phonemes do not match any of the active vocabulary words, nothing is
returned. The ISD-SR3000 does not return a score with the word; it either recognizes a word,
or it does not.
2.2.1
The ISD-SR3000 is capable of both speaker-independent and speaker defined recognition.
The recognition engine is continuous, allowing for multiple word commands and connected dig-
its. However, there must be recognized silence before and after valid utterances. The length of
the silence is programmed into the host controller, and may be as small as 100ms. The com-
mands and digits are speaker-independent, with models constructed from a large corpus of
speakers. The speaker-defined voicetags and commands are partially speaker-dependent.
However, they are constructed by creating acoustic models “on-the-fly” from the phoneme
base. This means only one training pass is required for entering the voicetags, and recognition
is possible with some variation in the way the name is spoken. The first pass is used to create
the phoneme model, and a second pass is used for recognition confirmation.
TYPES OF RECOGNITION
2.2.2
A grammar is used to define the structure of the commands. The ISD-SR3000 is designed to
work with multiple topics or a finite-state grammar. This type of grammar is designed to limit
perplexity (the number of possible branches during recognition) by pre-defining the number of
allowable words at a given state. For example, a prompt that requires a “yes” or “no” response
has a perplexity of two. Greater perplexities increase the chances for substitution errors. During
recognition, a limited number of topics are active. Topics are groups of words that are active at
a given time. For example, in a voice dialing application, digit topics are active after the user
issues the “dial” command. No other topics are open (except the global topics such as “cancel”
or “help”) so that the recognizer is only trying to recognize digits. This type of grammar and ac-
tive topics inherently increases recognition accuracy.
GRAMMAR
相關PDF資料
PDF描述
ISD1100SERIES ISD1110/ISD1112 Part2
ISD1200SERIES ISD1210/ISD1212 Part3
ISD1400_1
ISD1400_2
ISD1400_3
相關代理商/技術參數
參數描述
ISD-T266SA/J 制造商:未知廠家 制造商全稱:未知廠家 功能描述:Solid-State Recorder
ISD-T266SA/Q 制造商:未知廠家 制造商全稱:未知廠家 功能描述:Solid-State Recorder
ISD-T266SC/J 制造商:未知廠家 制造商全稱:未知廠家 功能描述:Solid-State Recorder
ISD-T266SC/Q 制造商:未知廠家 制造商全稱:未知廠家 功能描述:Solid-State Recorder
ISD-T266SP/J 制造商:未知廠家 制造商全稱:未知廠家 功能描述:Solid-State Recorder
主站蜘蛛池模板: 延吉市| 大新县| 临武县| 太原市| 杂多县| 襄樊市| 修武县| 汤阴县| 绥芬河市| 威远县| 商水县| 塔城市| 绵阳市| 息烽县| 滨海县| 龙海市| 天水市| 望城县| 航空| 武威市| 汤原县| 宣汉县| 南平市| 卫辉市| 定安县| 广平县| 桓台县| 山阴县| 应城市| 阆中市| 射洪县| 建始县| 迭部县| 定襄县| 玉龙| 河东区| 航空| 石景山区| 基隆市| 沙湾县| 敖汉旗|