JPS6320359B2 - - Google Patents
Info
- Publication number
- JPS6320359B2 JPS6320359B2 JP57178482A JP17848282A JPS6320359B2 JP S6320359 B2 JPS6320359 B2 JP S6320359B2 JP 57178482 A JP57178482 A JP 57178482A JP 17848282 A JP17848282 A JP 17848282A JP S6320359 B2 JPS6320359 B2 JP S6320359B2
- Authority
- JP
- Japan
- Prior art keywords
- dictionary
- likelihood
- phoneme
- word
- recognized
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired
Links
- 238000000034 method Methods 0.000 claims description 5
- 238000007476 Maximum Likelihood Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
Description
(産業上の利用分野)
本発明は、入力音声に対して先ず音素認識を行
ない、この認識音素系列を音素表記された単語辞
書と照合して単語を認識する単語音声認識方法に
関するものである。
(従来例の構成とその問題点)
従来の単語音声認識方法を第1図とともに説明
する。第1図に示すように、入力音声に対して先
ず分析を行ない、この入力単語音声の特徴を抽出
して、入力単語音声を構成する音素を認識する。
この認識された音素系列を、単語辞書中の各辞書
項目の辞書音素系列と照合し、2つの音素系列間
の尤度を、音素間のコンフユージヨンマトリクス
(Confusion Matrix、以下C.M.と略す)を用い
て、各音素毎の認識確率を求めることにより算出
し、音素系列間の尤度が最大となる辞書項目をも
つて認識単語とするものである。
第1表は、前記単語音声認識方法に用いる単語
辞書の一例を示しており、各単語は第2表に示す
音素表記法に従つて表記されている。第2図は前
記C.M.の一部を示す。第2図において、縦は単
語辞書中の音素を示し、横は認識音素を示してい
る。また第2図中の数字は単語辞書中の各音素が
どのような音素に認識されるかの確率を%で示し
たものである。例えば第2図において、単語辞書
中の音素IがIと認識される確率は75%、Uに認
識される確率は5%、Aに認識される確率は0
%、脱落する確率は8%……等を示している。
(Industrial Application Field) The present invention relates to a word speech recognition method that first performs phoneme recognition on input speech, and then recognizes words by comparing the recognized phoneme sequence with a word dictionary in which phonemes are expressed. (Structure of conventional example and its problems) A conventional word speech recognition method will be explained with reference to FIG. As shown in FIG. 1, input speech is first analyzed, features of the input word speech are extracted, and phonemes making up the input word speech are recognized.
This recognized phoneme sequence is compared with the dictionary phoneme sequence of each dictionary item in the word dictionary, and the likelihood between the two phoneme sequences is calculated using a Confusion Matrix (hereinafter abbreviated as CM) between the phonemes. The recognition probability is calculated for each phoneme using the above method, and the dictionary entry with the maximum likelihood between phoneme sequences is determined as a recognized word. Table 1 shows an example of a word dictionary used in the word speech recognition method, and each word is written according to the phoneme notation shown in Table 2. FIG. 2 shows a part of the CM. In FIG. 2, the vertical lines indicate phonemes in the word dictionary, and the horizontal lines indicate recognized phonemes. Further, the numbers in FIG. 2 indicate the probability of what kind of phoneme each phoneme in the word dictionary is recognized as, expressed in percentage. For example, in Figure 2, the probability that the phoneme I in the word dictionary will be recognized as I is 75%, the probability that it will be recognized as U is 5%, and the probability that it will be recognized as A is 0.
%, the probability of dropping out is 8%, etc.
【表】【table】
【表】【table】
Claims (1)
素系列を得、この認識音素系列と、音素表記され
た単語辞書の各辞書項目の辞書音素系列との尤度
を計算して単語を認識するに際し、入力音声に対
し前記音素系列間の尤度を計算した時、この各辞
書項目毎の尤度に、予め各辞書項目毎に前記尤度
が第1位である辞書項目が何であるかによりそれ
ぞれ異なる値に定められている尤度重み値を加算
または乗算して重み付き尤度値を算出し、この重
み付き尤度値が最大となる辞書項目をもつて認識
単語とすることを特徴とする単語音声認識方法。1 Perform phoneme recognition on the input speech to obtain a recognized phoneme sequence, and recognize words by calculating the likelihood between this recognized phoneme sequence and the dictionary phoneme sequence of each dictionary item in the word dictionary with phoneme notation. When calculating the likelihood between the phoneme sequences for the input speech, the likelihood for each dictionary item is determined in advance based on which dictionary item has the highest likelihood for each dictionary item. A weighted likelihood value is calculated by adding or multiplying likelihood weight values set to different values, and the dictionary entry for which this weighted likelihood value is maximum is determined as a recognized word. Word speech recognition method.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP57178482A JPS5968796A (en) | 1982-10-13 | 1982-10-13 | Recognition of word voice |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP57178482A JPS5968796A (en) | 1982-10-13 | 1982-10-13 | Recognition of word voice |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JPS5968796A JPS5968796A (en) | 1984-04-18 |
| JPS6320359B2 true JPS6320359B2 (en) | 1988-04-27 |
Family
ID=16049242
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP57178482A Granted JPS5968796A (en) | 1982-10-13 | 1982-10-13 | Recognition of word voice |
Country Status (1)
| Country | Link |
|---|---|
| JP (1) | JPS5968796A (en) |
-
1982
- 1982-10-13 JP JP57178482A patent/JPS5968796A/en active Granted
Also Published As
| Publication number | Publication date |
|---|---|
| JPS5968796A (en) | 1984-04-18 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Campbell et al. | Language recognition with support vector machines | |
| JPS63225300A (en) | Pattern recognition equipment | |
| Ramli et al. | An improved syllabification for a better Malay language text-to-speech synthesis (TTS) | |
| JPS6320359B2 (en) | ||
| JPS6310439B2 (en) | ||
| EP0256081B1 (en) | Improvements in or relating to acoustic recognition | |
| JPS6320360B2 (en) | ||
| JPH0126080B2 (en) | ||
| JPS6411959B2 (en) | ||
| JP3240691B2 (en) | Voice recognition method | |
| JPS6338720B2 (en) | ||
| JPH0158520B2 (en) | ||
| JPH0158519B2 (en) | ||
| JPS61122781A (en) | Speech word processor | |
| JPH04291399A (en) | Voice recognizing method | |
| JPS63161498A (en) | Voice information input device | |
| JPH0323920B2 (en) | ||
| JPS6155680B2 (en) | ||
| JPS62217297A (en) | Word voice recognition equipment | |
| JPS62226196A (en) | Reference pattern sequential learning system | |
| JPS6335998B2 (en) | ||
| JPS6131878B2 (en) | ||
| KR970063032A (en) | Chaotic Circular Neural Network (CRNN) and its Learning Method and Speech Recognition Method | |
| KR930003011A (en) | Similar word recognition method | |
| JPH04180097A (en) | Word voice recognition device |