JPH0415960B2

JPH0415960B2 -

Info

Publication number: JPH0415960B2
Application number: JP57039013A
Authority: JP
Inventors: Kazunaga Yoshida
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1982-03-12
Filing date: 1982-03-12
Publication date: 1992-03-19
Also published as: JPS58156998A

Description

【発明の詳細な説明】本発明は情報入力装置、特に音韻単位に発声さ
れた音声や手書き文字などのように人間により発
生された情報を、機械に入力する情報入力装置に
関するものである。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to an information input device, and particularly to an information input device for inputting information generated by a human, such as speech uttered in phoneme units or handwritten characters, into a machine.

本発明による情報入力装置は入力される情報が
音声でも手書き文字でも適用可能である。しかし
以後の説明においては分かりやすくするために音
声による入力の場合についてのみ述べる。またこ
こで言う入力情報の単位は、以後の説明において
は「あ」「い」「う」「え」「お」などの日本語の単
音節とする。 The information input device according to the present invention can be applied whether the input information is voice or handwritten characters. However, in the following explanation, only the case of voice input will be described for the sake of clarity. Furthermore, in the following explanation, the units of input information referred to here are Japanese monosyllables such as "a", "i", "u", "e", and "o".

従来、単音節単位に区切つて発声された音声を
認識する音声入力装置はすでに存在する。この一
つの例としてあらかじめ発声された単音節を標準
パタンとして登録しておき、入力された単音節と
の間のパタンマツチングにより認識を行なう方法
が提案されている。このような登録型の音声入力
装置においては標準パタンの選択が重要である。
登録時に不適当に発声された場合や、登録時から
時間がたつて実際の発声と標準パタンとが異なつ
てしまう場合がある。この影響を除くためには、
入力された最新の音声をもとにした標準パタンの
自動学習が有効である。認識誤りが生じた場合、
利用者が必ず訂正するようにすれば入力された音
声パタンをもとに標準パタンを更新することがで
きる。しかし、特に単音節の場合は利用者が誤り
を見のがすおそれがあるため、このように更新す
ることは誤つた音声パタンを標準パタンとしてし
まう可能性がある。 2. Description of the Related Art Conventionally, there are already speech input devices that recognize speech uttered in units of monosyllables. As an example of this, a method has been proposed in which uttered monosyllables are registered in advance as standard patterns, and recognition is performed by pattern matching between the uttered monosyllables and the input monosyllables. In such a registration type voice input device, selection of a standard pattern is important.
There are cases where the utterance is inappropriately uttered at the time of registration, or where the actual utterance differs from the standard pattern as time passes from the time of registration. To eliminate this effect,
Automatic learning of standard patterns based on the latest input audio is effective. If a recognition error occurs,
If the user always makes corrections, the standard pattern can be updated based on the input voice pattern. However, especially in the case of monosyllables, there is a risk that the user may overlook the error, so updating in this way may result in the incorrect speech pattern becoming the standard pattern.

本発明の目的は利用者がたとえば音声の場合、
単音節単位の訂正を行なわなくても標準パタンを
正しく、新しい音声パタンをもとに更新できるよ
うな情報入力装置を提供することにある。 The purpose of the present invention is to
To provide an information input device that can correctly update a standard pattern based on a new speech pattern without making corrections on a monosyllable basis.

本発明は入力された音声・手書き文字などの情
報をある定められた単位毎にあらかじめ登録され
た標準パタンをもとに認識し認識結果を出力する
認識部と、前記定められた単位の列として単語が
記憶されている単語辞書部と、前記辞書部の内容
と前記認識結果をマツチングし単語認識結果を得
る辞書マツチング部と、前記単語認識結果より前
記定められた単位毎の認識結果の正誤を判断しこ
の判断結果をもとに前記標準パタンの更新を指示
する更新指示部とを含んで構成される。 The present invention includes a recognition unit that recognizes input information such as voice and handwritten characters based on a standard pattern registered in advance for each predetermined unit, and outputs a recognition result, and a word dictionary section in which words are stored; a dictionary matching section that matches the contents of the dictionary section with the recognition results to obtain word recognition results; and a dictionary matching section that matches the contents of the dictionary section with the recognition results to obtain word recognition results; and an update instruction section that makes a judgment and instructs updating of the standard pattern based on the judgment result.

以下具体的な一実施例に基づいて本発明を詳細
に説明する。第１図は本発明の一実施例のブロツ
ク構成図である。図に於いて１はマイクロフオ
ン、２は分析部、３は認識部としての音声マツチ
ング部、４は標準パタンメモリ部、５は辞書マツ
チング部、６は単語辞書部、７は更新指示部であ
る。マイクロフオン１より入力された単音節は分
析部２において分析され音声パタンＰとして出力
される。同時に音声パタンＰは分析部２に保持さ
れる。音声パタンＰは音声マツチング部３におい
て標準パタンメモリ部４の中に記憶されている標
準パタンＲとマツチングされる。単音節単位の認
識結果Ｍは確からしさの順に上記数位の結果が確
からしさの値すなわち類似度とともに出力され
る。通常単音節の認識結果は上位３位程度の中に
99％以上正しい結果がはいるので出力される結果
はこの程度の数でよい。 The present invention will be described in detail below based on a specific example. FIG. 1 is a block diagram of an embodiment of the present invention. In the figure, 1 is a microphone, 2 is an analysis section, 3 is a voice matching section as a recognition section, 4 is a standard pattern memory section, 5 is a dictionary matching section, 6 is a word dictionary section, and 7 is an update instruction section. . A monosyllable inputted from the microphone 1 is analyzed by the analysis section 2 and outputted as a speech pattern P. At the same time, the voice pattern P is held in the analysis section 2. The audio pattern P is matched with the standard pattern R stored in the standard pattern memory section 4 in the audio matching section 3. As for the recognition results M in units of monosyllables, the results of the above-mentioned numbers are outputted in order of likelihood together with the likelihood value, that is, the degree of similarity. The recognition results for monosyllables are usually in the top 3.
Since more than 99% of the results are correct, this number of results is sufficient.

辞書マツチング部５では前記認識結果Ｍと単語
辞書部６の内容をマツチングして単語認識結果Ｗ
を出力する。ここで言う単語とは通常の単語に限
らずいくつかの単音節の連続という意味であり、
フレーズ等も含むものである。辞書マツチング部
５に１単語分の単音節の認識結果が入力される
と、まず単語辞書部６の中の単語のうち文字数の
一致するものを選択し読み出す。認識結果の単音
節列と単語辞書内の単語の単音節列とを比較し、
一致した単音節における類似度の合計をその単語
の類似度とする。この類似度が最大となる単語を
単語認識結果とする。 The dictionary matching section 5 matches the recognition result M with the contents of the word dictionary section 6 to obtain a word recognition result W.
Output. A word here means not only a normal word but also a series of several monosyllables.
It also includes phrases and the like. When the recognition result of one word of monosyllables is input to the dictionary matching section 5, first, among the words in the word dictionary section 6, the word with the matching number of characters is selected and read out. Compare the monosyllabic string of recognition results with the monosyllabic string of words in the word dictionary,
The sum of the degrees of similarity among the matched single syllables is taken as the degree of similarity of that word. The word with the highest degree of similarity is taken as the word recognition result.

たとえば、音声で「か・な・が・わ」と入力し
た場合のそれぞれの単音節の認識結果と類似度の
関係の一例を第２図に示す。類似度は大きいほう
がより近いとする。この場合、単語「かながわ」
と「かなざわ」と類似度は前者が80＋50＋80＋80
＝290後者は80＋50＋10＋80＝220であるため、類
似度のより大きい「かながわ」が認識結果とな
る。 For example, FIG. 2 shows an example of the relationship between the recognition results of each monosyllable and the degree of similarity when "ka-na-ga-wa" is input by voice. It is assumed that the larger the degree of similarity, the closer. In this case, the word "Kanagawa"
and "Kanazawa", the former has a similarity of 80 + 50 + 80 + 80
= 290 Since the latter is 80 + 50 + 10 + 80 = 220, the recognition result is "Kanagawa", which has a higher degree of similarity.

この例によると２番目の「な」が「ま」に誤つ
ていることを検出することができる。辞書マツチ
ング５からのこのような結果信号DRをもとに、
更新指示部７より標準パタンメモリ部４内の標準
パタンを更新する指示信号Ｃを出力する。すなわ
ち、すでに標準パタンメモリ部４の内にある
「な」の標準パタンのかわりに、今回入力され、
分析部２に保持されている音声パタンＰのうちの
「な」のパタンを標準パタンとして標準パタンメ
モリ部４で保持する。 According to this example, it is possible to detect that the second "na" is incorrectly translated into "ma". Based on the result signal DR from dictionary matching 5,
The update instruction section 7 outputs an instruction signal C for updating the standard pattern in the standard pattern memory section 4. That is, instead of the standard pattern "na" which is already in the standard pattern memory section 4, the pattern input this time is
Among the voice patterns P held in the analysis section 2, the pattern of "na" is held as a standard pattern in the standard pattern memory section 4.

標準パタンの更新方法としては上記の方法の他
にもいくつかの方法が考えられる。たとえば、各
単音節の標準パタンごとにカウンタを設ける。辞
書とのマツチングにより単音節の認識誤りが検出
された場合、誤認識した標準パタンの前記カウン
タをカウントアツプする。標準パタンごとの誤認
識の数がある定められた回数以上になつた時、こ
の標準パタンを新しいパタンにより更新するとい
う方法がある。標準パタンの更新は新しいパタン
と入れかえる方法の他に、入力されたパタンと標
準パタンとの平均をとることにより新たに標準パ
タンを作成する方法も考えられる。 In addition to the above methods, several other methods can be considered for updating the standard pattern. For example, a counter is provided for each standard pattern of each monosyllable. When a recognition error of a single syllable is detected by matching with a dictionary, the counter of the erroneously recognized standard pattern is counted up. There is a method of updating this standard pattern with a new pattern when the number of misrecognitions for each standard pattern exceeds a predetermined number of times. In addition to the method of updating the standard pattern by replacing it with a new pattern, it is also possible to create a new standard pattern by taking the average of the input pattern and the standard pattern.

このように、以上述べてきた実施例は説明の便
宜上選択したほんの一例であつて本発明はこの実
施例のみに限定されるものではない。入力された
単音節列と辞書とのマツチング方法も他のさまざ
まな方法が考えられる。 As described above, the embodiments described above are only examples selected for convenience of explanation, and the present invention is not limited to these embodiments. Various other methods can be considered for matching the input monosyllable string with the dictionary.

最初に述べたように本発明は手書き文字入力に
も適用できる。この場合単音節のかわりに１つの
文字を単位とすればよい。オンライン手書き文字
認識等は標準パタンとのパタンマツチング法も有
効であると考えられるので本発明を適用すること
ができる。 As mentioned at the beginning, the present invention can also be applied to handwritten character input. In this case, one character may be used as a unit instead of a single syllable. The present invention can be applied to online handwritten character recognition, etc., since a pattern matching method with a standard pattern is considered to be effective.

本発明によると、音声や文字の認識において、
定められた単位の標準パタンを正しいパタンに更
新できる、情報入力装置が得られる。 According to the present invention, in speech and character recognition,
An information input device capable of updating a standard pattern of a predetermined unit to a correct pattern is obtained.

[Brief explanation of drawings]

第１図は本発明の一実施例のブロツク構成図、
第２図は単音節の認識結果と類似度の関係の一例
について示した図である。図中、１……マイクロフオン、２……分析部、
３……音声マツチング部、４……標準パタンメモ
リ部、５……辞書マツチング部、６……単語辞書
部、７……更新指示部、である。 FIG. 1 is a block diagram of an embodiment of the present invention.
FIG. 2 is a diagram showing an example of the relationship between monosyllable recognition results and similarity. In the figure, 1... Microphone, 2... Analysis department,
3... Voice matching section, 4... Standard pattern memory section, 5... Dictionary matching section, 6... Word dictionary section, 7... Update instruction section.

Claims

[Claims]

1. A recognition unit that recognizes input information such as voice and handwritten characters based on standard patterns registered in advance for each predetermined unit and outputs the recognition result, and a stored word dictionary section; a dictionary matching section that matches the contents of the dictionary section with the recognition result to obtain a word recognition result; An information input device comprising: an update instruction section that instructs updating of the standard pattern based on the determination result.