JPH0438358B2

JPH0438358B2 -

Info

Publication number: JPH0438358B2
Application number: JP59251902A
Authority: JP
Priority date: 1984-11-30
Filing date: 1984-11-30
Publication date: 1992-06-24
Also published as: JPS61130999A

Description

【発明の詳細な説明】〔発明の利用分野〕本発明は、人間の音声を認識する装置に係り、
特に、入力された音声情報を、予め登録された音
声パターンと照合して認識する複数の音声認識手
段を備えた音声認識装置の改良に関する。[Detailed Description of the Invention] [Field of Application of the Invention] The present invention relates to a device for recognizing human speech.
In particular, the present invention relates to an improvement of a speech recognition device including a plurality of speech recognition means that recognize input speech information by comparing it with a pre-registered speech pattern.

[Background of the invention]

例えば、特開昭51−28701号公報に開示されて
いるように、現在の音声認識精度は、不特定話者
では著しく低下するので、話者毎の特徴を抽出し
た音声パターンを予め登録しておき、これと音声
入力を照合して、それらの特徴の一致により認識
する手法が主流である。 For example, as disclosed in Japanese Patent Application Laid-Open No. 51-28701, the current speech recognition accuracy is significantly degraded for unspecified speakers, so it is necessary to register speech patterns in advance that extract features for each speaker. The mainstream method is to compare this with the voice input and recognize it based on the matching of those features.

ところで、音声認識装置を、複数の話者で使用
したい要求が強く、この場合には、１台の音声認
識装置に、複数の話者（例えば20名など）の各単
語毎の音声パターンを予め記憶させておく。そし
て、使用者（話者）の発生した単語と、記憶され
た音声パターンとを照合し、最も近い、あるい
は、予定の誤差の範囲にある音声パターンに対応
する言葉であるものと認識する。 By the way, there is a strong demand to use a speech recognition device with multiple speakers, and in this case, one speech recognition device can be used to record the speech patterns for each word of multiple speakers (for example, 20 people) in advance. Let me remember it. Then, the words produced by the user (speaker) are compared with the stored speech pattern, and recognized as words that correspond to the speech pattern that is closest or within the expected error range.

しかし、多数の話者の多数の単語を記憶するた
めには大きな記憶容量を必要とし、また、その中
から一致する音声パターンを認識するためには、
認識時間が長くなるという欠点がある。 However, remembering a large number of words from a large number of speakers requires a large memory capacity, and recognizing matching speech patterns from among them requires a large amount of storage capacity.
The disadvantage is that the recognition time is long.

このため、各話者の音声パターンを記憶させた
カセツト・メモリを各話者が所有し、音声認識装
置を使用するときに、上記カセツトをセツトする
手法も提案されている。 For this reason, a method has been proposed in which each speaker owns a cassette memory in which each speaker's voice pattern is stored, and the cassette is set when using the speech recognition device.

この手法によれば、メモリの総容量としては同
じであるが、特定話者の音声パターンのみとの照
合により、認識精度及び認識速度が向上する利点
がある。 According to this method, although the total memory capacity is the same, there is an advantage that recognition accuracy and recognition speed are improved by matching only the voice pattern of a specific speaker.

しかしながら、カセツト・メモリなどの紛失や
置場の問題が生じ、利用者にとつて不便である。 However, this is inconvenient for the user, as the cassette memory, etc., may be lost or stored.

[Purpose of the invention]

本発明の目的は、複数の音声認識装置を複数の
話者が使用する場合において、認識精度や認識速
度を損うことがなく、また、着脱メモリなどを必
要としない音声認識装置を提供することである。 An object of the present invention is to provide a speech recognition device that does not impair recognition accuracy or recognition speed when multiple speech recognition devices are used by multiple speakers, and does not require removable memory. It is.

[Summary of the invention]

本発明の特徴とするところは、複数の音声認識
手段のうちの任意のものを使用する話者を識別す
る手段と、複数話者毎の音声パターンを記憶する
共通の補助記憶手段と、識別された話者に対応す
る音声パターンを上記補助記憶手段いから該当す
る音声認識手段の持つ音声パターン記憶手段へ読
出し格納する制御手段を設け、共通の補助記憶手
段には、(1)、話者識別用の音声パターン（話者交
代用語）と、(2)、作業用の音声パターン（命令語
や作業用語など）とを予め記憶させておき、各音
声認識手段の使用直前には、話者識別用の音声パ
ターンを各音声認識手段内の音声パターン記憶手
段に読出し格納しておく。 The present invention is characterized by a means for identifying a speaker using any one of a plurality of speech recognition means, a common auxiliary storage means for storing speech patterns for each of the plurality of speech recognition means, and a common auxiliary storage means for storing speech patterns for each of the plurality of speech recognition means. The common auxiliary storage means includes (1) a control means for reading and storing a voice pattern corresponding to a speaker identified from the auxiliary storage means into a voice pattern storage means of the corresponding voice recognition means; (2) A speech pattern for tasks (speaker change terms) and (2) a speech pattern for tasks (commands, task terms, etc.) are stored in advance, and immediately before using each speech recognition means, speaker identification is performed. The speech patterns for the speech recognition means are read out and stored in the speech pattern storage means in each speech recognition means.

この状態で、任意の音声認識手段を例えば、Ａ
太郎が使用を開始するとき、その氏名などの話者
識別用音声パターンと符号する定められた話者交
代用語を発声する。 In this state, any voice recognition means, for example, A
When Taro starts speaking, he utters a predetermined speaker change term that is encoded with a voice pattern for identifying the speaker, such as his name.

これにより、話者が識別されると、制御手段
は、識別された話者に対応する作業用の音声パタ
ーンを共通の補助記憶手段から読出し、対応する
音声認識手段内の音声パターン記憶手段へ格納す
る。 Accordingly, when the speaker is identified, the control means reads the working speech pattern corresponding to the identified speaker from the common auxiliary storage means and stores it in the speech pattern storage means in the corresponding speech recognition means. do.

以後は、話者と１対１に対応した作業用の音声
パターンのみと、その話者の音声入力情報との照
合の下に音声認識が行われ、必要な作業が遂行さ
れる。 Thereafter, voice recognition is performed by comparing only the voice pattern for work in one-on-one correspondence with the speaker and the voice input information of that speaker, and the necessary work is performed.

この結果、着脱メモリなどを必要とせず、複数
の音声認識手段を夫々異る話者が同時に使用する
速度を損うこともない。 As a result, there is no need for a removable memory or the like, and the speed with which different speakers can simultaneously use a plurality of voice recognition means is not impaired.

[Embodiments of the invention]

以下、音声入力記憶装置に本発明を適用した一
実施例につき詳細に説明する。この実施例におい
ては、音声によるガイダンス、アンサーバツクを
行いつつ、作業者の作業とその結果を入力して記
憶する装置である。また、最初に行うべき、ある
いは新たな話者に対して行うべき音声パターンの
登録をも自由に行いうるものである。 Hereinafter, an embodiment in which the present invention is applied to a voice input storage device will be described in detail. In this embodiment, the device inputs and stores the worker's work and its results while providing voice guidance and answering. Furthermore, it is also possible to freely register voice patterns that should be performed for the first time or for a new speaker.

第１図は本発明に係る音声情報入力記憶装置の
一実施例の構成を示す。同図において、２台の音
声認識装置１Ａおよび１Ｂは音声入力用マイク６
Ａおよび６Ｂの音声信号を増幅する増幅器１１Ａ
および１１Ｂ、音声信号をデイジタル信号に変換
するＡ／Ｄ変換器１２Ａおよび１２Ｂ、あらかじ
め音声パターンを記憶しておく音声パターンメモ
リ１４Ａおよび１４Ｂ及び入力音声と音声パター
ンとを比較して音声認識をする音声認識制御回路
１３Ａおよび１３Ｂによつて構成されている。 FIG. 1 shows the configuration of an embodiment of a voice information input storage device according to the present invention. In the figure, two voice recognition devices 1A and 1B are voice input microphones 6.
Amplifier 11A that amplifies the audio signals of A and 6B
and 11B, A/D converters 12A and 12B that convert audio signals into digital signals, audio pattern memories 14A and 14B that store audio patterns in advance, and audio that performs audio recognition by comparing input audio and audio patterns. It is composed of recognition control circuits 13A and 13B.

一方、音声出力装置２Ａおよび２Ｂは音声出力
をするための音声を記憶しておく合成音声メモリ
２２Ａおよび２２Ｂ、音声認識結果に応じて合成
音声メモリ２２Ａおよび２２Ｂの記憶内容を選別
して出力する音声出力制御回路２１Ａおよび２１
Ｂ、音声出力制御回路２１Ａおよび２１Ｂの出力
信号をアナログ信号に変換するＤ／Ａ変換器２３
Ａおよび２３Ｂ、アナログ信号を増幅してスピー
カ（またはイヤホン）７Ａおよび７Ｂからアンサ
ーバツクの音声を発声させる増幅器２４Ａおよび
２４Ｂによつて構成されている。 On the other hand, the voice output devices 2A and 2B include synthesized voice memories 22A and 22B that store voices for outputting voice, and voices that select and output the stored contents of the synthesized voice memories 22A and 22B according to the voice recognition result. Output control circuits 21A and 21
B. D/A converter 23 that converts the output signals of the audio output control circuits 21A and 21B into analog signals.
A and 23B, and amplifiers 24A and 24B which amplify the analog signal and make the answer back sound come out from the speakers (or earphones) 7A and 7B.

補助記憶装置８は、複数の音声認識装置１Ａお
よび１Ｂに共通して使用されるもので、夫々の音
声パターンメモリ１４Ａおよび１４Ｂへ格納すべ
き音声パターンを記憶するものである。 The auxiliary storage device 8 is commonly used by the plurality of speech recognition devices 1A and 1B, and stores speech patterns to be stored in the respective speech pattern memories 14A and 14B.

また制御回路３は音声認識装置１Ａおよび１Ｂ
の音声認識制御回路１３Ａおよび１３Ｂを制御し
て音声認識結果を取り込んだり、音声出力装置２
Ａおよび２Ｂの音声出力制御回路２１Ａおよび２
１Ｂの制御をしてガイダンスやアンサーバツク音
をスピーカ７Ａおよび７Ｂから出力させたり、音
声認識装置１Ａおよび１Ｂの音声パターンメモリ
１４Ａおよび１４Ｂの音声バターンを補助記憶装
置８に記憶させたり、逆に補助記憶装置８の音声
パターンを音声認識装置１Ａおよび１Ｂあるいは
１Ａまたは１Ｂの音声パターンメモリ１４Ａおよ
び１４Ｂに移し換えたり、表示器（またはプリン
タ）５に制御状態や音声認識結果などを表示（ま
たはプリントアウト）したりする制御用コンピユ
ータである。この制御回路３は音声の他にキーボ
ード４によつても制御される。 In addition, the control circuit 3 includes voice recognition devices 1A and 1B.
control the voice recognition control circuits 13A and 13B of the voice recognition control circuits 13A and 13B of the
A and 2B audio output control circuits 21A and 2
1B to output guidance and answer back sounds from the speakers 7A and 7B, to store the voice patterns in the voice pattern memories 14A and 14B of the voice recognition devices 1A and 1B in the auxiliary storage device 8, and vice versa. You can transfer the voice patterns in the storage device 8 to the voice recognition devices 1A and 1B or the voice pattern memories 14A and 14B of 1A or 1B, and display (or print out) the control status and voice recognition results on the display (or printer) 5. ) is a control computer. This control circuit 3 is controlled not only by voice but also by a keyboard 4.

次に本発明の一実施例に使用する音声単語の一
例を第２図に示す。 Next, FIG. 2 shows an example of audio words used in an embodiment of the present invention.

音声単語は、話者交代をするための話者交代用
語（話者識別用の音声パターン）と、作業をする
ための作業用語ならびに作業に使用する命令語
（作業用の音声パターン）から成る。 The voice words consist of a speaker change term (speech pattern for speaker identification) for changing the speaker, a work term for performing a task, and a command word used for the task (speech pattern for the task).

まず、音声パターンの登録は、話者がマイク６
Ａまたは６Ｂを使つて音声単語を順次音声で読み
上げることによつて行なわれ、その音声は増幅器
１１Ａまたは１１Ｂ、Ａ／Ｄ変換器１２Ａまたは
１２Ｂ、音声認識制御回路１３Ａまたは１３Ｂを
介して音声パターンメモリ１４Ａまたは１４Ｂに
記憶される。この音声パターンメモリ１４Ａまた
は１４Ｂに記憶された音声パターンは補助記憶装
置８に話者毎に番地付けされて格納される。 First, to register the voice pattern, the speaker
A or 6B is used to sequentially read the audio words aloud, and the audio is sent to the audio pattern memory via the amplifier 11A or 11B, the A/D converter 12A or 12B, and the audio recognition control circuit 13A or 13B. 14A or 14B. The voice patterns stored in the voice pattern memory 14A or 14B are stored in the auxiliary storage device 8 with addresses assigned for each speaker.

音声パターンメモリ１４Ａおよび１４Ｂへの音
声単語の記憶の番地付けは、命令語と作業用語に
ついては話者共通の同一番地とし、話者交代用語
は話者毎に相異した番地とする。そして話者交代
モード（使用開始時や交代命令があつたとき）に
おいては話者全員の話者交代用語の音声パターン
のみを、音声パターンメモリ１４Ａあるいは１４
Ｂに収納しておき、話者交代完了後の作業モード
では、上記交代モードで識別された１人の話者の
命令語と作業用語の音声パターンを音声パターン
メモリ１４Ａまたは１４Ｂに移して音声でデータ
の入力を行う。 Regarding the address allocation for storing voice words in the voice pattern memories 14A and 14B, command words and working words are stored at the same address common to all speakers, and speaker change words are stored at different addresses for each speaker. In the speaker change mode (at the start of use or when a change command is given), only the voice patterns of the speaker change terms of all speakers are stored in the voice pattern memory 14A or 14.
In the work mode after the speaker change is completed, the voice pattern of the command word and work term of one speaker identified in the change mode is transferred to the voice pattern memory 14A or 14B and audibly recorded. Enter data.

次に本発明による音声情報入力の一実施例を第
３図を用いて説明する。 Next, an embodiment of audio information input according to the present invention will be described using FIG. 3.

スピーカ７Ａからの音声ガイダンス「氏名
は？」に対し、Ａ太郎が、マイク６Ａから音声で
「Ａ太郎」と発声すると、音声認識装置１Ａの音
声認識制御回路１３Ａによつて音声パターンメモ
リ１４Ａに記憶されている話者交代用の音声単語
の中から、入力音声と一致する単語「Ａ太郎」を
探し出して、その記憶番地あるいは対応するコー
ドを制御回路３に出力する。 In response to the voice guidance "What's your name?" from the speaker 7A, A-taro utters "A-taro" from the microphone 6A, which is stored in the speech pattern memory 14A by the speech recognition control circuit 13A of the speech recognition device 1A. The word "A-taro" that matches the input speech is searched out from among the speech words for speaker change, and its memory address or corresponding code is output to the control circuit 3.

制御回路３は音声単語コードの入力によりデー
タとして取り込んだり表示器５に表示したりする
他に音声出力制御回路２１Ａにアンサーバツクさ
せるための指令を発する。音声出力制御回路２１
Ａは制御回路３のアンサーバツク指令により合成
音声メモリ２２Ａ内の音声データを出力してＤ／
Ａ変換器２３Ａ、増幅器２４Ａを介してスピーカ
７Ａから「Ａ太郎」と発声させる。ここで、Ａ太
郎がマイク６Ａから「OK」と発声して入力する
と、音声認識装置１Ａの音声認識制御回路１３Ａ
によつて音声パターンメモリ１４Ａの話者交代用
単語の中から、入力音声と一致する単語「OK」
を探し出してその番地あるいはコードを制御回路
３に出力する。制御回路３はこれにより、話者が
Ａ太郎であることを識別し、補助記憶装置８に記
憶していたＡ太郎の作業用の音声パターンを音声
パターンメモリ１４Ａに読出して格納し、Ａ太郎
の作業モードにするとともに、音声出力装置２Ａ
を制御してスピーカ７Ａから「作業は？」と音声
ガイダンスを発する。 The control circuit 3 receives the voice word code as data and displays it on the display 5, as well as issues a command to the voice output control circuit 21A to answer. Audio output control circuit 21
A outputs the voice data in the synthesized voice memory 22A according to the answer back command from the control circuit 3, and outputs the voice data in the synthesized voice memory 22A.
"A-Taro" is uttered from the speaker 7A via the A converter 23A and the amplifier 24A. Here, when Taro A utters and inputs "OK" from the microphone 6A, the voice recognition control circuit 13A of the voice recognition device 1A
The word "OK" that matches the input voice is selected from among the words for speaker change in the voice pattern memory 14A.
and outputs the address or code to the control circuit 3. The control circuit 3 thereby identifies that the speaker is A-Taro, reads out and stores A-Taro's working speech pattern stored in the auxiliary storage device 8 in the speech pattern memory 14A, and stores A-Taro's work speech pattern stored in the auxiliary storage device 8. While switching to work mode, turn on the audio output device 2A.
is controlled to emit voice guidance from the speaker 7A asking, "What's the work?"

Ａ太郎が「入庫」と音声入力すると、音声認識
の結果「品番は？」とスピーカ７Ａからガイダン
スが返つてくるので、例えば「１、２、３」と音
声入力すると正しく認識されれば「１、２、３」
とアンサーバツクが返つてくる。次に「置場
は？」のガイダンスに対し「Ａ」と音声入力する
と音声認識の結果「Ａ」とアンサーバツクが返つ
てくる。 When A-taro inputs ``in stock'' by voice, the speaker 7A returns a guidance saying ``What is the product number?'' as a result of voice recognition.For example, if he inputs ``1, 2, 3'' by voice, if it is recognized correctly, ``1'' will be returned. , 2, 3”
An answer backup is returned. Next, if you input "A" by voice in response to the guidance "Where is the parking lot?", the voice recognition will return "A" as an answer.

以上により、Ａ太郎は、Ａ太郎の音声で自分の
作業用の音声パターンを補助記憶装置８から音声
認識装置１Ａに移した上で、自分の作業用音声パ
ターンのみとの照合による精度の高い、かつ高速
の認識を用いて、「品番123と置場Ａに入庫」とい
うデータを入力したことになる。 As a result of the above, A-Taro transfers his work voice pattern from A-Taro's voice from the auxiliary storage device 8 to the voice recognition device 1A, and then performs a highly accurate speech pattern by comparing it only with his own work voice pattern. In addition, using high-speed recognition, the data ``Product number 123 and warehouse A'' are entered.

Ａ太郎が作業を終了するときは、「交代」とマ
イク６Ａから入力すると作業モードから話者交代
モードに切り換る。すなわち、制御回路３は、補
助記憶装置８内の話者交代用音声パターンを読出
して、音声パターンメモリ１４Ａへ格納する。 When Taro A wants to finish his work, he inputs "change" through the microphone 6A, and the work mode is switched to the speaker change mode. That is, the control circuit 3 reads out the speaker change voice pattern from the auxiliary storage device 8 and stores it in the voice pattern memory 14A.

以上はＡ太郎がマイク６Ａから音声入力した場
合について説明したが、Ａ太郎がマイク６Ｂから
音声入力した場合も全く同様である。スピーカ７
Ｂからの音声ガイダンス「氏名は？」に対して、
Ａ太郎が、マイク６Ｂから音声で「Ａ太郎」と発
声すると、音声認識装置１Ｂの音声認識制御回路
１３Ｂによつて音声パターンメモリ１４Ｂに記憶
されている音声単語の中から入力音声と一致する
単語「Ａ太郎」を探し出してその記憶番地あるい
は対応するコードを制御回路３に出力する。制御
回路３の制御によつて音声出力装置２Ｂの増幅器
２４Ｂを介してスピーカ７Ｂから「Ａ太郎」と発
声させる。ここで、Ａ太郎がマイク６Ｂから
「OK」と発声して入力すると、音声認識装置１
Ｂの音声認識制御回路１３Ｂによつて登録音声メ
モリ１４Ｂの単語の中から入力音声と一致する音
声単語である「OK」を探し出してその番地ある
いはコードを制御回路３に出力す。これにより、
制御回路３は補助記憶装置８に記憶していたＡ太
郎の作業用の音声パターンを音声パターンメモリ
１４Ｂに移し換えて、Ａ太郎の作業モードにする
とともに、音声出力装置２Ｂを制御してスピーカ
７Ｂから「作業は？」と音声ガイダンスを発す
る。以下マイク６Ａからの音声入力時と全く同様
に作用する。 The above description has been made of the case where Taro A inputs voice from the microphone 6A, but the same applies to the case where Taro A inputs voice from the microphone 6B. speaker 7
In response to the voice guidance from B, “What is your name?”
When Taro A utters "A Taro" from the microphone 6B, the speech recognition control circuit 13B of the speech recognition device 1B selects a word that matches the input speech from among the speech words stored in the speech pattern memory 14B. It searches for "A-taro" and outputs its memory address or corresponding code to the control circuit 3. Under the control of the control circuit 3, "A-taro" is uttered from the speaker 7B via the amplifier 24B of the audio output device 2B. Here, when Taro A speaks and inputs "OK" from the microphone 6B, the voice recognition device 1
The voice recognition control circuit 13B of B searches the voice word "OK" that matches the input voice from among the words in the registered voice memory 14B and outputs its address or code to the control circuit 3. This results in
The control circuit 3 transfers the voice pattern for A-Taro's work stored in the auxiliary storage device 8 to the voice pattern memory 14B, sets it to A-Taro's work mode, and controls the voice output device 2B to output the speaker 7B. gives voice guidance asking, ``What's the work?'' Thereafter, the operation is exactly the same as when inputting audio from the microphone 6A.

今度はＢ太郎がマイク６Ａ（または６Ｂ）から
「Ｂ太郎」と音声入力すると音声認識の結果、今
度は音声パターンメモリ１４Ａ（または１４Ｂ）
には補助記憶装置８からＢ太郎の作業用音声パタ
ーンが入り、Ｂ太郎が音声データ入力をすること
ができるようになる。 This time, when B-taro inputs "B-taro" into the microphone 6A (or 6B), as a result of voice recognition, this time the voice pattern memory 14A (or 14B) is recorded.
B-taro's working voice pattern is entered from the auxiliary storage device 8, and B-taro can now input voice data.

以下同様にして、１組の補助記憶装置８に記憶
しておいた話者交代用並びに複数話者毎の作業用
の音声パターンを複数の音声認識装置１Ａおよび
１Ｂに導き出して自由に音声で話者交代およびデ
ータ入力をすることができる。音声パターンの登
録は１組の音声認識装置から行ない補助記憶装置
を介して他の音声認識装置に移し換えても良く、
また各音声認識装置からそれぞれ登録しても良
い。 Thereafter, in the same manner, the voice patterns for changing speakers and for working with multiple speakers stored in one set of auxiliary storage device 8 are derived to the plurality of voice recognition devices 1A and 1B to freely speak aloud. Ability to change personnel and enter data. The voice pattern may be registered from one voice recognition device and transferred to another voice recognition device via an auxiliary storage device.
Alternatively, the information may be registered from each voice recognition device.

ここで補助記憶装置８は集積回路のRAMや
ROMとしても良く、また、バブルカセツト、カ
セツトテープ、フロツピーデイスクなどとしても
良い。但し、新たな話者の音声パターンを自由に
登録するためには、ROM以外の記憶手段を用い
る。 Here, the auxiliary storage device 8 is an integrated circuit RAM or
It may be used as a ROM, or as a bubble cassette, cassette tape, floppy disk, etc. However, in order to freely register the voice pattern of a new speaker, a storage means other than ROM is used.

補助記憶装置８と登録音声メモリ１４Ａまたは
１４Ｂの音声パターンの読出し格納は、音声入力
による他にキーボード４から行なうようにしても
良い。さらに、音声認識結果を表示器５に表示し
て、音声出力装置２Ａおよび２Ｂを省略しても複
数の話者が複数の音声認識装置から交代して音声
情報を入力することができる。 Reading and storing of voice patterns in the auxiliary storage device 8 and the registered voice memory 14A or 14B may be performed from the keyboard 4 in addition to voice input. Furthermore, by displaying the voice recognition results on the display 5, a plurality of speakers can alternately input voice information from a plurality of voice recognition devices even if the voice output devices 2A and 2B are omitted.

第４図は本発明に係る音声情報入力装置の他の
一実施例の構成を示したもので、第１図と同一符
号のものは同一機能を有する。同図において、無
線機移動局３０Ａおよび３０Ｂはマイク６Ａおよ
び６Ｂの入力音声をアンテナ３３Ａおよび３３Ｂ
から電波を発射する送信機３１Ａおよび３１Ｂ、
アンテナ３３Ａおよび３３Ｂから電波を受信して
スピーカ７Ａおよび７Ｂから音声ガイダンスやア
ンサーバツクを発生させる受信機３２Ａおよび３
２Ｂによつて構成されている。無線機固定局２０
Ａおよび２０Ｂは無線機移動局３０Ａおよび３０
Ｂの電波をアンテナ２３Ａおよび２３Ｂを介して
受信して音声入出力装置１０Ａおよび１０Ｂの音
声認識装置１Ａおよび１Ｂに入力する受信機２１
Ａおよび２１Ｂ、音声入出力装置１０Ａおよび１
０Ｂの音声出力装置２Ａおよび２Ｂの出力音声を
アンテナ２３Ａおよび２３Ｂを介して無線機移動
局３０Ａおよび３０Ｂの受信機３２Ａおよび３２
Ｂへ電波を発射する送信機２２Ａおよび２２Ｂか
ら構成されている。音声パターンの登録はマイク
６Ａまたは６Ｂから話者が音声単語を順次音声で
読み上げることによつて行なわれる。マイク６Ａ
または６Ｂから入力された音声は無線機移動局３
０Ａおよび３０Ｂの送信機３１Ａまたは３１Ｂか
らアンテナ３３Ａまたは３３Ｂを介して電波が発
射される。この電波はアンテナ２３Ａまたは２３
Ｂを介して無線機固定局２０Ａまたは２０Ｂの受
信機２１Ａおよび２１Ｂで受信して音声認識装置
１Ａまたは１Ｂの登録音声メモリに登録される。
この登録音声メモリに登録された音声パターンは
補助記憶装置８に話者毎に番地付けされて格納さ
れる。また、補助記憶装置８に格納された音声パ
ターンはキーボード４の操作あるいは音声認識装
置１Ａまたは１Ｂへの音声入力によつて音声認識
装置１Ａまたは１Ｂそれぞれの音声パターンメモ
リに移される。マイク６Ａまたは６Ｂから話者の
音声データが入力されると無線機移動局３０Ａま
たは３０Ｂの送信機３１Ａまたは３１Ｂから電波
をとおして無線機固定局２０Ａまたは２０Ｂの受
信機２１Ａまたは２１Ｂで受信し音声認識装置１
Ａまたは１Ｂに入力される。音声認識結果のアン
サーバツクは音声出力装置２Ａまたは２Ｂから発
せられ送信機２２Ａまたは２２Ｂによつて電波と
なつて発射される。この電波は受信機３２Ａまた
は３２Ｂによつて受信されスピーカ７Ａまたは７
Ｂから発声される。話者はマイク６Ａまたは６Ｂ
から音声でデータを入力するとスピーカ７Ａまた
は７Ｂからアンサーバツクあるいはガイダンスが
発せられるのでこれを開きながら音声でデータを
入力する。 FIG. 4 shows the configuration of another embodiment of the audio information input device according to the present invention, and the same reference numerals as in FIG. 1 have the same functions. In the figure, radio mobile stations 30A and 30B transmit input audio from microphones 6A and 6B to antennas 33A and 33B.
transmitters 31A and 31B that emit radio waves from;
Receivers 32A and 3 receive radio waves from antennas 33A and 33B and generate voice guidance and answer back from speakers 7A and 7B.
2B. Radio fixed station 20
A and 20B are radio mobile stations 30A and 30
Receiver 21 receives the radio waves of B via antennas 23A and 23B and inputs them to voice recognition devices 1A and 1B of voice input/output devices 10A and 10B.
A and 21B, audio input/output devices 10A and 1
The output audio from the audio output devices 2A and 2B of 0B is transmitted to the receivers 32A and 32 of the radio mobile stations 30A and 30B via antennas 23A and 23B.
It is composed of transmitters 22A and 22B that emit radio waves to B. The voice pattern is registered by the speaker reading voice words one after another from the microphone 6A or 6B. Microphone 6A
Or the audio input from 6B is the radio mobile station 3
Radio waves are emitted from transmitters 31A or 31B of 0A and 30B via antennas 33A or 33B. This radio wave is transmitted by antenna 23A or 23
The received signal is received by the receivers 21A and 21B of the radio fixed station 20A or 20B via the radio terminal B, and is registered in the registered voice memory of the voice recognition device 1A or 1B.
The voice patterns registered in the registered voice memory are stored in the auxiliary storage device 8 with addresses assigned for each speaker. Further, the voice pattern stored in the auxiliary storage device 8 is transferred to the voice pattern memory of the voice recognition device 1A or 1B by operating the keyboard 4 or inputting voice to the voice recognition device 1A or 1B. When the speaker's voice data is input from the microphone 6A or 6B, it is received by the receiver 21A or 21B of the radio fixed station 20A or 20B through radio waves from the transmitter 31A or 31B of the radio mobile station 30A or 30B, and the voice is transmitted. Recognition device 1
Input to A or 1B. The answer box as a result of voice recognition is emitted from the voice output device 2A or 2B, and is emitted in the form of radio waves by the transmitter 22A or 22B. This radio wave is received by the receiver 32A or 32B and the speaker 7A or 7
Voiced by B. Speaker is microphone 6A or 6B
When inputting data by voice, an answer call or guidance is emitted from the speaker 7A or 7B, and the user inputs data by voice while opening the speaker.

以上の実施例では、１組の音声認識装置で音声
パターンと登録をすれば他の音声認識装置への音
声パターンの登録は発声することなく補助記憶装
置を利用して行うことができる。 In the embodiments described above, once a voice pattern is registered in one set of voice recognition devices, the voice pattern can be registered in another voice recognition device using the auxiliary storage device without uttering a voice.

以上の実施例では、話者交代用の音声パターン
をも、共通の補助記憶装置８に登録しておき、話
者交代モードでのみ、各音声認識装置１Ａ，１Ｂ
内の音声パターンメモリ１４Ａ，１４Ｂへ格納す
るようにしている。しかし、話者交代用の音声パ
ターンは、常時、各音声パターンメモリ１４Ａ，
１４Ｂが記憶しておくようにすることができる。
この場合、各音声パターンメモリの他の番地に、
作業用の音声パターンのうち、識別された話者に
対応するパターンが選択的に格納されることとな
る。 In the above embodiment, the voice pattern for speaker change is also registered in the common auxiliary storage device 8, and only in the speaker change mode, each voice recognition device 1A, 1B
The data is stored in the voice pattern memories 14A and 14B in the internal memory. However, the voice pattern for speaker change is always stored in each voice pattern memory 14A,
14B may be stored.
In this case, at other addresses in each voice pattern memory,
Among the working voice patterns, patterns corresponding to the identified speaker are selectively stored.

また、話者の識別にも音声認識手段を利用する
ものにつき説明したが、これは話者別のコード
を、キーボードその他のいかなる入力手段によつ
て入力するようにしてもよく、この場合には、制
御回路が簡単に話者を識別できる。 In addition, although we have described a system that uses voice recognition means to identify speakers, it is also possible to enter a code for each speaker using a keyboard or any other input means; in this case, , the control circuit can easily identify the speaker.

〔Effect of the invention〕

本発明によれば、複数の音声認識装置を複数の
話者が自由に使用でき、話者の識別によつて該当
話者の音声パターンを対応する音声認識装置の音
声パターン記憶手段へ格納することにより、認識
精度に優れた音声認識装置を提供することができ
る。 According to the present invention, a plurality of speech recognition devices can be freely used by a plurality of speakers, and by identifying the speaker, the speech pattern of the corresponding speaker can be stored in the speech pattern storage means of the corresponding speech recognition device. Accordingly, it is possible to provide a speech recognition device with excellent recognition accuracy.

[Brief explanation of drawings]

第１図は本発明の一実施例を示す音声情報入力
装置の構成を示すシステム構成図、第２図は第１
図に示した音声情報入力装置に使用する音声単語
の１例と、その記憶内容を示す図、第３図は話者
交代と作業の１例を示す音声情報入力の手順図、
第４図は本発明の他の一実施例を示す他の音声情
報入力装置の構成を示すシステム構成図である。１Ａ，１Ｂ……音声認識手段、２Ａ，２Ｂ……
音声出力装置、３……制御手段、４……キーボー
ド、５……表示器またはプリンタ、６Ａ，６Ｂ…
…マイク、７Ａ，７Ｂ……スピーカ、８……補助
記憶手段、１０Ａ，１０Ｂ……音声入出力装置、
２０Ａ，２０Ｂ……無線機固定局、３０Ａ，３０
Ｂ……無線機移動局、１４Ａ，１４Ｂ……音声パ
ターンメモリ。 FIG. 1 is a system configuration diagram showing the configuration of a voice information input device showing an embodiment of the present invention, and FIG.
FIG. 3 is a diagram showing an example of voice words used in the voice information input device shown in FIG.
FIG. 4 is a system configuration diagram showing the configuration of another voice information input device showing another embodiment of the present invention. 1A, 1B...Voice recognition means, 2A, 2B...
Audio output device, 3... Control means, 4... Keyboard, 5... Display device or printer, 6A, 6B...
...Microphone, 7A, 7B...Speaker, 8...Auxiliary storage means, 10A, 10B...Audio input/output device,
20A, 20B... Radio fixed station, 30A, 30
B... Radio mobile station, 14A, 14B... Voice pattern memory.

Claims

[Claims] 1. A microphone, a voice pattern storage means for storing a voice pattern, and a voice recognition system that recognizes voice by comparing the voice pattern stored in the voice pattern storage means with the voice input from the microphone. and an auxiliary storage means for storing the speech pattern for each speaker, wherein a plurality of the speech recognition sections are provided, and the auxiliary storage means is connected to the plurality of speech recognition sections. The voice pattern stored in this auxiliary storage means is used as the voice pattern for speaker identification and the voice pattern for work, and the voice pattern for speaker identification stored in this auxiliary storage means is used as the voice pattern. means for storing in a storage means; speaker identification means for identifying a speaker by comparing the stored voice pattern for speaker identification with the voice input from the microphone; and a speaker identified by the identification. and means for writing a working voice pattern from the auxiliary storage means into the voice pattern storage means. 2. The speech recognition device according to claim 1, wherein the auxiliary storage means is a floppy disk.