JPH0228878B2

JPH0228878B2 -

Info

Publication number: JPH0228878B2
Application number: JP56093028A
Authority: JP
Inventors: Gichu Oota
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1981-06-18
Filing date: 1981-06-18
Publication date: 1990-06-26
Also published as: JPS57208596A

Description

【発明の詳細な説明】本発明は音声認識回路に係り、特にテレビジヨ
ン受信機等の音声によるリモコン装置等に用いて
好適な音声認識回路に関するものである。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a voice recognition circuit, and more particularly to a voice recognition circuit suitable for use in voice remote control devices such as television receivers.

音声認識回路は、人間が音声で動作を制御する
装置に使用される。ここで音声認識回路は、入力
音声を切り出し、その入力音声の特徴を抽出し、
この特徴パラメータと標準音声特徴パラメータと
を比較して、入力音声が装置の動作の何を制御す
るための命令であつたかを識別する。しかし、テ
レビジヨン受像機などにおいて、音声で命令語を
伝えて受信チヤンネルを切換える場合、テレビジ
ヨン受像機自身の再生音声が外部から与える命令
音声（例えば受信チヤンネルを２チヤンネルから
４チヤンネルに切換える場合の「チヤンネル４」
という命令音声）に妨害を与え、正確な命令語の
入力は困難である。 Speech recognition circuits are used in devices whose operations are controlled by humans using voice. Here, the speech recognition circuit cuts out the input speech, extracts the characteristics of the input speech,
This characteristic parameter is compared with the standard voice characteristic parameter to identify which of the operations of the device the input voice is a command for controlling. However, when switching the receiving channel by conveying a command word in voice on a television receiver, etc., the sound reproduced by the television receiver itself is used as the command voice given from the outside (for example, when switching the receiving channel from channel 2 to channel 4). "Channel 4"
This makes it difficult to input commands accurately.

そこで従来、特願昭54―91913号に示されるよ
うに、命令音声入力部に有指向性と無指向性の２
つのマイクロホンをもうけ、それぞれのマイクロ
ホンを互いに逆位相に接続し、指向性範囲外から
の音声信号すなわちテレビジヨン音声に対して、
指向性範囲内の命令音声のSN比を高めておき、
重要な「チヤンネル」、「デンゲン」、「ボリユー
ム」等の指令語の識別のための許容値を大きくと
り、これらが識別された場合、一時的にスピーカ
出力を断つかもしくは大巾に減衰せしめる方式が
提案されている。この方式では２つの指向性の異
なるマイクロホンを必要とし、また重要指令語の
識別許容値を大きくとつているためスピーカから
流れる音声中の指令類似語による誤動作によるミ
ユーテイング動作が起り易いなどの問題があつ
た。 Therefore, conventionally, as shown in Japanese Patent Application No. 54-91913, the command voice input section has two types: directional and non-directional.
The microphones are connected in opposite phases to each other, and the microphones are connected in opposite phases to each other.
Increase the SN ratio of the command voice within the directional range,
A method that sets a large tolerance value for identifying important command words such as "channel", "dengen", and "volume", and when these are identified, the speaker output is temporarily cut off or greatly attenuated. is proposed. This method requires two microphones with different directivity, and because it has a large tolerance for identifying important command words, there are problems such as erroneous muting due to command-like words in the voice coming from the speaker. Ta.

また特願昭54―67142号に示されるように、ス
ピーカからの音声と命令音声を集音するマイクロ
ホンからの信号と、スピーカを駆動する信号をレ
ベル調整器，位相調整器に通し調整した信号とを
信号加算回路に逆位相で印加し、スピーカ音声を
打ち消し、命令音声のみを認識回路に取り込む方
式が提案されている。しかしこの方式ではレベル
調整，位相調整をすべての周波数帯域で完全に行
うのは困難であり、スピーカ音声を完全に打ち消
すことはできず、命令音声認識の誤動作につなが
る。 In addition, as shown in Japanese Patent Application No. 54-67142, the signal from the microphone that collects the voice from the speaker and the command voice, and the signal that drives the speaker are adjusted by passing them through a level adjuster and a phase adjuster. A method has been proposed in which the command signal is applied to a signal addition circuit in opposite phase to cancel out the speaker sound, and only the command sound is taken into the recognition circuit. However, with this method, it is difficult to completely adjust the level and phase in all frequency bands, and the speaker sound cannot be completely canceled, leading to malfunctions in command voice recognition.

本発明の目的は、上記した従来技術の欠点をな
くし、テレビジヨン受像機など自身が放声手段を
有する装置に用いても、音声による命令入力を誤
動作なしに可能にする低価格の音声認識回路を提
供することにある。 An object of the present invention is to eliminate the above-mentioned drawbacks of the prior art, and to provide a low-cost voice recognition circuit that enables voice command input without malfunction even when used in devices such as television receivers that have their own voice emitting means. It is about providing.

本発明は、テレビジヨン受像機など自身が放声
手段を有する装置に対する音声による命令入力を
可能とするために、装置に付けられたマイクロホ
ンの集音信号から命令音声の出だし数＋msの音
声の存在をもつて、放声手段からの音声以外の命
令音声の存在を、それぞれの音声の零交差波数の
相違により検出し、該放声手段にミユーテイング
をかけ、ミユーテイング以降の音声を命令音声と
して音声認識部に取り込み、該命令語を識別する
音声認識回路である。特にスピーカ等の放声手段
に確実にミユーテイングをかける簡略な方法を提
案し、音声認識回路の認識率を向上させるもので
ある。 In order to make it possible to input voice commands to a device such as a television receiver that has its own sound emitting means, the present invention detects the presence of voice for the number of command voices + ms from the sound collection signal of a microphone attached to the device. The presence of a command voice other than the voice from the voice emitting means is detected based on the difference in the zero-crossing wave number of each voice, mutating is applied to the voice emitting means, and the voice after the mutating is input into the voice recognition unit as a command voice. , is a voice recognition circuit that identifies the command word. In particular, we propose a simple method for reliably muting sound emitting means such as speakers, thereby improving the recognition rate of speech recognition circuits.

以下本発明の実施例を図面に基づいて説明す
る。第１図は本発明の一実施例を示すブロツク図
であり、音声命令語によつて受信チヤンネル切
換、音量調整、電源投入などの機能を行なわしめ
るようにしたテレビジヨン受像機の要部を示すブ
ロツク図でもある。第１図において、１はテレビ
ジヨン受像機自体のスピーカ、２はテレビジヨン
音声にミユーテイングをかけるミユーテイング回
路、３は受信局の復調音声信号を増幅するテレビ
ジヨン音声増幅回路、４は音声命令語を集音する
マイクロホン、５はマイクロホン信号を増幅する
マイクロホン増幅回路、６，７は受信局の復調音
声信号およびマイクロホン信号を音圧零の点を中
心に上下にインフイニツトクリツピングを行い、
これを矩形波パルス列に変換する零交差波回路、
８，９は上記零交差波数を計数する計数回路、１
０は上記計数回路の計数値を比較する比較回路、
１１は命令音声を識別する音声認識部、１２はミ
ユーテイング回路へのミユーテイング信号、音声
認識部への命令音声取込み信号、命令音声の識別
結果からテレビジヨン受像機のチヤンネル切換な
どの制御信号を出力する制御回路である。 Embodiments of the present invention will be described below based on the drawings. FIG. 1 is a block diagram showing one embodiment of the present invention, showing the main parts of a television receiver that allows functions such as switching reception channels, adjusting volume, and turning on the power to be performed by voice commands. It is also a block diagram. In FIG. 1, 1 is the speaker of the television receiver itself, 2 is a muting circuit that mutates the television audio, 3 is a television audio amplification circuit that amplifies the demodulated audio signal of the receiving station, and 4 is the audio command word. A microphone for collecting sound, 5 a microphone amplification circuit for amplifying the microphone signal, 6 and 7 perform infinite clipping of the demodulated audio signal of the receiving station and the microphone signal up and down around the point of zero sound pressure;
A zero-crossing wave circuit that converts this into a rectangular wave pulse train,
8 and 9 are counting circuits for counting the above-mentioned zero-crossing wave numbers, 1
0 is a comparison circuit that compares the count values of the above-mentioned counting circuit;
Reference numeral 11 denotes a voice recognition unit that identifies command voices; 12 outputs a muting signal to a muting circuit, a command voice input signal to the voice recognition unit, and a control signal for switching channels of a television receiver, etc. based on the recognition result of the command voice. It is a control circuit.

第１図の動作を現在の受信チヤンネルのチヤン
ネル２をチヤンネル４に音声命令で変更する場合
を例に説明する。今チヤンネル２の復調音声信号
はテレビジヨン音声増幅回路３で増幅され、ミユ
ーテイング回路２でミユーテイングをかけられる
ことなくスピーカ１で再生されている。マイクロ
ホン４はスピーカ１よりの復調音声を周囲音とし
て集音し、電気信号に変換しマイクロホン増幅回
路５に供給している。命令音声がない場合、零交
差波回路６に入力される信号はスピーカを駆動す
る信号であり、他方零交差波回路７に入力される
信号はスピーカ１の信号をマイクロホン４で集音
した信号である。したがつて多少振幅、位相の相
違はあるがほぼ同一の信号となり零交差波回路
６，７の出力である矩形波パルス列は多少の振幅
の相違には影響されずほぼ同一のものとなる。計
数回路８，９はある周期たとえば40ms毎に零交
差波回路６，７からの矩形波パルスの数を計数す
る。この場合これらの計数値は計数周期をある時
間にしてあるため信号位相の多少のずれは計数値
に影響を及ぼさない。以上より音声命令のない場
合には計数回路８，９の計数値は同一のものとな
る。 The operation shown in FIG. 1 will be explained by taking as an example a case where the current reception channel 2 is changed to channel 4 by a voice command. The demodulated audio signal of channel 2 is now amplified by the television audio amplification circuit 3, and is reproduced by the speaker 1 without being muted by the muting circuit 2. The microphone 4 collects the demodulated sound from the speaker 1 as ambient sound, converts it into an electrical signal, and supplies it to the microphone amplifier circuit 5. When there is no command voice, the signal input to the zero-crossing wave circuit 6 is a signal that drives the speaker, and the signal input to the zero-crossing wave circuit 7 is a signal obtained by collecting the signal from the speaker 1 with the microphone 4. be. Therefore, although there are some differences in amplitude and phase, the signals are almost the same, and the rectangular wave pulse trains output from the zero-crossing wave circuits 6 and 7 are not affected by the slight difference in amplitude and are almost the same. Counting circuits 8 and 9 count the number of rectangular wave pulses from zero-crossing wave circuits 6 and 7 at a certain period, for example, every 40 ms. In this case, since these count values have a counting period set to a certain time, a slight shift in the signal phase does not affect the count values. From the above, when there is no voice command, the count values of the counting circuits 8 and 9 are the same.

次に復調音声がスピーカ１で再生されている時
「チヤンネル４」なる命令音声を発声した場合を
考えると、マイクロホン４はスピーカ１よりの復
調音声と命令音声の重畳した音声を周囲音として
集音しマイクロホン増幅器５を介して増幅し、零
交差波回路７に入力する。この時、零交差波回路
６には復調音声のみが入力されている。したがつ
て、各々の零交差波の計数値は大きく異なること
になる。 Next, consider the case where a command voice called "Channel 4" is uttered while the demodulated voice is being played back by speaker 1. Microphone 4 collects the superimposed voice of the demodulated voice from speaker 1 and the command voice as ambient sound. The signal is amplified via the microphone amplifier 5 and input to the zero crossing wave circuit 7. At this time, only demodulated audio is input to the zero crossing wave circuit 6. Therefore, the count values of each zero-crossing wave will differ greatly.

第２図に発明者が実験を行つた結果による零交
差波回路６，７の計数値とそれらの差を示す。実
験は計数周期を40ミリ秒にとり「関東地方の明日
の天気は」というテレビジヨン受像機のアナウン
ス音声発声中に「チヤンネル４」という命令音声
を発声した時のものである。第２図から明らかな
ようにアナウンサ音声に音声命令が重なつた時間
において、零交差波回路６，７の出力の計数値す
なわち計数回路８，９の計数値は大きく異なつて
いることがわかる。 FIG. 2 shows the counts of the zero-crossing wave circuits 6 and 7 and the difference between them based on the results of experiments conducted by the inventor. In the experiment, the counting period was set to 40 milliseconds, and the command voice ``Channel 4'' was uttered while the television receiver was making an announcement voice saying ``What is the weather in the Kanto region tomorrow?''. As is clear from FIG. 2, at the time when the voice command is superimposed on the announcer's voice, the count values of the outputs of the zero crossing wave circuits 6 and 7, that is, the count values of the counting circuits 8 and 9, are significantly different.

第１図に戻り、比較回路１０は計数回路８，９
の計数値を計数周期毎に比較するものであり、こ
の比較結果を周期毎に制御回路１２に送出してい
る。 Returning to FIG. 1, the comparison circuit 10 is the counting circuit 8, 9
The count values of are compared every counting cycle, and the comparison results are sent to the control circuit 12 every cycle.

制御回路１２は比較回路１０の結果を常に監視
しており、計数回路８，９の計数値が大きく異な
つた場合を比較回路１０の結果から知ることがで
き、計数値が大きく異なつてから最初の数＋ms
間でミユーテイング信号をミユーテイング回路２
に送出するとともに、音声認識部１１に命令音声
取り込み信号を送出する。こうすることにより音
声認識部１１は「チヤンネル４」という命令音声
の最初の数＋msの部分を除いた音声を、復調音
声に妨害されることなく取り込むことができる。 The control circuit 12 constantly monitors the results of the comparator circuit 10, and can tell from the results of the comparator circuit 10 when the count values of the counting circuits 8 and 9 are significantly different, and when the count values are significantly different, the first number + ms
The mutating signal is transmitted between the mutating circuit 2 and the mutating circuit 2.
At the same time, a command voice capture signal is sent to the voice recognition unit 11. By doing this, the voice recognition unit 11 can take in the voice excluding the first number + ms of the command voice "channel 4" without being interfered with by the demodulated voice.

音声認識部１１は、たとえば第３図に示すごと
く、音声信号の特徴を抽出する特徴抽出回路１
３、あらかじめ登録され、識別すべき標準音声の
特徴パターンを記憶する標準パターン記憶回路１
４、入力音声から抽出された特徴パターンと標準
パターンとを比較し、その類似度により入力音声
を特定する認識処理回路１５よりなるものであ
る。音声認識部１１に取り込まれた命令音声は特
徴抽出器１３により特徴パターンに変換され、標
準パターンとして標準音声記憶回路１４に登録さ
れている標準音声たとえば「チヤンネル１」，「チ
ヤンネル２」……などと認識処理回路１５で比較
され、その類似度の大小により「チヤンネル４」
であると特定され、「チヤンネル４」を表わすシ
ンボルを制御回路１２に送出する。 The speech recognition unit 11 includes a feature extraction circuit 1 that extracts features of a speech signal, as shown in FIG. 3, for example.
3. Standard pattern storage circuit 1 that is registered in advance and stores characteristic patterns of standard voices to be identified.
4. It consists of a recognition processing circuit 15 that compares the characteristic pattern extracted from the input voice with a standard pattern and identifies the input voice based on the degree of similarity. The command voice captured by the voice recognition unit 11 is converted into a feature pattern by the feature extractor 13, and the standard voice registered in the standard voice storage circuit 14 as a standard pattern, such as "Channel 1", "Channel 2", etc. and is compared by the recognition processing circuit 15, and depending on the degree of similarity, it is selected as "channel 4".
, and sends a symbol representing "channel 4" to the control circuit 12.

第１図に戻り、制御回路１２は音声認識部１１
の出力シンボルを取り込み、これに基づき、チヤ
ンネルをチヤンネル４に切換えるべく、テレビ受
像機チユーナ（図示せず）の局部発振周波数を変
更する制御信号（図示せず）をチユーナに送出す
る。その後、制御回路１２はミユーテイング回路
２にミユーテイング解除信号を送出し、ミユーテ
イングを解除する。以上で命令音声によるチヤン
ネル４の受信動作が完了する。 Returning to FIG. 1, the control circuit 12 includes the voice recognition section 11
Based on this, a control signal (not shown) for changing the local oscillation frequency of the television receiver tuner (not shown) is sent to the tuner in order to switch the channel to channel 4. Thereafter, the control circuit 12 sends a muting release signal to the muting circuit 2 to release the muting. This completes the channel 4 reception operation using the command voice.

音声認識部１１への命令音声取り込み信号は計
数回路８，９の計数値が大きく相違し、ミユーテ
イング信号を送出した時点から数百ミリ秒〜数秒
の任意の固定時間継続するものである。もちろん
その継続時間は使用する命令音声の時間に合わせ
なければならない。 The command voice input signal to the voice recognition unit 11 has a large difference in the count values of the counting circuits 8 and 9, and continues for an arbitrary fixed period of several hundred milliseconds to several seconds from the time when the muting signal is sent. Of course, its duration must match the time of the command voice used.

第４図に第１図の実施例における各部の動作タ
イミングの概略を計数周期をもとに、計数値とと
もに示すので参照されたい。 Please refer to FIG. 4, which shows an outline of the operation timing of each part in the embodiment of FIG. 1, based on the counting period, together with the counted values.

第１図において零交差波回路６，７はたとえば
演算増幅器によるシユミツト回路として簡単に構
成できるものである。そしてこのシユミツト回路
はシユミツトレベルを音圧零のレベルよりも多少
高くとることにより不感帯をもたせ、命令音声、
復調音声よりも振幅の小さな背景雑音には感応し
ないようにさせることにより命令音声の検出をよ
り確実にすることもできる。 In FIG. 1, the zero crossing wave circuits 6 and 7 can be easily constructed as Schmitt circuits using operational amplifiers, for example. This Schmitt circuit creates a dead zone by setting the Schmitt level a little higher than the level of zero sound pressure.
It is also possible to more reliably detect the command voice by making it insensitive to background noise having a smaller amplitude than the demodulated voice.

比較回路１０はデイジタルコンパレータで構成
し、制御回路１２に計数回路８，９の計数値の大
小関係のみを送出する構成でもよく、また減算回
路で構成し、その減算結果を制御回路１２に送出
する構成でもよい。 The comparator circuit 10 may be composed of a digital comparator and may be configured to send only the magnitude relationship between the count values of the counting circuits 8 and 9 to the control circuit 12, or may be composed of a subtraction circuit and the result of the subtraction is sent to the control circuit 12. It may be a configuration.

制御回路１２はランダムロジツクで構成しても
よいが、マイクロプロセツサーを用いても同様な
動作を行わせしめることが可能であることは明ら
かである。この場合、マイクロプロセツサーの演
算処理時間が許せば、計数回路８，９の計数動
作、比較回路１０の比較動作をマイクロプロセツ
サーのソフトウエアで行わせしめることが可能な
ことも明らかである。 Although the control circuit 12 may be constructed of random logic, it is clear that similar operations can be performed using a microprocessor. In this case, it is also clear that if the processing time of the microprocessor is allowed, the counting operations of the counting circuits 8 and 9 and the comparison operation of the comparator circuit 10 can be performed by the software of the microprocessor. .

またマイクロホン増幅器５に可変利得制御機能
をもたせ、零交差波回路６，７への入力振幅レベ
ルをほぼ同一になるようにしてもよい。本発明は
本質的に振幅レベルには依存しないものであるが
極端な振幅の相違は零交差計数値に相違をもたら
すためである。この可変利得制御は特願昭54―
67142号に示されるものに比べすこぶる粗い利得
合せでよい。またマイクロホン４は他の電気音響
変換器でもよいことは明らかである。 Further, the microphone amplifier 5 may be provided with a variable gain control function so that the input amplitude levels to the zero-crossing wave circuits 6 and 7 are substantially the same. This is because although the present invention is essentially independent of amplitude levels, extreme differences in amplitude will result in differences in zero-crossing counts. This variable gain control was applied in a patent application filed in 1973.
A much coarser gain adjustment is required than that shown in No. 67142. It is clear that the microphone 4 can also be another electroacoustic transducer.

第５図に本発明の他の実施例をブロツク図で示
す。第５図において第１図と同一符号は同一物を
示す。基本的な動作は第１図と同様であるため説
明は省く。第１図と第５図の違いは零交差波回路
６の入力をミユーテイング回路２の出力に接続
し、ミユーテイング回路２を通した復調音声を零
交差波回路に導いている点と、計数回路９の出力
を制御回路１２にも接続している点である。第５
図において命令音声取り込み信号は制御回路１２
より計数回路８，９の計数値が大きく相違した時
点（ミユーテイング信号を送出した時点）から計
数回路９の計数値が零あるいはあるしきい値以下
になつた時点まで送出されるものである。これは
ミユーテイングがかけられた時点からマイクロホ
ン４に集音される音声は命令音声のみとなり、零
交差波回路６に入力される信号はなくなり、零交
差波回路７に入力される信号は命令音声のみとな
るために、命令音声の検出は零交差波回路７の出
力の計数値すなわち計数回路９の計数値で判断で
きるためである。つまり制御回路１２は音声認識
部１１の命令音声取込み信号（すなわち命令音声
の切り出し）を自動的に命令音声の時間長に合わ
せて送出することを可能にしている。これによ
り、第１図の実施例におけるように命令音声取込
み信号を固定にした場合にくらべ、命令音声以外
の雑音を認識部１１に取り込む危険性は少なくな
り、認識率の向上が図れる。 FIG. 5 shows a block diagram of another embodiment of the invention. In FIG. 5, the same symbols as in FIG. 1 indicate the same parts. The basic operation is the same as that shown in FIG. 1, so the explanation will be omitted. The difference between FIG. 1 and FIG. 5 is that the input of the zero-crossing wave circuit 6 is connected to the output of the muting circuit 2, and the demodulated audio that has passed through the muting circuit 2 is guided to the zero-crossing wave circuit. The output of the control circuit 12 is also connected to the control circuit 12. Fifth
In the figure, the command voice capture signal is sent to the control circuit 12.
Therefore, the signal is sent from the time when the count values of the counting circuits 8 and 9 become significantly different (the time when the mutating signal is sent out) until the time when the count value of the counting circuit 9 becomes zero or below a certain threshold value. This is because from the time when muting is applied, the only voice collected by the microphone 4 is the command voice, the signal input to the zero-crossing wave circuit 6 disappears, and the signal input to the zero-crossing wave circuit 7 is only the command voice. This is because the command voice detection can be determined based on the count value of the output of the zero-crossing wave circuit 7, that is, the count value of the counting circuit 9. In other words, the control circuit 12 makes it possible to automatically send out the command voice capture signal (ie, cutting out the command voice) of the voice recognition unit 11 in accordance with the time length of the command voice. As a result, compared to the case where the command voice capture signal is fixed as in the embodiment of FIG. 1, there is less risk of noise other than the command voice being introduced into the recognition unit 11, and the recognition rate can be improved.

第６図に本発明の更に他の実施例をブロツク図
で示す。第６図において第５図と同一符号は同一
物を示す。基本的動作は第５図と同様なため説明
を省く。第６図で１６，１７はバンドパスフイル
ターを示す。このバンドパスフイルターは通常音
声帯域帯300Hz〜3000KHzのみを通過させるもの
であり、これにより音声以外の雑音が零交差波回
路６，７で零交差波に変換されるのを防止すると
ともに音声認識部１１には音声信号のみを入力す
ることを図つたものである。１８，１９は差分回
路である。これは信号とその信号がある時間遅延
した信号との差分を信号として出力するものであ
る。 FIG. 6 shows a block diagram of still another embodiment of the present invention. In FIG. 6, the same reference numerals as in FIG. 5 indicate the same parts. The basic operation is the same as that shown in FIG. 5, so the explanation will be omitted. In FIG. 6, 16 and 17 indicate band pass filters. This bandpass filter normally passes only the voice band 300Hz to 3000KHz, and thereby prevents noise other than voice from being converted into zero-crossing waves in the zero-crossing wave circuits 6 and 7, and also prevents noise from being converted into zero-crossing waves by the zero-crossing wave circuits 6 and 7. 11 is designed to input only audio signals. 18 and 19 are differential circuits. This outputs the difference between a signal and a signal delayed by a certain time as a signal.

第７図に零交差波回路への入力波形と出力波形
を示す。第７図で（１）で示す波形は長い周期を
もつ復調音声信号、（２）は短い周期をもつ命令
音声信号である。この場合、第１図，第５図で示
す零交差波回路６への入力は（１）で示す波形で
あり、零交差波回路７への入力は（１）で示す波
形と（２）で示す波形の合成された（３）の波形
である。零交差波回路６，７の出力はそれぞれ
ａ，ｂに示している。計数回路８，９はａ，ｂ波
形を計数するのである。（４），（５）に示す波形
は（１），（３）で示す波形を差分回路に通したも
のであり、ｃ，ｄで示す波形は（４），（５）の波
形を零交差波回路に通した波形である。 FIG. 7 shows the input waveform and output waveform to the zero-crossing wave circuit. In FIG. 7, the waveform indicated by (1) is a demodulated voice signal with a long period, and the waveform (2) is a command voice signal with a short period. In this case, the input to the zero-crossing wave circuit 6 shown in FIGS. 1 and 5 is the waveform shown in (1), and the input to the zero-crossing wave circuit 7 is the waveform shown in (1) and (2). This is the waveform (3) that is a composite of the waveforms shown in FIG. The outputs of the zero crossing wave circuits 6 and 7 are shown in a and b, respectively. Counting circuits 8 and 9 count the a and b waveforms. The waveforms shown in (4) and (5) are the waveforms shown in (1) and (3) passed through a differential circuit, and the waveforms shown in c and d are the waveforms shown in (4) and (5) passed through the zero crossing. This is the waveform passed through the wave circuit.

第６図においては零交差波回路６，７への入力
は（４），（５）の波形となり、計数回路８，９は
ｃ，ｄで示す波形を計数する。図から明らかに
ａ，ｂの示す波形の零交差数の差より、ｃ，ｄで
示す波形の零交差数の差の方が大きくなることが
わかる。これは第１，５図に示す回路よりも第６
図に示す回路の方が命令音声の存在をより顕著に
検出できることを示している。第６図に示す差分
回路１８，１９はその特性から微分回路としても
同様な効果の得られることは明らかである。また
音声認識部１１への入力を差分回路１９の出力か
ら得ることも可能で、この場合何ら音声認識部へ
影響を及ぼすことはない。 In FIG. 6, the inputs to the zero crossing wave circuits 6 and 7 have waveforms (4) and (5), and the counting circuits 8 and 9 count the waveforms shown as c and d. It is clearly seen from the figure that the difference in the number of zero crossings between the waveforms c and d is greater than the difference in the number of zero crossings between the waveforms a and b. This is better than the circuit shown in Figures 1 and 5.
This shows that the circuit shown in the figure can more clearly detect the presence of a command voice. It is clear that the difference circuits 18 and 19 shown in FIG. 6 can obtain the same effect as a differentiation circuit from their characteristics. It is also possible to obtain the input to the speech recognition section 11 from the output of the difference circuit 19, and in this case there is no influence on the speech recognition section.

以上の説明はテレビジヨン受像機の音声による
制御をもとに行なつてきたが、本発明は放声手段
を備えるものであれば被制御機器を選ばない。 Although the above explanation has been made based on voice control of a television receiver, the present invention is applicable to any device to be controlled as long as it is equipped with voice emitting means.

本発明によれば、命令音声の検出をある時間間
隔内の零交差数の相違により行うために振幅、位
相のずれに影響されることなく低価格な回路構成
で確実に検出することが可能となる。またこの検
出信号により機器の放声手段にミユーテイングを
かけることにより命令音声を正確に音声認識部に
入力できるため、誤認識を防ぐことができる。ま
たこの検出信号を命令音声の切り出し信号に使用
することも可能である。以上より、本発明は放声
手段を有する機器の音声認識回路としてきわめて
有効である。 According to the present invention, since the command voice is detected based on the difference in the number of zero crossings within a certain time interval, it is possible to reliably detect the command voice with a low-cost circuit configuration without being affected by amplitude and phase shifts. Become. Furthermore, by muting the voice emitting means of the device based on this detection signal, the command voice can be accurately input to the voice recognition section, thereby preventing erroneous recognition. It is also possible to use this detection signal as a command voice cutout signal. As described above, the present invention is extremely effective as a voice recognition circuit for equipment having a voice emitting means.

[Brief explanation of drawings]

第１図は本発明の一実施例を示すブロツク図、
第２図は本発明において用いる零交差波計数値の
実験結果を示す説明図、第３図は音声認識部の詳
細ブロツク図、第４図は第１図の実施例における
各部の動作タイミングの概略を示すタイミング
図、第５図は本発明の他の実施例を示すブロツク
図、第６図は本発明の更に別の実施例を示すブロ
ツク図、第７図は零交差波回路への入力波形と出
力波形を示す波形図、である。符号説明、１…スピーカ、２…ミユーテイング
回路、４…マイクロホン、６，７…零交差波回
路、８，９…計数回路、１０…比較回路、１１…
音声認識部、１２…制御回路、１６，１７…バン
ドパスフイルタ、１８，１９…差分回路。 FIG. 1 is a block diagram showing one embodiment of the present invention;
Fig. 2 is an explanatory diagram showing the experimental results of the zero-crossing wave count used in the present invention, Fig. 3 is a detailed block diagram of the speech recognition section, and Fig. 4 is an outline of the operation timing of each part in the embodiment of Fig. 1. FIG. 5 is a block diagram showing another embodiment of the present invention, FIG. 6 is a block diagram showing yet another embodiment of the present invention, and FIG. 7 is an input waveform to the zero-crossing wave circuit. and a waveform diagram showing output waveforms. Explanation of symbols, 1... Speaker, 2... Muting circuit, 4... Microphone, 6, 7... Zero crossing wave circuit, 8, 9... Counting circuit, 10... Comparing circuit, 11...
Speech recognition unit, 12...control circuit, 16, 17...bandpass filter, 18, 19...difference circuit.

Claims

[Claims] 1. A voice that can identify and recognize the second voice when a first voice generated from a specific sound emitting device and a second voice generated from another source are input. The recognition circuit includes an electro-acoustic transducer that collects ambient sounds including the first and second sounds, and an electric signal waveform from the transducer is inputted, and the waveform is inputted upward and downward around a point of zero sound pressure. A first zero-crossing wave circuit converts the pulse train into a rectangular wave pulse train obtained by infinite clipping and outputs the pulse train. A second pulse train that converts into a rectangular wave pulse train obtained by performing infinite clipping up and down around a point and outputs it.
a zero-crossing circuit; a first counting circuit that counts the rectangular wave pulse train from the first zero-crossing circuit;
It comprises a second counting circuit that counts the rectangular wave pulse train from the second zero-crossing circuit, and a comparison circuit that compares each counting result in both of the counting circuits, and the comparison results are different. The first sound generated from the voice emitting device is reduced based on the above, and the converted output signal of the second sound from the electroacoustic transducer after that time is taken into a speech recognition unit for speech recognition. A speech recognition circuit characterized by: 2. The speech recognition circuit according to claim 1, wherein a bandpass filter is connected in series to each of the first and second zero-crossing circuits. 3. The speech recognition circuit according to claim 1 or 2, characterized in that a difference circuit or a differential circuit is connected in series to the first and second zero-crossing circuits, respectively. circuit.