JPH0944183A

JPH0944183A - Level display device, voice recognition device and navigation device

Info

Publication number: JPH0944183A
Application number: JP7190723A
Authority: JP
Inventors: Eiji Yamamoto; 英二山本
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1995-07-26
Filing date: 1995-07-26
Publication date: 1997-02-14

Abstract

(57)【要約】【課題】ノイズと音声とを区別できる適切な発生音量
を表示することを課題とするレベル表示装置、音声認識
装置およびナビゲーション装置の提供を目的とする。【解決手段】この音声認識装置９は、音声および周辺
のノイズを電気信号に変換するマイクロホン１と、アン
プ２と、Ａ／Ｄ変換器３と、音声を入力とノイズ入力と
をオンまたはオフで切り換えるトークスイッチ８と、ノ
イズレベルを記憶するメモリ５と、トークスイッチ８が
オフのときにノイズレベルをメモリ５に記憶し、トーク
スイッチ８がオンのときにノイズレベルと音声レベルと
を比較して適切な音声レベルであるか否かを検出するマ
イクロプロセッサ４と、出力ポート６と、音声レベルを
ＬＥＤで表示するレベル表示器７とを有する。 It is an object of the present invention to provide a level display device, a voice recognition device, and a navigation device, which have an object of displaying an appropriate generated sound volume capable of distinguishing noise from voice. SOLUTION: This voice recognition device 9 turns on or off a microphone 1 for converting voice and ambient noise into an electric signal, an amplifier 2, an A / D converter 3, and a voice input and a noise input. The talk switch 8 to be switched, the memory 5 for storing the noise level, the noise level when the talk switch 8 is off are stored in the memory 5, and the noise level and the voice level are compared when the talk switch 8 is on. It has a microprocessor 4 for detecting whether or not the sound level is appropriate, an output port 6, and a level indicator 7 for displaying the sound level by an LED.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】この発明は、例えば、音声入
力に対応した道路地図などを表示させるナビゲーション
装置に使用して好適なレベル表示装置、音声認識装置お
よびナビゲーション装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a level display device, a voice recognition device and a navigation device suitable for use in a navigation device for displaying a road map or the like corresponding to voice input.

【０００２】[0002]

【従来の技術】従来、自動車などに搭載させるナビゲー
ション装置が各種開発されている。このナビゲーション
装置は、例えば、道路地図データが記憶されたＣＤ−Ｒ
ＯＭなどの大容量データ記憶手段と、現在位置の検出手
段と、検出した現在位置の近傍の道路地図を、データ記
憶手段から読み出したデータに基づいて表示させるディ
スプレイ装置とで構成される。2. Description of the Related Art Conventionally, various navigation devices to be mounted on automobiles have been developed. This navigation device is, for example, a CD-R in which road map data is stored.
It is composed of a large-capacity data storage means such as an OM, a current position detection means, and a display device for displaying a road map in the vicinity of the detected current position based on the data read from the data storage means.

【０００３】この場合、現在位置の検出手段としては、
ＧＰＳ（ＧｌｏｂａｌＰｏｓｉｔｉｏｎｉｎｇＳｙ
ｓｔｅｍ）と称される測位用の人工衛星を使用した測位
システムを使用したものや、車両の走行方向、走行速度
などの情報に基づいて出発地点から現在位置の変化を追
跡する自律航法によるものなどがある。In this case, as means for detecting the current position,
GPS (Global Positioning Sy)
A system that uses a positioning system that uses satellites for positioning called "system", or that uses autonomous navigation that tracks changes in the current position from the departure point based on information such as the traveling direction and traveling speed of the vehicle. There is.

【０００４】また、ディスプレイ装置に表示される地図
としては、キー操作などを行うことで、現在位置の近傍
だけでなく、地図データがある限りは、所望の位置の地
図を表示させることができるようにしてある。The map displayed on the display device can be displayed not only in the vicinity of the current position but also in the desired position as long as there is map data, by performing a key operation or the like. I am doing it.

【０００５】このようなナビゲーション装置の場合に
は、例えば自動車の場合、運転席の近傍にディスプレイ
装置を設置して、運転者が走行中や信号停止などの一時
停止中に現在位置の近傍の地図を見ることができるよう
にするのが一般的である。In the case of such a navigation device, for example, in the case of an automobile, a display device is installed in the vicinity of the driver's seat, and a map near the current position is provided while the driver is traveling or temporarily stopped, such as a signal stop. It is common to be able to see.

【０００６】[0006]

【発明が解決しようとする課題】ところで、このような
ナビゲーション装置は、自動車の運転などの邪魔になら
ないように操作できるようにする必要があり、例えば走
行中は複雑な操作を禁止するようにしてある。即ち、こ
のようなナビゲーション装置を車両に設置する場合に
は、何らかの走行状態検出部（例えば自動車のパーキン
グブレーキスイッチ）と接続して、この検出部の状態に
より車両が停止していることが検出されるときだけ、す
べての操作ができるように設定し、停止していない状態
（即ち走行中）には、複雑なキー操作を禁止するように
設定してある。By the way, it is necessary to operate such a navigation device so that it does not interfere with driving of a car. For example, complicated operation is prohibited during running. is there. That is, when such a navigation device is installed in a vehicle, it is connected to some running state detection unit (for example, a parking brake switch of an automobile), and the state of the detection unit detects that the vehicle is stopped. It is set so that all operations can be performed only when the vehicle is turned on, and complicated key operations are prohibited when it is not stopped (that is, while the vehicle is running).

【０００７】ところが、このように走行中に表示地図を
切り換える等の操作ができないのは不便であり、走行中
であっても、運転の邪魔にならないように、高度な操作
ができるようにすることが要請されていた。[0007] However, it is inconvenient that the operation such as switching the displayed map cannot be performed while the vehicle is running, and even if the vehicle is running, it is possible to perform advanced operation so as not to disturb the driving. Was requested.

【０００８】そこで、本出願人は、マイクなどから入力
した音声信号から、特定の音声だけを認識する音声処理
を行い、認識した特定のデータを絶対的な座標位置デー
タに変換して出力させるようにした音声認識装置、およ
び、マイクなどから入力した音声信号から、特定の音声
だけを認識する音声処理を行い、認識した特定のデータ
に対して絶対的な座標位置の近傍の地図を表示処理させ
るようにしたナビゲーション装置に関する出願（特願平
７−１１０９８７号）を先に提出した。Therefore, the applicant of the present invention performs voice processing for recognizing only a specific voice from a voice signal input from a microphone or the like, converts the recognized specific data into absolute coordinate position data, and outputs the absolute coordinate position data. Performs voice processing that recognizes only specific voice from the voice signal input from the voice recognition device that has been set up and a microphone, and displays the map near the absolute coordinate position for the recognized specific data. The application (Japanese Patent Application No. 7-110987) relating to such a navigation device was previously submitted.

【０００９】しかし、このような音声認識装置およびナ
ビゲーション装置においては、音声入力を行うマイク
は、目的とする音声を最も効率良く入力可能な位置に取
り付けるものの、一般的な使用環境においては入力した
い音声と共にマイク周辺のノイズも同時に入力してしま
う。However, in such a voice recognition device and a navigation device, although the microphone for inputting voice is attached at a position where the target voice can be input most efficiently, the voice to be input in a general use environment is desired. At the same time, noise around the microphone is also input.

【００１０】従って、ノイズに対して音声を区別するた
めに、音声認識を行う際に、音声認識のために音声の入
力を行うマイク周辺のノイズに対して、ある一定のレベ
ル以上の音量で話者が発声しなければならず、この点を
改善すべきとの要請があった。Therefore, in order to distinguish the voice from the noise, when performing the voice recognition, the voice around the microphone that inputs the voice for the voice recognition is talked at a volume higher than a certain level. The person had to utter, and there was a request to improve this point.

【００１１】この発明は、かかる点に鑑みてなされたも
のであり、ノイズと音声とを区別できる適切な発生音量
を表示するレベル表示装置、音声認識装置およびナビゲ
ーション装置の提供を目的とする。The present invention has been made in view of the above circumstances, and an object of the present invention is to provide a level display device, a voice recognition device, and a navigation device for displaying an appropriate generated sound volume capable of distinguishing noise from voice.

【００１２】[0012]

【課題を解決するための手段】この発明のレベル表示装
置は、入力されたノイズレベルと入力された音声信号レ
ベルとを比較し、上記ノイズレベルに対する上記音声信
号レベルの比率が所定値以上であるか否かを検出する制
御手段を備え、上記音声信号レベルの比率が所定値以上
であることを検出したときに、所定の表示を行うもので
ある。The level display device of the present invention compares the input noise level with the input audio signal level, and the ratio of the audio signal level to the noise level is a predetermined value or more. A control means for detecting whether or not it is provided, and a predetermined display is performed when it is detected that the ratio of the audio signal level is equal to or more than a predetermined value.

【００１３】また、この発明の音声認識装置は、音声信
号入力手段と、上記音声信号入力手段により入力された
ノイズレベルを記憶する記憶手段と、上記記憶手段に記
憶された上記ノイズレベルと上記音声信号入力手段によ
り入力された音声信号レベルとを比較し、上記ノイズレ
ベルに対する上記音声信号レベルの比率が所定値以上で
あるか否かを検出する制御手段と、上記制御手段が上記
音声信号レベルの比率が所定値以上であることを検出し
たときに、所定の表示を行う表示手段とを備えたもので
ある。Further, the voice recognition device of the present invention includes voice signal input means, storage means for storing the noise level input by the voice signal input means, the noise level and the voice stored in the storage means. A control unit that compares the audio signal level input by the signal input unit and detects whether the ratio of the audio signal level to the noise level is a predetermined value or more; And a display unit for performing a predetermined display when it is detected that the ratio is equal to or higher than a predetermined value.

【００１４】また、この発明のナビゲーション装置は、
音声信号入力手段と、上記音声信号入力手段により入力
されたノイズレベルを記憶する記憶手段と、上記記憶手
段に記憶された上記ノイズレベルと上記音声信号入力手
段により入力された音声信号レベルとを比較し、上記ノ
イズレベルに対する上記音声信号レベルの比率が所定値
以上であるか否かを検出する制御手段と、上記制御手段
が上記音声信号レベルの比率が所定値以上であることを
検出したときに、所定の表示を行う表示手段とを有する
音声認識部と、上記音声認識部が認識した特定の音声の
データを座標位置データに変換する変換部と、地図デー
タ記憶手段と、上記変換部で変換された座標位置データ
で示される位置の地図データを上記地図データ記憶手段
から読み出して、地図表示用映像信号を作成する地図デ
ータ読み出し手段とを備えたものである。Further, the navigation device of the present invention comprises:
Audio signal input means, storage means for storing the noise level input by the audio signal input means, and comparison of the noise level stored by the storage means with the audio signal level input by the audio signal input means And a control means for detecting whether or not the ratio of the audio signal level to the noise level is equal to or higher than a predetermined value, and the control means detects that the ratio of the audio signal level is equal to or higher than a predetermined value. , A voice recognition unit having a display unit for performing a predetermined display, a conversion unit for converting data of a specific voice recognized by the voice recognition unit into coordinate position data, a map data storage unit, and a conversion unit. Map data reading means for reading map data at a position indicated by the coordinate position data thus created from the map data storage means and creating a map display video signal. It is those with a.

【００１５】この発明のレベル表示装置によれば以下の
作用をする。このレベル表示装置は、ある一定レベル以
上の音声認識率を確保するために、ある一定レベル以上
のＳ／Ｎが確保されるように動作する。制御手段は、ノ
イズレベルと音声レベルとの比較を行う。そして、制御
手段は、このときのノイズレベルに対して音声レベルが
認識される程度のある一定レベル以上のＳ／Ｎであるか
否かを判断する。制御手段がある一定レベル以上のＳ／
Ｎであると判断したときは、所定の表示をする。話者は
この表示を見ながら発声する。そして、発声時の声の音
量を調整する。このようにすることにより、レベル表示
装置にとって必要な、ある一定値以上のＳ／Ｎを、話者
に対して明示することができる。The level display device of the present invention has the following functions. This level display device operates so as to secure S / N above a certain level in order to secure a voice recognition rate above a certain level. The control means compares the noise level and the voice level. Then, the control means determines whether or not the S / N is a certain level or higher at which the voice level is recognized with respect to the noise level at this time. S / above a certain level with control means
If it is determined to be N, a predetermined display is displayed. The speaker speaks while looking at this display. Then, the volume of the voice when uttering is adjusted. By doing so, the S / N required for the level display device, which is equal to or higher than a certain fixed value, can be clearly indicated to the speaker.

【００１６】また、この発明の音声認識装置によれば以
下の作用をする。この音声認識装置は、ある一定レベル
以上の音声認識率を確保するために、ある一定レベル以
上のＳ／Ｎが確保されるように動作する。制御手段は入
力指示手段により音声入力区間では無いことを判断した
ときは、音声信号入力手段は周辺のノイズを電気信号に
変換する。この電気信号は信号処理可能な信号レベルに
増幅された後に、ディジタル信号に変換され、制御手段
に供給される。制御手段はノイズレベルを表すディジタ
ル信号を記憶手段に供給して記憶させる。これにより、
入力のうちのノイズレベルの取り込みと書き込みが行わ
れる。Further, according to the voice recognition device of the present invention, the following operations are performed. This voice recognition device operates so as to secure an S / N of a certain level or higher in order to secure a voice recognition rate of a certain level or higher. When the control means determines by the input instructing means that it is not the voice input section, the voice signal input means converts ambient noise into an electric signal. This electric signal is amplified to a signal level capable of signal processing, then converted into a digital signal and supplied to the control means. The control means supplies a digital signal representing the noise level to the storage means for storage. This allows
The noise level of the input is captured and written.

【００１７】ここで、使用環境のノイズとは異なる突発
的な大きなノイズをノイズレベルとして取り込まないよ
うに、制御手段は、ノイズレベルをサンプリングし、平
均値を採るように統計処理を行う。また、このノイズレ
ベルは、ある一定期間毎に最新の値を取り込んで書き換
えるように制御手段が処理する。制御手段は入力指示手
段により音声入力区間であることを判断したときは、制
御手段は話者が発声した音声を電気信号に変換する。変
換された電気信号は信号処理可能な信号レベルに増幅さ
れた後に、ディジタル信号に変換されて制御手段に供給
される。これにより、入力のうちの音声レベルの取り込
みが行われる。Here, the control means samples the noise level and performs statistical processing so as to obtain an average value so as not to capture a sudden large noise different from the noise of the use environment as the noise level. Further, the noise level is processed by the control means so that the latest value is taken in and rewritten at every certain period. When the control means determines by the input instructing means that it is the voice input section, the control means converts the voice uttered by the speaker into an electric signal. The converted electric signal is amplified to a signal level capable of signal processing, then converted to a digital signal and supplied to the control means. As a result, the audio level of the input is captured.

【００１８】次に、制御手段は、記憶手段に記憶された
ノイズレベルを読み出し、ノイズレベルと音声レベルと
の比較を行う。そして、制御手段は、このときのノイズ
レベルに対して音声レベルが認識される程度のある一定
レベル以上のＳ／Ｎであるか否かを判断する。制御手段
がある一定レベル以上のＳ／Ｎであると判断したとき
は、表示手段にある一定レベル以上のＳ／Ｎであること
を表示させる。ここでの音声レベルとノイズレベルとの
比較は所定サイクルで逐次行う。比較の結果の表示は、
短い周期で繰り返すようにする。話者はこの表示を見な
がら発声する。そして、発声時の声の音量を調整する。
このようにすることにより、音声認識装置にとって必要
な、ある一定値以上のＳ／Ｎを、話者に対して明示する
ことができる。Next, the control means reads the noise level stored in the storage means and compares the noise level with the voice level. Then, the control means determines whether or not the S / N is a certain level or higher at which the voice level is recognized with respect to the noise level at this time. When the control means determines that the S / N is above a certain level, the display means displays that the S / N is above the certain level. The comparison between the voice level and the noise level here is sequentially performed in a predetermined cycle. The display of the comparison result is
Try to repeat in a short cycle. The speaker speaks while looking at this display. Then, the volume of the voice when uttering is adjusted.
By doing so, it is possible to clearly indicate to the speaker the S / N that is necessary for the voice recognition device and is above a certain fixed value.

【００１９】また、この発明のナビゲーション装置によ
れば以下の作用する。音声認識部で、制御手段がある一
定レベル以上のＳ／Ｎであると判断したときは、表示手
段にある一定レベル以上のＳ／Ｎであることを表示させ
る。表示手段の表示が行われ、音声認識部にとって必要
な、ある一定値以上のＳ／Ｎが得られたとき、制御手段
は、音声データを変換部に供給する。変換部は、音声認
識部が認識した特定の音声のデータを座標位置データに
変換する。地図データ読み出し手段は、変換部で変換さ
れた座標位置データで示される位置の地図データを地図
データ記憶手段から読み出して、地図表示用映像信号を
作成する。これにより、ナビゲーションに必要な地図表
示用映像信号を得ることができる。Further, according to the navigation device of the present invention, the following operations are performed. When the voice recognition unit determines that the control means has an S / N above a certain level, the display means displays that the S / N is above a certain level. When the display of the display unit is performed and the S / N of a certain value or more required for the voice recognition unit is obtained, the control unit supplies the voice data to the conversion unit. The conversion unit converts the specific voice data recognized by the voice recognition unit into coordinate position data. The map data reading means reads the map data at the position indicated by the coordinate position data converted by the conversion unit from the map data storage means and creates a map display video signal. This makes it possible to obtain a map display video signal required for navigation.

【００２０】[0020]

【発明の実施の形態】以下に、本実施例を説明する。ま
ず、本実施例の構成を示す。図１は、本実施例の音声認
識装置の構成を示すブロック図である。この音声認識装
置９は、音声および周辺のノイズを電気信号に変換する
マイクロホン１と、電気信号を所定増幅率で増幅するア
ンプ２と、増幅された電気信号をディジタル信号に変換
するＡ／Ｄ変換器３と、音声の入力とノイズ入力とをオ
ンまたはオフで切り換えるトークスイッチ８と、ノイズ
レベルを記憶するメモリ５と、トークスイッチ８がオフ
のときにノイズレベルをメモリ５に記憶し、トークスイ
ッチ８がオンのときにノイズレベルと音声レベルとを比
較して適切な音声レベルであるか否かを検出するマイク
ロプロセッサ４と、マイクロプロセッサ４で検出された
音声レベルを出力する出力ポート６と、出力ポート６に
出力された音声レベルをＬＥＤで表示するレベル表示器
７とを有する。ここで、マイクロホン１は音声信号入力
手段、メモリ５は記憶手段、マクロプロセッサ４は制御
手段、レベル表示器７は表示手段を構成する。BEST MODE FOR CARRYING OUT THE INVENTION The present embodiment will be described below. First, the configuration of this embodiment will be described. FIG. 1 is a block diagram showing the configuration of the speech recognition apparatus of this embodiment. The voice recognition device 9 includes a microphone 1 for converting voice and ambient noise into an electric signal, an amplifier 2 for amplifying the electric signal with a predetermined amplification factor, and an A / D conversion for converting the amplified electric signal into a digital signal. Device 3, a talk switch 8 for switching between voice input and noise input by turning on or off, a memory 5 for storing the noise level, and a noise level in the memory 5 when the talk switch 8 is off. A microprocessor 4 that compares the noise level with the voice level to detect whether the voice level is appropriate when 8 is on; and an output port 6 that outputs the voice level detected by the microprocessor 4. It has a level indicator 7 for displaying the audio level output to the output port 6 with an LED. Here, the microphone 1 constitutes audio signal input means, the memory 5 constitutes storage means, the macro processor 4 constitutes control means, and the level indicator 7 constitutes display means.

【００２１】このように構成された音声認識装置の動作
を以下に説明する。ここで、音声認識にとって理想的な
環境下では、外部からの雑音は無く、認識させようとす
る音声のみがマイクロホン１で拾われ、アンプ２で増幅
され、Ａ／Ｄ変換器３でディジタル信号に変換された後
にマイクロプロセッサ４に供給される。一方、一般的な
使用環境では周囲のノイズが大半の場合に存在し、音声
信号に不要なノイズが重畳された状態でマイクロプロセ
ッサ４に供給される。この場合、音声信号とノイズのレ
ベル比（Ｓ／Ｎ）は、音声認識装置の音声認識率に対し
て、直接的に影響を与える。即ち、ある一定レベル以上
の認識率を確保するためには、ある一定レベル以上のＳ
／Ｎが確保されていなければならない。以下、このため
の動作を説明する。The operation of the speech recognition apparatus thus configured will be described below. Here, in an ideal environment for voice recognition, there is no external noise, only the voice to be recognized is picked up by the microphone 1, amplified by the amplifier 2, and converted into a digital signal by the A / D converter 3. The converted data is supplied to the microprocessor 4. On the other hand, in a general usage environment, ambient noise is present in most cases, and the noise is supplied to the microprocessor 4 in a state where unnecessary noise is superimposed on the audio signal. In this case, the voice signal / noise level ratio (S / N) directly affects the voice recognition rate of the voice recognition device. That is, in order to secure the recognition rate above a certain level, the S above a certain level is required.
/ N must be secured. The operation for this will be described below.

【００２２】図２は、本実施例の音声認識装置の動作を
示すフローチャートである。図２において、スタートす
ると、ステップＳ１でマイクロプロセッサ４はトークス
イッチ８がオンであるか否かを判断する。トークスイッ
チ８がオフのときはトークスイッチ８からマイクロプロ
セッサ４に対して音声入力区間では無いことを示す信号
が供給される。ステップＳ１でマイクロプロセッサ４が
トークスイッチ８がオフであることを判断したときは、
マイクロホン１は周辺のノイズを電気信号に変換する。
この電気信号はアンプ２に供給される。アンプ２は電気
信号を増幅して信号処理可能な信号レベルにする。増幅
された電気信号はＡ／Ｄ変換器３に供給される。Ａ／Ｄ
変換器３は供給されたアナログの電気信号をディジタル
信号に変換する。ディジタル信号はマイクロプロセッサ
４に供給される。ステップＳ２でマイクロプロセッサ４
はノイズレベルを表すディジタル信号をメモリ５に供給
して記憶させる。これにより、マイク入力のうちのノイ
ズレベルの取り込みと書き込みが行われる。そしてステ
ップＳ１の処理に戻る。FIG. 2 is a flow chart showing the operation of the speech recognition apparatus of this embodiment. In FIG. 2, when starting, the microprocessor 4 determines whether or not the talk switch 8 is on in step S1. When the talk switch 8 is off, the talk switch 8 supplies the microprocessor 4 with a signal indicating that it is not in the voice input section. When the microprocessor 4 determines in step S1 that the talk switch 8 is off,
The microphone 1 converts ambient noise into an electric signal.
This electric signal is supplied to the amplifier 2. The amplifier 2 amplifies the electric signal to a signal level capable of signal processing. The amplified electric signal is supplied to the A / D converter 3. A / D
The converter 3 converts the supplied analog electric signal into a digital signal. The digital signal is supplied to the microprocessor 4. In step S2, the microprocessor 4
Supplies a digital signal representing the noise level to the memory 5 for storage. As a result, the noise level of the microphone input is captured and written. Then, the process returns to step S1.

【００２３】ここで、使用環境のノイズとは異なる突発
的な大きなノイズをノイズレベルとして取り込まないよ
うに、マイクロプロセッサ４は、ノイズレベルをサンプ
リングし、ノイズレベルのうち最大値、最小値を除いた
値の平均値を採るように統計処理を行う。また、このノ
イズレベルは、ある一定期間毎に最新の値を取り込んで
書き換えるようにマイクロプロセッサ４が処理する。Here, the microprocessor 4 samples the noise level and removes the maximum value and the minimum value of the noise level so as not to capture a sudden large noise different from the noise of the use environment as the noise level. Statistical processing is performed so as to take the average value. Further, the noise level is processed by the microprocessor 4 so that the latest value is taken in and rewritten every certain period.

【００２４】トークスイッチ８がオンのときはトークス
イッチ８からマイクロプロセッサ４に対して音声入力区
間であることを示す信号が供給される。ステップＳ１で
マイクロプロセッサ４がトークスイッチ８がオンである
ことを判断したときは、マイクロホン１は話者が発声し
た音声を電気信号に変換する。変換された電気信号はア
ンプ２に供給される。アンプ２は電気信号を増幅して信
号処理可能な信号レベルにする。増幅された電気信号は
Ａ／Ｄ変換器３に供給される。Ａ／Ｄ変換器３は供給さ
れれたアナログの電気信号をディジタル信号に変換す
る。変換されたディジタル信号はマイクロプロセッサ４
に供給される。ステップＳ３でマイクロプロセッサ４は
内部のＲＡＭに音声レベルを表すディジタル信号を一時
的に記憶する。これにより、マイク入力のうちの音声レ
ベルの取り込みが行われる。When the talk switch 8 is on, the talk switch 8 supplies a signal to the microprocessor 4 to indicate that it is a voice input section. When the microprocessor 4 determines in step S1 that the talk switch 8 is on, the microphone 1 converts the voice uttered by the speaker into an electric signal. The converted electric signal is supplied to the amplifier 2. The amplifier 2 amplifies the electric signal to a signal level capable of signal processing. The amplified electric signal is supplied to the A / D converter 3. The A / D converter 3 converts the supplied analog electric signal into a digital signal. The converted digital signal is sent to the microprocessor 4
Is supplied to. In step S3, the microprocessor 4 temporarily stores the digital signal representing the audio level in the internal RAM. As a result, the audio level of the microphone input is captured.

【００２５】次に、ステップＳ４で、マイクロプロセッ
サ４は、メモリ５に記憶されたノイズレベルの読み出し
と、内部のＲＡＭに一時的に記憶された音声レベルの読
み出しを行い、ノイズレベルと音声レベルとの比較を行
う。そして、ステップＳ４で、マイクロプロセッサ４
は、このときのノイズレベルに対して音声レベルが認識
される程度のある一定レベル以上のＳ／Ｎであるか否か
を判断する。ステップＳ４でマイクロプロセッサ４があ
る一定レベル以上のＳ／Ｎでないと判断したときは、ス
テップＳ１の処理に戻る。ステップＳ４でマイクロプロ
セッサ４がある一定レベル以上のＳ／Ｎであると判断し
たときは、ステップＳ６でマイクロプロセッサ４は、レ
ベル表示器８にある一定レベル以上のＳ／Ｎであること
を表示させる。そして、ステップＳ１の処理に戻る。レ
ベル表示器８の表示は、例えば、ＬＥＤを点灯させるこ
とにより行う。Next, in step S4, the microprocessor 4 reads out the noise level stored in the memory 5 and the voice level temporarily stored in the internal RAM to obtain the noise level and the voice level. Make a comparison. Then, in step S4, the microprocessor 4
Determines whether the S / N is a certain level or higher with which the voice level is recognized with respect to the noise level at this time. When it is determined in step S4 that the microprocessor 4 does not have an S / N above a certain level, the process returns to step S1. When the microprocessor 4 determines in step S4 that the S / N is above a certain level, the microprocessor 4 causes the level indicator 8 to display that the S / N is above the certain level in step S6. . Then, the process returns to step S1. The display on the level indicator 8 is performed, for example, by turning on an LED.

【００２６】ここでのマイク入力のうちの音声レベルと
ノイズレベルとの比較は所定サイクルで逐次行う。比較
の結果のレベル表示器８の点灯表示は、短い周期で繰り
返すようにする。話者はこのレベル表示器８の点灯表示
を見ながら発声する。そして、発声時の声の音量を、レ
ベル表示器８の点灯表示が短い周期で繰り返されるよう
に調整する。このようにすることにより、音声認識装置
にとって必要な、ある一定値以上のＳ／Ｎを、話者に対
して即座に明示することができる。The comparison between the voice level of the microphone input and the noise level is performed sequentially in a predetermined cycle. The lighting display of the level indicator 8 as a result of the comparison is repeated in a short cycle. The speaker speaks while watching the lighting display of the level display 8. Then, the volume of the voice at the time of utterance is adjusted so that the lighting display of the level indicator 8 is repeated in a short cycle. By doing so, the S / N required for the voice recognition device, which is equal to or higher than a certain value, can be immediately specified to the speaker.

【００２７】上例では、マイクロホン１と、アンプ２
と、Ａ／Ｄ変換器３と、トークスイッチ８と、メモリ５
と、マイクロプロセッサ４と、出力ポート６と、レベル
表示器７とを有する音声認識装置について述べたが、マ
イクロホン１と、アンプ２と、Ａ／Ｄ変換器３と、トー
クスイッチ８と、メモリ５と、マイクロプロセッサ４
と、出力ポート６とを、レベル表示器７と一体にして設
けるようにしても良い。また、このとき、例えば、録音
装置における録音キーに相当するトークスイッチ８と、
メモリ５と、ディジタル音声信号またはディジタルノイ
ズ信号が供給されるマイクロプロセッサ４とを、レベル
表示器７と一体にして設けるようにしても良い。また、
このときにおいても、トークスイッチ８がオフのときは
マイクロプロセッサ４に対して音声入力区間では無いこ
とを示すようにし、トークスイッチ８がオンのときはマ
イクロプロセッサ４に対して音声入力区間であることを
示すようにする。また、さらに、このときのノイズレベ
ルも、ある一定期間毎に最新の値を取り込んで書き換え
るようにする。In the above example, the microphone 1 and the amplifier 2
, A / D converter 3, talk switch 8, and memory 5
Although the voice recognition device having the microprocessor 4, the output port 6 and the level indicator 7 has been described, the microphone 1, the amplifier 2, the A / D converter 3, the talk switch 8 and the memory 5 are described. And a microprocessor 4
The output port 6 and the output port 6 may be integrated with the level indicator 7. At this time, for example, a talk switch 8 corresponding to a recording key in the recording device,
The memory 5 and the microprocessor 4 to which the digital voice signal or the digital noise signal is supplied may be provided integrally with the level indicator 7. Also,
Also at this time, when the talk switch 8 is off, it is indicated that it is not in the voice input section for the microprocessor 4, and when the talk switch 8 is on, it is in the voice input section for the microprocessor 4. As shown. Further, the noise level at this time is also rewritten by taking in the latest value at every certain period.

【００２８】次に、このような音声認識装置を用いたナ
ビゲーション装置について説明する。図３は、他の実施
例のナビゲーション装置の構成を示すブロック図であ
る。まず、このナビゲーション装置の構成を説明する。
なお、図１に示した音声認識装置９と共通する部分の説
明は省略する。この例のナビゲーション装置は、音声認
識装置９と、ナビゲーション装置２０と、ディスプレイ
装置４０とを有する。音声認識装置９内のマイクロプロ
セッサー４には、音声認識データ記憶用ＲＯＭ１２、経
緯度変換回路１０が接続され、経緯度変換回路１０には
経緯度変換データ記憶用ＲＯＭ１１が接続されている。
音声認識装置９は音声認識部を構成する。ナビゲーショ
ン装置２０は、ＧＰＳ用アンテナ２１と、現在位置検出
回路２２と、演算回路２３と、ＣＤ−ＲＯＭドライバ２
４と、ＲＡＭ２５と、車速センサ２６と、操作キー２７
と、映像信号生成回路２８と、音声合成回路３１と、ス
ピーカ３２とを有する。変換回路１０は変換部、ＣＤ−
ＲＯＭは地図データ記憶手段、ＣＤ−ＲＯＭドライバ２
４は地図データ読み出し手段を構成する。Next, a navigation device using such a voice recognition device will be described. FIG. 3 is a block diagram showing the configuration of a navigation device according to another embodiment. First, the configuration of this navigation device will be described.
It should be noted that description of portions common to the voice recognition device 9 shown in FIG. 1 will be omitted. The navigation device of this example includes a voice recognition device 9, a navigation device 20, and a display device 40. A voice recognition data storage ROM 12 and a latitude / longitude conversion circuit 10 are connected to the microprocessor 4 in the voice recognition device 9, and a longitude / latitude conversion data storage ROM 11 is connected to the longitude / latitude conversion circuit 10.
The voice recognition device 9 constitutes a voice recognition unit. The navigation device 20 includes a GPS antenna 21, a current position detection circuit 22, an arithmetic circuit 23, and a CD-ROM driver 2
4, RAM 25, vehicle speed sensor 26, operation keys 27
And a video signal generation circuit 28, a voice synthesis circuit 31, and a speaker 32. The conversion circuit 10 is a conversion unit, CD-
ROM is map data storage means, CD-ROM driver 2
Reference numeral 4 constitutes a map data reading means.

【００２９】このように構成されたナビゲーション装置
の動作を以下に説明する。なお、図１に示した音声認識
装置９と共通する部分の説明は省略する。レベル表示器
７の点灯表示が短い周期で繰り返され、音声認識装置に
とって必要な、ある一定値以上のＳ／Ｎが得られたと
き、マイクロプロセッサー４は、音声認識の処理を行
う。つまり、マイクロプロセッサー４は、音声認識デー
タ記憶用ＲＯＭ１２に記憶された音声認識データとマイ
ク入力のうちの音声ベクトルデータとの所定の音声認識
アルゴリズム（例えば、ＨＭＭ：隠れマルコフモデル）
に従った比較を行い、所定の条件に基づいて一致を検出
したとき、音声ベクトルデータに対応して記憶された文
字データを読み出す。The operation of the navigation device thus configured will be described below. It should be noted that description of portions common to the voice recognition device 9 shown in FIG. 1 will be omitted. When the lighting display of the level indicator 7 is repeated in a short cycle and the S / N required for the voice recognition device is equal to or higher than a certain value, the microprocessor 4 performs the voice recognition process. That is, the microprocessor 4 uses a predetermined voice recognition algorithm (for example, HMM: Hidden Markov Model) for the voice recognition data stored in the voice recognition data storage ROM 12 and the voice vector data of the microphone input.
When a match is detected based on a predetermined condition, the character data stored corresponding to the voice vector data is read out.

【００３０】音声認識データ記憶用ＲＯＭ１２には、地
名と、ナビゲーション装置の操作を指示する言葉だけを
認識するようにデータが記憶されている。マイクロプロ
セッサー４は、音声ベクトルデータとの一致が検出され
た音声ベクトルデータに対応する文字コードが地名の文
字コードである場合には、この文字コードを音声認識デ
ータ記憶用ＲＯＭ１２から読み出す。読み出された文字
コードは経緯度変換回路１０に供給される。経緯度変換
回路１０はこの文字コードに対応する経緯度データおよ
びその付随データを経緯度変換データ記憶用ＲＯＭ１１
から読み出す。経緯度変換データ記憶用ＲＯＭ１１に
は、音声認識データ記憶用ＲＯＭ１２に記憶された地名
の文字コードと同じ文字コード毎に記憶エリアが設定さ
れ、各文字コード毎に、その文字で示される地名の緯度
と経度のデータと、付随するデータとして表示スケール
のデータとが記憶されている。The voice recognition data storage ROM 12 stores data for recognizing only the place name and the words instructing the operation of the navigation device. When the character code corresponding to the voice vector data whose match with the voice vector data is detected is the character code of the place name, the microprocessor 4 reads this character code from the voice recognition data storage ROM 12. The read character code is supplied to the latitude / longitude conversion circuit 10. The latitude / longitude conversion circuit 10 stores the longitude / latitude data corresponding to this character code and its associated data in the ROM 11 for storing the longitude / latitude conversion data.
Read from In the latitude / longitude conversion data storage ROM 11, a storage area is set for each character code that is the same as the character code of the place name stored in the voice recognition data storage ROM 12, and for each character code, the latitude of the place name indicated by that character is set. And longitude data, and display scale data as associated data are stored.

【００３１】そして、経緯度変換データ記憶用ＲＯＭ１
１から読み出された経緯度データおよびその付随データ
はナビゲーション装置２０の演算回路２３に供給され
る。また、マイクロプロセッサー４で一致が検出された
文字コードのデータもナビゲーション装置２０の演算回
路２３に供給される。ナビゲーション装置２０では、ア
ンテナ２１で受信したＧＰＳ用衛星からの測位用信号
を、現在位置検出回路２２が受信処理し、この受信した
データを解析して、現在位置を検出する。この検出した
現在位置のデータとしては、そのときの絶対的な位置で
ある緯度と経度のデータである。Then, the ROM 1 for storing the latitude / longitude conversion data
The latitude and longitude data read from the data 1 and its associated data are supplied to the arithmetic circuit 23 of the navigation device 20. In addition, the data of the character code whose match is detected by the microprocessor 4 is also supplied to the arithmetic circuit 23 of the navigation device 20. In the navigation device 20, the current position detection circuit 22 receives and processes the positioning signal from the GPS satellite received by the antenna 21, and analyzes the received data to detect the current position. The data of the detected current position is data of latitude and longitude, which are absolute positions at that time.

【００３２】検出された緯度と経度のデータは演算回路
２３に供給される。演算回路２３は、ナビゲーション装
置２０による動作を制御するシステムコントローラとし
て機能する回路である。演算回路２３は、緯度と経度の
座標データが得られたとき、ＣＤ−ＲＯＭドライバ２４
に対して、道路地図データが記憶されたＣＤ−ＲＯＭ
（光ディスク）から座標位置近傍の道路地図データを読
み出すように制御する。演算回路２３は、ＣＤ−ＲＯＭ
ドライバ２４で読み出した道路地図データをＲＡＭ２５
に一時記憶させ、この記憶された道路地図データを使用
して、道路地図を表示させるための表示データを作成す
る。このときには、操作キー２７の操作などにより設定
された表示スケール（縮尺）で地図を表示させるような
表示データとする。The detected latitude and longitude data are supplied to the arithmetic circuit 23. The arithmetic circuit 23 is a circuit that functions as a system controller that controls the operation of the navigation device 20. When the latitude and longitude coordinate data are obtained, the arithmetic circuit 23 uses the CD-ROM driver 24.
CD-ROM that stores road map data
Control is performed so that road map data near the coordinate position is read from the (optical disk). The arithmetic circuit 23 is a CD-ROM
The road map data read by the driver 24 is stored in the RAM 25
The display data for displaying the road map is created using the stored road map data. At this time, the display data is such that the map is displayed at the display scale (scale) set by operating the operation keys 27 or the like.

【００３３】演算回路２３で作成された表示データは映
像信号生成回路２８に供給される。映像信号生成回路２
８は表示データに基づいて所定のフォーマットの映像信
号を生成させる。映像信号はディスプレイ装置４０に供
給される。ディスプレイ装置４０は、映像信号に基づい
た受像処理を行い、その表示パネル上に道路地図などを
表示させる。また、現在位置の近傍の道路地図を表示さ
せる他に、操作キー２７の操作などで指示された位置の
道路地図なども、演算回路２３の制御に基づいて表示す
る。また、車速センサ２６が車両の走行を検出したとき
には、演算回路２３が操作キー２７の操作の内の比較的
簡単な操作以外の操作を受け付けないようにしてある。
また、演算回路２３で音声による何らかの指示が必要な
場合には、音声合成回路３１で指示する音声の合成処理
を実行させ、スピーカ３２から音声を出力させる。上例
において、音声認識装置９で、トークスイッチ８がオフ
のときはマイクロプロセッサ４に対して音声入力区間で
は無いことを示すようにし、トークスイッチ８がオンの
ときはマイクロプロセッサ４に対して音声入力区間であ
ることを示すようにする。また、さらに、このときのノ
イズレベルも、ある一定期間毎に最新の値を取り込んで
書き換えるようにする。このようにすることにより、自
動車のスピードの違いによるエンジン音等のノイズレベ
ルが異なる場合でも、一定期間毎に最新のノイズレベル
を取り込んで、このノイズレベルと音声レベルとを比較
して、音声認識を可能とするために適切なＳ／Ｎが得ら
れる場合を表示できる。The display data created by the arithmetic circuit 23 is supplied to the video signal generating circuit 28. Video signal generation circuit 2
Reference numeral 8 generates a video signal in a predetermined format based on the display data. The video signal is supplied to the display device 40. The display device 40 performs an image receiving process based on a video signal and displays a road map or the like on its display panel. In addition to displaying a road map near the current position, a road map at a position designated by operating the operation key 27 or the like is also displayed under the control of the arithmetic circuit 23. Further, when the vehicle speed sensor 26 detects traveling of the vehicle, the arithmetic circuit 23 does not accept any operation other than a relatively simple operation of the operation keys 27.
When the arithmetic circuit 23 needs to give a voice instruction, the voice synthesizing circuit 31 executes the voice synthesizing process and outputs the voice from the speaker 32. In the above example, when the talk switch 8 is off, the voice recognition device 9 indicates to the microprocessor 4 that it is not in the voice input section, and when the talk switch 8 is on, the voice recognition is performed to the microprocessor 4. It indicates that it is an input section. Further, the noise level at this time is also rewritten by taking in the latest value at every certain period. By doing this, even if the noise level of the engine sound etc. due to the difference in the speed of the car is different, the latest noise level is fetched at regular intervals and this noise level is compared with the voice level for voice recognition. It is possible to display a case where an appropriate S / N is obtained to enable the above.

【００３４】図４は、ナビゲーション装置における音声
認識のレベルを示す図である。図４において、縦軸は不
特定話者および特定話者により発声される語数を対数目
盛りで示し、横軸はＳ／Ｎ［ｄＢ］を示す。図４におい
て、地名認識の場合には、不特定話者により発声される
語数が約２０００および特定話者により発声される語数
が約２００００のとき、Ｓ／Ｎは約＋１５［ｄＢ］から
約＋４［ｄＢ］である。また、コマンドなどの場合に
は、不特定話者により発声される語数が約３００および
特定話者により発声される語数が約３０００のとき、Ｓ
／Ｎは約＋１３［ｄＢ］であり、不特定話者により発声
される語数が約７０および特定話者により発声される語
数が約７００のとき、Ｓ／Ｎは約＋３［ｄＢ］である。
つまり、コマンドなどの場合には、語数を少なくすれば
低いＳ／Ｎでも所定の認識率を得ることができ、語数が
多くなるほど必要なＳ／Ｎは大きくなる。例えば地名の
認識語数が不特定話者で２０００語、コマンドが同じく
不特定話者で３００語の音声認識を１つの音声認識装置
で行う場合、地名の認識で必要な最大の値である約＋１
５〔ｄＢ〕以上のＳ／Ｎがあれば、全ての言葉に対して
所定以上の認識率を得ることができる。FIG. 4 is a diagram showing the level of voice recognition in the navigation device. In FIG. 4, the vertical axis represents the number of words uttered by the unspecified speaker and the specific speaker on a logarithmic scale, and the horizontal axis represents S / N [dB]. In FIG. 4, in the case of place name recognition, when the number of words spoken by an unspecified speaker is about 2000 and the number of words spoken by a specified speaker is about 20000, the S / N is about +15 [dB] to about +4. It is [dB]. In the case of a command or the like, when the number of words spoken by the unspecified speaker is about 300 and the number of words spoken by the specified speaker is about 3000, S
/ N is about +13 [dB], and when the number of words spoken by the unspecified speaker is about 70 and the number of words spoken by the specified speaker is about 700, the S / N is about +3 [dB].
That is, in the case of a command or the like, a predetermined recognition rate can be obtained even with a low S / N by reducing the number of words, and the required S / N increases as the number of words increases. For example, if one speech recognition device recognizes a place name with 2000 words for an unspecified speaker and a command with 300 words for an unspecified speaker, the maximum value required for recognizing a place name is about +1.
If the S / N is 5 [dB] or more, it is possible to obtain a recognition rate above a predetermined level for all words.

【００３５】[0035]

【発明の効果】この発明のレベル表示装置によれば、入
力されたノイズレベルと入力された音声信号レベルとを
比較し、上記ノイズレベルに対する上記音声信号レベル
の比率が所定値以上であるか否かを検出する制御手段を
備え、上記音声信号レベルの比率が所定値以上であるこ
とを検出したときに、所定の表示を行うので、入力され
た音声信号とノイズとに基づいて、使用環境におけるバ
ックグランドノイズが比較的大きく、かつそれが定常的
なノイズの場合であっても、その環境下で音声を認識さ
せるために必要な発声の音量を絶対的な音声入力レベル
の表示ではなく、ノイズレベルと入力音声レベルとの相
対的なレベル表示により明示することができ、レベル表
示の質を向上させることができる。According to the level display device of the present invention, the input noise level is compared with the input audio signal level, and it is determined whether the ratio of the audio signal level to the noise level is a predetermined value or more. When the ratio of the audio signal level is detected to be equal to or more than a predetermined value, a predetermined display is performed. Therefore, based on the input audio signal and noise, Even if the background noise is relatively large and it is a stationary noise, the volume of the utterance necessary for recognizing the voice in that environment is not the absolute voice input level display but the noise. The level can be clearly indicated by the relative level display between the level and the input voice level, and the quality of the level display can be improved.

【００３６】また、この発明のレベル表示装置によれ
ば、上述において、音声信号入力手段と、上記音声信号
入力手段により入力されたノイズレベルを記憶する記憶
手段と、上記音声信号入力手段の音声信号の入力動作を
有効または無効にする入力指示手段を設け、上記入力指
示手段の指示が音声信号の入力動作を無効にするとき
に、ノイズレベルを上記記憶手段に記憶し、上記入力指
示手段の指示が音声信号の入力動作を有効にするとき
に、上記記憶手段に記憶された上記ノイズレベルと上記
音声信号レベルとを上記制御手段により比較するように
したので、音声信号とノイズとを切り換えて入力して、
これに基づいて、音声認識に必要な発声の音量を、絶対
的な音声入力レベルの表示ではなく、ノイズレベルと入
力音声レベルとの相対的なレベル表示により明示するこ
とができ、レベル表示の質を向上させることができる。Further, according to the level display device of the present invention, in the above description, the audio signal input means, the storage means for storing the noise level input by the audio signal input means, and the audio signal of the audio signal input means. Is provided to enable or disable the input operation of the input instruction means, and when the instruction of the input instruction means invalidates the input operation of the audio signal, the noise level is stored in the storage means, and the instruction of the input instruction means is stored. Since the noise level stored in the storage means and the voice signal level are compared by the control means when the input operation of the voice signal is enabled, the voice signal and the noise are switched and input. do it,
Based on this, the volume of utterance required for voice recognition can be specified by the relative level display between the noise level and the input voice level, rather than the absolute voice input level display. Can be improved.

【００３７】また、この発明のレベル表示装置によれ
ば、上述において、入力されたノイズレベルと入力され
た音声信号レベルとを比較する比較動作と、上記ノイズ
レベルに対する上記音声信号レベルの比率が所定値以上
であるか否かを検出する検出動作とを所定期間毎に繰り
返して行うので、使用環境におけるバックグランドノイ
ズが変化する場合でも、最新のノイズレベルを用いて、
音声認識に必要な発声の音量を、絶対的な音声入力レベ
ルの表示ではなく、ノイズレベルと入力音声レベルとの
相対的なレベル表示により明示することができ、レベル
表示の質を向上させることができる。Further, according to the level display device of the present invention, in the above, the comparison operation for comparing the input noise level with the input audio signal level and the ratio of the audio signal level to the noise level are predetermined. Since the detection operation of detecting whether or not the value is equal to or more than the value is repeated every predetermined period, even when the background noise in the usage environment changes, the latest noise level is used.
The volume of utterance required for voice recognition can be specified by the relative level display between the noise level and the input voice level, rather than the absolute voice input level display, and the quality of the level display can be improved. it can.

【００３８】また、この発明の音声認識装置によれば、
音声信号入力手段と、上記音声信号入力手段により入力
されたノイズレベルを記憶する記憶手段と、上記記憶手
段に記憶された上記ノイズレベルと上記音声信号入力手
段により入力された音声信号レベルとを比較し、上記ノ
イズレベルに対する上記音声信号レベルの比率が所定値
以上であるか否かを検出する制御手段と、上記制御手段
が上記音声信号レベルの比率が所定値以上であることを
検出したときに、所定の表示を行う表示手段とを備えた
ので、使用環境におけるバックグランドノイズが比較的
大きく、かつそれが定常的なノイズの場合、その環境下
で音声を認識させるために必要な発声の音量を、「大き
め」、「小さめ」といった抽象的な表示ではなく、表示
手段による点灯表示により明示することができ、音声認
識率を向上させることができる。According to the voice recognition device of the present invention,
Audio signal input means, storage means for storing the noise level input by the audio signal input means, and comparison of the noise level stored by the storage means with the audio signal level input by the audio signal input means And a control means for detecting whether or not the ratio of the audio signal level to the noise level is equal to or higher than a predetermined value, and the control means detects that the ratio of the audio signal level is equal to or higher than a predetermined value. Since the background noise in the environment of use is relatively large and the noise is stationary, the volume of the utterance necessary for recognizing the voice in the environment is provided since the background noise in the environment of use is relatively large. Can be clearly indicated by the lighting display by the display means instead of the abstract display such as "large" or "small", thereby improving the voice recognition rate. Door can be.

【００３９】また、この発明の音声認識装置によれば、
上述において、上記音声信号入力手段の音声信号の入力
動作を有効または無効にする入力指示手段を設け、上記
入力指示手段の指示が音声信号の入力動作を無効にする
ときに、ノイズレベルを記憶手段に記憶し、上記入力指
示手段の指示が音声信号の入力動作を有効にするとき
に、上記記憶手段に記憶された上記ノイズレベルと上記
音声信号レベルとを比較するようにしたので、音声信号
とノイズとを切り換えて入力して、これに基づいて、音
声認識に必要な発声の音量を、「大きめ」、「小さめ」
といった抽象的な表示ではなく、表示手段による点灯表
示により明示することができ、音声認識率を向上させる
ことができる。According to the voice recognition device of the present invention,
In the above description, the input instruction means for validating or invalidating the voice signal input operation of the voice signal input means is provided, and the noise level is stored when the instruction of the input instruction means invalidates the voice signal input operation. When the instruction of the input instructing means validates the input operation of the audio signal, the noise level and the audio signal level stored in the storage means are compared. Switching between noise and input, and based on this, the volume of utterance required for voice recognition is set to "large" or "small".
Instead of such an abstract display, it can be clearly indicated by a lighting display by the display means, and the voice recognition rate can be improved.

【００４０】また、この発明の音声認識装置によれば、
上述において、ノイズレベルを記憶する上記記憶手段
は、所定期間毎にノイズレベルを書き換えて記憶するよ
うにしたので、使用環境におけるバックグランドノイズ
が変化する場合でも、最新のノイズレベルを用いて、音
声認識に必要な発声の音量を、「大きめ」、「小さめ」
といった抽象的な表示ではなく、表示手段による点灯表
示により明示することができ、音声認識率を向上させる
ことができる。According to the voice recognition device of the present invention,
In the above description, since the storage means for storing the noise level is configured to rewrite and store the noise level for each predetermined period, even if the background noise in the usage environment changes, the latest noise level is used and the voice The volume of utterance required for recognition is set to "large" or "small".
Instead of such an abstract display, it can be clearly indicated by a lighting display by the display means, and the voice recognition rate can be improved.

【００４１】また、この発明のナビゲーション装置によ
れば、音声信号入力手段と、上記音声信号入力手段によ
り入力されたノイズレベルを記憶する記憶手段と、上記
記憶手段に記憶された上記ノイズレベルと上記音声信号
入力手段により入力された音声信号レベルとを比較し、
上記ノイズレベルに対する上記音声信号レベルの比率が
所定値以上であるか否かを検出する制御手段と、上記制
御手段が上記音声信号レベルの比率が所定値以上である
ことを検出したときに、所定の表示を行う表示手段とを
有する音声認識部と、上記音声認識部が認識した特定の
音声のデータを座標位置データに変換する変換部と、地
図データ記憶手段と、上記変換部で変換された座標位置
データで示される位置の地図データを上記地図データ記
憶手段から読み出して、地図表示用映像信号を作成する
地図データ読み出し手段とを備えたので、使用環境にお
けるバックグランドノイズが比較的大きく、かつそれが
定常的なノイズの場合、その環境下で音声を認識させる
ために必要な発声の音量を、「大きめ」、「小さめ」と
いった抽象的な表示ではなく、表示手段による点灯表示
により明示することができ、音声認識率を向上させるこ
とができ、これにより適切なナビゲーションの動作を行
うことができる。According to the navigation device of the present invention, the audio signal input means, the storage means for storing the noise level input by the audio signal input means, the noise level stored in the storage means, and the noise level Compare the audio signal level input by the audio signal input means,
Control means for detecting whether or not the ratio of the audio signal level to the noise level is equal to or higher than a predetermined value, and a predetermined value when the control means detects that the ratio of the audio signal level is equal to or higher than a predetermined value. A voice recognition section having display means for displaying, a conversion section for converting data of a specific voice recognized by the voice recognition section into coordinate position data, a map data storage means, and a conversion section converted by the conversion section. Since the map data at the position indicated by the coordinate position data is read from the map data storage means and the map data reading means for creating the map display video signal is provided, the background noise in the use environment is relatively large, and If it is a stationary noise, the volume of the utterance necessary for recognizing the voice in that environment is represented by an abstract table such as "large" or "small". Rather, can be evidenced by the lighting display by the display unit, it is possible to improve the speech recognition rate, thereby performing an operation appropriate navigation.

【００４２】また、この発明のナビゲーション装置によ
れば、上述において、上記音声認識部に、上記音声信号
入力手段の音声信号の入力動作を有効または無効にする
入力指示手段を設け、上記入力指示手段の指示が音声信
号の入力動作を無効にするときに、ノイズレベルを記憶
手段に記憶し、上記入力指示手段の指示が音声信号の入
力動作を有効にするときに、上記記憶手段に記憶された
上記ノイズレベルと上記音声信号レベルとを比較するよ
うにしたので、音声信号とノイズとを切り換えて入力し
て、これに基づいて、音声認識に必要な発声の音量を、
「大きめ」、「小さめ」といった抽象的な表示ではな
く、表示手段による点灯表示により明示することがで
き、音声認識率を向上させることができ、これにより適
切なナビゲーションの動作を行うことができる。Further, according to the navigation apparatus of the present invention, in the above description, the voice recognition section is provided with input instruction means for enabling or disabling the input operation of the voice signal of the voice signal input means, and the input instruction means is provided. The noise level is stored in the storage means when the input operation of the audio signal is invalidated, and the noise level is stored in the storage means when the input operation of the audio signal is enabled. Since the noise level and the voice signal level are compared with each other, the voice signal and the noise are switched and input, and based on this, the volume of utterance required for voice recognition,
Instead of the "large" or "small" abstract display, it can be clearly indicated by the lighting display by the display unit, and the voice recognition rate can be improved, whereby an appropriate navigation operation can be performed.

【００４３】また、この発明のナビゲーション装置によ
れば、上述において、上記音声認識部のノイズレベルを
記憶する上記記憶手段は、所定期間毎にノイズレベルを
書き換えて記憶するようにしたので、使用環境における
バックグランドノイズが変化する場合でも、最新のノイ
ズレベルを用いて、音声認識に必要な発声の音量を、
「大きめ」、「小さめ」といった抽象的な表示ではな
く、表示手段による点灯表示により明示することがで
き、音声認識率を向上させることができ、これにより適
切なナビゲーションの動作を行うことができる。Further, according to the navigation apparatus of the present invention, in the above description, the storage means for storing the noise level of the voice recognition unit rewrites and stores the noise level every predetermined period, so that the environment of use Even if the background noise in the changes, the latest noise level is used to determine the volume of utterance required for voice recognition.
Instead of the "large" or "small" abstract display, it can be clearly indicated by the lighting display by the display unit, and the voice recognition rate can be improved, whereby an appropriate navigation operation can be performed.

[Brief description of drawings]

【図１】この発明の一実施例の音声認識装置の構成を示
すブロック図である。FIG. 1 is a block diagram showing a configuration of a voice recognition device according to an embodiment of the present invention.

【図２】この発明の一実施例の音声認識装置の動作を示
すフローチャートである。FIG. 2 is a flowchart showing the operation of the voice recognition device in one embodiment of the present invention.

【図３】この発明の他の実施例のナビゲーション装置の
構成を示すブロック図である。FIG. 3 is a block diagram showing a configuration of a navigation device according to another embodiment of the present invention.

【図４】この発明の他の実施例のナビゲーション装置に
おける音声認識のレベルを示す図である。FIG. 4 is a diagram showing levels of voice recognition in a navigation device according to another embodiment of the present invention.

[Explanation of symbols]

１マイクロホン（音声信号入力手段）２アンプ３Ａ／Ｄ変換器４マイクロプロセッサー（制御手段）５メモリ（記憶手段）６出力ポート７レベル表示器（表示手段）８トークスイッチ（入力指示手段）９音声認識装置１０変換回路（変換部）１１経緯度変換データ記憶用ＲＯＭ１２音声認識データ記憶用ＲＯＭ２０ナビゲーション装置２１ＧＰＳ用アンテナ２２現在位置検出回路２３演算回路２４ＣＤ−ＲＯＭドライバ（地図データ読み出し手
段）２５ＲＡＭ２６車速センサ２７操作キー２８映像信号生成回路３１音声合成回路３２スピーカ４０ディスプレイ装置1 Microphone (voice signal input means) 2 Amplifier 3 A / D converter 4 Microprocessor (control means) 5 Memory (storage means) 6 Output port 7 Level indicator (display means) 8 Talk switch (input instruction means) 9 Voice Recognition device 10 Conversion circuit (conversion part) 11 ROM for latitude and longitude conversion data storage 12 ROM for voice recognition data storage 20 Navigation device 21 GPS antenna 22 Current position detection circuit 23 Arithmetic circuit 24 CD-ROM driver (map data reading means) 25 RAM 26 Vehicle Speed Sensor 27 Operation Keys 28 Video Signal Generation Circuit 31 Voice Synthesis Circuit 32 Speaker 40 Display Device

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁶ 識別記号庁内整理番号ＦＩ技術表示箇所 // Ｇ０８Ｇ 1/0962 9289−5ＬＧ０６Ｆ 15/40 ３７０Ｅ ─────────────────────────────────────────────────── ─── Continuation of the front page (51) Int.Cl. ⁶ Identification code Internal reference number FI technical display location // G08G 1/0962 9289-5L G06F 15/40 370E

Claims

[Claims]

1. A control means for comparing the input noise level with the input audio signal level to detect whether the ratio of the audio signal level to the noise level is equal to or more than a predetermined value. A level display device which performs a predetermined display when it is detected that the ratio of the audio signal levels is equal to or higher than a predetermined value.

2. The level display device according to claim 1, wherein the audio signal input means, the storage means for storing the noise level input by the audio signal input means, and the audio signal of the audio signal input means. An input instruction means for validating or invalidating the input operation is provided, and when the instruction of the input instruction means invalidates the input operation of the audio signal, the noise level is stored in the storage means, and the instruction of the input instruction means is A level display device characterized in that, when an input operation of an audio signal is made effective, the noise level stored in the storage means and the audio signal level are compared by the control means.

3. The level display device according to claim 1, wherein a comparison operation for comparing the input noise level with the input audio signal level and a ratio of the audio signal level to the noise level are predetermined values. A level display device characterized in that a detection operation for detecting whether or not the above is performed is repeatedly performed every predetermined period.

4. A voice signal input means, a storage means for storing a noise level input by the voice signal input means, a noise level stored in the storage means, and a voice input by the voice signal input means. Control means for comparing with the signal level to detect whether the ratio of the audio signal level to the noise level is a predetermined value or more; and the control means, the ratio of the audio signal level is a predetermined value or more. A voice recognition device, comprising: a display unit that performs a predetermined display when the detection is performed.

5. The voice recognition device according to claim 4, further comprising input instruction means for validating or invalidating the input operation of the voice signal of said voice signal input means, and the instruction of said input instruction means is a voice signal. When invalidating the input operation, the noise level is stored in the storage means, and when the instruction of the input instructing means enables the input operation of the audio signal, the noise level and the noise level stored in the storage means are stored. A voice recognition device characterized by being compared with a voice signal level.

6. The voice recognition device according to claim 4, wherein the storage unit for storing the noise level is configured to rewrite and store the noise level every predetermined period.

7. A voice signal input means, a storage means for storing the noise level input by the voice signal input means, a noise level stored in the storage means, and a voice input by the voice signal input means. Control means for comparing the signal level with the noise level to detect whether or not the ratio of the audio signal level to the noise level is a predetermined value or more; and the control means, the ratio of the audio signal level is a predetermined value or more. When detecting, a voice recognition unit having a display unit for performing a predetermined display, a conversion unit for converting the data of the specific voice recognized by the voice recognition unit into coordinate position data, a map data storage unit, The map data for reading the map data at the position indicated by the coordinate position data converted by the conversion unit from the map data storage means to create a map display video signal. Navigation apparatus characterized by comprising a data reading means.

8. The navigation device according to claim 7, wherein the voice recognition unit is provided with input instruction means for enabling or disabling an input operation of the voice signal of the voice signal input means, The noise level is stored in the storage means when the instruction invalidates the input operation of the voice signal, and the noise level is stored in the storage means when the instruction of the input instruction means validates the input operation of the voice signal. A navigation device, characterized in that the noise level and the audio signal level are compared.

9. The navigation device according to claim 7, wherein the storage unit for storing the noise level of the voice recognition unit rewrites and stores the noise level every predetermined period. Navigation device.