JP7724305B2

JP7724305B2 - Information processing device, information processing method, program, and recording medium

Info

Publication number: JP7724305B2
Application number: JP2023556447A
Authority: JP
Inventors: 洋人河内; 壮志中川
Original assignee: Pioneer Corp
Current assignee: Pioneer Corp
Priority date: 2021-10-27
Filing date: 2022-10-25
Publication date: 2025-08-15
Anticipated expiration: 2042-10-25
Also published as: WO2023074654A1; JPWO2023074654A1

Description

本発明は、情報処理装置、情報処理方法、プログラムおよび記録媒体に関する。 The present invention relates to an information processing device, an information processing method, a program, and a recording medium.

近年、ユーザがウェイクワード（ウェイクアップワードあるいはホットワードともいう）を発話することにより音声アシスタントを起動させ、ユーザの発話により、操作指示、情報検索等を行う機器、例えば、スマートフォンやスマートスピーカ等が普及している。
一般に、この種の機器は、騒音や雑音（ノイズ）が含まれている環境下で動作させると、ユーザが発話した言葉を正しく検出できないことがある。
ここで、この種の機器を車輌に搭載した場合には、走行状態（走行速度、エンジン回転数等）や車両状態（エアコン作動状況、窓開閉状況等）により、騒音や雑音等の大きさが大きく変化するため、収音した音声からノイズを除去し、ユーザの発話した言葉を検出しやすくする技術が開示されている（例えば、特許文献１参照）。 In recent years, devices such as smartphones and smart speakers have become widespread, allowing users to activate voice assistants by speaking a wake word (also known as a wake-up word or hot word) and providing operational instructions, searching for information, and other functions based on the user's speech.
Generally, when this type of device is operated in an environment containing noise, it may not be possible to correctly detect words spoken by the user.
When this type of device is installed in a vehicle, the volume of noise and other disturbances varies greatly depending on the driving conditions (driving speed, engine speed, etc.) and vehicle conditions (air conditioner operation status, window opening/closing status, etc.). Therefore, technology has been disclosed that removes noise from the picked-up audio, making it easier to detect the words spoken by the user (see, for example, Patent Document 1).

特開２００９－２１０６４７号公報JP 2009-210647 A

上述した先行技術では、ユーザの発話音声、騒音や雑音等（ノイズ）が含まれた音声を収音し、その収音した音声に基づいてノイズを除去するためのフィルタ特性を決定している。
しかしながら、上述した先行技術では、機器の電源がオンされた直後では、フィルタ特性を決定するための音声が収音されていないため、ノイズを除去するための適切なフィルタ特性を決定することができず、ユーザが発話した言葉を正しく検出できないという課題が一例として挙げられる。 In the above-mentioned prior art, a user's speech and a voice containing noise are picked up, and filter characteristics for removing the noise are determined based on the picked-up voice.
However, in the above-mentioned prior art, immediately after the device is turned on, no audio is picked up to determine the filter characteristics, so appropriate filter characteristics for removing noise cannot be determined, and the words spoken by the user cannot be correctly detected, as an example of the problem.

本発明は、上述の一例として挙げられた課題に鑑みてなされたものであり、機器の電源がオンされた直後でも、ノイズを除去するための最適なフィルタ特性を決定し、ノイズを除去する情報処理装置、情報処理方法、プログラムおよび記録媒体を提供することを主な目的とする。 The present invention has been made in consideration of the problems cited as examples above, and its main objective is to provide an information processing device, information processing method, program, and recording medium that determine optimal filter characteristics for removing noise and remove noise even immediately after the device is turned on.

上記課題を解決するために、請求項１に記載の発明は、収音した音声に基づいて前記音声から雑音を除去するためのフィルタ特性を算出するフィルタ特性算出部と、前記フィルタ特性算出部によって算出された前記フィルタ特性に基づいて、収音した音声から雑音を除去するフィルタ部と、センサ情報に基づいて前記音声の収音環境を検出する収音環境検出部と、起動時を除く期間では、前記フィルタ特性算出部で算出された前記フィルタ特性を前記フィルタ部に設定するフィルタ制御部と、を備え、前記フィルタ制御部は、起動時のみにおいて、前記収音環境を取得し、該起動時の収音環境と、あらかじめ前記フィルタ特性算出部が算出した前記フィルタ特性と前記収音環境とを紐づけて格納しているフィルタテーブルと、に基づいて、前記フィルタ部に前記フィルタ特性を設定することを特徴とする情報処理装置である。 In order to solve the above problem, the invention described in claim 1 is an information processing device comprising: a filter characteristic calculation unit that calculates filter characteristics for removing noise from a collected sound based on the sound; a filter unit that removes noise from the collected sound based on the filter characteristics calculated by the filter characteristic calculation unit; a sound collection environment detection unit that detects the sound collection environment based on sensor information; and a filter control unit that sets the filter characteristics calculated by the filter characteristic calculation unit to the filter unit during periods other than startup, wherein the filter control unit acquires the sound collection environment only during startup, and sets the filter characteristics to the filter unit based on the sound collection environment at the time of startup and a filter table that stores the filter characteristics calculated in advance by the filter characteristic calculation unit in association with the sound collection environment.

また、請求項６に記載の発明は、フィルタ特性算出部と、フィルタ部と、収音環境検出部と、フィルタ制御部とを備えた情報処理装置における情報処理方法であって、前記フィルタ特性算出部が、収音した音声に基づいて前記音声から雑音を除去するためのフィルタ特性を算出する第１の工程と、前記フィルタ部が、前記フィルタ特性算出部によって算出された前記フィルタ特性に基づいて、収音した音声から雑音を除去する第２の工程と、前記収音環境検出部が、センサ情報に基づいて前記音声の収音環境を検出する第３の工程と、前記フィルタ制御部が、起動時を除く期間では、前記フィルタ特性算出部で算出された前記フィルタ特性を前記フィルタ部に設定する第４の工程と、を備え、前記第４の工程において、前記フィルタ制御部は、起動時のみにおいて、前記収音環境を取得し、該起動時の収音環境と、あらかじめ前記フィルタ特性算出部が算出した前記フィルタ特性と前記収音環境とを紐づけて格納しているフィルタテーブルと、に基づいて、前記フィルタ部に前記フィルタ特性を設定することを特徴とする情報処理方法である。 The invention of claim 6 is an information processing method for an information processing device including a filter characteristic calculation unit, a filter unit, a sound collection environment detection unit, and a filter control unit, the information processing method including: a first step in which the filter characteristic calculation unit calculates filter characteristics for removing noise from sound based on the sound that has been collected; a second step in which the filter unit removes noise from the sound based on the filter characteristics calculated by the filter characteristic calculation unit; a third step in which the sound collection environment detection unit detects the sound collection environment based on sensor information; and a fourth step in which the filter control unit sets the filter characteristics calculated by the filter characteristic calculation unit in the filter unit during a period other than startup, wherein in the fourth step, the filter control unit acquires the sound collection environment only during startup, and sets the filter characteristics in the filter unit based on the sound collection environment at the time of startup and a filter table that stores the filter characteristics calculated in advance by the filter characteristic calculation unit in association with the sound collection environment.

また、請求項７に記載の発明は、フィルタ特性算出部と、フィルタ部と、収音環境検出部と、フィルタ制御部とを備えた情報処理装置における情報処理方法をコンピュータに実行させるためのプログラムであって、前記フィルタ特性算出部が、収音した音声に基づいて前記音声から雑音を除去するためのフィルタ特性を算出する第１の工程と、前記フィルタ部が、前記フィルタ特性算出部によって算出された前記フィルタ特性に基づいて、収音した音声から雑音を除去する第２の工程と、前記収音環境検出部が、センサ情報に基づいて前記音声の収音環境を検出する第３の工程と、前記フィルタ制御部が、起動時を除く期間では、前記フィルタ特性算出部で算出された前記フィルタ特性を前記フィルタ部に設定する第４の工程と、を備え、前記第４の工程において、前記フィルタ制御部は、起動時のみにおいて、前記収音環境を取得し、該起動時の収音環境と、あらかじめ前記フィルタ特性算出部が算出した前記フィルタ特性と前記収音環境とを紐づけて格納しているフィルタテーブルと、に基づいて、前記フィルタ部に前記フィルタ特性を設定する情報処理方法をコンピュータに実行させるためのプログラムである。 Further, the invention described in claim 7 is a program for causing a computer to execute an information processing method in an information processing device including a filter characteristic calculation unit, a filter unit, a sound collection environment detection unit, and a filter control unit, the program including: a first step in which the filter characteristic calculation unit calculates filter characteristics for removing noise from sound based on the sound that has been collected; a second step in which the filter unit removes noise from the sound based on the filter characteristics calculated by the filter characteristic calculation unit; a third step in which the sound collection environment detection unit detects the sound collection environment based on sensor information; and a fourth step in which the filter control unit sets the filter characteristics calculated by the filter characteristic calculation unit in the filter unit during a period other than startup, wherein in the fourth step, the filter control unit acquires the sound collection environment only at startup, and sets the filter characteristics in the filter unit based on the sound collection environment at the time of startup and a filter table that stores the filter characteristics calculated in advance by the filter characteristic calculation unit in association with the sound collection environment.

また、請求項８に記載の発明は、フィルタ特性算出部と、フィルタ部と、収音環境検出部と、フィルタ制御部とを備えた情報処理装置における情報処理方法をコンピュータに実行させるためのプログラムを記録したコンピュータによって読み取り可能な非一過性の記録媒体であって、前記フィルタ特性算出部が、収音した音声に基づいて前記音声から雑音を除去するためのフィルタ特性を算出する第１の工程と、前記フィルタ部が、前記フィルタ特性算出部によって算出された前記フィルタ特性に基づいて、収音した音声から雑音を除去する第２の工程と、前記収音環境検出部が、センサ情報に基づいて前記音声の収音環境を検出する第３の工程と、前記フィルタ制御部が、起動時を除く期間では、前記フィルタ特性算出部で算出された前記フィルタ特性を前記フィルタ部に設定する第４の工程と、を備え、前記第４の工程において、前記フィルタ制御部は、起動時のみにおいて、前記収音環境を取得し、該起動時の収音環境と、あらかじめ前記フィルタ特性算出部が算出した前記フィルタ特性と前記収音環境とを紐づけて格納しているフィルタテーブルと、に基づいて、前記フィルタ部に前記フィルタ特性を設定する情報処理方法をコンピュータに実行させるためのプログラムを記録した記録媒体である。 The invention of claim 8 is a non-transitory computer-readable recording medium having recorded thereon a program for causing a computer to execute an information processing method in an information processing device including a filter characteristic calculation unit, a filter unit, a sound collection environment detection unit, and a filter control unit, the information processing method including a first step in which the filter characteristic calculation unit calculates filter characteristics for removing noise from a sound based on the sound collected, a second step in which the filter unit removes noise from the sound based on the filter characteristics calculated by the filter characteristic calculation unit, and a third step in which the sound collection environment detection unit calculates a filter characteristic for removing noise from the sound based on sensor information. The information processing method includes a third step of detecting a sound collection environment for the sound, and a fourth step of the filter control unit setting the filter characteristics calculated by the filter characteristic calculation unit in the filter unit during a period other than startup, wherein in the fourth step, the filter control unit acquires the sound collection environment only during startup, and sets the filter characteristics in the filter unit based on the sound collection environment at the time of startup and a filter table that stores the filter characteristics calculated in advance by the filter characteristic calculation unit and the sound collection environment in association with each other. This is a recording medium having a program recorded thereon to cause a computer to execute an information processing method.

本発明の実施例に係る情報処理装置の構成を示す図である。1 is a diagram illustrating a configuration of an information processing device according to an embodiment of the present invention. 本発明の実施例に係る情報処理装置のフィルタ制御部が生成および参照するフィルタテーブルを例示した図である。3 is a diagram illustrating a filter table generated and referenced by a filter control unit of the information processing device according to the embodiment of the present invention; FIG. 本発明の実施例に係る情報処理装置のフィルタ制御部の処理フローを示す図である。FIG. 4 is a diagram showing a processing flow of a filter control unit of the information processing device according to the embodiment of the present invention. 本発明の実施例に係る情報処理装置のフィルタ制御部がフィルタテーブルを参照するときの処理を例示する図である。10 is a diagram illustrating a process performed when a filter control unit of an information processing device according to an embodiment of the present invention refers to a filter table; 本発明の実施例に係る情報処理装置のフィルタ制御部がフィルタテーブルに収音環境とフィルタ特性とを追加するときの処理を例示した図である。10A and 10B are diagrams illustrating processing when a filter control unit of the information processing device according to the embodiment of the present invention adds a sound collection environment and a filter characteristic to a filter table. 本発明のその他の実施例に係る情報処理装置のセンサ部が取得するセンサ情報を例示した図である。FIG. 10 is a diagram illustrating sensor information acquired by a sensor unit of an information processing device according to another embodiment of the present invention.

本実施形態に係る情報処理装置は、収音した音声に基づいて音声から雑音を除去するためのフィルタ特性を算出するフィルタ特性算出部と、フィルタ特性算出部によって算出されたフィルタ特性に基づいて、収音した音声から雑音を除去するフィルタ部と、センサ情報に基づいて音声の収音環境を検出する収音環境検出部と、起動時に収音環境を取得し、該起動時の収音環境と、フィルタ特性算出部が算出したフィルタ特性と収音環境とを紐づけたフィルタテーブルとに基づいて、フィルタ部にフィルタ特性を設定するフィルタ制御部とを備えている。
フィルタ特性算出部は、収音した音声に基づいて、音声から雑音を除去するためのフィルタ特性を算出する。
フィルタ部は、フィルタ特性算出部において算出されたフィルタ特性に基づいて、収音した音声から雑音を除去する。
収音環境検出部は、カメラ画像、車両センサ等のセンサ情報に基づいて、音声を収音している収音環境を検出する。
フィルタ制御部は、情報処理装置の起動時に収音環境を取得し、その起動時の収音環境と、フィルタ特性算出部が算出したフィルタ特性と収音環境とを紐づけたフィルタテーブルと、に基づいて、フィルタ部に設定するフィルタ特性を決定し、そのフィルタ特性をフィルタ部に設定する。
フィルタ制御部は、情報処理装置の起動時の収音環境が、フィルタテーブルにある場合には、起動時の収音環境に紐づいたフィルタ特性をフィルタ部に設定し、フィルタテーブルにない場合には、最後にフィルタ部に設定したフィルタ特性をフィルタ部に設定する。
情報処理装置の起動時には、フィルタ特性を決定するための音声が収音されていないため、雑音を除去するための適切なフィルタ特性を決定することができない。
そのため、フィルタ制御部は、起動時の収音環境とフィルタテーブルとに基づいて、フィルタ部に設定するフィルタ特性を決定する。
これにより、フィルタ特性を決定するための音声が収音されていない起動時であっても、起動時の収音環境に基づいたフィルタ特性を設定することができるため、適切に音声から雑音を除去することができる。
また、フィルタ制御部は、情報処理装置の起動時を除く期間では、フィルタ特性算出部により算出されたフィルタ特性を、フィルタ部に設定する。
すなわち、情報処理装置の起動時以外では、収音部により収音された音声に基づいて、フィルタ特性算出部が算出したフィルタ特性を、フィルタ部に設定する。
これにより、情報処理装置の起動時を除く期間では、ユーザが発話する空間の音声に基づいて、フィルタ特性が算出されるため、最適なフィルタ特性を設定することができる。 The information processing device according to this embodiment includes a filter characteristic calculation unit that calculates filter characteristics for removing noise from audio based on collected audio, a filter unit that removes noise from the collected audio based on the filter characteristics calculated by the filter characteristic calculation unit, a sound collection environment detection unit that detects the audio collection environment based on sensor information, and a filter control unit that acquires the sound collection environment at startup and sets the filter characteristics in the filter unit based on the sound collection environment at startup and a filter table that links the filter characteristics calculated by the filter characteristic calculation unit with the sound collection environment.
The filter characteristic calculation unit calculates filter characteristics for removing noise from the audio based on the picked-up audio.
The filter unit removes noise from the collected voice based on the filter characteristics calculated by the filter characteristics calculation unit.
The sound collection environment detection unit detects the sound collection environment in which sound is collected based on camera images, sensor information from vehicle sensors, and the like.
The filter control unit acquires the sound collection environment when the information processing device is started up, and determines the filter characteristics to be set in the filter unit based on the sound collection environment at the time of startup and a filter table that links the filter characteristics calculated by the filter characteristic calculation unit with the sound collection environment, and sets the filter characteristics in the filter unit.
If the sound collection environment at the time of startup of the information processing device is in the filter table, the filter control unit sets the filter characteristics linked to the sound collection environment at the time of startup in the filter unit, and if the sound collection environment is not in the filter table, sets the filter characteristics last set in the filter unit in the filter unit.
When the information processing device is started up, no sound is collected for determining the filter characteristics, and therefore it is not possible to determine appropriate filter characteristics for removing noise.
Therefore, the filter control unit determines the filter characteristics to be set in the filter unit based on the sound collection environment at the time of startup and the filter table.
This allows the filter characteristics to be set based on the sound collection environment at the time of startup, even when no sound is collected to determine the filter characteristics, so that noise can be appropriately removed from the sound.
Further, the filter control unit sets the filter characteristics calculated by the filter characteristics calculation unit in the filter unit during a period other than when the information processing device is started up.
That is, except when the information processing device is started up, the filter characteristics calculated by the filter characteristics calculation unit based on the sound picked up by the sound pickup unit are set in the filter unit.
As a result, during the period other than the time when the information processing device is started up, the filter characteristics are calculated based on the sound in the space where the user speaks, so that the optimum filter characteristics can be set.

また、フィルタテーブルに起動時の収音環境がない場合でも、最後にフィルタ部に設定されたフィルタ特性が設定されるため、ユーザが発話する空間にとって、より良いフィルタ特性を設定することができる。
上述した、最後にフィルタ部に設定したフィルタ特性とは、情報処理装置が、電源オフ等により動作が停止されたときに、フィルタ部に設定されているフィルタ特性のことであり、次に情報処理装置が起動され、フィルタテーブルに起動時の収音環境がないときには、当該フィルタ特性がフィルタ部に設定される。
つまり、情報処理装置の起動時であっても、ユーザが発話する空間の音に基づいて算出された実績のあるフィルタ特性が設定されるため、その空間にとって、より適切なフィルタ特性を設定することができる。
また、フィルタ制御部は、フィルタ特性算出部においてフィルタ特性を算出したときの収音環境を取得し、その収音環境がフィルタテーブルにない場合には、その収音環境とフィルタ特性算出部において算出したフィルタ特性とを紐づけて、フィルタテーブルに追加する。
これにより、情報処理装置を動作させるだけで、収音環境毎に最適なフィルタ特性をフィルタテーブルに蓄積することができるため、フィルタ制御部は、フィルタテーブルを参照することにより、起動時に最適なフィルタ特性をフィルタ部に設定することができる。 Furthermore, even if the sound collection environment at the time of startup is not included in the filter table, the filter characteristics last set in the filter section are set, so that better filter characteristics can be set for the space in which the user speaks.
The filter characteristics last set in the filter unit as mentioned above refer to the filter characteristics set in the filter unit when the information processing device is stopped from operating due to power-off or the like, and when the information processing device is next started and the sound collection environment at the time of startup is not in the filter table, the filter characteristics are set in the filter unit.
In other words, even when the information processing device is started up, proven filter characteristics calculated based on the sound of the space in which the user is speaking are set, so that more appropriate filter characteristics can be set for that space.
Furthermore, the filter control unit acquires the sound collection environment when the filter characteristics are calculated by the filter characteristic calculation unit, and if the sound collection environment is not in the filter table, the sound collection environment is linked to the filter characteristics calculated by the filter characteristic calculation unit and added to the filter table.
This allows the optimal filter characteristics for each sound collection environment to be stored in the filter table simply by operating the information processing device, so that the filter control unit can set the optimal filter characteristics in the filter unit at startup by referring to the filter table.

＜実施例＞
図１から図５を用いて、本実施例に係る情報処理装置１について説明する。 <Example>
An information processing device 1 according to this embodiment will be described with reference to FIGS. 1 to 5. FIG.

＜情報処理装置１の構成＞
図１を用いて、本実施例に係る情報処理装置１の構成について説明する。
情報処理装置１は、収音部１０と、フィルタ部２０と、フィルタ特性算出部３０と、センサ部４０と、収音環境検出部５０と、フィルタ制御部６０と、を少なくとも含んで構成されている。 <Configuration of information processing device 1>
The configuration of an information processing device 1 according to this embodiment will be described with reference to FIG.
The information processing device 1 includes at least a sound collection unit 10 , a filter unit 20 , a filter characteristic calculation unit 30 , a sensor unit 40 , a sound collection environment detection unit 50 , and a filter control unit 60 .

収音部１０は、例えば、マイクロフォンで構成され、車室内の音声を収音し、収音した音声をフィルタ部２０およびフィルタ特性算出部３０に送信する。
収音部１０が収音した音声には、ユーザの発話音声と、マイクロフォン周辺で発生している雑音、騒音等が含まれている。
具体的には、車室内に設置されたマイクロフォンにより収音されている音声には、走行時のエンジン音、風切り音、ロードノイズ、エアコンの作動音、スピーカから出力されている音楽等が含まれている。
なお、マイクロフォンは、上述した車室内の音声を収音できればよいため、車両に設置されている、例えば、ハンズフリー通話用のマイクロフォンを用いて構成してもよい。 The sound collection unit 10 is configured by, for example, a microphone, collects sound from within the vehicle cabin, and transmits the collected sound to the filter unit 20 and the filter characteristic calculation unit 30 .
The sound picked up by the sound pickup unit 10 includes the user's speech and noises and other sounds occurring around the microphone.
Specifically, the sounds picked up by the microphone installed inside the vehicle cabin include engine noise, wind noise, road noise, air conditioner operation sounds, music output from the speakers, etc. while driving.
The microphone may be a microphone installed in the vehicle, for example, a microphone for hands-free calling, as long as it can pick up the above-mentioned sounds inside the vehicle.

フィルタ部２０は、後述するフィルタ制御部６０から受信したフィルタ特性に基づいて、収音部１０において収音した音声から雑音を除去する。
なお、雑音が除去された音声は、図示しない音声認識エンジンに入力され、ユーザの発話した言葉が検出される。 The filter unit 20 removes noise from the sound collected by the sound collection unit 10 based on the filter characteristics received from the filter control unit 60, which will be described later.
The noise-removed voice is input to a voice recognition engine (not shown) to detect the words spoken by the user.

フィルタ特性算出部３０は、収音部１０において収音された音声に基づいて、その音声から雑音を除去するためのフィルタ特性を算出する。
具体的には、フィルタ特性算出部３０は、収音部１０において収音した音声を、例えば２０秒間毎の音声データに分割し、分割された音声データ毎に雑音を除去するためのフィルタ特性を算出する。
なお、フィルタ特性算出部３０において算出されたフィルタ特性は、後述するフィルタ制御部６０に送信される。 The filter characteristic calculation unit 30 calculates, based on the sound collected by the sound collection unit 10, filter characteristics for removing noise from the sound.
Specifically, the filter characteristic calculation unit 30 divides the sound collected by the sound collection unit 10 into sound data of, for example, 20 seconds each, and calculates filter characteristics for removing noise for each of the divided sound data.
The filter characteristics calculated by the filter characteristics calculation unit 30 are transmitted to a filter control unit 60, which will be described later.

センサ部４０は、少なくとも、車両内を撮像するカメラ、車両の状態を検出するセンサにより構成され、取得したセンサ情報を、後述する収音環境検出部５０に送信する。
ここで、センサ部４０からセンサ情報として送信する画像は、車両内の画像が取得できればよいため、所謂、車両に設置されているドライブレコーダが撮像している画像を収音環境検出部５０に送信するようにしてもよい。
また、車両の状態を検出するセンサ情報としては、車速パルス、加速度センサ、ＧＰＳ信号、車両のＥＣＵ（Electronic Control Unit）に接続されている各種センサ情報等を例示することができる。 The sensor unit 40 is composed of at least a camera that takes images of the inside of the vehicle and a sensor that detects the state of the vehicle, and transmits the acquired sensor information to the sound collection environment detection unit 50, which will be described later.
Here, the image transmitted from the sensor unit 40 as sensor information only needs to be an image of the inside of the vehicle, so it is possible to transmit to the sound collection environment detection unit 50 an image captured by a drive recorder installed in the vehicle.
Examples of sensor information for detecting the state of the vehicle include vehicle speed pulses, acceleration sensors, GPS signals, and information from various sensors connected to an ECU (Electronic Control Unit) of the vehicle.

収音環境検出部５０は、センサ部４０からのセンサ情報に基づいて、収音環境を検出する。
具体的には、収音環境検出部５０は、車両内を撮像した画像を分析し、例えば、乗員の乗車位置、乗員の性別、車両の窓の開閉状態等を収音環境として検出する。
また、収音環境検出部５０は、車両の状態を示すセンサ情報に基づいて、車両の走行速度、エンジン回転数、エアコン作動状況等を収音環境として検出する。
なお、収音環境検出部５０は、検出した収音環境を、フィルタ制御部６０に送信する。
また、収音環境検出部５０は、フィルタ特性算出部３０がフィルタ特性を算出したときの収音環境を検出し、検出した収音環境をフィルタ制御部６０に送信する。
例えば、収音環境検出部５０は、フィルタ特性算出部３０において、フィルタ特性を算出している期間中の収音環境の平均値（エンジン回転数、走行速度等の平均値）を算出して、当該収音環境をフィルタ制御部６０に送信する。 The sound collection environment detection unit 50 detects the sound collection environment based on the sensor information from the sensor unit 40 .
Specifically, the sound collection environment detection unit 50 analyzes images captured inside the vehicle and detects, for example, the passenger's riding position, the passenger's gender, and the open/closed state of the vehicle windows as the sound collection environment.
The sound collection environment detection unit 50 also detects the vehicle's running speed, engine speed, air conditioner operation status, etc. as the sound collection environment based on sensor information indicating the vehicle's state.
The sound collection environment detection unit 50 transmits the detected sound collection environment to the filter control unit 60 .
Furthermore, the sound collection environment detection unit 50 detects the sound collection environment when the filter characteristic calculation unit 30 calculates the filter characteristic, and transmits the detected sound collection environment to the filter control unit 60 .
For example, the sound collection environment detection unit 50 calculates the average value of the sound collection environment (average value of engine rotation speed, driving speed, etc.) during the period when the filter characteristics are calculated in the filter characteristic calculation unit 30, and transmits the sound collection environment to the filter control unit 60.

フィルタ制御部６０は、起動時に収音環境を取得し、起動時の収音環境と、フィルタ特性算出部３０が算出したフィルタ特性とフィルタ特性を算出したときの収音環境とを紐づけたフィルタテーブルと、に基づいて、フィルタ部２０にフィルタ特性を設定する。
つまり、情報処理装置１の起動時には、フィルタ特性を決定するための音声がまだ収音されていないため、フィルタ特性算出部３０において、フィルタ特性を算出することができない。
そのため、フィルタ制御部６０は、起動時に取得した収音環境と、図２に示すようなフィルタテーブルとに基づいて、フィルタ部２０に設定するフィルタ特性を決定する。 The filter control unit 60 acquires the sound collection environment at startup and sets the filter characteristics in the filter unit 20 based on the sound collection environment at startup and a filter table that links the filter characteristics calculated by the filter characteristic calculation unit 30 with the sound collection environment when the filter characteristics were calculated.
That is, when the information processing device 1 is started up, the sound for determining the filter characteristics has not yet been collected, and therefore the filter characteristics calculation unit 30 cannot calculate the filter characteristics.
Therefore, the filter control unit 60 determines the filter characteristics to be set in the filter unit 20 based on the sound collection environment acquired at the time of startup and the filter table shown in FIG.

フィルタテーブルには、フィルタ特性算出部３０においてフィルタ特性が算出されたときの収音環境と、算出されたフィルタ特性とが紐づけられて、格納されている。
具体的には、センサ部４０のカメラ画像および車両センサ情報から検出された収音環境（乗車位置、性別、窓開閉状態、走行速度、エンジン回転数、エアコン作動状況）と、フィルタ特性算出部３０において算出されたフィルタ特性とが紐づけられて、フィルタテーブルに格納されている。
より具体的には、図２に示すように、収音環境Ｋ１～Ｋ５と、フィルタ特性Ｆ１～Ｆ５とが、それぞれ紐づけられて、フィルタテーブルに格納されている。 The filter table stores the sound collection environment when the filter characteristics are calculated by the filter characteristics calculation unit 30 and the calculated filter characteristics in association with each other.
Specifically, the sound collection environment (rider position, gender, window open/close status, driving speed, engine RPM, air conditioner operation status) detected from the camera image of the sensor unit 40 and vehicle sensor information is linked to the filter characteristics calculated by the filter characteristic calculation unit 30 and stored in the filter table.
More specifically, as shown in FIG. 2, sound collection environments K1 to K5 and filter characteristics F1 to F5 are associated with each other and stored in a filter table.

フィルタ制御部６０は、起動時に取得した収音環境と同じ収音環境がフィルタテーブルにある場合には、その収音環境に紐づいたフィルタ特性をフィルタテーブルから取得し、フィルタ部２０にそのフィルタ特性を設定する。
一方で、起動時に取得した収音環境と同じ収音環境がフィルタテーブルにない場合には、フィルタ部２０に最後に設定したフィルタ特性をフィルタ部２０に設定する。
上述した、フィルタ部２０に最後に設定したフィルタ特性とは、情報処理装置１が、電源オフ等により動作が停止されたときに、フィルタ部２０に設定されていたフィルタ特性のことであり、次に情報処理装置１が起動され、起動時に取得した収音環境がフィルタテーブルにないときには、当該フィルタ特性をフィルタ部２０に設定する。
なお、フィルタ制御部６０は、フィルタ部２０に最後に設定したフィルタ特性の値を、図示しないメモリに格納する。 If the filter table contains a sound collection environment that is the same as the sound collection environment acquired at startup, the filter control unit 60 acquires the filter characteristics associated with that sound collection environment from the filter table and sets those filter characteristics in the filter unit 20.
On the other hand, if the filter table does not contain the same sound collection environment as the sound collection environment acquired at the time of startup, the filter characteristics last set in the filter section 20 are set in the filter section 20 .
The filter characteristics last set in the filter unit 20 as mentioned above are the filter characteristics that were set in the filter unit 20 when the information processing device 1 was stopped from operating due to power-off or the like, and when the information processing device 1 is next started and the sound collection environment acquired at the time of startup is not in the filter table, the filter characteristics are set in the filter unit 20.
The filter control unit 60 stores the filter characteristic values that were last set in the filter unit 20 in a memory (not shown).

また、フィルタ制御部６０は、起動時を除く期間では、フィルタ特性算出部３０において算出されたフィルタ特性をフィルタ部２０に設定する。 In addition, the filter control unit 60 sets the filter characteristics calculated by the filter characteristic calculation unit 30 to the filter unit 20 during periods other than startup.

フィルタ制御部６０は、フィルタ特性算出部３０においてフィルタ特性が算出されたときの収音環境を収音環境検出部５０から取得し、その収音環境がフィルタテーブルにない場合には、その収音環境とフィルタ特性とを紐づけて、フィルタテーブルに追加する。
なお、フィルタ制御部６０の処理の詳細は、以下に説明する。 The filter control unit 60 acquires the sound collection environment at the time when the filter characteristics were calculated by the filter characteristics calculation unit 30 from the sound collection environment detection unit 50, and if the sound collection environment is not in the filter table, links the sound collection environment with the filter characteristics and adds it to the filter table.
The processing performed by the filter control unit 60 will be described in detail below.

＜フィルタ制御部６０の処理＞
図３から図５を用いて、フィルタ制御部６０の処理の詳細について説明する。 <Processing of the filter control unit 60>
The processing of the filter control unit 60 will be described in detail with reference to FIGS.

図３に示すように、車両のＡＣＣ電源（アクセサリ電源）がオン状態であるか否かを判定する（ステップＳ１００）。
車両のＡＣＣ電源がオン状態にないと判定した場合（ステップＳ１００の「ＮＯ」）には、処理をステップＳ１００に戻し、待機状態に移行する。
一方で、車両のＡＣＣ電源がオン状態にあると判定した場合（ステップＳ１００の「ＹＥＳ」）には、処理をステップＳ１１０に移行させる。 As shown in FIG. 3, it is determined whether the ACC power supply (accessory power supply) of the vehicle is in an ON state (step S100).
If it is determined that the ACC power supply of the vehicle is not in an ON state ("NO" in step S100), the process returns to step S100 and transitions to a standby state.
On the other hand, if it is determined that the ACC power supply of the vehicle is in the ON state ("YES" in step S100), the process proceeds to step S110.

車両のＡＣＣ電源がオン状態にあると判定した場合（ステップＳ１００の「ＹＥＳ」）には、収音環境検出部５０から収音環境を取得する（ステップＳ１１０）。
つまり、フィルタ制御部６０は、ＡＣＣ電源がオンされた（情報処理装置１がオンされた）直後に、現在の収音環境を収音環境検出部５０から取得する。 If it is determined that the ACC power supply of the vehicle is in an on state ("YES" in step S100), the sound collection environment is acquired from the sound collection environment detection unit 50 (step S110).
That is, the filter control unit 60 acquires the current sound collection environment from the sound collection environment detection unit 50 immediately after the ACC power supply is turned on (the information processing device 1 is turned on).

フィルタ制御部６０は、ステップＳ１１０で取得した収音環境が、フィルタテーブルにあるか否かを判定する（ステップＳ１２０）。
取得した収音環境がフィルタテーブルにあると判定した場合（ステップＳ１２０の「ＹＥＳ」）には、処理をステップＳ１３０に移行させる。
一方で、取得した収音環境がフィルタテーブルにないと判定した場合（ステップＳ１２０の「ＮＯ」）には、処理をステップＳ１４０に移行させる。 The filter control unit 60 determines whether the sound collection environment acquired in step S110 is in the filter table (step S120).
If it is determined that the acquired sound collection environment is in the filter table ("YES" in step S120), the process proceeds to step S130.
On the other hand, if it is determined that the acquired sound collection environment is not in the filter table ("NO" in step S120), the process proceeds to step S140.

ここで、収音環境検出部５０から取得した収音環境と同じ収音環境がフィルタテーブルにあるか否かを判定する方法について説明する。
収音環境検出部５０から取得した収音環境を示す情報の中には、走行速度やエンジン回転数等のように、走行中に大きく値が変化する収音環境がある。
そのため、収音環境検出部５０から受信した収音環境と同じ収音環境がフィルタテーブルにあるか否かを判定するときには、同じであるかを判定するための類似度を算出する。 Here, a method for determining whether or not the same sound collection environment as the sound collection environment acquired from the sound collection environment detection unit 50 exists in the filter table will be described.
The information indicating the sound collection environment acquired from the sound collection environment detection unit 50 includes sound collection environment information whose values change significantly during driving, such as the driving speed and engine rotation speed.
Therefore, when determining whether or not the filter table contains a sound collection environment that is the same as the sound collection environment received from the sound collection environment detection unit 50, a similarity is calculated to determine whether or not they are the same.

図４を用いて、起動時直後に取得した収音環境ＫＡの値と、フィルタテーブルにある収音環境Ｋ１～Ｋ３と、を比較した場合を例示して、類似度について説明する。
フィルタ制御部６０は、収音環境検出部５０から取得した起動時の収音環境の走行速度の値がＫ_Ａ１であった場合には、フィルタテーブルにある走行速度の値（Ｋ_１１、Ｋ_２１、Ｋ_３１）との差の絶対値を算出し、その値を類似度としている。
そして、フィルタ制御部６０は、算出した類似度が所定値より小さい場合（例えば、類似度＜１０Ｋｍ／ｈの場合）には、２つの走行速度の値は同じであると判定する。
また、エンジン回転数においても、同様に算出した類似度が所定値より小さい場合（例えば、類似度＜２００ｒｐｍの場合）には、２つのエンジン回転数の数値は同じであると判定する。 The similarity will be described with reference to FIG. 4, exemplifying a case where the value of the sound collection environment KA obtained immediately after startup is compared with the sound collection environments K1 to K3 in the filter table.
When the value of the traveling speed in the sound collection environment at the time of startup obtained from the sound collection environment detection unit 50 is K _A1 , the filter control unit 60 calculates the absolute value of the difference between this value and the traveling speed values (K ₁₁ , K ₂₁ , K ₃₁ ) in the filter table, and regards this value as the similarity.
If the calculated similarity is smaller than a predetermined value (for example, if the similarity is smaller than 10 km/h), the filter control unit 60 determines that the two travel speed values are the same.
Similarly, when the similarity calculated for the engine speed is smaller than a predetermined value (for example, when the similarity is smaller than 200 rpm), the two engine speeds are determined to be the same.

上述した類似度の判定において、走行速度およびエンジン回転数の双方とも同じであると判定され、さらに、その他の収音環境（乗車位置、性別、窓開閉状態、エアコン作動状況）が同じであると判定された場合に、フィルタ制御部６０は、同じ収音環境であると判定する（収音環境検出部５０から取得した収音環境ＫＡと収音環境Ｋ３とは、同じ収音環境であると判定する）。
なお、上述した類似度の判定を行ったときに、複数の収音環境と同じであると判定された場合には、例えば、類似度の値が一番小さい値となる収音環境を、同じ収音環境として判定する。 In the above-mentioned similarity determination, if it is determined that both the driving speed and the engine rotation speed are the same, and further if it is determined that the other sound collection environments (riding position, gender, window opening/closing status, air conditioning operation status) are the same, the filter control unit 60 determines that the sound collection environments are the same (the sound collection environment KA and the sound collection environment K3 obtained from the sound collection environment detection unit 50 are determined to be the same sound collection environment).
When the above-described similarity determination is performed and it is determined that the sound collection environments are the same as multiple sound collection environments, for example, the sound collection environment with the smallest similarity value is determined to be the same sound collection environment.

起動時に取得した収音環境（ステップＳ１１０において取得した収音環境）が、フィルタテーブルにあると判定した場合（ステップＳ１２０の「ＹＥＳ」）には、その収音環境に紐づいたフィルタ特性をフィルタテーブルから取得し、そのフィルタ特性をフィルタ部２０に設定する（ステップＳ１３０）。 If it is determined that the sound collection environment acquired at startup (sound collection environment acquired in step S110) is in the filter table ('YES' in step S120), the filter characteristics associated with that sound collection environment are acquired from the filter table and set in the filter unit 20 (step S130).

一方で、起動時に取得した収音環境（ステップＳ１１０において取得した収音環境）がフィルタテーブルにないと判定した場合（ステップＳ１２０の「ＮＯ」）には、図示しないメモリに格納されている、最後にフィルタ部２０に設定したフィルタ特性を、フィルタ部２０に設定する（ステップＳ１４０）。 On the other hand, if it is determined that the sound collection environment acquired at startup (sound collection environment acquired in step S110) is not in the filter table ("NO" in step S120), the filter characteristics last set in the filter unit 20, which are stored in a memory not shown, are set in the filter unit 20 (step S140).

フィルタ制御部６０は、フィルタ特性算出部３０において算出されたフィルタ特性を取得し、取得したフィルタ特性をフィルタ部２０に設定する（ステップＳ１５０）。
つまり、起動時から所定時間経過すれば、フィルタ特性算出部３０がフィルタ特性を算出することができるため、フィルタ制御部６０は、起動時を除く期間では、フィルタ特性算出部３０において算出されたフィルタ特性を、フィルタ部２０に設定する。 The filter control unit 60 acquires the filter characteristics calculated by the filter characteristics calculation unit 30, and sets the acquired filter characteristics in the filter unit 20 (step S150).
In other words, once a predetermined time has passed since startup, the filter characteristic calculation unit 30 can calculate the filter characteristics, and the filter control unit 60 sets the filter characteristics calculated by the filter characteristic calculation unit 30 to the filter unit 20 during periods other than startup.

フィルタ制御部６０は、ステップＳ１５０においてフィルタ部２０に設定したフィルタ特性を算出した時の収音環境を、収音環境検出部５０から取得する（ステップＳ１６０）。 The filter control unit 60 obtains the sound collection environment at the time when the filter characteristics set in the filter unit 20 were calculated in step S150 from the sound collection environment detection unit 50 (step S160).

フィルタ制御部６０は、フィルタ部２０に設定したフィルタ特性を、図示しないメモリに保存する（ステップＳ１７０）。
つまり、ステップＳ１７０では、最後にフィルタ部２０に設定したフィルタ特性の値をメモリに格納する処理が実行される。 The filter control unit 60 stores the filter characteristics set in the filter unit 20 in a memory (not shown) (step S170).
That is, in step S170, the process of storing the filter characteristic values last set in the filter unit 20 in memory is executed.

フィルタ制御部６０は、ステップＳ１６０において取得しく収音環境と同じ収音環境が、フィルタテーブルにあるか否かを判定する（ステップＳ１８０）。
同じ収音環境がフィルタテーブルにあると判定した場合（ステップＳ１８０の「ＹＥＳ」）には、処理をステップＳ２００に移行させる。
一方で、同じ収音環境がフィルタテーブルにないと判定した場合（ステップＳ１８０の「ＮＯ」）には、ステップＳ１５０においてフィルタ部２０に設定したフィルタ特性と、ステップＳ１６０で取得した収音環境とを紐づけて、フィルタテーブルに追加する（ステップＳ１９０）。 The filter control unit 60 determines whether or not the same sound collection environment as the sound collection environment acquired in step S160 exists in the filter table (step S180).
If it is determined that the same sound collection environment exists in the filter table ("YES" in step S180), the process proceeds to step S200.
On the other hand, if it is determined that the same sound collection environment is not in the filter table ("NO" in step S180), the filter characteristics set in the filter unit 20 in step S150 are linked to the sound collection environment acquired in step S160 and added to the filter table (step S190).

つまり、フィルタ制御部６０は、フィルタ特性算出部３０においてフィルタ特性が算出されたときの収音環境を収音環境検出部５０から取得し、その収音環境とフィルタ特性とを紐づけて、フィルタテーブルに追加する。
具体的には、収音環境検出部５０から取得した収音環境である、例えば、乗員の乗車位置、各乗員の性別、車両窓の開閉状態、走行速度、エンジン回転数、エアコン作動状況等の各情報と、フィルタテーブルにある収音環境とを比較し、同じ収音環境がフィルタテーブルにあるか否かを判定し、フィルタテーブルに同じ収音環境がない場合には、その収音環境とフィルタ特性とを紐づけてフィルタテーブルに追加する。
より具体的には、例えば、図５に示すように、フィルタテーブルに登録されている収音環境Ｋ１～Ｋ５の中に、収音環境検出部５０から取得した収音環境と同じ収音環境があるか否かを判定し、同じ収音環境がない場合には、新たな収音環境Ｋ６として、その収音環境と、その収音環境に紐づいたフィルタ特性Ｆ６とをフィルタテーブルに追加する。
なお、同じ収音環境がフィルタテーブルにあるか否かの判定方法は、上述したステップＳ１２０における判定方法と同じである。 That is, the filter control unit 60 acquires from the sound collection environment detection unit 50 the sound collection environment when the filter characteristics were calculated by the filter characteristics calculation unit 30, associates the sound collection environment with the filter characteristics, and adds them to the filter table.
Specifically, the sound collection environment information obtained from the sound collection environment detection unit 50, such as the passenger's riding position, the gender of each passenger, the open/close state of the vehicle windows, the driving speed, the engine speed, the air conditioning operation status, etc., is compared with the sound collection environment in the filter table to determine whether the same sound collection environment is in the filter table, and if the same sound collection environment is not in the filter table, the sound collection environment and the filter characteristics are linked and added to the filter table.
More specifically, for example, as shown in FIG. 5, it is determined whether or not there is a sound collection environment K1 to K5 registered in the filter table that is the same as the sound collection environment acquired from the sound collection environment detection unit 50. If there is no such sound collection environment, the sound collection environment K6 and the filter characteristic F6 associated with that sound collection environment are added to the filter table as a new sound collection environment K6.
The method for determining whether the same sound collection environment is in the filter table is the same as the determination method in step S120 described above.

フィルタ制御部６０は、車両のＡＣＣ電源（アクセサリ電源）がオン状態であるか否かを判定する（ステップＳ２００）。
車両のＡＣＣ電源がオン状態にあると判定した場合（ステップＳ２００の「ＹＥＳ」）には、処理をステップＳ１５０に移行させ、処理を継続させる。
一方で、車両のＡＣＣ電源がオン状態にないと判定した場合（ステップＳ２００の「ＮＯ」）には、処理を終了させる。 Filter control unit 60 determines whether the ACC power supply (accessory power supply) of the vehicle is in an ON state (step S200).
If it is determined that the ACC power supply of the vehicle is in the ON state ("YES" in step S200), the process proceeds to step S150 and continues.
On the other hand, if it is determined that the ACC power supply of the vehicle is not in an ON state ("NO" in step S200), the process is ended.

本実施例に係る情報処理装置１は、収音した音声に基づいて、その音声から雑音を除去するためのフィルタ特性を算出するフィルタ特性算出部３０と、フィルタ特性算出部３０によって算出されたフィルタ特性に基づいて、収音した音声から雑音を除去するフィルタ部２０と、センサ部４０に基づいて音声の収音環境を検出する収音環境検出部５０と、起動時に収音環境を取得し、該起動時の収音環境と、フィルタ特性算出部３０が算出したフィルタ特性と収音環境とを紐づけたフィルタテーブルと、に基づいて、フィルタ部２０にフィルタ特性を設定するフィルタ制御部６０と、を備えている。
フィルタ部２０は、フィルタ特性算出部３０において算出されたフィルタ特性に基づいて収音した音声から雑音を除去する。
フィルタ特性算出部３０は、収音部１０において収音された音声に基づいて、その音声から雑音を除去するためのフィルタ特性を算出する。
収音環境検出部５０は、カメラ画像、車両センサ等のセンサ部４０の情報に基づいて、音声を収音している収音環境を検出する。
フィルタ制御部６０は、情報処理装置１の起動時に収音環境を取得し、その起動時の収音環境とフィルタテーブルと、に基づいて、フィルタ部２０に設定するフィルタ特性を決定し、そのフィルタ特性をフィルタ部２０に設定する。
フィルタ制御部６０は、情報処理装置１の起動時の収音環境が、フィルタテーブルにある場合には、起動時の収音環境に紐づいたフィルタ特性をフィルタ部２０に設定し、フィルタテーブルにない場合には、最後にフィルタ部２０に設定したフィルタ特性をフィルタ部２０に設定する。
起動時直後には、フィルタ特性を決定するための音声が収音されていないため、適切なフィルタ特性を決定することができない。
そのため、フィルタ制御部６０は、起動時の収音環境とフィルタテーブルとに基づいて、フィルタ部２０に設定するフィルタ特性を決定する。
これにより、フィルタ特性を決定するための音声が収音されていない起動時であっても、起動時の収音環境に基づいたフィルタ特性を設定することができるため、適切に音声から雑音を除去することができる。
また、フィルタテーブルに起動時の収音環境がない場合でも、最後にフィルタ部２０に設定したフィルタ特性を設定する。
つまり、ユーザが発話する空間の音に基づいて算出された実績のあるフィルタ特性が設定されるため、その空間にとって、より適切なフィルタ特性を設定することができる。 The information processing device 1 according to this embodiment includes a filter characteristic calculation unit 30 that calculates filter characteristics for removing noise from collected sound based on the sound, a filter unit 20 that removes noise from the collected sound based on the filter characteristics calculated by the filter characteristic calculation unit 30, a sound collection environment detection unit 50 that detects the sound collection environment based on a sensor unit 40, and a filter control unit 60 that acquires the sound collection environment at startup and sets the filter characteristics in the filter unit 20 based on the sound collection environment at startup and a filter table that links the sound collection environment with the filter characteristics calculated by the filter characteristic calculation unit 30.
The filter unit 20 removes noise from the collected voice based on the filter characteristics calculated by the filter characteristics calculation unit 30 .
The filter characteristic calculation unit 30 calculates, based on the sound collected by the sound collection unit 10, filter characteristics for removing noise from the sound.
The sound collection environment detection unit 50 detects the sound collection environment in which sound is being collected based on information from the sensor unit 40 such as a camera image and a vehicle sensor.
The filter control unit 60 acquires the sound collection environment when the information processing device 1 is started up, and determines the filter characteristics to be set in the filter unit 20 based on the sound collection environment at the time of startup and the filter table, and sets the filter characteristics in the filter unit 20.
If the sound collection environment at the time of startup of the information processing device 1 is in the filter table, the filter control unit 60 sets the filter characteristics associated with the sound collection environment at the time of startup to the filter unit 20, and if the sound collection environment is not in the filter table, the filter control unit 60 sets the filter characteristics last set to the filter unit 20 to the filter unit 20.
Immediately after startup, no sound is collected to determine the filter characteristics, so appropriate filter characteristics cannot be determined.
Therefore, the filter control unit 60 determines the filter characteristics to be set in the filter unit 20 based on the sound collection environment at the time of startup and the filter table.
This allows the filter characteristics to be set based on the sound collection environment at the time of startup, even when no sound is collected to determine the filter characteristics, so that noise can be appropriately removed from the sound.
Furthermore, even if the sound collection environment at the time of startup is not present in the filter table, the filter characteristics last set in the filter section 20 are set.
That is, since proven filter characteristics calculated based on the sound of the space in which the user speaks are set, it is possible to set filter characteristics that are more appropriate for that space.

さらに、フィルタ制御部６０は、フィルタ特性算出部３０においてフィルタ特性を算出したときの収音環境を取得し、その収音環境がフィルタテーブルにない場合には、その収音環境とフィルタ特性算出部３０において算出したフィルタ特性とを紐づけて、フィルタテーブルに追加する。
フィルタ特性算出部３０において算出されるフィルタ特性は、収音環境によってフィルタ特性の値は大きく変化する。
例えば、収音した音声に含まれるエンジン音は、エンジン回転数によって、音の大きさや周波数が変化する。
また、収音した音声に含まれるロードノイズは、走行速度によって、音の大きさや周波数が変化する。
また、収音した音声に含まれる発話音声は、発話する性別によって、音の大きさや周波数が変化する。
そのため、フィルタ特性を算出した時の収音環境と、算出されたフィルタ特性とを紐づけて、フィルタテーブルに蓄積することにより、その車両、乗員にとって、最適なフィルタ特性を蓄積することができる。
すなわち、情報処理装置１を動作させるだけで、その空間に最適な収音環境毎のフィルタ特性をフィルタテーブルに蓄積することができるため、フィルタ制御部６０は、フィルタテーブルを参照することにより、最適なフィルタ特性を設定することができる。 Furthermore, the filter control unit 60 acquires the sound collection environment when the filter characteristics are calculated by the filter characteristic calculation unit 30, and if the sound collection environment is not in the filter table, it links the sound collection environment with the filter characteristics calculated by the filter characteristic calculation unit 30 and adds them to the filter table.
The filter characteristics calculated by the filter characteristic calculation unit 30 vary greatly in value depending on the sound collection environment.
For example, the volume and frequency of engine sounds contained in the collected audio change depending on the engine speed.
Furthermore, the volume and frequency of road noise contained in the picked-up audio changes depending on the driving speed.
Furthermore, the volume and frequency of the speech contained in the collected audio changes depending on the gender of the speaker.
Therefore, by linking the sound pickup environment at the time the filter characteristics were calculated with the calculated filter characteristics and storing them in a filter table, it is possible to store the filter characteristics that are optimal for the vehicle and occupants.
In other words, simply by operating the information processing device 1, the filter characteristics for each sound collection environment that are optimal for that space can be stored in the filter table, and the filter control unit 60 can set the optimal filter characteristics by referring to the filter table.

また、フィルタ制御部６０は、情報処理装置１の起動時を除く期間では、フィルタ特性算出部３０で算出されたフィルタ特性をフィルタ部２０に設定する。
すなわち、情報処理装置１の起動時以外では、フィルタ特性算出部３０が、収音部１０により収音した音声に基づいて算出したフィルタ特性をフィルタ部２０に設定する。
これにより、情報処理装置１の起動時以外は、ユーザが発話する空間の音声に基づいてフィルタ特性が算出されるため、最適なフィルタ特性を設定することができる。 Furthermore, the filter control unit 60 sets the filter characteristics calculated by the filter characteristics calculation unit 30 in the filter unit 20 during periods other than when the information processing device 1 is started up.
That is, except when the information processing device 1 is started up, the filter characteristic calculation unit 30 sets the filter characteristic calculated based on the sound picked up by the sound collection unit 10 in the filter unit 20 .
As a result, except when the information processing device 1 is started up, the filter characteristics are calculated based on the sound in the space where the user speaks, so that the optimal filter characteristics can be set.

また、センサ部４０のセンサ情報は、少なくとも車両内を撮像した画像と車両の走行状態を示す情報とを含んでいる。
すなわち、収音した音声に含まれる雑音を除去するフィルタ特性に影響を与える要因である、乗車位置、性別、窓開閉状態、走行速度、エンジン回転数等の情報に紐づけてフィルタテーブルを作成し、該フィルタテーブルと、起動時の収音環境により、フィルタ部２０に設定するフィルタ値を決定する。
これにより、ユーザの発話する空間にとって最適なフィルタ特性を、起動時の収音環境により決定することができるため、起動時であっても、収音した音声から雑音を除去することができる。 The sensor information of the sensor unit 40 includes at least an image captured inside the vehicle and information indicating the running state of the vehicle.
That is, a filter table is created in association with information such as the passenger position, gender, window opening/closing status, driving speed, and engine RPM, which are factors that affect the filter characteristics for removing noise contained in the collected voice, and the filter values to be set in the filter unit 20 are determined based on the filter table and the sound collection environment at the time of startup.
This allows the optimum filter characteristics for the space in which the user speaks to be determined based on the sound collection environment at the time of startup, making it possible to remove noise from the collected voice even at startup.

また、収音部１０の収音した音声には、雑音とユーザの発話音声とが含まれ、フィルタ部２０において雑音が除去された音声は、音声認識エンジンに送信される。
すなわち、フィルタ制御部６０の制御により、情報処理装置１が起動直後であっても、最適なフィルタ特性がフィルタ部２０に設定されるため、収音部１０において収音した音声から雑音を除去することができる。
これにより、音声認識エンジンにおける、発話音声の認識率を向上させることができる。 The sound collected by the sound collection unit 10 contains noise and the user's speech, and the sound from which the noise has been removed by the filter unit 20 is sent to a speech recognition engine.
In other words, under the control of the filter control unit 60, optimal filter characteristics are set in the filter unit 20 even immediately after the information processing device 1 is started up, so that noise can be removed from the sound picked up by the sound pickup unit 10.
This makes it possible to improve the recognition rate of the spoken voice in the voice recognition engine.

＜その他の実施例＞
上述した収音環境検出部５０では、乗員の乗車位置、各乗員の性別、車両窓の開閉状態、走行速度、エンジン回転数、エアコン作動状況等を収音環境として検出していたが、図６に示すような情報をさらに検出するようにしてもよい。
具体的には、カメラで車両内外を撮像した画像から、現在の天候状態や周辺車両の走行状態を収音環境として検出するようにしてもよい。
降雨時は晴天時に比べ、走行騒音が増える可能性があるため、現在の天候を収音環境の条件として追加することにより、より適切なフィルタ特性を設定することができる。
また、車両周辺にトラックやバイク等が走行している場合には、走行騒音が増える可能性があるため、自車両周辺の走行車両の状態を収音環境として検出することにより、より適切なフィルタ特性を設定することができる。
また、自車両のＧＰＳ（Global Positioning System）情報から、現在車両が走行している位置を収音環境として検出してもよい。
例えば、高速道路、住宅街、市街地等の走行位置特有に発生する雑音があるため、走行位置を把握することにより、より適切なフィルタ特性を設定することができる。 <Other Examples>
The sound collection environment detection unit 50 described above detects the passenger positions, the gender of each passenger, the open/closed state of the vehicle windows, the traveling speed, the engine speed, the air conditioner operation status, etc. as the sound collection environment, but may also detect further information such as that shown in FIG. 6.
Specifically, the current weather conditions and the driving conditions of surrounding vehicles may be detected as the sound collection environment from images taken inside and outside the vehicle by a camera.
Since road noise may increase during rainy weather compared to sunny days, more appropriate filter characteristics can be set by adding the current weather as a condition of the sound collection environment.
In addition, if trucks, motorcycles, etc. are traveling around the vehicle, there is a possibility that the noise from the vehicles traveling around the vehicle may increase, so by detecting the state of the vehicles traveling around the vehicle as the sound collection environment, more appropriate filter characteristics can be set.
Alternatively, the current location of the vehicle may be detected as the sound collection environment from GPS (Global Positioning System) information of the vehicle.
For example, there is noise that occurs uniquely in driving locations such as highways, residential areas, and urban areas, so by knowing the driving location, more appropriate filter characteristics can be set.

また、上述した収音環境検出部５０では、車両内を撮像する画像に基づいて、発話者の位置等を検出していたが、マイクアレイを用いて収音した音声から、乗員の乗車位置を収音環境として検出するようにしてもよい。
これにより、車両内を撮像するカメラが設置できない場合でも、乗員の乗車位置を把握することができる。 Furthermore, the above-mentioned sound collection environment detection unit 50 detects the position of the speaker, etc. based on images taken inside the vehicle, but it is also possible to detect the passenger's riding position as the sound collection environment from the sound collected using a microphone array.
This makes it possible to determine the passenger's position even if a camera cannot be installed to capture images inside the vehicle.

また、上述した情報処理装置１では、フィルタ制御部６０においてフィルタテーブルを生成したが、収音環境検出部５０から受信した収音環境を、インターネット回線を介してサーバに送信し、サーバにおいてフィルタテーブルを作成するようにしてもよい。
これにより、フィルタ制御部６０におけるフィルタテーブル作成処理の負荷をなくすことができるため、消費電力を低減することができる。
また、フィルタテーブルを格納および作成するためのメモリの容量を小さくすること、もしくは、削除することができるため、情報処理装置１のコストダウンを図ることができる。
また、サーバにフィルタテーブルを保管することにより、フィルタテーブルを他のユーザと共有することができる。
具体的には、例えば、サーバにおいて、同じ車種毎のフィルタテーブルのデータ集計および分析を行い、車種毎に共有できるフィルタテーブルを生成する。
これにより、共有のフィルタテーブルを参照して、フィルタ特性を設定することができるため、情報処理装置１Ａを初めて使用する、起動直後であっても、最適なフィルタ特性を設定することができる。 Furthermore, in the above-described information processing device 1, the filter table is generated in the filter control unit 60, but the sound collection environment received from the sound collection environment detection unit 50 may be transmitted to a server via an Internet line, and the filter table may be generated in the server.
This eliminates the load of the filter table creation process on the filter control unit 60, thereby reducing power consumption.
Furthermore, the memory capacity for storing and creating the filter table can be reduced or eliminated, which allows the cost of the information processing device 1 to be reduced.
Furthermore, by storing the filter table on a server, the filter table can be shared with other users.
Specifically, for example, the server aggregates and analyzes data from filter tables for the same vehicle type, and generates a filter table that can be shared for each vehicle type.
This allows the filter characteristics to be set by referring to the shared filter table, so that optimal filter characteristics can be set even when the information processing device 1A is used for the first time or immediately after startup.

上述した情報処理装置１では、起動時に取得した収音環境がフィルタテーブルにない場合には、最後にフィルタ部２０に設定したフィルタ特性をフィルタ部２０に設定していたが、起動時に取得した収音環境に一番近い収音環境に紐づいたフィルタ特性をフィルタ部２０に設定するようにしてもよい。
具体的には、例えば、上述した類似度が所定値以上であった場合でも、類似度が一番小さい値を示す収音環境を、同じ収音環境と判定し、その収音環境に紐づいたフィルタ特性をフィルタ部２０に設定する。
これにより、起動時に取得した収音環境に一番近い収音環境に紐づいたフィルタ特性をフィルタ部２０に設定できるため、ユーザが発話する空間にとって、最適なフィルタ特性を設定することができる。 In the above-described information processing device 1, if the sound collection environment acquired at startup is not in the filter table, the filter characteristics last set in the filter unit 20 are set in the filter unit 20, but it is also possible to set the filter characteristics associated with the sound collection environment that is closest to the sound collection environment acquired at startup in the filter unit 20.
Specifically, for example, even if the above-mentioned similarity is equal to or greater than a predetermined value, the sound collection environment showing the smallest similarity value is determined to be the same sound collection environment, and the filter characteristics associated with that sound collection environment are set in the filter unit 20.
This allows the filter unit 20 to be set with filter characteristics associated with the sound collection environment that is closest to the sound collection environment acquired at startup, making it possible to set filter characteristics that are optimal for the space in which the user speaks.

以上、この発明の実施例につき、図面を参照して詳述してきたが、具体的な構成はこの実施例に限られるものではなく、この発明の要旨を逸脱しない範囲の設計等も含まれる。 The above describes in detail an embodiment of the present invention with reference to the drawings, but the specific configuration is not limited to this embodiment and also includes designs that do not deviate from the gist of the present invention.

１；情報処理装置
１０；収音部
２０；フィルタ部
３０；フィルタ特性算出部
４０；センサ部
５０；収音環境取得部
６０；フィルタ制御部 REFERENCE SIGNS LIST 1; information processing device 10; sound collection unit 20; filter unit 30; filter characteristic calculation unit 40; sensor unit 50; sound collection environment acquisition unit 60; filter control unit

Claims

a filter characteristic calculation unit that calculates a filter characteristic for removing noise from a collected voice based on the collected voice;
a filter unit that removes noise from the collected voice based on the filter characteristics calculated by the filter characteristics calculation unit;
a sound collection environment detection unit that detects a sound collection environment of the sound based on sensor information;
a filter control unit that sets the filter characteristics calculated by the filter characteristics calculation unit to the filter unit during a period other than the start-up period;
Equipped with
the filter control unit acquires the sound collection environment only at startup, and sets the filter characteristics in the filter unit based on the sound collection environment at startup and a filter table that associates the filter characteristics calculated in advance by the filter characteristic calculation unit with the sound collection environment.
An information processing device comprising:

The information processing device according to claim 1, characterized in that, if the sound collection environment at the time of startup is in the filter table, the filter control unit sets the filter characteristics associated with the sound collection environment at the time of startup to the filter unit, and if the sound collection environment is not in the filter table, the filter control unit sets the filter characteristics last set to the filter unit to the filter unit.

3. The information processing apparatus according to claim 1, wherein the filter control unit acquires the sound collection environment when the filter characteristics are calculated by the filter characteristics calculation unit, and when the sound collection environment is not in the filter table, the sound collection environment and the filter characteristics calculated by the filter characteristics calculation unit are linked to each other and added to the filter table.

2. The information processing apparatus according to claim 1 , wherein the sensor information includes at least an image captured inside the vehicle and information indicating a running state of the vehicle.

2. The information processing apparatus according to claim 1 , wherein the collected voice includes the noise and a voice uttered by the user, and the voice from which the noise has been removed by the filter unit is transmitted to a voice recognition engine.

An information processing method in an information processing device including a filter characteristic calculation unit, a filter unit, a sound collection environment detection unit, and a filter control unit,
a first step in which the filter characteristic calculation unit calculates a filter characteristic for removing noise from the audio based on the audio;
a second step in which the filter unit removes noise from the collected voice based on the filter characteristics calculated by the filter characteristics calculation unit;
a third step in which the sound collection environment detection unit detects the sound collection environment based on sensor information;
a fourth step in which the filter control unit sets the filter characteristics calculated by the filter characteristics calculation unit in the filter unit during a period other than a start-up period;
Equipped with
the fourth step is an information processing method characterized in that the filter control unit acquires the sound collection environment only at startup, and sets the filter characteristics in the filter unit based on the sound collection environment at startup and a filter table that associates the filter characteristics calculated in advance by the filter characteristic calculation unit with the sound collection environment.

A program for causing a computer to execute an information processing method in an information processing device including a filter characteristic calculation unit, a filter unit, a sound collection environment detection unit, and a filter control unit,
a first step in which the filter characteristic calculation unit calculates a filter characteristic for removing noise from the audio based on the audio;
a second step in which the filter unit removes noise from the collected voice based on the filter characteristics calculated by the filter characteristics calculation unit;
a third step in which the sound collection environment detection unit detects the sound collection environment based on sensor information;
a fourth step in which the filter control unit sets the filter characteristics calculated by the filter characteristics calculation unit in the filter unit during a period other than a start-up period;
Equipped with
In the fourth step, the filter control unit acquires the sound collection environment only at startup, and sets the filter characteristics in the filter unit based on the sound collection environment at startup and a filter table that stores the filter characteristics calculated in advance by the filter characteristic calculation unit in association with the sound collection environment.

A non-transitory computer-readable recording medium storing a program for causing a computer to execute an information processing method in an information processing device including a filter characteristic calculation unit, a filter unit, a sound collection environment detection unit, and a filter control unit,
a first step in which the filter characteristic calculation unit calculates a filter characteristic for removing noise from the audio based on the audio;
a second step in which the filter unit removes noise from the collected voice based on the filter characteristics calculated by the filter characteristics calculation unit;
a third step in which the sound collection environment detection unit detects the sound collection environment based on sensor information;
a fourth step in which the filter control unit sets the filter characteristics calculated by the filter characteristics calculation unit in the filter unit during a period other than a start-up period;
Equipped with
a recording medium having a program recorded thereon for causing a computer to execute an information processing method in which, in the fourth step, the filter control unit acquires the sound collection environment only at startup, and sets the filter characteristics in the filter unit based on the sound collection environment at startup and a filter table that associates and stores the filter characteristics calculated in advance by the filter characteristic calculation unit with the sound collection environment.