JP6233023B2

JP6233023B2 - Acoustic processing apparatus, acoustic processing method, and acoustic processing program

Info

Publication number: JP6233023B2
Application number: JP2014000178A
Authority: JP
Inventors: 純也藤本; 桂樹岡林
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2014-01-06
Filing date: 2014-01-06
Publication date: 2017-11-22
Anticipated expiration: 2034-01-06
Also published as: JP2015130550A

Description

本発明は、音響処理装置、音響処理方法および音響処理プログラムに関する。 The present invention relates to an acoustic processing device, an acoustic processing method, and an acoustic processing program.

音が人の頭部や耳介の影響を受けて左右の耳に入るまでの時間的な特性を示す頭部インパルス応答(HRIR: Head Related Impulse Response)を用いることで、任意の方向から音が聞こえてくるように人に感じさせる技術がある。この種の技術は、音像定位技術あるいは立体音響技術と呼ばれている。 By using the head related impulse response (HRIR), which shows the temporal characteristics of the sound from the human head and pinna to the left and right ears, the sound can be heard from any direction. There is technology that makes people feel as if they are heard. This kind of technology is called sound image localization technology or stereophonic technology.

頭部インパルス応答には個人差がある。そこで、個人の頭部の周囲に設けられた複数の音源のそれぞれを用いて頭部インパルス応答を計測し、計測した複数の頭部インパルス応答を互いに補間することで、任意の方向の頭部インパルス応答を求める技術が提案されている(例えば、特許文献１参照)。また、複数の方向について計測した個人の頭部インパルス応答相互の補間に、複数の人物について計測して蓄積した頭部インパルス応答に対する統計的な学習で求めた共通の頭部インパルス応答を用いる手法が提案されている(例えば、特許文献２，３参照)。 There are individual differences in the head impulse response. Therefore, a head impulse response in an arbitrary direction is obtained by measuring a head impulse response using each of a plurality of sound sources provided around an individual's head and interpolating the measured plurality of head impulse responses with each other. A technique for obtaining a response has been proposed (see, for example, Patent Document 1). In addition, there is a method that uses a common head impulse response obtained by statistical learning for head impulse responses measured and accumulated for a plurality of persons for interpolation between individual head impulse responses measured for a plurality of directions. It has been proposed (see, for example, Patent Documents 2 and 3).

特開２０００−１６６０００号公報JP 2000-166000 A 特表２００８−５２７８２１号公報Special table 2008-527821 特開２０１０−４５４８９号公報JP 2010-45489 A

複数の方向について計測した個人の頭部インパルス応答を補間することで任意の方向の頭部インパルス応答を生成する従来の技術では、個人の頭部の周囲に例えば等間隔で複数の音源を配置した状態で、個人の頭部インパルス応答の計測が行われる。したがって、この種の技術では、個人の頭部インパルス応答を計測するために、大規模な計測装置が設けられる場合が多く、また、周囲に設けられた音源のそれぞれについて計測が繰り返されるので、頭部インパルス応答の計測作業も煩雑である。 In the conventional technology for generating a head impulse response in an arbitrary direction by interpolating individual head impulse responses measured in a plurality of directions, for example, a plurality of sound sources are arranged at equal intervals around the individual head. In the state, an individual's head impulse response is measured. Therefore, in this type of technology, in order to measure an individual's head impulse response, a large-scale measuring device is often provided, and the measurement is repeated for each of the sound sources provided around the head. The measurement work of the part impulse response is also complicated.

ところで、例えば、展示会などに集まった多数の人物のそれぞれを対象として音像定位技術を用いたサービスを実現するためには、多数の人物のそれぞれに対応して、任意の方向についての頭部インパルス応答を用意することが望ましい。 By the way, for example, in order to realize a service using sound image localization technology for a large number of persons gathered at an exhibition, etc., a head impulse in an arbitrary direction corresponding to each of the large number of persons. It is desirable to have a response.

しかしながら、多数の人物のそれぞれに対応した頭部インパルス応答の計測に大規模な計測装置を準備し、また、煩雑な作業を繰り返すことは、計測装置を設置する場所の制約や時間的な制約のために、実現が困難である。 However, preparing a large-scale measuring device for measuring the head impulse response corresponding to each of a large number of people and repeating complicated operations may cause restrictions on the place where the measuring device is installed and time constraints. Therefore, realization is difficult.

本件開示の音響処理装置、音響処理方法および音響処理プログラムは、一部の方向について個人毎に計測した頭部インパルス応答を用いて、良好な音像定位を実現する技術を提供することを目的とする。 The acoustic processing device, the acoustic processing method, and the acoustic processing program of the present disclosure are intended to provide a technique for realizing a good sound image localization using a head impulse response measured for each individual in a part of a direction. .

一つの観点によれば、音響処理装置は、頭部の前方方向の所定の範囲内の複数の第１方向のそれぞれから頭部に音響が到達する際に計測されたインパルス応答に基づいて、所定の範囲の外側の第２方向から頭部に音響が到達する際のインパルス応答の遅延時間を予測する予測部と、第２方向からの音響に対して予めモデル化された基準のインパルス応答の遅延時間を、予測部で予測された遅延時間に合わせて補正する補正部と、を備え、予測部は、頭部とインパルス応答の計測の際に第１方向に設置された音源との位置関係として、音源から到達する音響の遅延時間が、計測されたインパルス応答の遅延時間となる位置関係を特定する特定部と、特定部によって特定された位置関係に基づいて、第２方向から頭部に音響が到達する場合に予測される遅延時間を算出する算出部と、を有する。 According to one aspect, the sound processing device performs predetermined processing based on an impulse response measured when sound reaches the head from each of a plurality of first directions within a predetermined range in the forward direction of the head. A prediction unit for predicting a delay time of an impulse response when the sound reaches the head from the second direction outside the range, and a reference impulse response delay modeled in advance for the sound from the second direction A correction unit that corrects the time according to the delay time predicted by the prediction unit , and the prediction unit is a positional relationship between the head and the sound source installed in the first direction when measuring the impulse response. Based on the positional relationship specified by the specifying unit that specifies the positional relationship in which the delay time of the sound that arrives from the sound source becomes the delay time of the measured impulse response, the sound from the second direction to the head Predict if will reach Having a calculation unit for calculating a delay time.

別の観点によれば、音響処理方法は、頭部の前方方向の所定の範囲内の複数の第１方向のそれぞれから頭部に音響が到達する際に計測されたインパルス応答に基づいて、所定の範囲の外側の第２方向から頭部に音響が到達する際のインパルス応答の遅延時間を予測する予測工程と、第２方向からの音響に対して予めモデル化された基準のインパルス応答の遅延時間を、予測工程で予測された遅延時間に合わせて補正する補正工程と、を含み、予測工程は、頭部とインパルス応答の計測の際に第１方向に設置された音源との位置関係として、音源から到達する音響の遅延時間が、計測されたインパルス応答の遅延時間となる位置関係を特定する特定工程と、特定工程によって特定された位置関係に基づいて、第２方向から頭部に音響が到達する場合に予測される遅延時間を算出する算出工程と、を有する。 According to another aspect, the sound processing method is based on an impulse response measured when sound reaches the head from each of a plurality of first directions within a predetermined range in the forward direction of the head. A prediction step for predicting a delay time of the impulse response when the sound reaches the head from the second direction outside the range, and a delay of the reference impulse response modeled in advance for the sound from the second direction A correction step that corrects the time according to the delay time predicted in the prediction step , and the prediction step is a positional relationship between the head and the sound source installed in the first direction when measuring the impulse response. Based on the positional relationship that specifies the positional relationship in which the delay time of the sound that arrives from the sound source becomes the measured delay time of the impulse response, and the positional relationship that is identified by the identifying step, the sound from the second direction to the head Reach It has a calculation step of calculating a delay time which is predicted if the.

別の観点によれば、音響処理プログラムは、頭部の前方方向の所定の範囲内の複数の第１方向のそれぞれから頭部に音響が到達する際に計測されたインパルス応答に基づいて、所定の範囲の外側の第２方向から頭部に音響が到達する際のインパルス応答の遅延時間を予測する予測工程と、第２方向からの音響に対して予めモデル化された基準のインパルス応答の遅延時間を、予測工程で予測された遅延時間に合わせて補正する補正工程と、を含む処理をコンピュータに実行させ、予測工程は、頭部とインパルス応答の計測の際に第１方向に設置された音源との位置関係として、音源から到達する音響の遅延時間が、計測されたインパルス応答の遅延時間となる位置関係を特定する特定工程と、特定工程によって特定された位置関係に基づいて、第２方向から頭部に音響が到達する場合に予測される遅延時間を算出する算出工程と、を有する。 According to another aspect, the sound processing program is based on an impulse response measured when sound reaches the head from each of a plurality of first directions within a predetermined range in the forward direction of the head. A prediction step for predicting a delay time of the impulse response when the sound reaches the head from the second direction outside the range, and a delay of the reference impulse response modeled in advance for the sound from the second direction And a correction step for correcting the time according to the delay time predicted in the prediction step. The computer executes a process including a correction step, and the prediction step is installed in the first direction when measuring the head and the impulse response. As a positional relationship with the sound source, based on the positional relationship identified by the identifying step and the identifying step that identifies the positional relationship in which the delay time of the sound that arrives from the sound source becomes the measured delay time of the impulse response Having a calculation step of calculating a delay time which is expected if the acoustic reaches the head from the second direction.

本発明の音響処理装置、音響処理方法および音響処理プログラムは、一部の方向について個人毎に計測した頭部インパルス応答を用いて、良好な音像定位を実現することができる。 The sound processing apparatus, sound processing method, and sound processing program of the present invention can realize a good sound image localization by using a head impulse response measured for each individual in some directions.

音響処理装置の一実施形態を示す図である。It is a figure which shows one Embodiment of a sound processing apparatus. 図１に示した計測装置により個別頭部インパルス応答を計測する範囲の例を示す図である。It is a figure which shows the example of the range which measures an individual head impulse response with the measuring apparatus shown in FIG. 個別頭部インパルス応答の例を示す図である。It is a figure which shows the example of an individual head impulse response. 遅延時間と音源の方向との関係の例を示す図である。It is a figure which shows the example of the relationship between delay time and the direction of a sound source. 共通頭部インパルス応答の遅延時間の補正例を示す図である。It is a figure which shows the example of correction | amendment of the delay time of a common head impulse response. 図１に示した音響処理装置の動作を示す図である。It is a figure which shows operation | movement of the sound processing apparatus shown in FIG. 図２に示したマイクロホンと音源との位置関係の例を示す図である。It is a figure which shows the example of the positional relationship of the microphone shown in FIG. 2, and a sound source. 音響処理装置の別実施形態を示す図である。It is a figure which shows another embodiment of a sound processing apparatus. 図８に示した算出部によって算出される遅延時間の例を示す図である。It is a figure which shows the example of the delay time calculated by the calculation part shown in FIG. 音響処理装置の別実施形態を示す図である。It is a figure which shows another embodiment of a sound processing apparatus. 図１０に示した重み設定部により設定される重みの例を示す図である。It is a figure which shows the example of the weight set by the weight setting part shown in FIG. 音響処理装置の別実施形態を示す図である。It is a figure which shows another embodiment of a sound processing apparatus. 図１２に示した人物と展示会場内の展示物との位置関係の例を示す図である。It is a figure which shows the example of the positional relationship of the person shown in FIG. 12, and the exhibit in an exhibition hall. 音響処理装置のハードウェア構成の一例を示す図である。It is a figure which shows an example of the hardware constitutions of a sound processing apparatus. 図１４に示した位置検出装置のハードウェア構成例を示す図である。It is a figure which shows the hardware structural example of the position detection apparatus shown in FIG. 図１４に示した計測装置の動作を示す図である。It is a figure which shows operation | movement of the measuring device shown in FIG. 図１４に示した音響ＡＲ装置の動作を示す図である。It is a figure which shows operation | movement of the acoustic AR apparatus shown in FIG.

以下、図面に基づいて、実施形態を説明する。以下では、所定の計測範囲に含まれる方向ついて個人毎に計測された頭部インパルス応答と、計測範囲に含まれない他の方向について予め用意した共通の頭部インパルス応答とを組み合わせて利用する技術が説明される。 Hereinafter, embodiments will be described with reference to the drawings. In the following, a technique of using a combination of a head impulse response measured for each individual in a direction included in a predetermined measurement range and a common head impulse response prepared in advance for other directions not included in the measurement range Is explained.

ここで、人物の頭部の向きを基準とする角度で示される様々な方向についての頭部インパルス応答を比較すると、人物の頭部の前方の頭部インパルス応答は、頭部の側方及び後方の頭部インパルス応答に比べて個人差が大きい。そこで、以下では、人物の頭部の向きを含む所定の範囲について個人毎に計測した頭部インパルス応答と、個人毎の計測を行わない範囲についてダミーヘッドを用いた計測などにより予めモデル化された頭部インパルス応答とを組み合わせる場合について説明する。 Here, when comparing the head impulse response in various directions indicated by an angle with respect to the orientation of the person's head, the head impulse response in front of the person's head is The individual difference is larger than the head impulse response. Therefore, in the following, the head impulse response measured for each individual for a predetermined range including the orientation of the person's head and the range not measured for each individual are modeled in advance by measurement using a dummy head, etc. A case of combining with the head impulse response will be described.

図１は、音響処理装置の一実施形態を示す。図１に示した音響処理装置１０は、予測部１１と、補正部１２とを含んでいる。また、図１に示した計測装置ＥＱは、図２を用いて説明する所定の計測範囲Ｒの内側に設定された複数の方向である第１方向について、人物Ｑ１に固有の頭部インパルス応答を計測し、計測で得られた人物Ｑ１の頭部インパルス応答を音響処理装置１０に渡す。また、図１に示した記憶装置ＳＤは、計測装置ＥＱによる計測範囲Ｒの外側に設定された複数の方向である第２方向について、ダミーヘッドなどを用いた計測を行うことで得られた別の頭部インパルス応答を示す情報を格納している。記憶装置ＳＤは、音響処理装置１０とは独立した構成要素として設けられてもよいし、音響処理装置１０に含まれていてもよい。 FIG. 1 shows an embodiment of a sound processing apparatus. The sound processing apparatus 10 illustrated in FIG. 1 includes a prediction unit 11 and a correction unit 12. Further, the measuring apparatus EQ shown in FIG. 1 generates a head impulse response unique to the person Q1 in a first direction which is a plurality of directions set inside a predetermined measurement range R described with reference to FIG. Measurement is performed, and the head impulse response of the person Q1 obtained by the measurement is passed to the sound processing apparatus 10. In addition, the storage device SD illustrated in FIG. 1 is obtained by performing measurement using a dummy head or the like in the second direction, which is a plurality of directions set outside the measurement range R by the measurement device EQ. The information indicating the head impulse response is stored. The storage device SD may be provided as a component independent of the sound processing device 10 or may be included in the sound processing device 10.

以下の説明において、計測装置ＥＱによる計測で得られる人物Ｑ１に固有の頭部インパルス応答は、個別頭部インパルス応答と称される。また、記憶装置ＳＤに格納された情報で示される別の頭部インパルス応答は、共通頭部インパルス応答と称される。共通頭部インパルス応答は、ダミーヘッドなどを用いた計測によって得られた頭部インパルス応答に限られず、計測範囲Ｒの外側に設定された第２方向からの音響に対して予めモデル化された頭部インパルス応答であればよい。例えば、共通頭部インパルス応答は、多数の人物についての計測で得られた頭部インパルス応答の学習によりモデル化された頭部インパルス応答でもよい。なお、計測装置ＥＱによる個別頭部インパルス応答の計測および共通頭部インパルス応答の計測については、図２を用いて後述する。 In the following description, the head impulse response unique to the person Q1 obtained by measurement by the measuring device EQ is referred to as an individual head impulse response. Further, another head impulse response indicated by information stored in the storage device SD is referred to as a common head impulse response. The common head impulse response is not limited to the head impulse response obtained by measurement using a dummy head or the like, but is a head modeled in advance with respect to the sound from the second direction set outside the measurement range R. Any partial impulse response may be used. For example, the common head impulse response may be a head impulse response modeled by learning a head impulse response obtained by measurement for a large number of persons. The measurement of the individual head impulse response and the measurement of the common head impulse response by the measurement device EQ will be described later with reference to FIG.

図１に示した音響処理装置１０において、予測部１１は、計測装置ＥＱによって第１方向のそれぞれについて計測された個別頭部インパルス応答を受ける。予測部１１は、受けた個別頭部インパルス応答に基づいて、図３及び図４を用いて後述する予測処理を行い、計測装置ＥＱによる計測範囲Ｒの外側に設定された複数の第２方向のそれぞれから人物Ｑ１に音響が到達する場合の遅延時間を予測する。予測部１１において、各第２方向について予測された遅延時間は、補正部１２に渡される。補正部１２は、図５を用いて後述する補正処理を行うことで、記憶装置ＳＤに各第２方向に対応して格納された共通頭部インパルス応答の遅延時間を、同じ方向について予測された遅延時間に合わせる。補正部１２によって遅延時間が補正された共通頭部インパルス応答と計測された個別頭部インパルス応答とは、次に述べる音響ＡＲ（Augmented Reality：拡張現実）装置ＡＲＣに渡される。音響ＡＲ装置ＡＲＣに渡された個別頭部インパルス応答と補正された共通頭部インパルス応答とは、任意の方向に音像を定位させる処理である音像定位処理において、組み合わせて用いられる。 In the acoustic processing device 10 illustrated in FIG. 1, the prediction unit 11 receives individual head impulse responses measured for each of the first directions by the measurement device EQ. Based on the received individual head impulse response, the prediction unit 11 performs a prediction process to be described later with reference to FIGS. 3 and 4, and performs a plurality of second directions set outside the measurement range R by the measurement device EQ. The delay time when the sound reaches the person Q1 from each is predicted. In the prediction unit 11, the delay time predicted for each second direction is passed to the correction unit 12. The correction unit 12 performs a correction process to be described later with reference to FIG. 5, thereby predicting the delay time of the common head impulse response stored in the storage device SD corresponding to each second direction in the same direction. Adjust to the delay time. The common head impulse response whose delay time is corrected by the correction unit 12 and the measured individual head impulse response are passed to an acoustic AR (Augmented Reality) device ARC described below. The individual head impulse response passed to the acoustic AR device ARC and the corrected common head impulse response are used in combination in a sound image localization process that is a process of localizing a sound image in an arbitrary direction.

音響ＡＲ装置ＡＲＣは、制御部ＣＮＴと、音声データベースＤＢ１と、例えば、スマートホンやタブレット型端末などの人物Ｑ１によって携帯可能な端末装置ＵＥに内蔵された音響処理部ＳＰとを含んでいる。制御部ＣＮＴは、音声データベースＤＢ１に接続されており、制御部ＣＮＴは、音声データベースＤＢ１に格納された音声情報を取得可能である。また、制御部ＣＮＴと音響処理部ＳＰとは、例えば、無線ＬＡＮ(Local Area Network)などを用いた通信経路により接続されている。音響処理部ＳＰは、制御部ＣＮＴから受けた音声情報に基づいて音響信号を生成する機能を有する。 The acoustic AR device ARC includes a control unit CNT, an audio database DB1, and an acoustic processing unit SP built in the terminal device UE that can be carried by a person Q1, such as a smart phone or a tablet terminal. The control unit CNT is connected to the voice database DB1, and the control unit CNT can acquire the voice information stored in the voice database DB1. The control unit CNT and the acoustic processing unit SP are connected by a communication path using, for example, a wireless LAN (Local Area Network). The acoustic processing unit SP has a function of generating an acoustic signal based on audio information received from the control unit CNT.

制御部ＣＮＴは、例えば、音像定位処理により音像が定位させられる方向毎に、音響処理装置１０から渡される個別頭部インパルス応答あるいは補正された共通頭部インパルス応答を対応付ける。例えば、制御部ＣＮＴは、所定の計測範囲Ｒの内側に設定された第１方向のそれぞれと、当該第１方向についての計測で得られた個別頭部インパルス応答との対応関係を示す情報を内部のメモリなどに記憶する。また、制御部ＣＮＴは、計測範囲Ｒの外側に設定された第２方向のそれぞれと、当該第２方向に対応する共通頭部インパルス応答の遅延時間を補正することで得られた補正後の共通頭部インパルス応答との対応関係を示す情報を内部のメモリなどに記憶する。 For example, the control unit CNT associates the individual head impulse response or the corrected common head impulse response passed from the sound processing device 10 for each direction in which the sound image is localized by the sound image localization processing. For example, the control unit CNT internally stores information indicating the correspondence between each of the first directions set inside the predetermined measurement range R and the individual head impulse response obtained by the measurement in the first direction. It memorizes in the memory etc. In addition, the control unit CNT corrects each of the second directions set outside the measurement range R and the corrected common head obtained by correcting the delay time of the common head impulse response corresponding to the second direction. Information indicating a correspondence relationship with the head impulse response is stored in an internal memory or the like.

制御部ＣＮＴは、音声データベースＤＢ１から取得した音声情報を音響処理部ＳＰに渡す際に、音像を定位させる方向に対応して格納された個別頭部インパルス応答あるいは補正された共通頭部インパルス応答を示す情報を内部のメモリなどから読み出す。そして、制御部ＣＮＴは、内部のメモリなどから読み出した情報を、音声情報から生成した音響信号に適用する頭部インパルス応答を示す情報として、音声データベースＤＢ１から取得した音声情報とともに音響処理部ＳＰに渡す。 When the control unit CNT passes the sound information acquired from the sound database DB1 to the sound processing unit SP, the control unit CNT outputs the individual head impulse response stored corresponding to the direction in which the sound image is localized or the corrected common head impulse response. The indicated information is read from an internal memory or the like. Then, the control unit CNT uses the information read from the internal memory or the like as information indicating a head impulse response to be applied to the acoustic signal generated from the speech information, together with the speech information acquired from the speech database DB1, to the acoustic processing unit SP. hand over.

音響処理部ＳＰは、制御部ＣＮＴから受けた音声情報に基づいて音響信号を生成する。また、音響処理部ＳＰは、内蔵のフィルタを用いて、制御部ＣＮＴから渡される頭部インパルス応答と音声情報から生成した音響信号との畳み込み処理を行い、畳み込み処理後の音響信号を、人物Ｑ１の耳に装着されたイアホンＥＰＬ，ＥＰＲにより出力する。 The acoustic processing unit SP generates an acoustic signal based on the audio information received from the control unit CNT. In addition, the acoustic processing unit SP performs a convolution process between the head impulse response passed from the control unit CNT and the acoustic signal generated from the voice information using a built-in filter, and the acoustic signal after the convolution processing is converted into the person Q1. Is output by the earphones EPL and EPR attached to the ears.

すなわち、図１に示した音響ＡＲ装置ＡＲＣは、人物Ｑ１に対して第１方向のそれぞれに音像を定位させる音響の生成に、人物Ｑ１に同じ第１方向から音響を到達させた状態で計測された個別頭部インパルス応答を用いる。そして、音響ＡＲ装置ＡＲＣは、人物Ｑ１に対して第２方向に音像を定位させる音響の生成に、第２方向からの音響に対してモデル化された共通頭部インパルス応答の遅延時間を補正することで得られた補正後の共通頭部インパルス応答を用いる。 That is, the acoustic AR device ARC shown in FIG. 1 is measured in a state in which sound is caused to reach the person Q1 from the same first direction to generate sound that localizes the sound image in the first direction with respect to the person Q1. Individual head impulse responses are used. Then, the acoustic AR device ARC corrects the delay time of the common head impulse response modeled with respect to the sound from the second direction to generate sound that localizes the sound image in the second direction with respect to the person Q1. The corrected common head impulse response obtained by the above is used.

なお、端末装置ＵＥは、スマートホンやタブレット型端末に限らず、人物Ｑ１による持ち運びが可能であり、イアホンＥＰＬ，ＥＰＲにステレオ音響を出力させるための音響処理部ＳＰを含む装置であればよく、携帯電話や携帯型ゲーム機などでもよい。また、音響ＡＲ装置ＡＲＣの制御部ＣＮＴは、端末装置ＵＥに含まれていてもよいし、また、音響処理装置１０は、音響ＡＲ装置ＡＲＣの制御部ＣＮＴ及び音響処理部ＳＰを含んでもよい。 The terminal device UE is not limited to a smart phone or a tablet-type terminal, and can be carried by a person Q1, and may be any device including an acoustic processing unit SP for causing the earphones EPL and EPR to output stereo sound. A mobile phone or a portable game machine may be used. Further, the control unit CNT of the acoustic AR device ARC may be included in the terminal device UE, and the acoustic processing device 10 may include the control unit CNT and the acoustic processing unit SP of the acoustic AR device ARC.

次に、音響処理装置１０に含まれる予測部１１および補正部１２の機能および動作の説明に先立って、計測装置ＥＱにより、個別頭部インパルス応答を計測する手法について説明する。 Next, prior to description of the functions and operations of the prediction unit 11 and the correction unit 12 included in the acoustic processing device 10, a method of measuring an individual head impulse response by the measurement device EQ will be described.

図２は、図１に示した計測装置ＥＱにより個別頭部インパルス応答を計測する範囲の例を示す。なお、図２に示す要素のうち、図１に示した要素と同等のものは、同一の符号で示すとともに要素の説明を省略する場合がある。 FIG. 2 shows an example of a range in which the individual head impulse response is measured by the measuring apparatus EQ shown in FIG. 2 that are equivalent to the elements shown in FIG. 1 are denoted by the same reference numerals and description of the elements may be omitted.

図２に示した人物Ｑ１の耳ＥＬ，ＥＲのそれぞれには、マイクロホンＭＣＬ，ＭＣＲが装着されている。マイクロホンＭＣＬ，ＭＣＲの出力は、計測装置ＥＱに接続されている。計測装置ＥＱは、インパルス応答の測定用の信号であるＴＳＰ(Time stretched Pulse)信号を生成する機能を有しており、生成したＴＳＰ信号をスピーカＳ１に入力する。なお、図２の例では、スピーカＳ１は、人物Ｑ１の頭部の正面の方向を示す向きＤｉｒを基準にして角度θ１の方向に人物Ｑ１から距離Ｄの位置に設置されている。また、図２において、点線で示した円形Ｓ’は、図２とともに図４を用いて後述する共通頭部インパルス応答の計測に用いられる音源の一例を示す。 Microphones MCL and MCR are attached to the ears EL and ER of the person Q1 shown in FIG. The outputs of the microphones MCL and MCR are connected to the measuring device EQ. The measuring device EQ has a function of generating a TSP (Time stretched Pulse) signal that is a signal for measuring an impulse response, and inputs the generated TSP signal to the speaker S1. In the example of FIG. 2, the speaker S <b> 1 is installed at a distance D from the person Q <b> 1 in the direction of the angle θ <b> 1 with reference to the direction Dir indicating the front direction of the head of the person Q <b> 1. In FIG. 2, a circle S ′ indicated by a dotted line indicates an example of a sound source used for measurement of a common head impulse response, which will be described later with reference to FIG. 4 together with FIG. 2.

計測装置ＥＱは、例えば、スピーカＳ１から人物Ｑ１に到達した音響を、マイクロホンＭＣＬ，ＭＣＲで得られる音響信号として受ける。そして、受けた音響信号で示されるインパルス応答を、人物Ｑ１の頭部の向きＤｉｒから角度θ１で示される方向について個別頭部インパルス応答として音響処理装置１０に渡す。即ち、計測装置ＥＱは、個別頭部インパルス応答を計測する。 The measuring device EQ receives, for example, sound that has reached the person Q1 from the speaker S1 as sound signals obtained by the microphones MCL and MCR. Then, the impulse response indicated by the received acoustic signal is passed to the acoustic processing device 10 as an individual head impulse response in the direction indicated by the angle θ1 from the head direction Dir of the person Q1. That is, the measuring device EQ measures the individual head impulse response.

同様にして、計測装置ＥＱは、人物Ｑ１の頭部の向きＤｉｒを中心軸とし、中心角２φの扇形で示される計測範囲Ｒにおいて、スピーカＳ１が設置される角度θ１を変えながら、個別頭部インパルス応答の計測を行う。例えば、計測装置ＥＱは、計測範囲Ｒを示す扇形の弧上に計測を行う角度θ１のそれぞれで示される位置に設置された複数のスピーカＳ１のそれぞれに順次にＴＳＰ信号に対応する音響を発生させる。そして、計測装置ＥＱは、各スピーカＳ１の位置に対応する第１方向（例えば、角度θ１の方向）から人物Ｑ１に到達した音響を示す音響信号から、第１方向のそれぞれについての個別頭部インパルス応答を求める。なお、図２に示した計測範囲Ｒは、人物Ｑ１の正面の向きＤｉｒを基準とする角度が所定の範囲内である計測範囲の一例であり、角度θ１で示される方向は、計測範囲Ｒの内側に設定された複数の第１方向の一例である。また、計測範囲Ｒを示す扇形の中心角２φは、例えば、角度１８０度よりも小さい角度であり、１２０度〜１５０度程度に設定されることが望ましい。 Similarly, the measuring apparatus EQ changes the individual head while changing the angle θ1 at which the speaker S1 is installed in the measurement range R indicated by a fan shape having a central angle 2φ with the head direction Dir of the person Q1 as the central axis. Measure impulse response. For example, the measuring device EQ sequentially generates sound corresponding to the TSP signal in each of the plurality of speakers S1 installed at the positions indicated by the angles θ1 at which the measurement is performed on the fan-shaped arc indicating the measurement range R. . Then, the measuring device EQ uses the individual head impulse for each of the first directions from the acoustic signal indicating the sound that has reached the person Q1 from the first direction (for example, the direction of the angle θ1) corresponding to the position of each speaker S1. Ask for a response. The measurement range R shown in FIG. 2 is an example of a measurement range in which an angle with respect to the front direction Dir of the person Q1 is within a predetermined range, and the direction indicated by the angle θ1 is the direction of the measurement range R. It is an example of the several 1st direction set inside. Further, the sector-shaped central angle 2φ indicating the measurement range R is, for example, an angle smaller than an angle of 180 degrees, and is preferably set to about 120 to 150 degrees.

図２に示した計測範囲Ｒに含まれる各方向についての個別頭部インパルス応答の計測は、人物Ｑ１の周囲の３６０度方向について頭部インパルス応答を計測する場合に比べて少ないスペースで計測を行うことができる。このため、人物Ｑ１の周囲の３６０度方向について頭部インパルス応答を計測する場合に比べて、スピーカＳ１の設置数及びスピーカＳ１と計測装置ＥＱとを接続する配線数を削減することができ、また、計測時間も短縮できる。 The measurement of the individual head impulse response for each direction included in the measurement range R shown in FIG. 2 is performed in a smaller space than when the head impulse response is measured for the 360-degree direction around the person Q1. be able to. For this reason, compared with the case where the head impulse response is measured in the 360-degree direction around the person Q1, the number of speakers S1 and the number of wires connecting the speakers S1 and the measuring device EQ can be reduced. Measurement time can be shortened.

次に、人物Ｑ１の頭部を基準とする所定の範囲について計測された個別頭部インパルス応答の遅延時間を用いて、共通頭部インパルス応答の遅延時間を補正する手法について説明する。 Next, a method for correcting the delay time of the common head impulse response using the delay time of the individual head impulse response measured for a predetermined range based on the head of the person Q1 will be described.

図１に示した予測部１１は、計測装置ＥＱから受けた個別頭部インパルス応答に基づいて、図２に示した計測範囲Ｒの外側に設定された複数の第２方向のそれぞれから音響が到達する際のインパルス応答の遅延時間を予測する。 The prediction unit 11 shown in FIG. 1 receives sound from each of the plurality of second directions set outside the measurement range R shown in FIG. 2 based on the individual head impulse response received from the measurement device EQ. Predict the delay time of the impulse response.

予測部１１は、各第１方向の個別頭部インパルス応答に基づいて、第１方向のそれぞれから音響が人物Ｑ１の頭部に到達する際の遅延時間を特定する。例えば、予測部１１は、計測装置ＥＱから受けた個別頭部インパルス応答において、音響が発生した時刻から波形の振幅が所定の閾値以上となるまでの時間を遅延時間とする。なお、遅延時間の特定に用いる閾値は、例えば、音響信号において、雑音成分を判別する際に用いられる閾値と同等の値を設定することが望ましい。 The prediction unit 11 specifies a delay time when sound reaches the head of the person Q1 from each of the first directions based on the individual head impulse responses of the first directions. For example, in the individual head impulse response received from the measurement apparatus EQ, the prediction unit 11 sets the time from when the sound is generated until the amplitude of the waveform becomes a predetermined threshold or more as the delay time. Note that the threshold value used for specifying the delay time is preferably set to a value equivalent to the threshold value used for determining the noise component in the acoustic signal, for example.

図３は、個別頭部インパルス応答の例を示す。なお、図３に示した座標軸ｔは、音響が生成された時刻からの時間を示し、座標軸Ｐは音圧を示す。ここで、音響が生成された時刻としては、図２に示した計測装置ＥＱからスピーカＳ１にＴＳＰ信号が渡された時刻を用いることが望ましい。 FIG. 3 shows an example of an individual head impulse response. The coordinate axis t shown in FIG. 3 indicates the time from the time when the sound is generated, and the coordinate axis P indicates the sound pressure. Here, as the time when the sound is generated, it is desirable to use the time when the TSP signal is passed from the measuring apparatus EQ shown in FIG. 2 to the speaker S1.

図３の例は、図２において角度θ１で示した方向から人物Ｑ１の頭部に音響を到達させた際にマイクロホンＭＣＬ，ＭＣＲのいずれかで得られた音響信号から求めた個別頭部インパルス応答の波形を示す。図３の例において、図２において角度θ１で示される方向についての個別頭部インパルス応答の遅延時間は、座標軸ｔの原点（ｔ＝０）から個別頭部インパルス応答を示す波形が閾値Ｔｈｐを初めて超えるまでの時間δｐ（θ１）で示される。なお、図３の例では、人物Ｑ１の耳の一方について求められた個別頭部インパルス応答を示し、他方の耳について求められた個別頭部インパルス応答の図示は省略されている。 The example of FIG. 3 shows an individual head impulse response obtained from an acoustic signal obtained by either the microphone MCL or MCR when the sound reaches the head of the person Q1 from the direction indicated by the angle θ1 in FIG. The waveform is shown. In the example of FIG. 3, the delay time of the individual head impulse response in the direction indicated by the angle θ1 in FIG. 2 is the first time that the waveform showing the individual head impulse response from the origin (t = 0) of the coordinate axis t has the threshold value Thp. It is indicated by the time δp (θ1) until it exceeds. In the example of FIG. 3, the individual head impulse response obtained for one ear of the person Q1 is shown, and the individual head impulse response obtained for the other ear is not shown.

予測部１１は、図２に示した計測範囲Ｒの内側に設定された各第１方向（例えば、図２のθ１）と、個別頭部インパルス応答の遅延時間δｐ（θ１）との関係から、計測範囲Ｒの外側の他の方向からの音響に対するインパルス応答の遅延時間を予測する。 From the relationship between each first direction (for example, θ1 in FIG. 2) set inside the measurement range R shown in FIG. 2 and the delay time δp (θ1) of the individual head impulse response, the prediction unit 11 The delay time of the impulse response to the sound from other directions outside the measurement range R is predicted.

図４は、遅延時間と音源の方向との関係の例を示す。図４において、座標軸θは、図２に示した人物Ｑ１の頭部の向きＤｉｒを基準とする音源の方向を示し、座標軸ｔは、図３に示した遅延時間を示す。なお、図４の例において、図２に示した人物Ｑ１の頭部の正面の向きＤｉｒから時計回りで測った角度は、座標軸θにおいて正の値として示され、人物Ｑ１の頭部の向きＤｉｒから反時計回りに測った角度は、座標軸θにおいて負の値として示される。すなわち、図２に示した計測範囲Ｒは、図４に示した座標軸θにおいて、角度「−φ」〜角度「＋φ」の範囲に相当する。 FIG. 4 shows an example of the relationship between the delay time and the direction of the sound source. 4, the coordinate axis θ represents the direction of the sound source with reference to the head direction Dir of the person Q1 shown in FIG. 2, and the coordinate axis t represents the delay time shown in FIG. In the example of FIG. 4, the angle measured clockwise from the front direction Dir of the head of the person Q1 shown in FIG. 2 is shown as a positive value on the coordinate axis θ, and the head direction Dir of the person Q1. The angle measured counterclockwise from is indicated as a negative value on the coordinate axis θ. That is, the measurement range R shown in FIG. 2 corresponds to a range from the angle “−φ” to the angle “+ φ” on the coordinate axis θ shown in FIG.

また、図４の例において、黒丸のそれぞれは、図２に示した計測範囲Ｒに設定された複数の第１方向のそれぞれに配置された音源から音響が人物Ｑ１の一方の耳に到達する場合について計測された個別頭部インパルス応答が示す遅延時間を示す。例えば、黒色の円形Ｐｍ（θ１）は、図２に示した人物Ｑ１の頭部の向きＤｉｒに対して角度θ１の方向に音源（例えば、スピーカＳ１）がある場合について計測された個別頭部インパルス応答に現れる遅延時間を示す。なお、図４においては、人物Ｑ１の他方の耳に音響が到達する場合について計測された個別頭部インパルス応答が示す遅延時間と音源の方向との関係についての図示は省略されている。 In the example of FIG. 4, each of the black circles represents a case where sound reaches one ear of the person Q1 from a sound source arranged in each of the plurality of first directions set in the measurement range R shown in FIG. The delay time which the individual head impulse response measured about is shown. For example, the black circle Pm (θ1) is the individual head impulse measured when the sound source (for example, the speaker S1) is in the direction of the angle θ1 with respect to the head direction Dir of the person Q1 shown in FIG. Indicates the delay time that appears in the response. In FIG. 4, the illustration of the relationship between the delay time indicated by the individual head impulse response measured when the sound reaches the other ear of the person Q1 and the direction of the sound source is omitted.

図１に示した予測部１１は、例えば、図４に示した複数の黒丸の分布に近似する曲線ＣＶを求めることで、音源の方向と遅延時間との関係を推定する。そして、予測部１１は、推定した関係を示す曲線ＣＶに基づいて、図２に示した計測範囲Ｒの外側に設定された第２方向のそれぞれに音源を配置した計測を行った場合に想定される個別頭部インパルス応答が示す遅延時間を予測する。なお、予測部１１において、計測された個別頭部インパルス応答で示される遅延時間から計測範囲Ｒの外側に設定される各方向から音響が到達する場合に予測される遅延時間を求める方法は、図４に示した曲線ＣＶを求める方法に限られない。例えば、予測部１１は、図７から図９を用いて後述する手法を用いて、音源の方向と遅延時間との関係を推定してもよい。 The prediction unit 11 illustrated in FIG. 1 estimates the relationship between the direction of the sound source and the delay time by, for example, obtaining a curve CV that approximates the distribution of the plurality of black circles illustrated in FIG. And the prediction part 11 is assumed when performing the measurement which has arrange | positioned the sound source in each of the 2nd direction set outside the measurement range R shown in FIG. 2 based on the curve CV which shows the estimated relationship. The delay time indicated by the individual head impulse response is predicted. Note that, in the prediction unit 11, a method for obtaining a delay time predicted when sound arrives from each direction set outside the measurement range R from the delay time indicated by the measured individual head impulse response is shown in FIG. It is not restricted to the method of calculating | requiring the curve CV shown in 4. FIG. For example, the prediction unit 11 may estimate the relationship between the direction of the sound source and the delay time using a method described later with reference to FIGS.

図１に示した記憶装置ＳＤには、図２に示した計測範囲Ｒの外側に設定された第２方向のそれぞれについて、ダミーヘッドなどを用いて予め計測された共通頭部インパルス応答を示す情報が記憶されている。なお、共通頭部インパルス応答は、ダミーヘッドを用いて計測された頭部インパルス応答に限られない。例えば、共通頭部インパルス応答は、図２に示した人物Ｑ１とは別の人物の頭部に図２に示した計測範囲Ｒの外側に設定された第２方向のそれぞれから音響を到達させた状態で計測した頭部インパルス応答でもよい。また、図２において、点線で示した円形Ｓ’は、共通頭部インパルス応答の計測に用いた音源の一例を示す。 In the storage device SD shown in FIG. 1, information indicating the common head impulse response measured in advance using a dummy head or the like for each of the second directions set outside the measurement range R shown in FIG. Is remembered. The common head impulse response is not limited to the head impulse response measured using a dummy head. For example, the common head impulse response causes sound to reach the head of a person different from the person Q1 shown in FIG. 2 from each of the second directions set outside the measurement range R shown in FIG. It may be a head impulse response measured in a state. In FIG. 2, a circle S ′ indicated by a dotted line indicates an example of a sound source used for measurement of the common head impulse response.

ここで、図２に示した人物Ｑ１の頭部と個別頭部インパルス応答の計測に用いられた音源であるスピーカＳ１との距離Ｄと、ダミーヘッドと共通頭部インパルス応答の計測に用いられた音源Ｓ’との距離Ｄ’とは、厳密には一致しない場合がある。なぜなら、人物Ｑ１の両耳を結ぶ線分の中点およびダミーヘッドの両耳を結ぶ線分の中点のそれぞれを、計測範囲Ｒを示す扇形の中心に位置決めすることは困難だからである。同様に、人物Ｑ１の両耳間の距離と、共通頭部インパルス応答の計測のためにダミーヘッドに装着された２つのマイクロホン相互の距離とは、厳密には一致しない場合がある。このため、図２に示した計測範囲Ｒの外側に設定された第２方向のそれぞれについて予測された遅延時間と、対応する方向に音源がある場合について記憶装置ＳＤに記憶された共通頭部インパルス応答が示す遅延時間とは一致しない場合がある。例えば、人物Ｑ１の頭部の向きＤｉｒから角度θ２（θ２＜−φ）の方向について図４に示した曲線ＣＶから予測される遅延時間τ（θ２）と、同じ角度θ２で示される方向に音源Ｓ’がある場合の共通頭部インパルス応答が示す遅延時間とは、必ずしも一致しない。 Here, the distance D between the head of the person Q1 shown in FIG. 2 and the speaker S1, which is the sound source used for the measurement of the individual head impulse response, and the dummy head and the common head impulse response were measured. The distance D ′ to the sound source S ′ may not exactly match. This is because it is difficult to position the midpoint of the line connecting both ears of the person Q1 and the midpoint of the line connecting both ears of the dummy head at the center of the sector indicating the measurement range R. Similarly, the distance between both ears of the person Q1 and the distance between the two microphones attached to the dummy head for the measurement of the common head impulse response may not exactly match. Therefore, the delay time predicted for each of the second directions set outside the measurement range R shown in FIG. 2 and the common head impulse stored in the storage device SD when there is a sound source in the corresponding direction. The delay time indicated by the response may not match. For example, the sound source in the direction indicated by the same angle θ2 as the delay time τ (θ2) predicted from the curve CV shown in FIG. 4 in the direction of the angle θ2 (θ2 <−φ) from the head direction Dir of the person Q1. It does not necessarily match the delay time indicated by the common head impulse response when S ′ is present.

そこで、図１に示した補正部１２は、図２に示した計測範囲Ｒの外側に設定された複数の第２方向のそれぞれについて記憶装置ＳＤに記憶された共通頭部インパルス応答に、予測部１１によって対応する方向について求められた遅延時間を示させる補正を行う。 Therefore, the correction unit 12 illustrated in FIG. 1 generates a prediction unit in response to the common head impulse response stored in the storage device SD for each of the plurality of second directions set outside the measurement range R illustrated in FIG. 11 is performed to indicate the delay time obtained for the corresponding direction.

図５は、共通頭部インパルス応答の遅延時間の補正例を示す。図５において、座標軸ｔは、共通頭部インパルス応答における時間の経過を示し、座標軸Ｐは、音圧を示す。 FIG. 5 shows an example of correcting the delay time of the common head impulse response. In FIG. 5, the coordinate axis t represents the passage of time in the common head impulse response, and the coordinate axis P represents the sound pressure.

図５(Ａ)は、図２に示した人物Ｑ１の頭部の位置に、ダミーヘッドの正面の向きを人物Ｑ１の頭部の向きＤｉｒと一致させて配置し、頭部の向きＤｉｒと角度θ２で交差する方向から音響を到達させた状態で計測された頭部インパルス応答の例である。即ち、図５(Ａ)に示した頭部インパルス応答は、図１に示した記憶装置ＳＤに角度θ２に対応して記憶された共通頭部インパルス応答の一例である。なお、角度θ２で示される方向は、図２に示した計測範囲Ｒの外側に設定された第２方向の一つである。図５(Ａ)に示した共通頭部インパルス応答の遅延時間は、共通頭部インパルス応答を表す波形が閾値Ｔｈｐを初めて超える時刻δｃ(θ２)で示される。 FIG. 5A shows the position of the head of the person Q1 shown in FIG. 2 with the front direction of the dummy head aligned with the direction Dir of the head of the person Q1, and the angle with the head direction Dir. It is an example of a head impulse response measured in a state where sound is made to reach from a direction intersecting at θ2. That is, the head impulse response shown in FIG. 5A is an example of the common head impulse response stored in correspondence with the angle θ2 in the storage device SD shown in FIG. Note that the direction indicated by the angle θ2 is one of the second directions set outside the measurement range R shown in FIG. The delay time of the common head impulse response shown in FIG. 5A is indicated by a time δc (θ2) when the waveform representing the common head impulse response first exceeds the threshold Thp.

また、図５（Ｂ）は、図１に示した補正部１２で得られる補正された共通頭部インパルス応答の例を示す。すなわち、補正部１２は、図５（Ａ）に示した共通頭部インパルス応答の遅延時間を補正することで、図５（Ｂ）に示す補正後の共通頭部インパルス応答を得る。 FIG. 5B shows an example of the corrected common head impulse response obtained by the correction unit 12 shown in FIG. That is, the correction unit 12 obtains the corrected common head impulse response shown in FIG. 5B by correcting the delay time of the common head impulse response shown in FIG.

図１に示した補正部１２は、記憶装置ＳＤに第２方向のそれぞれについて保持されている共通頭部インパルス応答を時間軸方向において移動させることで、第２方向のそれぞれについて予測部１１によって予測された遅延時間に合わせる。 The correction unit 12 illustrated in FIG. 1 predicts each of the second directions by the prediction unit 11 by moving the common head impulse response held in each of the second directions in the storage device SD in the time axis direction. To the specified delay time.

例えば、補正部１２は、図２に示した頭部の向きＤｉｒから角度θ２の方向について、図４に示した関係から予測された遅延時間τ（θ２）と図５（Ａ）に示した共通頭部インパルス応答の遅延時間δｃ（θ２）との差分ｄτを求める。そして、補正部１２は、差分ｄτがなくなるように、角度θ２で示される方向の共通頭部インパルス応答を座標軸ｔ上で移動させる。補正部１２は、以上に説明した補正を行うことで、移動後の共通頭部インパルス応答の波形が閾値Ｔｈｐを超えるまでの経過時間と、予測部１１によって予測された遅延時間とをほぼ同等にする。 For example, the correction unit 12 has the delay time τ (θ2) predicted from the relationship shown in FIG. 4 and the common shown in FIG. 5A with respect to the direction of the angle θ2 from the head direction Dir shown in FIG. A difference dτ from the delay time δc (θ2) of the head impulse response is obtained. Then, the correction unit 12 moves the common head impulse response in the direction indicated by the angle θ2 on the coordinate axis t so that the difference dτ is eliminated. By performing the correction described above, the correction unit 12 substantially equalizes the elapsed time until the waveform of the common head impulse response after movement exceeds the threshold Thp and the delay time predicted by the prediction unit 11. To do.

図６は、図１に示した音響処理装置１０の動作を示す。図６に示したステップＳ３０１〜ステップＳ３０３の処理は、図１に示した音響処理装置１０の動作を示す。また、図６に示した各ステップの処理は、個人について計測された個別頭部インパルス応答と予め用意された共通頭部インパルス応答とを用いて任意の方向についての音像定位を実現するための音響処理方法および音響処理プログラムの例を示す。例えば、図６に示す処理は、音響処理装置１０に搭載されたプロセッサが音響処理プログラムを実行することで実現される。なお、図６に示す処理は、音響処理装置１０に搭載されるハードウェアによって実行されてもよい。 FIG. 6 shows the operation of the sound processing apparatus 10 shown in FIG. The process of step S301 to step S303 shown in FIG. 6 shows the operation of the sound processing apparatus 10 shown in FIG. Further, the processing of each step shown in FIG. 6 is performed by using an acoustic for realizing sound image localization in an arbitrary direction using an individual head impulse response measured for an individual and a common head impulse response prepared in advance. The example of a processing method and a sound processing program is shown. For example, the process illustrated in FIG. 6 is realized by a processor installed in the sound processing apparatus 10 executing a sound processing program. Note that the processing shown in FIG. 6 may be executed by hardware mounted on the sound processing apparatus 10.

ステップＳ３０１において、図１に示した音響処理装置１０は、例えば、図２に示した計測範囲Ｒの内側に設定された複数の第１方向のそれぞれについて、計測装置ＥＱによって計測された個別頭部インパルス応答を受ける。 In step S301, the acoustic processing device 10 illustrated in FIG. 1, for example, the individual heads measured by the measurement device EQ for each of the plurality of first directions set inside the measurement range R illustrated in FIG. Receive impulse response.

ステップＳ３０２において、図１に示した予測部１１は、計測された個別頭部インパルス応答から、図２に示した計測範囲Ｒの外側に設定された複数の第２方向について計測した場合に得られる個別頭部インパルス応答が示す遅延時間を予測する。 In step S302, the prediction unit 11 illustrated in FIG. 1 is obtained when measurement is performed for a plurality of second directions set outside the measurement range R illustrated in FIG. 2 from the measured individual head impulse response. The delay time indicated by the individual head impulse response is predicted.

ステップＳ３０３において、図１に示した補正部１２は、第２方向のそれぞれに対応する共通頭部インパルス応答が示す遅延時間を、ステップＳ３０２の処理で予測された遅延時間に近づける補正を行う。 In step S303, the correction unit 12 illustrated in FIG. 1 performs a correction so that the delay time indicated by the common head impulse response corresponding to each of the second directions approaches the delay time predicted in the process of step S302.

以上に説明したステップＳ３０３の処理で遅延時間が補正された共通頭部インパルス応答は、ステップＳ３０１の処理で計測装置ＥＱから受けた個別頭部インパルス応答とともに、図１に示した音響ＡＲ装置ＡＲＣに渡される。 The common head impulse response whose delay time has been corrected in the process of step S303 described above is applied to the acoustic AR device ARC shown in FIG. 1 together with the individual head impulse response received from the measurement apparatus EQ in the process of step S301. Passed.

そして、音響ＡＲ装置ＡＲＣは、図２に示した計測範囲Ｒの内側に設定された第１方向に音像を定位させる音響の生成に、同じ方向からの音響に対して計測された個別頭部インパルス応答を用いる。また、音響ＡＲ装置ＡＲＣは、計測範囲Ｒの外側に設定された第２方向に音像を定位させる音響の生成に、同じ方向からの音響に対してモデル化された共通頭部インパルス応答の遅延時間を補正することで得られた補正後の共通頭部インパルス応答を用いる。 The acoustic AR device ARC then generates individual head impulses measured with respect to the sound from the same direction to generate sound that localizes the sound image in the first direction set inside the measurement range R shown in FIG. Use a response. The acoustic AR device ARC also generates a delay time of the common head impulse response modeled for the sound from the same direction in generating the sound that localizes the sound image in the second direction set outside the measurement range R. The corrected common head impulse response obtained by correcting is used.

つまり、図１に示した音響ＡＲ装置ＡＲＣにおいて、第２方向に音像を定位させる音響の生成に用いられる頭部インパルス応答の遅延時間は、同じ第２方向について予測部１１によって予測された遅延時間とほぼ同等になる。ここで、予測部１１によって予測された遅延時間は、図２に示したスピーカＳ１が配置された弧の延長上に設置された別のスピーカＳ１’から人物Ｑ１に音響を到達させた状態で計測される頭部インパルス応答の遅延時間とほぼ同等である。 That is, in the acoustic AR device ARC shown in FIG. 1, the delay time of the head impulse response used for generating the sound that localizes the sound image in the second direction is the delay time predicted by the prediction unit 11 in the same second direction. Is almost the same. Here, the delay time predicted by the prediction unit 11 is measured in a state where sound is made to reach the person Q1 from another speaker S1 ′ installed on the extension of the arc in which the speaker S1 shown in FIG. 2 is arranged. It is almost equivalent to the delay time of the head impulse response.

即ち、図１に示した音響処理装置１０は、任意の方向に音像を定位させるために音響ＡＲ装置ＡＲＣによって生成される音響において、対応する方向についての個別頭部インパルス応答が示す遅延時間を再現することができる。したがって、図１に示した音響処理装置１０を用いることにより、人物Ｑ１に聴取させる音響が有する遅延時間の観点において、全ての方向につき個別頭部インパルス応答を計測した場合と同等の良好な音像定位を実現することができる。 That is, the sound processing device 10 shown in FIG. 1 reproduces the delay time indicated by the individual head impulse response in the corresponding direction in the sound generated by the sound AR device ARC in order to localize the sound image in an arbitrary direction. can do. Therefore, by using the sound processing device 10 shown in FIG. 1, in terms of the delay time of the sound to be heard by the person Q1, good sound image localization equivalent to the case where individual head impulse responses are measured in all directions Can be realized.

ここで、人間は、両耳のそれぞれで聴取した音響の時間差に基づいて、聴取した音響に対応する音源の方向を知覚する。したがって、任意の方向に音像を定位させる音響において、対応する方向についての個別頭部インパルス応答が示す遅延時間を再現することで、人物Ｑ１と音源との相対位置が変化する場合にも、人物Ｑ１に不自然な印象を与えない音響を聴取させることができる。 Here, the human perceives the direction of the sound source corresponding to the heard sound based on the time difference of the sound heard by both ears. Therefore, in the sound that localizes the sound image in an arbitrary direction, even when the relative position between the person Q1 and the sound source changes by reproducing the delay time indicated by the individual head impulse response in the corresponding direction, the person Q1 The sound that does not give an unnatural impression can be heard.

また、図２を用いて説明したように、計測範囲Ｒに含まれる各方向について個別頭部インパルス応答を計測するために用いるスペースは、人物Ｑ１の周囲３６０度についての計測のために用いられるスペースよりも小さい。更に、計測範囲Ｒを分割して得られる複数の範囲ごとに個別頭部インパルス応答を計測することで、個別頭部インパルス応答の計測のために用意するスペースを縮小することも可能である。例えば、計測範囲Ｒを示す扇形の内角をｎ(ｎは２以上の整数)個に分割して得られる図形に外接する程度の大きさの矩形を底面とする箱型のブース内に、回転可能なイスと複数のスピーカとを対向させて配置することで、個別頭部インパルス応答の計測は可能である。この場合に、図２に計測範囲Ｒに含まれる各方向の個別頭部インパルス応答は、イスに着席した人物Ｑ１と複数のスピーカとの相対位置を変えて、計測処理をｎ回繰り返すことで計測することができる。 In addition, as described with reference to FIG. 2, the space used for measuring the individual head impulse response in each direction included in the measurement range R is a space used for measurement about 360 degrees around the person Q1. Smaller than. Furthermore, by measuring the individual head impulse response for each of a plurality of ranges obtained by dividing the measurement range R, it is possible to reduce the space prepared for measuring the individual head impulse response. For example, it can be rotated in a box-shaped booth with a rectangular bottom that is circumscribed by a figure that is obtained by dividing the inner angle of the sector shape indicating the measurement range R into n (n is an integer of 2 or more). An individual head impulse response can be measured by arranging a chair and a plurality of speakers facing each other. In this case, the individual head impulse response in each direction included in the measurement range R in FIG. 2 is measured by changing the relative position between the person Q1 seated on the chair and the plurality of speakers and repeating the measurement process n times. can do.

以上に説明したように、図１に示した音響処理装置１０で用いる個別頭部インパルス応答の計測は、従来の技術で用いられたような大規模な設備を用いなくても実現することが可能である。したがって、例えば、展示会などの会場の一角などに、個別頭部インパルス応答の計測用のブースを設け、展示会などに集まった多数の人物のそれぞれについて、個別頭部インパルス応答の計測を行うことが可能である。そして、多数の人物のそれぞれについての計測で得られた個別頭部インパルス応答を用いて、各人物に対して音像定位技術を用いたサービスを提供することが可能となる。なお、図１に示した音響処理装置１０を用いて、例えば、人物Ｑ１に対して音像定位技術を用いたサービスを提供する音響ＡＲシステムについては、図１２〜図１７を用いて後述する。 As described above, the measurement of the individual head impulse response used in the sound processing apparatus 10 shown in FIG. 1 can be realized without using a large-scale facility as used in the prior art. It is. Therefore, for example, a booth for measuring the individual head impulse response is provided in a corner of a venue such as an exhibition, and the individual head impulse response is measured for each of a large number of people gathered at the exhibition. Is possible. Then, using the individual head impulse response obtained by measurement for each of a large number of persons, it is possible to provide a service using sound image localization technology to each person. Note that, for example, an acoustic AR system that provides a service using the sound image localization technology to the person Q1 using the acoustic processing device 10 illustrated in FIG. 1 will be described later with reference to FIGS.

次に、図１に示した予測部１１において、計測に用いられたマイクロホンと音源との位置関係を推定することで、人物Ｑ１の頭部の向きＤｉｒを基準とする音源の方向と、個別頭部インパルス応答が示す遅延時間との関係を推定する手法について説明する。 Next, the prediction unit 11 shown in FIG. 1 estimates the positional relationship between the microphone used for the measurement and the sound source, so that the direction of the sound source based on the head direction Dir of the person Q1 and the individual head A method for estimating the relationship with the delay time indicated by the partial impulse response will be described.

図７は、図２に示したマイクロホンＭＣＬ，ＭＣＲとスピーカＳ１との位置関係の例を示す。なお、図７に示す要素のうち、図２に示した要素と同等のものは、同一の符号で示すとともに要素の説明を省略する場合がある。また、図７に示したスピーカＳ１は、個別頭部インパルス応答の計測に用いられた音源の一例である。 FIG. 7 shows an example of the positional relationship between the microphones MCL and MCR and the speaker S1 shown in FIG. Of the elements shown in FIG. 7, elements equivalent to those shown in FIG. 2 are denoted by the same reference numerals and description of the elements may be omitted. Moreover, the speaker S1 shown in FIG. 7 is an example of a sound source used for measurement of an individual head impulse response.

図７において、線分ＤＬおよび線分ＤＲは、スピーカＳ１と２つのマイクロホンＭＣＬ，ＭＣＲとを互いに結んで得られる三角形の辺のうち、スピーカＳ１に相当する頂点を挟む２つの辺をそれぞれ示す。即ち、図７に示した線分ＤＬの長さ｜ＤＬ｜は、スピーカＳ１からマイクロホンＭＣＬまでの距離を示し、線分ＤＲの長さ｜ＤＲ｜は、スピーカＳ１からマイクロホンＭＣＲまでの距離を示す。また、線分Ｄは、２つのマイクロホンＭＣＬ，ＭＣＲを互いに結ぶ線分の中点ＱｃとスピーカＳ１とを結んで得られる線分を示す。そして、線分Ｄの長さＹは、２つのマイクロホンＭＣＬ，ＭＣＲを互いに結ぶ線分Ｗの中点ＱｃからスピーカＳ１までの距離を示し、線分Ｗの長さＸは、２つのマイクロホンＭＣＬ，ＭＣＲ間の距離を示す。すなわち、線分Ｄの長さＹは、人物Ｑ１の両耳を結ぶ線分の中点からスピーカＳ１までの距離を示し、線分Ｗの長さＸは、人物Ｑ１の両耳の間の距離を示す。 In FIG. 7, a line segment DL and a line segment DR indicate two sides sandwiching the apex corresponding to the speaker S1 among the triangular sides obtained by connecting the speaker S1 and the two microphones MCL and MCR to each other. That is, the length | DL | of the line segment DL shown in FIG. 7 indicates the distance from the speaker S1 to the microphone MCL, and the length | DR | of the line segment DR indicates the distance from the speaker S1 to the microphone MCR. . A line segment D indicates a line segment obtained by connecting the midpoint Qc of the line segment connecting the two microphones MCL and MCR and the speaker S1. The length Y of the line segment D indicates the distance from the midpoint Qc of the line segment W connecting the two microphones MCL and MCR to the speaker S1, and the length X of the line segment W indicates the two microphones MCL, The distance between MCR is shown. That is, the length Y of the line segment D indicates the distance from the midpoint of the line segment connecting both ears of the person Q1 to the speaker S1, and the length X of the line segment W indicates the distance between both ears of the person Q1. Indicates.

スピーカＳ１が人物Ｑ１の頭部の正面の向きＤｉｒを基準として角度θの方向にある場合に、スピーカＳ１からマイクロホンＭＣＬ、ＭＣＲまでの距離｜ＤＬ｜および距離｜ＤＲ｜のそれぞれは、角度θの関数として式(１)、式(２)で表される。なお、式(１)、式(２)において、符号Ｙは、線分Ｗの中点ＱｃからスピーカＳ１までの距離を示し、符号Ｘは、２つのマイクロホンＭＣＬ，ＭＣＲ間の距離Ｘを示す。 When the speaker S1 is in the direction of the angle θ with respect to the front direction Dir of the head of the person Q1, the distance | DL | and the distance | DR | from the speaker S1 to the microphones MCL and MCR are It is represented by the formula (1) and the formula (2) as functions. In equations (1) and (2), the symbol Y indicates the distance from the midpoint Qc of the line segment W to the speaker S1, and the symbol X indicates the distance X between the two microphones MCL and MCR.

そして、スピーカＳ１で発生した音響をマイクロホンＭＣＬ，ＭＣＲで受けた際に得られる音響信号から頭部インパルス応答を求めた場合に、求めた頭部インパルス応答に現れる遅延時間ＴＬ，ＴＲは、式(３)、式(４)で示される。なお、式(３)および式(４)において、符号ＤＬ(θ)は、角度θの関数として式(１)で表されるスピーカＳ１からマイクロホンＭＣＬまでの距離を示す。また、符号ＤＲ(θ)は、角度θの関数として式(２)で表されるスピーカＳ１からマイクロホンＭＣＲまでの距離を示す。そして、符号Ｖは、空気中の音速を示し、符号Ｃは、図１に示した計測装置ＥＱによる頭部インパルス応答の計測処理にかかる処理時間などを含む固定のオフセット時間を示す。

When the head impulse response is obtained from the acoustic signal obtained when the sound generated by the speaker S1 is received by the microphones MCL and MCR, the delay times TL and TR appearing in the obtained head impulse response are expressed by the following formulas ( 3) and expressed by equation (4). In equations (3) and (4), symbol DL (θ) indicates the distance from speaker S1 to microphone MCL represented by equation (1) as a function of angle θ. Further, the symbol DR (θ) indicates the distance from the speaker S1 to the microphone MCR expressed by the equation (2) as a function of the angle θ. Reference sign V indicates the speed of sound in the air, and reference sign C indicates a fixed offset time including a processing time required for the measurement processing of the head impulse response by the measuring device EQ shown in FIG.

式(１)から式(４)に示した関係を用いれば、計測装置ＥＱを用いて複数の方向について計測された個別頭部インパルス応答のそれぞれに現れた遅延時間に基づいて、図７に示した距離Ｘ，距離Ｙと式(３)、(４)に示したオフセット時間Ｃとを推定することができる。ここで、図７に示した距離Ｘ，距離Ｙと式(３)、(４)に示したオフセット時間Ｃとは、人物Ｑ１の個別頭部インパルス応答を計測した際に固有のパラメータであり、計測装置ＥＱによる人物Ｑ１についての計測を特徴付ける計測条件である。即ち、音響処理装置１０の予測部１１において上述の式(１)から式(４)に示した関係を用いることで、一部の方向について計測された個別頭部インパルス応答から、計測時における人物Ｑ１の両耳と音源との位置関係を含む計測条件を推定することができる。そして、推定された計測条件に基づいて、計測装置ＥＱによる計測が行われていない任意の方向について、人物Ｑ１の個別頭部インパルス応答が示すと予想される遅延時間を求めることができる。

Using the relationships shown in Equations (1) to (4), based on the delay times appearing in the individual head impulse responses measured in a plurality of directions using the measuring device EQ, shown in FIG. The distances X and Y and the offset time C shown in equations (3) and (4) can be estimated. Here, the distance X and the distance Y shown in FIG. 7 and the offset time C shown in the equations (3) and (4) are unique parameters when the individual head impulse response of the person Q1 is measured, This is a measurement condition that characterizes the measurement of the person Q1 by the measurement device EQ. That is, the prediction unit 11 of the sound processing device 10 uses the relationship shown in the above formulas (1) to (4), so that the person at the time of measurement can be obtained from the individual head impulse responses measured in some directions. Measurement conditions including the positional relationship between both ears of Q1 and the sound source can be estimated. Then, based on the estimated measurement conditions, it is possible to obtain a delay time that is expected to be indicated by the individual head impulse response of the person Q1 in an arbitrary direction in which measurement by the measurement device EQ is not performed.

図８は、音響処理装置１０の別実施形態を示す。なお、図８に示す構成要素のうち、図１に示した構成要素と同等のものは、同一の符号で示すとともに構成要素の説明を省略する場合がある。 FIG. 8 shows another embodiment of the sound processing apparatus 10. 8 that are equivalent to the components shown in FIG. 1 are denoted by the same reference numerals and description of the components may be omitted.

図８に示した音響処理装置１０は、予測部１１および補正部１２に加えて、記憶装置ＳＤと生成部１３とを含んでいる。図８の例では、計測装置ＥＱによる第１方向のそれぞれについての計測で得られた個別頭部インパルス応答（個別ＨＲＩＲ：Head Related Impulse Response）ＰＩＲは、例えば、第１方向のそれぞれに対応して記憶装置ＳＤに格納される。また、記憶装置ＳＤは、第２方向のそれぞれに対応して予め用意された共通頭部インパルス応答(共通ＨＲＩＲ)ＣＩＲを格納しており、補正部１２は、記憶装置ＳＤにアクセスすることで、共通頭部インパルス応答ＣＩＲを取得する。そして、補正部１２によって補正された共通頭部インパルス応答(補正ＨＲＩＲ)ＡＩＲは、記憶装置ＳＤに格納される。 The sound processing apparatus 10 illustrated in FIG. 8 includes a storage device SD and a generation unit 13 in addition to the prediction unit 11 and the correction unit 12. In the example of FIG. 8, individual head impulse response (Individual HRIR) PIR obtained by measurement in each of the first directions by the measurement device EQ corresponds to, for example, each of the first directions. It is stored in the storage device SD. The storage device SD stores a common head impulse response (common HRIR) CIR prepared in advance corresponding to each of the second directions, and the correction unit 12 accesses the storage device SD, The common head impulse response CIR is acquired. The common head impulse response (corrected HRIR) AIR corrected by the correcting unit 12 is stored in the storage device SD.

図８に示した生成部１３は、例えば、設定部１３１と、選択部１３２と、音響処理部ＳＰと、記憶部ＭＥＭとを含んでいる。図８に示した音響処理部ＳＰは、例えば、端末装置ＵＥに搭載されたハードウェアである。また、記憶部ＭＥＭは、端末装置ＵＥに内蔵されたメモリの一部を用いて実現される。そして、選択部１３２は、例えば、端末装置ＵＥに搭載されたプロセッサにより、図１７を用いて後述するアプリケーションプログラムを実行することによって実現される。また、設定部１３１は、例えば、無線ＬＡＮなどのネットワークＮＷを介して端末装置ＵＥに接続されており、記憶部ＭＥＭに対するアクセスが可能である。 The generation unit 13 illustrated in FIG. 8 includes, for example, a setting unit 131, a selection unit 132, an acoustic processing unit SP, and a storage unit MEM. The acoustic processing unit SP illustrated in FIG. 8 is, for example, hardware mounted on the terminal device UE. Moreover, the memory | storage part MEM is implement | achieved using some memory built in the terminal device UE. And the selection part 132 is implement | achieved by performing the application program mentioned later using FIG. 17 by the processor mounted in the terminal device UE, for example. The setting unit 131 is connected to the terminal device UE via a network NW such as a wireless LAN, for example, and can access the storage unit MEM.

図８に示した生成部１３において、設定部１３１は、図２に示した計測範囲Ｒの内側に設定された第１方向のそれぞれに対応して、当該第１方向についての計測で得られた個別頭部インパルス応答を記憶部ＭＥＭに記憶させる。また、設定部１３１は、計測範囲Ｒの外側に設定された第２方向のそれぞれに対応して、当該第２方向についての共通頭部インパルス応答の遅延時間を補正することで得られた補正後の共通頭部インパルス応答を記憶部ＭＥＭに記憶させる。 In the generation unit 13 illustrated in FIG. 8, the setting unit 131 is obtained by measurement in the first direction corresponding to each of the first directions set inside the measurement range R illustrated in FIG. 2. The individual head impulse response is stored in the storage unit MEM. Further, the setting unit 131 corresponds to each of the second directions set outside the measurement range R, and after correction is obtained by correcting the delay time of the common head impulse response in the second direction. Is stored in the storage unit MEM.

選択部１３２は、例えば、ネットワークＮＷを介して、サーバ装置ＳＶから音像を定位させる方向を示す情報を受け、受けた情報で示される方向に対応して記憶部ＭＥＭに格納された個別頭部インパルス応答あるいは補正後の共通頭部インパルス応答を読み出す。そして、選択部１３２は、読み出した個別頭部インパルス応答あるいは補正後の共通頭部インパルス応答を、サーバ装置ＳＶからの情報で示された方向に音像を定位させる音響の生成に用いるインパルス応答として音響処理部ＳＰに渡す。 For example, the selection unit 132 receives information indicating the direction in which the sound image is localized from the server device SV via the network NW, and the individual head impulse stored in the storage unit MEM corresponding to the direction indicated by the received information. Read the response or corrected common head impulse response. Then, the selection unit 132 uses the read individual head impulse response or the corrected common head impulse response as an impulse response used for generating sound for localizing a sound image in the direction indicated by the information from the server device SV. It passes to processing section SP.

また、音声データベースＤＢ１に蓄積された音響情報は、例えば、サーバ装置ＳＶによって読み出され、ネットワークＮＷを介して、音響処理部ＳＰに渡される。そして、音響処理部ＳＰは、サーバ装置ＳＶから渡された音響情報から生成した音響信号と選択部１３２から渡されたインパルス応答との畳み込み処理を行うことで、サーバ装置ＳＶからの情報で示された方向に音像を定位させる音響を生成する。 In addition, the acoustic information stored in the voice database DB1 is read by, for example, the server device SV, and passed to the acoustic processing unit SP via the network NW. Then, the acoustic processing unit SP performs the convolution process between the acoustic signal generated from the acoustic information passed from the server device SV and the impulse response passed from the selection unit 132, thereby being indicated by the information from the server device SV. The sound that localizes the sound image in the selected direction is generated.

即ち、図８に示した生成部１３は、補正部１２で遅延時間が補正された共通頭部インパルス応答を用いて、第２方向に音像を定位させる音響を生成する。そして、生成部１３は、計測装置ＥＱによって計測された個別頭部インパルス応答を用いて、第１方向に音像を定位させる音響の生成を行う。また、図８に示したサーバ装置ＳＶおよび選択部１３２は、図１に示した音響ＡＲ装置ＡＲＣの制御部ＣＮＴに相当する機能を果たす。 That is, the generation unit 13 illustrated in FIG. 8 generates sound that localizes the sound image in the second direction using the common head impulse response whose delay time is corrected by the correction unit 12. And the production | generation part 13 produces | generates the sound which localizes a sound image in a 1st direction using the separate head impulse response measured by the measuring apparatus EQ. Further, the server device SV and the selection unit 132 illustrated in FIG. 8 perform a function corresponding to the control unit CNT of the acoustic AR device ARC illustrated in FIG.

図８に示した音響処理装置１０において、予測部１１は、特定部１１１と、算出部１１２とを含んでいる。特定部１１１は、人物Ｑ１の頭部と個別頭部インパルス応答の計測の際に各第１方向に設置された音源との位置関係として、各音源から到達する音響の遅延時間が、計測された個別頭部インパルス応答のそれぞれの遅延時間となる位置関係を特定する。算出部１１２は、特定された計測条件に基づいて、第２方向のそれぞれから人物Ｑ１の頭部に音響が到達する場合に予測される遅延時間を算出し、第２方向のそれぞれについて算出した遅延時間を補正部１２に渡す。 In the acoustic processing apparatus 10 illustrated in FIG. 8, the prediction unit 11 includes a specifying unit 111 and a calculation unit 112. The identifying unit 111 measures the delay time of the sound reaching from each sound source as the positional relationship between the head of the person Q1 and the sound source installed in each first direction when measuring the individual head impulse response. The positional relationship that is the delay time of each individual head impulse response is specified. The calculation unit 112 calculates a delay time predicted when sound reaches the head of the person Q1 from each of the second directions based on the specified measurement conditions, and calculates the delay calculated for each of the second directions. The time is passed to the correction unit 12.

特定部１１１は、例えば、上述の式（１）から式（４）に示した関係を用いて、個別頭部インパルス応答が計測された際の計測条件として、図７に示した距離Ｘ，距離Ｙと式（３）、（４）に示したオフセット時間Ｃとを求める。ここで、図７に示した角θで示される第１方向についての計測で左耳について得られた遅延時間ｔＬ（θ）に含まれる誤差は、遅延時間ｔＬ（θ）と式（３）で求められる遅延時間ＴＬ（θ）との差で示される。同様に、角θで示される第１方向のそれぞれについての計測で右耳について得られた遅延時間ｔＲ（θ）に含まれる誤差は、遅延時間ｔＲ（θ）と式（４）で求められる遅延時間ＴＲ（θ）との差で示される。そこで、特定部１１１は、図７に示した角度θが値−φから値φの範囲で変化する場合について、例えば、式（５）で示される誤差の二乗和Ｅを最小化するパラメータのセットとして、図７に示した距離Ｘ，距離Ｙとオフセット時間Ｃとを特定する。なお、式（５）において、角度θの変域の下限として示した値−φから角度θの上限として示した値φまでの範囲は、図２に示した計測範囲Ｒの内側に相当する。 The specifying unit 111 uses, for example, the distance X and the distance shown in FIG. 7 as the measurement conditions when the individual head impulse response is measured using the relationship shown in the above equations (1) to (4). Y and the offset time C shown in equations (3) and (4) are obtained. Here, the error included in the delay time tL (θ) obtained for the left ear in the measurement in the first direction indicated by the angle θ shown in FIG. 7 is represented by the delay time tL (θ) and Equation (3). This is indicated by the difference from the required delay time TL (θ). Similarly, the error included in the delay time tR (θ) obtained for the right ear in the measurement in each of the first directions indicated by the angle θ is the delay obtained by the delay time tR (θ) and Equation (4). It is indicated by the difference from the time TR (θ). Therefore, the specifying unit 111 sets, for example, a parameter that minimizes the error sum of squares E shown in Expression (5) when the angle θ shown in FIG. 7 changes in the range of the value −φ to the value φ. As shown in FIG. 7, the distance X, the distance Y, and the offset time C shown in FIG. In the equation (5), the range from the value −φ shown as the lower limit of the range of the angle θ to the value φ shown as the upper limit of the angle θ corresponds to the inside of the measurement range R shown in FIG.

そして、算出部１１２は、特定部１１１によって特定されたパラメータと上述の式（１）〜式（４）とを用いることで、図２に示した計測範囲Ｒの外側に設定された複数の第２方向のそれぞれにおいて予測される遅延時間を算出する。つまり、算出部１１２は、特定されたパラメータＸ，Ｙと第２方向を示す角度θとを式（１）を代入することで、個別頭部インパルス応答の計測条件が再現された場合に、計測範囲Ｒの外側に設置された音源から人物Ｑ１の左耳までの距離ＤＬ（θ）を求める。同様に、算出部１１２は、特定されたパラメータＸ，Ｙと第２方向を示す角度θとを式（２）に代入することで、個別頭部インパルス応答の計測条件が再現された場合に、計測範囲Ｒの外側に設置された音源から人物Ｑ１の右耳までの距離ＤＲ（θ）を求める。そして、算出部１１２は、式（１）を用いて求めた距離ＤＬ（θ）と特定されたパラメータＣとを式（３）に代入することで、個別頭部インパルス応答の計測条件が再現された状態で、第２方向からの音響に対して得られるインパルス応答の遅延時間を算出する。同様に、算出部１１２は、式（２）を用いて求めた距離ＤＲ（θ）と特定されたパラメータＣとを式（４）に代入することで、個別頭部インパルス応答の計測条件が再現された状態で、第２方向からの音響に対して得られるインパルス応答の遅延時間を算出する。

Then, the calculation unit 112 uses the parameters specified by the specifying unit 111 and the above-described equations (1) to (4), so that a plurality of second values set outside the measurement range R shown in FIG. A delay time predicted in each of the two directions is calculated. That is, the calculation unit 112 performs measurement when the measurement conditions of the individual head impulse response are reproduced by substituting the specified parameters X and Y and the angle θ indicating the second direction with Expression (1). A distance DL (θ) from a sound source installed outside the range R to the left ear of the person Q1 is obtained. Similarly, the calculation unit 112 substitutes the specified parameters X and Y and the angle θ indicating the second direction in the equation (2), so that when the measurement condition of the individual head impulse response is reproduced, The distance DR (θ) from the sound source installed outside the measurement range R to the right ear of the person Q1 is obtained. Then, the calculation unit 112 substitutes the distance DL (θ) obtained using the equation (1) and the identified parameter C into the equation (3), thereby reproducing the measurement conditions of the individual head impulse response. In this state, the delay time of the impulse response obtained for the sound from the second direction is calculated. Similarly, the calculation unit 112 reproduces the measurement condition of the individual head impulse response by substituting the distance DR (θ) obtained using Expression (2) and the identified parameter C into Expression (4). In this state, the delay time of the impulse response obtained for the sound from the second direction is calculated.

図９は、図８に示した算出部１１２によって算出される遅延時間の例を示す。なお、図９に示す要素のうち、図４に示した要素と同等のものは、同一の符号で示すとともに構成要素の説明を省略する場合がある。 FIG. 9 shows an example of the delay time calculated by the calculation unit 112 shown in FIG. Of the elements shown in FIG. 9, elements equivalent to those shown in FIG. 4 are denoted by the same reference numerals and description of the constituent elements may be omitted.

図９(Ａ)は、図８に示した計測装置ＥＱによる計測で得られた個別頭部インパルス応答の遅延時間と式（１）〜式（５）とに基づいて特定部１１１によって特定されたパラメータを用いた場合に、算出部１１２によって算出される遅延時間の例を示す。また、図９（Ｂ）は、図１０を用いて後述する別の特定部１１１ａによって特定されたパラメータを用いた場合に算出部１１２によって算出される遅延時間の例を示す。 FIG. 9A is specified by the specifying unit 111 based on the delay time of the individual head impulse response obtained by the measurement by the measurement apparatus EQ shown in FIG. 8 and the equations (1) to (5). An example of the delay time calculated by the calculation unit 112 when parameters are used is shown. FIG. 9B shows an example of a delay time calculated by the calculation unit 112 when using a parameter specified by another specifying unit 111a described later with reference to FIG.

まず、図９（Ａ）の例について説明する。図９（Ａ）に示した黒丸のそれぞれは、計測装置ＥＱによる計測範囲Ｒに含まれる複数の第１方向のそれぞれについて得られた個別頭部インパルス応答の遅延時間を示す。また、図９（Ａ）に示した曲線ＣＶａは、式（５）で示される誤差の二乗和Ｅを最小化するパラメータを代入した式（３）あるいは式（４）から算出された遅延時間の角度θに対応する変化を示す。そして、図９（Ａ）に示した白丸のそれぞれは、図８に示した記憶装置ＳＤに格納された共通頭部インパルス応答のそれぞれに対応する第２方向について、算出部１１２によって算出される遅延時間を示す。 First, an example of FIG. 9A will be described. Each black circle shown in FIG. 9A indicates a delay time of the individual head impulse response obtained for each of the plurality of first directions included in the measurement range R by the measurement apparatus EQ. Further, the curve CVa shown in FIG. 9A shows the delay time calculated from the equation (3) or the equation (4) in which the parameter for minimizing the square sum E of the error shown in the equation (5) is substituted. A change corresponding to the angle θ is shown. Each white circle shown in FIG. 9A represents a delay calculated by the calculation unit 112 in the second direction corresponding to each of the common head impulse responses stored in the storage device SD shown in FIG. Show time.

ここで、上述した式（５）で示される誤差の二乗和Ｅにおいては、計測で得られた全ての遅延時間に同等の重みが与えられている。このため、式（５）を用いる特定部１１１によって得られるパラメータのセットは、図９（Ａ）に示した全ての黒丸の分布を近似する曲線ＣＶａを与えるパラメータのセットとなる。しかしながら、曲線Ｃｖａを与えるパラメータのセットと式（１）から式（４）とを用いて、計測範囲Ｒの内側と外側の境界を示す境界方向の遅延時間を算出すると、計測で得られた遅延時間と算出される遅延時間との間に差が生じる場合がある。例えば、図９(Ａ)の例では、計測範囲Ｒの境界方向を示す角度θ＝−φについて計測された個別頭部インパルス応答に表れる遅延時間Ｐｍ(−φ)と、特定されたパラメータのセットを用いて算出された遅延時間Ｔａ(−φ)との間には差ｄが生じている。 Here, in the square sum of errors E expressed by the above equation (5), an equal weight is given to all delay times obtained by measurement. Therefore, the parameter set obtained by the specifying unit 111 using Expression (5) is a parameter set that gives a curve CVa that approximates the distribution of all the black circles shown in FIG. However, if the delay time in the boundary direction indicating the inner and outer boundaries of the measurement range R is calculated using the set of parameters that give the curve Cva and the equations (1) to (4), the delay obtained by the measurement is calculated. There may be a difference between the time and the calculated delay time. For example, in the example of FIG. 9A, the delay time Pm (−φ) appearing in the individual head impulse response measured for the angle θ = −φ indicating the boundary direction of the measurement range R and the set of the specified parameters There is a difference d from the delay time Ta (−φ) calculated using

図９（Ａ）の例に示した差ｄは、計測範囲Ｒの内側に設定された第１方向のそれぞれからの音響に対して計測された個別頭部インパルス応答に含まれる誤差によって発生する。このような差ｄが生じていると、計測範囲Ｒの境界付近において、計測で得られた個別頭部インパルス応答が示す遅延時間と、共通頭部インパルス応答の補正に用いる遅延時間とが滑らかに接続しなくなる。例えば、図９（Ａ）の例では、計測範囲Ｒの境界付近に設定された第２方向を示す角度θ３について算出された遅延時間Ｔａ（−θ３）と境界方向についての計測で得られた遅延時間Ｐｍ（−φ）との間に、差ｄと同程度の大きさを持つギャップｄＡが生じている。そして、このようなギャップｄＡが生じていると、計測範囲Ｒの境界付近で音像を定位させる方向が変化した際に、人物Ｑ１に聴取させる音響に不自然な無音時間が発生する場合や、順次に聴取されるはずの音響が重なり合って聴取される場合などが発生する。 The difference d shown in the example of FIG. 9A is caused by an error included in the individual head impulse response measured with respect to the sound from each of the first directions set inside the measurement range R. When such a difference d occurs, in the vicinity of the boundary of the measurement range R, the delay time indicated by the individual head impulse response obtained by measurement and the delay time used for correcting the common head impulse response are smooth. The connection is lost. For example, in the example of FIG. 9A, the delay time Ta (−θ3) calculated for the angle θ3 indicating the second direction set near the boundary of the measurement range R and the delay obtained by the measurement for the boundary direction. A gap dA having the same size as the difference d is generated between the time Pm (−φ). When such a gap dA occurs, when the direction in which the sound image is localized near the boundary of the measurement range R changes, an unnatural silence time occurs in the sound to be heard by the person Q1, or sequentially The sound that is supposed to be heard by the user may be heard in an overlapping manner.

図９(Ａ)に示したギャップｄＡは、図１０に示す特定部１１１ａにより特定されたパラメータのセットを用いることにより、計測で得られた全ての遅延時間に同等の重みを与えた最小二乗法で特定されたパラメータを用いる場合に比べて小さくすることができる。 The gap dA shown in FIG. 9A is a least square method in which the same weight is given to all delay times obtained by measurement by using the parameter set specified by the specifying unit 111a shown in FIG. Compared to the case where the parameters specified in (1) are used, the size can be reduced.

図１０は、音響処理装置１０の別実施形態を示す。なお、図１０に示す構成要素のうち、図１または図８に示した構成要素と同等のものは、同一の符号で示すとともに構成要素の説明を省略する場合がある。 FIG. 10 shows another embodiment of the sound processing apparatus 10. 10 that are equivalent to the components shown in FIG. 1 or FIG. 8 are denoted by the same reference numerals and description of the components may be omitted.

図１０の音響処理装置は、図８に示した音響処理装置１０の特定部１１１に代えて、特定部１１１ａを有している。図１０に示した特定部１１１ａは、最小二情報などを用いた回帰分析により、上述の式（１）から式（４）に示したパラメータＸ，Ｙ，Ｃを求める分析部１１３と、分析部１１３による回帰分析に用いる重みを設定する重み設定部１１４とを含んでいる。 The sound processing apparatus of FIG. 10 includes a specifying unit 111a instead of the specifying unit 111 of the sound processing apparatus 10 illustrated in FIG. The identifying unit 111a illustrated in FIG. 10 includes an analysis unit 113 that obtains the parameters X, Y, and C shown in the above formulas (1) to (4) by regression analysis using the minimum two information, and the analysis unit And a weight setting unit 114 for setting weights used in the regression analysis by 113.

分析部１１３は、例えば、上述の式（５）に代えて、次に示す式（６）で示される誤差の二乗和Ｅ’を最小化するパラメータＸ，Ｙ，Ｃを求める。なお、式（６）において、符号Ｗ（θ）は、人物Ｑ１の正面の向きＤｉｒを基準として角度θの方向からの音響に対して計測された個別頭部インパルス応答の遅延時間の誤差に対して、重み設定部１４によって設定される重みを示す。 For example, the analysis unit 113 obtains parameters X, Y, and C that minimize the square sum E ′ of errors represented by the following equation (6) instead of the above equation (5). In Equation (6), the symbol W (θ) represents the delay time error of the individual head impulse response measured for the sound from the direction of the angle θ with reference to the front direction Dir of the person Q1. The weight set by the weight setting unit 14 is shown.

重み設定部１４は、例えば、角度θの下限（θ＝−φ）および上限（θ＝φ）に設定された第１方向からの音響に対して計測された個別頭部インパルス応答の遅延時間の誤差に他の第１方向よりも大きい重みを設定する。

For example, the weight setting unit 14 determines the delay time of the individual head impulse response measured for the sound from the first direction set to the lower limit (θ = −φ) and the upper limit (θ = φ) of the angle θ. A larger weight than the other first direction is set for the error.

図１１は、図１０に示した重み設定部１１４により設定される重みの例を示す。なお、図１１において、座標軸θは、図２に示した人物Ｑ１の頭部の向きＤｉｒを基準とする音源の方向を示し、座標軸Ｗは、重みとして設定される値の大きさを示す。なお、図４の例において、図２に示した人物Ｑ１の頭部の正面の向きＤｉｒから時計回りで測った角度は、座標軸θにおいて正の値として示され、人物Ｑ１の頭部の向きＤｉｒから反時計回りに測った角度は、座標軸θにおいて負の値として示される。すなわち、図２に示した計測範囲Ｒは、図４に示した座標軸θにおいて、角度「−φ」〜角度「＋φ」の範囲に相当する。 FIG. 11 shows an example of weights set by the weight setting unit 114 shown in FIG. In FIG. 11, the coordinate axis θ indicates the direction of the sound source with reference to the head direction Dir of the person Q1 shown in FIG. 2, and the coordinate axis W indicates the magnitude of the value set as the weight. In the example of FIG. 4, the angle measured clockwise from the front direction Dir of the head of the person Q1 shown in FIG. 2 is shown as a positive value on the coordinate axis θ, and the head direction Dir of the person Q1. The angle measured counterclockwise from is indicated as a negative value on the coordinate axis θ. That is, the measurement range R shown in FIG. 2 corresponds to a range from the angle “−φ” to the angle “+ φ” on the coordinate axis θ shown in FIG.

図１１（Ａ）及び図１１（Ｂ）のそれぞれは、図１０に示した重み設定部１１４により、式（６）において、角度θの関数Ｗ（θ）として設定される重みの例を示す。 Each of FIG. 11A and FIG. 11B shows an example of the weight set as the function W (θ) of the angle θ in the equation (6) by the weight setting unit 114 shown in FIG.

図１１（Ａ）に示した重みＷ（θ）は、角度θの下限（θ＝−φ）および上限（θ＝φ）に設定された第１方向の計測で得られた個別頭部インパルス応答の遅延時間の誤差に重みＷ１を与える。一方、重みＷ（θ）は、他の第１方向の計測で得られた個別頭部インパルス応答の遅延時間の誤差に、重みＷ１よりも小さい値を持つ重みＷ２を与える。 The weight W (θ) shown in FIG. 11A is the individual head impulse response obtained by the measurement in the first direction set at the lower limit (θ = −φ) and the upper limit (θ = φ) of the angle θ. Is given a weight W1. On the other hand, the weight W (θ) gives the weight W2 having a value smaller than the weight W1 to the error in the delay time of the individual head impulse response obtained by the measurement in the other first direction.

図１０に示した分析部１１３は、図１１（Ａ）に示した重みＷ（θ）が設定された式（６）で示される誤差の二乗和を最小化するパラメータＸ，Ｙ，Ｃを求めることで、遅延時間ｔＬ(−φ)，ｔＲ(−φ)を含む項で示される誤差を最小化するパラメータを求める。これにより、図９（Ｂ）に示したように、分析部１１３で求められたパラメータのセットで示される曲線ＣＶｂが角度θ＝−φの場合に示す遅延時間と計測された個別頭部インパルス応答の遅延時間との差を、図９（Ａ）に示した差ｄよりも小さくできる。 The analysis unit 113 illustrated in FIG. 10 obtains parameters X, Y, and C that minimize the sum of squares of the error represented by Expression (6) in which the weight W (θ) illustrated in FIG. Thus, a parameter that minimizes the error indicated by the term including the delay times tL (−φ) and tR (−φ) is obtained. Accordingly, as shown in FIG. 9B, the delay time shown when the curve CVb shown by the set of parameters obtained by the analysis unit 113 is an angle θ = −φ and the measured individual head impulse response. The difference from the delay time can be made smaller than the difference d shown in FIG.

図９（Ｂ）の例では、角度θ＝−φの方向について計測された個別頭部インパルス応答の遅延時間Ｐｍ（−φ）と、特定されたパラメータＸ，Ｙ，Ｃを用いて算出部１１２により算出される遅延時間Ｔｂ（−φ）とはほぼ同等になっている。これに伴って、図９（Ｂ）に示した角度θ３について算出部１１２により算出される遅延時間Ｔｂ（−θ３）と計測で得られた遅延時間Ｐｍ（−φ）との間のギャップｄＢは、図９（Ａ）に示したギャップｄＡよりも小さくなっている。 In the example of FIG. 9B, the calculation unit 112 uses the delay time Pm (−φ) of the individual head impulse response measured in the direction of the angle θ = −φ and the specified parameters X, Y, and C. Is substantially equal to the delay time Tb (−φ) calculated by. Accordingly, the gap dB between the delay time Tb (−θ3) calculated by the calculation unit 112 and the delay time Pm (−φ) obtained by the measurement for the angle θ3 shown in FIG. The gap dA is smaller than that shown in FIG.

即ち、図１０に示した特定部１１１ａにより特定されたパラメータを用いることで、個別頭部インパルス応答の計測に誤差がある場合でも、個別頭部インパルス応答の遅延時間と補正された共通頭部インパルス応答の遅延時間とを平滑に接続することができる。 That is, by using the parameters specified by the specifying unit 111a shown in FIG. 10, even when there is an error in the measurement of the individual head impulse response, the delay time of the individual head impulse response and the corrected common head impulse The response delay time can be smoothly connected.

したがって、図１０に示した特定部１１１ａを有する音響処理装置１０は、生成部１３により、個別頭部インパルス応答の計測に誤差がある場合でも、計測範囲Ｒの境界付近において、音源の方向が滑らかに変化する音響を人物Ｑ１に与えることができる。 Therefore, in the sound processing apparatus 10 having the specifying unit 111a illustrated in FIG. 10, the direction of the sound source is smooth in the vicinity of the boundary of the measurement range R even when there is an error in the measurement of the individual head impulse response by the generation unit 13. The sound that changes to can be given to the person Q1.

なお、重み設定部１１４によって設定される重みＷ（θ）は、図１１（Ａ）に示した重みＷ（θ）に限らず、計測で得られた個別頭部インパルス応答の遅延時間の誤差に、計測範囲Ｒの境界に近いほど大きい重みを与える重みＷ（θ）であればよい。重み設定部１１４は、例えば、図１１（Ｂ）に示すように、角度θと角度φあるいは角度−φとの差に応じて、重みとして設定する値を段階的に変化させる重みＷ（θ）を設定してもよい。 Note that the weight W (θ) set by the weight setting unit 114 is not limited to the weight W (θ) shown in FIG. 11A, but is an error in the delay time of the individual head impulse response obtained by measurement. Any weight W (θ) may be used as long as it is closer to the boundary of the measurement range R. For example, as shown in FIG. 11B, the weight setting unit 114 changes the value set as the weight stepwise according to the difference between the angle θ and the angle φ or the angle −φ. May be set.

図１１（Ｂ）に示した重みＷ（θ）は、角度θが角度−φ＋ηより大きく角度φ−η未満である範囲内に設定された第１方向からの音響に対して計測された個別頭部インパルス応答の遅延時間の誤差に所定の値Ｗ２を持つ重みを設定する。一方、図１１（Ｂ）に示した重みＷ（θ）は、角度θが角度−φあるいは角度φである第１方向からの音響に対して計測された個別頭部インパルス応答の遅延時間の誤差に、値Ｗ２よりも大きい値Ｗ１を持つ重みを設定する。そして、図１１（Ｂ）に示した重みＷ（θ）は、角度θが角度−φ＋ηあるいは角度φ−ηである第１方向からの音響に対して計測された個別頭部インパルス応答の遅延時間の誤差に、値Ｗ１よりも小さく、かつ値Ｗ２よりも大きい値Ｗ３を持つ重みを設定する。 The weight W (θ) shown in FIG. 11B is the individual head measured for the sound from the first direction set in a range where the angle θ is larger than the angle −φ + η and smaller than the angle φ−η. A weight having a predetermined value W2 is set for the error of the delay time of the partial impulse response. On the other hand, the weight W (θ) shown in FIG. 11B is the error of the delay time of the individual head impulse response measured for the sound from the first direction where the angle θ is the angle −φ or the angle φ. In addition, a weight having a value W1 larger than the value W2 is set. The weight W (θ) shown in FIG. 11B is the delay time of the individual head impulse response measured for the sound from the first direction where the angle θ is the angle −φ + η or the angle φ−η. Is set to a weight having a value W3 smaller than the value W1 and larger than the value W2.

また、重み設定部１１４は、図１１（Ａ），（Ｂ）の例に限らず、重みとして設定する値を４段階以上に区切って設定する重みＷ（θ）を用いて、分析部１１３における重み付けを設定してもよい。 Further, the weight setting unit 114 is not limited to the example of FIGS. 11A and 11B, and the analysis unit 113 uses the weight W (θ) set by dividing the value set as the weight into four or more levels. A weight may be set.

また、特定部１１１ａにおいて、パラメータＸ，Ｙ，Ｃを求めるために用いる手法は、重み付き最小二乗法に限られない。特定部１１１ａは、例えば、計測範囲Ｒの境界に近い第１方向の個別頭部インパルス応答の遅延時間に対して、境界から離れた第１方向の個別頭部インパルス応答の遅延時間に対する重みよりも大きい重み与える重み付けで、パラメータＸ，Ｙ，Ｃを求めればよい。 In addition, the method used to determine the parameters X, Y, and C in the specifying unit 111a is not limited to the weighted least square method. For example, the specifying unit 111a is configured to have a delay time of the individual head impulse response in the first direction close to the boundary of the measurement range R, rather than a weight for the delay time of the individual head impulse response in the first direction away from the boundary What is necessary is just to obtain | require the parameters X, Y, and C by weighting which gives a big weight.

以上に説明した音響処理装置１０は、例えば、展示会場などへの来場者に対して、展示物を説明する音声情報を展示物の方向から聞こえるように認識させる案内システムを実現する上で有用である。 The sound processing apparatus 10 described above is useful for realizing a guidance system that makes it possible for a visitor to an exhibition hall or the like to recognize audio information explaining an exhibit so that it can be heard from the direction of the exhibit. is there.

図１２は、音響処理装置１０の別実施形態を示す。なお、図１２に示す構成要素のうち、図１または図８に示した構成要素と同等のものは、同一の符号で示すとともに構成要素の説明を省略する場合がある。 FIG. 12 shows another embodiment of the sound processing apparatus 10. Note that, among the components shown in FIG. 12, components equivalent to those shown in FIG. 1 or FIG. 8 are denoted by the same reference numerals and description of the components may be omitted.

図１２に示した音響処理装置１０及び音響ＡＲ装置ＡＲＣは、音像定位技術を用いて、展示会場などへの来場者に音声情報による案内を行う案内システムＧＳに含まれている。 The acoustic processing device 10 and the acoustic AR device ARC shown in FIG. 12 are included in a guidance system GS that uses a sound image localization technique to guide visitors to an exhibition hall or the like using audio information.

図１２に示した音響ＡＲ装置ＡＲＣは、図８に示したサーバ装置ＳＶと音声データベースＤＢ１と選択部１３２と音響処理部ＳＰとに加えて、展示データベースＤＢ２と、方位特定部ＤＲＤとを含んでいる。 The acoustic AR device ARC shown in FIG. 12 includes an exhibition database DB2 and an orientation specifying unit DRD in addition to the server device SV, the voice database DB1, the selection unit 132, and the acoustic processing unit SP shown in FIG. Yes.

図１２に示したサーバ装置ＳＶは、音声データベースＤＢ１及び展示データベースＤＢ２のそれぞれに接続されており、サーバ装置ＳＶは、音声データベースＤＢ１及び展示データベースＤＢ２に蓄積された情報にアクセス可能である。展示データベースＤＢ２には、図１３を用いて後述する展示会場ＨＬ内に配置された展示物のそれぞれの位置を示す情報が蓄積されている。また、図１２に示した音声データベースＤＢ１には、展示会場ＨＬに配置された展示物のそれぞれを説明するための音声情報が蓄積されている。 The server apparatus SV shown in FIG. 12 is connected to each of the audio database DB1 and the exhibition database DB2, and the server apparatus SV can access information stored in the audio database DB1 and the exhibition database DB2. In the exhibition database DB2, information indicating the positions of the exhibits arranged in the exhibition hall HL described later with reference to FIG. 13 is accumulated. In the audio database DB1 shown in FIG. 12, audio information for explaining each of the exhibits arranged in the exhibition hall HL is accumulated.

また、図１２に示した方位特定部ＤＲＤは、例えば、端末装置ＵＥに搭載されたプロセッサにより、図１７を用いて後述するアプリケーションプログラムを実行することにより実現される。方位特定部ＤＲＤは、例えば、近距離無線通信技術を用いた無線通信経路などにより、人物Ｑ１の頭部に装着された位置検出装置ＨＭＤに接続されており、位置検出装置ＨＭＤによって得られた情報を受ける。また、方位特定部ＤＲＤは、ネットワークＮＷを介してサーバ装置ＳＶに接続されており、サーバ装置ＳＶに対して問い合わせを行うことにより、展示データベースＤＢ２に蓄積された情報を参照する。 Further, the orientation specifying unit DRD shown in FIG. 12 is realized, for example, by executing an application program described later with reference to FIG. 17 by a processor mounted on the terminal device UE. The direction specifying unit DRD is connected to the position detection device HMD mounted on the head of the person Q1 by, for example, a wireless communication path using a short-range wireless communication technology, and information obtained by the position detection device HMD Receive. The orientation specifying unit DRD is connected to the server apparatus SV via the network NW, and refers to information stored in the exhibition database DB2 by making an inquiry to the server apparatus SV.

図１２に示した位置検出装置ＨＭＤは、図１３を用いて後述する処理を行うことにより、人物Ｑ１の頭部の位置及び頭部の正面の向きＤｉｒを示す情報を取得する。 The position detection device HMD illustrated in FIG. 12 acquires information indicating the position of the head of the person Q1 and the front direction Dir of the person Q1 by performing processing described later with reference to FIG.

次に、図１２に示した位置検出装置ＨＭＤの機能および動作と音響ＡＲ装置ＡＲＣに含まれる各構成要素の機能および動作とについて、図１３を用いて説明する。 Next, the function and operation of the position detection device HMD shown in FIG. 12 and the function and operation of each component included in the acoustic AR device ARC will be described with reference to FIG.

図１３は、図１２に示した人物Ｑ１と展示会場ＨＬ内の展示物との位置関係の例を示す。なお、図１３に示す要素のうち、図１２に示した要素と同等のものは、同一の符号で示すとともに構成要素の説明を省略する場合がある。 FIG. 13 shows an example of the positional relationship between the person Q1 shown in FIG. 12 and the exhibits in the exhibition hall HL. Of the elements shown in FIG. 13, elements equivalent to those shown in FIG. 12 are denoted by the same reference numerals and description of the constituent elements may be omitted.

図１３の例は、矩形の領域ＨＬで示した展示会場内に、カプセル型の図形Ｅｘｈ１，Ｅｘｈ２で示した２つの展示物と、円形Ａｎｃ１，Ａｎｃ２で示した標識が設置されている場合を示す。標識Ａｎｃ１，Ａｎｃ２のそれぞれは、展示物Ｅｘｈ１，Ｅｘｈ２のそれぞれに対応付けられており、例えば、赤外線などを用いて、対応する展示物Ｅｘｈ１、Ｅｘｈ２を示す識別情報を発信する機能を有している。図１３において、破線で示した扇形ＡＲ１，ＡＲ２のそれぞれは、標識Ａｎｃ１，Ａｎｃ２のそれぞれによって発信された識別情報を示す赤外線などが到達する範囲を示している。 The example of FIG. 13 shows a case where two exhibits indicated by capsule-shaped figures Exh1 and Exh2 and signs indicated by circular Anc1 and Anc2 are installed in an exhibition hall indicated by a rectangular area HL. . Each of the signs Anc1 and Anc2 is associated with each of the exhibits Exh1 and Exh2, and has a function of transmitting identification information indicating the corresponding exhibits Exh1 and Exh2, for example, using infrared rays. . In FIG. 13, each of the sectors AR1 and AR2 indicated by broken lines indicates a range where infrared rays or the like indicating identification information transmitted by the signs Anc1 and Anc2 reach.

図１３の例では、展示物Ｅｘｈ１および標識Ａｎｃ１は、展示会場ＨＬの角の一つに配置されており、展示物Ｅｘｈ２および標識Ａｎｃ２は、展示会場ＨＬの別の角の一つに配置されている。なお、展示会場ＨＬには、３以上の展示物と展示物のそれぞれに対応付けられた標識が配置されてもよい。 In the example of FIG. 13, the exhibit Exh1 and the sign Anc1 are arranged at one of the corners of the exhibition hall HL, and the exhibit Exh2 and the sign Anc2 are arranged at one of the other corners of the exhibition hall HL. Yes. Note that three or more exhibits and signs associated with the exhibits may be arranged in the exhibition hall HL.

また、図１３に示した音声データベースＤＢ１は、例えば、展示物Ｅｘｈ１，Ｅｘｈ２のそれぞれを示す識別情報に対応して、各展示物Ｅｘｈ１，Ｅｘｈ２の内容を説明する音声情報を蓄積している。そして、展示データベースＤＢ２は、例えば、各展示物Ｅｘｈ１，Ｅｘｈ２の識別情報に対応して、各展示物Ｅｘｈ１，Ｅｘｈ２の展示会場ＨＬにおける位置を示す情報を蓄積している。 In addition, the audio database DB1 illustrated in FIG. 13 stores, for example, audio information that describes the contents of the exhibits Exh1 and Exh2, corresponding to the identification information indicating the exhibits Exh1 and Exh2. For example, the exhibition database DB2 stores information indicating the positions of the exhibits Exh1 and Exh2 in the exhibition hall HL corresponding to the identification information of the exhibits Exh1 and Exh2.

図１３に示した位置検出装置ＨＭＤは、標識Ａｎｃ１，Ａｎｃ２によって発信された識別情報を受信する機能と、ジャイロセンサなどにより人物Ｑ１の位置および人物Ｑ１の頭部の正面の向きＤｉｒを検出する機能を有している。また、位置検出装置ＨＭＤは、近距離無線通信技術などを用いて、受信した識別情報および人物Ｑ１の位置および頭部の正面の向きＤｉｒを示す情報を端末装置ＵＥに送信する機能を有している。なお、図１３においては、位置検出装置ＨＭＤと端末装置ＵＥとの間に設定される近距離無線通信技術による通信経路の図示は省略されている。また、位置検出装置ＨＭＤに含まれるジャイロセンサなどの機能および動作については、図１５および図１７を用いて後述する。 The position detection device HMD shown in FIG. 13 has a function of receiving identification information transmitted by the signs Anc1 and Anc2, and a function of detecting the position of the person Q1 and the front direction Dir of the person Q1 by a gyro sensor or the like. have. Further, the position detection device HMD has a function of transmitting the received identification information and information indicating the position of the person Q1 and the front direction Dir of the head to the terminal device UE using a short-range wireless communication technology or the like. Yes. Note that, in FIG. 13, illustration of a communication path by the short-range wireless communication technology set between the position detection device HMD and the terminal device UE is omitted. The functions and operations of the gyro sensor included in the position detection device HMD will be described later with reference to FIGS. 15 and 17.

図１３の例では、端末装置ＵＥは、展示会場ＨＬの壁などに設置されたアクセスポイントＡＰを介してネットワークＮＷに接続されており、ネットワークＮＷを介してサーバ装置ＳＶおよび音響処理装置１０との間で情報の授受が可能である。 In the example of FIG. 13, the terminal device UE is connected to the network NW via an access point AP installed on the wall of the exhibition hall HL and the like, and is connected to the server device SV and the sound processing device 10 via the network NW. Information can be exchanged between them.

図１２に示した端末装置ＵＥに含まれる方位特定部ＤＲＤは、例えば、所定の時間毎に、位置検出装置ＨＭＤから、位置検出装置ＨＭＤで受信された識別情報と位置検出装置で検出された人物Ｑ１の位置及び人物Ｑ１の頭部の向きＤｉｒを示す情報とを受ける。ここで、位置検出装置ＨＭＤで受信された識別情報は、人物Ｑ１が図１３に示した領域ＡＲ１，ＡＲ２のどちらに滞在しているか、即ち、人物Ｑ１に最寄りの展示物が展示物Ｅｘｈ１，Ｅｘｈ２のいずれであるかを示している。図１３の例では、人物Ｑ１は領域ＡＲ１内に滞在しているため、位置検出装置ＨＭＤは、標識Ａｎｃ１から発信された識別情報を受信し、受信した識別情報を人物Ｑ１の頭部の位置及び向きＤｉｒを示す情報とともに、方位特定部ＤＲＤに渡す。 The orientation specifying unit DRD included in the terminal device UE illustrated in FIG. 12 includes, for example, the identification information received by the position detection device HMD and the person detected by the position detection device from the position detection device HMD every predetermined time. Information indicating the position of Q1 and the head direction Dir of the person Q1 is received. Here, the identification information received by the position detection device HMD indicates that the person Q1 is staying in the areas AR1 and AR2 shown in FIG. 13, that is, the exhibit closest to the person Q1 is the exhibits Exh1 and Exh2. It indicates which one of them. In the example of FIG. 13, since the person Q1 stays in the area AR1, the position detection device HMD receives the identification information transmitted from the sign Anc1, and uses the received identification information as the position of the head of the person Q1 and Along with information indicating the direction Dir, the information is passed to the direction specifying unit DRD.

方位特定部ＤＲＤは、位置検出装置ＨＭＤから受けた識別情報に基づいて、サーバ装置ＳＶに問い合わせを行うことで、例えば、展示データベースＤＢ２から展示物Ｅｘｈ１の位置を示す情報を取得する。そして、方位特定部ＤＲＤから受けた人物Ｑ１の位置および頭部の向きＤｉｒを示す情報と展示物Ｅｘｈ１の位置を示す情報とに基づいて、人物Ｑ１の頭部の向きＤｉｒを基準とする展示物Ｅｘｈ１の方向を示す角度を求める。ここで、人物Ｑ１の頭部の向きＤｉｒを基準とする展示物Ｅｘｈ１の方向は、音響ＡＲ装置ＡＲＣによる音像定位処理により、展示物Ｅｘｈ１を説明する音声情報に対応する音像を定位させる方向を示す。 The direction specifying unit DRD obtains information indicating the position of the exhibit Exh1 from the exhibition database DB2, for example, by making an inquiry to the server device SV based on the identification information received from the position detection device HMD. Then, based on the information indicating the position of the person Q1 and the head direction Dir received from the orientation specifying unit DRD and the information indicating the position of the exhibit Exh1, the exhibit based on the head direction Dir of the person Q1. An angle indicating the direction of Exh1 is obtained. Here, the direction of the exhibit Exh1 based on the head direction Dir of the person Q1 indicates the direction in which the sound image corresponding to the audio information describing the exhibit Exh1 is localized by the sound image localization process by the acoustic AR device ARC. .

つまり、方位特定部ＤＲＤは、例えば、位置検出装置ＨＭＤから人物Ｑ１に最寄りの展示物を示す情報と、人物Ｑ１の頭部の向きＤｉｒを示す情報とを受ける毎に、音像定位処理により音像を定位させる方向を示す角度θを求める。そして、方位特定部ＤＲＤは、求めた角度θを、音響処理部ＳＰによる畳み込み処理に用いられる頭部インパルス応答を指定するための情報として、図１２に示した選択部１３２に渡す。 That is, for example, every time the orientation specifying unit DRD receives information indicating the exhibit nearest to the person Q1 from the position detection device HMD and information indicating the head direction Dir of the person Q1, a sound image localization process is performed. An angle θ indicating the orientation direction is obtained. Then, the azimuth specifying unit DRD passes the obtained angle θ to the selection unit 132 shown in FIG. 12 as information for designating the head impulse response used for the convolution processing by the acoustic processing unit SP.

音像を定位させる方向を示す角度θを示す情報を方位特定部ＤＲＤから受けた場合に、選択部１３２は、角度θに対応して記憶部ＭＥＭに格納された個別頭部インパルス応答あるいは補正後の共通頭部インパルス応答を読み出す。そして、選択部１３２は、読み出した個別頭部インパルス応答あるいは補正後の共通頭部インパルス応答を、音像定位処理のための畳み込みに用いる頭部インパルス応答として、音響処理部ＳＰに渡す。 When the information indicating the angle θ indicating the direction in which the sound image is localized is received from the direction specifying unit DRD, the selection unit 132 stores the individual head impulse response stored in the storage unit MEM corresponding to the angle θ or the corrected head Read the common head impulse response. Then, the selection unit 132 passes the read individual head impulse response or the corrected common head impulse response to the acoustic processing unit SP as a head impulse response used for convolution for sound image localization processing.

また、サーバ装置ＳＶは、例えば、方位特定部ＤＲＤから展示部Ｅｘｈ１を示す識別情報に基づく問い合わせを受けた場合に、人物Ｑ１が展示物Ｅｘｈ１に対応する領域ＡＲ１に滞在していることを認識する。この場合に、サーバ装置ＳＶは、展示するＥｘｈ１を示す識別情報に対応して音声データベースＤＢ１に蓄積された音声情報を読み出し、読み出した音声情報を、ネットワークＮＷを介して端末装置ＵＥの音響処理部ＳＰに渡す。 For example, when the server apparatus SV receives an inquiry based on the identification information indicating the exhibition unit Exh1 from the direction specifying unit DRD, the server device SV recognizes that the person Q1 is staying in the area AR1 corresponding to the exhibit Exh1. . In this case, the server apparatus SV reads out the voice information stored in the voice database DB1 corresponding to the identification information indicating Exh1 to be exhibited, and uses the read voice information as the acoustic processing unit of the terminal apparatus UE via the network NW. Pass to SP.

したがって、音響処理部ＳＰは、人物Ｑ１の頭部の向きＤｉｒを基準とする角度θの方向が、図２に示した計測範囲Ｒの内側に設定された第１方向である場合に、展示物Ｅｘｈ１の音声情報から生成した音響信号と個別頭部インパルス応答との畳み込みを行う。一方、角度θの方向が、図２に示した計測範囲Ｒの外側に設定された第２方向である場合に、音響処理部ＳＰは、展示物Ｅｘｈ１の音声情報から生成した音響信号と補正後の共通頭部インパルス応答との畳み込みを行う。 Therefore, the acoustic processing unit SP exhibits the exhibits when the direction of the angle θ with respect to the head direction Dir of the person Q1 is the first direction set inside the measurement range R shown in FIG. The acoustic signal generated from the audio information of Exh1 is convolved with the individual head impulse response. On the other hand, when the direction of the angle θ is the second direction set outside the measurement range R shown in FIG. 2, the acoustic processing unit SP performs the correction with the acoustic signal generated from the audio information of the exhibit Exh1. Is convolved with the common head impulse response.

図９を用いて説明したように、人物Ｑ１の頭部の向きＤｉｒを基準とする角度毎に記憶部ＭＥＭに格納された個別頭部インパルス応答あるいは補正された共通頭部インパルス応答は、角度の変化に応じて滑らかに変化する遅延時間を示す。したがって、方位特定部ＤＲＤで求められた角度θの変化に応じて、例えば、音声処理部ＳＰでの畳み込み処理に用いる頭部インパルス応答が個別頭部インパルス応答と補正後の共通頭部インパルス応答との間で切り替えられても遅延時間の連続性は維持される。 As described with reference to FIG. 9, the individual head impulse response stored in the storage unit MEM or the corrected common head impulse response for each angle with respect to the head direction Dir of the person Q1 is an angle of The delay time which changes smoothly according to the change is shown. Therefore, according to the change of the angle θ obtained by the orientation specifying unit DRD, for example, the head impulse response used for the convolution processing in the speech processing unit SP is the individual head impulse response and the corrected common head impulse response. The continuity of the delay time is maintained even if it is switched between.

即ち、図１２に示した音響ＡＲ装置ＡＲＣは、例えば、図１３に示した展示会場ＨＬ内で移動する人物Ｑ１と展示物Ｅｘｈ１との相対位置の変化を、人物Ｑ１に対して定位させる音像の位置に滑らかに反映することができる。即ち、図１２に示した案内システムＧＳは、人物Ｑ１の頭部の前方方向の所定の範囲内について計測された個別頭部インパルス応答と予め用意された共通頭部インパルス応答とを用いて、任意の方向に仮想的な音像を定位させて音声による案内を提供可能である。 That is, the acoustic AR device ARC shown in FIG. 12 has, for example, a sound image for localizing a change in the relative position of the person Q1 and the exhibit Exh1 moving in the exhibition hall HL shown in FIG. The position can be reflected smoothly. That is, the guidance system GS shown in FIG. 12 uses an individual head impulse response measured within a predetermined range in the front direction of the head of the person Q1 and a common head impulse response prepared in advance. It is possible to provide voice guidance by localizing a virtual sound image in the direction of.

図２を用いて説明したように、計測装置ＥＱによる計測を人物Ｑ１の頭部の正面方向を含む一部の方向に限定することで、例えば、展示会場などに訪れる多数の人物についての個別頭部インパルス応答の計測が可能となる。したがって、図１２に示した案内システムＧＳは、展示会場などに訪れる多数の人物に対して、個別に全ての方向について個別頭部インパルス応答を計測する場合よりも低いコストで、ほぼ同等の自然さで音像を定位させるサービスを提供することができる。 As described with reference to FIG. 2, by limiting the measurement by the measurement device EQ to a part of the direction including the front direction of the head of the person Q1, for example, individual heads for a large number of persons visiting an exhibition hall or the like Measurement of the impulse response is possible. Therefore, the guidance system GS shown in FIG. 12 has substantially the same naturalness at a lower cost than a case where individual head impulse responses are individually measured in all directions for a large number of persons visiting an exhibition hall or the like. Can provide a service for localizing sound images.

以上に説明した本件開示の音響処理装置１０は、コンピュータ装置などを用いて実現することができる。 The sound processing device 10 disclosed herein can be realized using a computer device or the like.

図１４は、音響処理装置１０のハードウェア構成の一例を示す。なお、図１４に示す構成要素のうち、図１２に示した構成要素と同等のものは、同一の符号で示すとともに構成要素の説明を省略する場合がある。 FIG. 14 shows an example of the hardware configuration of the sound processing apparatus 10. Note that among the components shown in FIG. 14, components equivalent to those shown in FIG. 12 are denoted by the same reference numerals and description of the components may be omitted.

コンピュータ装置２０は、プロセッサ２１と、メモリ２２と、ハードディスク装置２３と、ネットワークインタフェース２４と、オーディオインタフェース２５と、音響信号生成部２６とを含んでいる。図１４に示したプロセッサ２１と、メモリ２２と、ハードディスク装置２３と、ネットワークインタフェース２４と、オーディオインタフェース２５と、音響信号生成部２６とは、バスを介して互いに接続されている。コンピュータ装置２０は、ネットワークインタフェース２４を介して、ネットワークＮＷに接続されており、サーバ装置ＳＶ及び端末装置ＵＥのそれぞれとネットワークを介したデータの授受が可能である。また、コンピュータ装置２０は、オーディオインタフェース２５を介して、複数のスピーカＳＰＫと人物Ｑ１の両耳のそれぞれに装着されたマイクロホンＭＣＬ，ＭＣＲとに接続されている。 The computer device 20 includes a processor 21, a memory 22, a hard disk device 23, a network interface 24, an audio interface 25, and an acoustic signal generation unit 26. The processor 21, the memory 22, the hard disk device 23, the network interface 24, the audio interface 25, and the acoustic signal generation unit 26 illustrated in FIG. 14 are connected to each other via a bus. The computer device 20 is connected to the network NW via the network interface 24, and can exchange data with each of the server device SV and the terminal device UE via the network. The computer device 20 is connected via an audio interface 25 to a plurality of speakers SPK and microphones MCL and MCR attached to both ears of the person Q1.

図１４において、プロセッサ２１と、メモリ２２と、ハードディスク装置２３と、ネットワークインタフェース２４と、オーディオインタフェース２５とは、音響処理装置１０に含まれる。また、プロセッサ２１と、メモリ２２と、音響信号生成部２６と、オーディオインタフェース２５とは、計測装置ＥＱに含まれる。 In FIG. 14, a processor 21, a memory 22, a hard disk device 23, a network interface 24, and an audio interface 25 are included in the sound processing device 10. The processor 21, the memory 22, the acoustic signal generation unit 26, and the audio interface 25 are included in the measurement device EQ.

図１４に示したメモリ２２は、コンピュータ装置２０のオペレーティングシステムを格納している。更に、メモリ２２は、プロセッサ２１が図６に示した音響処理を実行するためのアプリケーションプログラムを格納している。また、メモリ２２は、更に、図２を用いて説明した個別頭部インパルス応答を計測するための計測処理を実行するためのアプリケーションプログラムを格納している。なお、図６に示した音響処理を実行するためのアプリケーションプログラム及び計測処理を実行するためのアプリケーションプログラムは、例えば、光ディスクなどの記憶媒体に記録して頒布することもできるし、ネットワークＮＷを介して配信することもできる。例えば、図６に示した音響処理のためのアプリケーションプログラム及び計測処理のためのアプリケーションプログラムは、ネットワークインタフェース２４を介して、サーバ装置ＳＶからダウンロードされてもよい。ダウンロードされたアプリケーションプログラムは、メモリ２２あるいはハードディスク装置２３に格納されることで、プロセッサ２１による実行が可能になる。なお、音響処理のためのアプリケーションプログラムは、ダミーヘッドなどを用いて計測された共通頭部インパルス応答を示す情報を含んでいることが望ましい。 The memory 22 shown in FIG. 14 stores the operating system of the computer device 20. Furthermore, the memory 22 stores an application program for the processor 21 to execute the acoustic processing shown in FIG. Further, the memory 22 further stores an application program for executing the measurement process for measuring the individual head impulse response described with reference to FIG. Note that the application program for executing the acoustic processing and the application program for executing the measurement processing shown in FIG. 6 can be recorded and distributed on a storage medium such as an optical disc, or can be distributed via the network NW. Can also be distributed. For example, the application program for acoustic processing and the application program for measurement processing illustrated in FIG. 6 may be downloaded from the server apparatus SV via the network interface 24. The downloaded application program is stored in the memory 22 or the hard disk device 23, so that it can be executed by the processor 21. Note that the application program for acoustic processing desirably includes information indicating a common head impulse response measured using a dummy head or the like.

プロセッサ２１は、メモリ２２に格納された音響処理のためのアプリケーションプログラムを実行することにより、図１に示した予測部１１、補正部１２の機能を果たす。また、プロセッサ２１は、メモリ２２に格納された計測処理のためのアプリケーションプログラムに基づいて、音響信号生成部２６およびオーディオインタフェース２５の動作を制御することにより、図１に示した計測装置ＥＱの機能を果たす。なお、計測装置ＥＱに含まれる音響信号生成部２６の機能及び計測装置ＥＱの動作については、図１６を用いて後述する。 The processor 21 performs the functions of the prediction unit 11 and the correction unit 12 illustrated in FIG. 1 by executing an application program for acoustic processing stored in the memory 22. Further, the processor 21 controls the operations of the acoustic signal generation unit 26 and the audio interface 25 based on an application program for measurement processing stored in the memory 22, so that the function of the measurement device EQ illustrated in FIG. Fulfill. The function of the acoustic signal generation unit 26 included in the measurement device EQ and the operation of the measurement device EQ will be described later with reference to FIG.

図１４に示した端末装置ＵＥは、プロセッサ３１と、メモリ３２と、ネットワークインタフェース３３と、音響処理部ＳＰと、近距離無線通信インタフェース３４とを含んでいる。図１４に示したプロセッサ３１と、メモリ３２と、ネットワークインタフェース３３と、音響処理部ＳＰと、近距離無線通信インタフェース３４とは、バスを介して互いに接続されている。端末装置ＵＥは、ネットワークインタフェース３３を介してネットワークＮＷに接続されており、サーバ装置ＳＶ及び音響処理装置１０のそれぞれとネットワークＮＷを介したデータの授受が可能である。また、端末装置ＵＥは、近距離無線通信インタフェース３４を介して、人物Ｑ１’の頭部に装着された位置検出装置ＨＭＤに接続されている。また、音響処理部ＳＰは、人物Ｑ１’の両耳のそれぞれに装着されたイアホンＥＰＬ，ＥＰＲのそれぞれに接続されている。音響処理部ＳＰで生成された音響信号は、イアホンＥＰＬ，ＥＰＲにより音響として出力され、イアホンＥＰＬ，ＥＰＲにより出力された音響は人物Ｑ１’によって聴取される。なお、図１４に示した人物Ｑ１’は、計測装置ＥＱによって個別頭部インパルス応答の計測が行われた人物Ｑ１と同一の人物を示している。また、位置検出装置ＨＭＤのハードウェア構成については、図１５を用いて後述する。 The terminal device UE illustrated in FIG. 14 includes a processor 31, a memory 32, a network interface 33, an acoustic processing unit SP, and a short-range wireless communication interface 34. The processor 31, the memory 32, the network interface 33, the acoustic processing unit SP, and the short-range wireless communication interface 34 illustrated in FIG. 14 are connected to each other via a bus. The terminal device UE is connected to the network NW via the network interface 33, and can exchange data with each of the server device SV and the sound processing device 10 via the network NW. The terminal device UE is connected to the position detection device HMD attached to the head of the person Q1 'via the short-range wireless communication interface 34. In addition, the acoustic processing unit SP is connected to each of the earphones EPL and EPR attached to both ears of the person Q1 '. The sound signal generated by the sound processing unit SP is output as sound by the earphones EPL and EPR, and the sound output by the earphones EPL and EPR is heard by the person Q1 '. The person Q1 'shown in FIG. 14 is the same person as the person Q1 whose individual head impulse response has been measured by the measuring device EQ. The hardware configuration of the position detection device HMD will be described later with reference to FIG.

図１４に示した端末装置ＵＥにおいて、プロセッサ３１と、メモリ３２と、ネットワークインタフェース３３と、音響処理部ＳＰと、近距離無線通信インタフェース３４とは、音響ＡＲ装置ＡＲＣに含まれる。 In the terminal device UE illustrated in FIG. 14, the processor 31, the memory 32, the network interface 33, the acoustic processing unit SP, and the short-range wireless communication interface 34 are included in the acoustic AR device ARC.

図１４に示したメモリ３２は、端末装置ＵＥのオペレーティングシステムとともに、プロセッサ３１が、人物Ｑ１’に対して音像定位技術を用いたサービスを提供するための音響ＡＲ処理を実行するためのアプリケーションプログラムを格納している。なお、音響ＡＲ処理を実行するためのアプリケーションプログラムは、例えば、メモリカードなどの記憶媒体に記録して頒布することもできるし、ネットワークＮＷを介して配信することもできる。例えば、音響ＡＲ処理のためのアプリケーションプログラムは、ネットワークインタフェース３４を介して、サーバ装置ＳＶからダウンロードされてもよい。ダウンロードされたアプリケーションプログラムは、メモリ３２に格納されることで、プロセッサ３１による実行が可能になる。 The memory 32 illustrated in FIG. 14 includes an application program for the processor 31 to execute the acoustic AR process for providing the service using the sound image localization technology to the person Q1 ′ together with the operating system of the terminal device UE. Storing. The application program for executing the acoustic AR process can be recorded and distributed on a storage medium such as a memory card, or can be distributed via the network NW. For example, an application program for acoustic AR processing may be downloaded from the server device SV via the network interface 34. The downloaded application program is stored in the memory 32 so that the processor 31 can execute it.

そして、プロセッサ３１は、メモリ３２に格納された音響ＡＲ処理のためのアプリケーションプログラムを実行することにより、図１２に示した選択部１３２及び方位特定部ＤＲＤの機能を果たす。 Then, the processor 31 performs the functions of the selection unit 132 and the orientation specifying unit DRD shown in FIG. 12 by executing an application program for the acoustic AR process stored in the memory 32.

図１５は、図１４に示した位置検出装置ＨＭＤのハードウェア構成例を示す。なお、図１５に示す要素のうち、図１２に示した要素と同等のものは、同一の符号で示すとともに構成要素の説明を省略する場合がある。 FIG. 15 shows a hardware configuration example of the position detection device HMD shown in FIG. Of the elements shown in FIG. 15, elements equivalent to those shown in FIG. 12 are denoted by the same reference numerals and description of the constituent elements may be omitted.

図１５に示した位置検出装置ＨＭＤは、プロセッサ４１と、近距離無線通信インタフェース４２と、赤外線センサ４３と、ジャイロセンサ４４と、加速度センサ４５とを含んでいる。図１５に示した位置検出装置ＨＭＤにおいて、プロセッサ４１は、近距離無線送受信部４２、赤外線センサ４３、ジャイロセンサ４４及び加速度センサ４５のそれぞれと接続されている。位置検出装置ＨＭＤは、近距離無線通信インタフェース４２を介して端末装置ＵＥに接続されている。また、赤外線センサ４３は、図１３に示した標識Ａｎｃ１，Ａｎｃ２から放出された赤外線で示される識別情報を受信する機能を有している。 The position detection device HMD shown in FIG. 15 includes a processor 41, a short-range wireless communication interface 42, an infrared sensor 43, a gyro sensor 44, and an acceleration sensor 45. In the position detection device HMD shown in FIG. 15, the processor 41 is connected to each of the short-range wireless transmission / reception unit 42, the infrared sensor 43, the gyro sensor 44, and the acceleration sensor 45. The position detection device HMD is connected to the terminal device UE via the short-range wireless communication interface 42. The infrared sensor 43 has a function of receiving identification information indicated by infrared rays emitted from the signs Anc1 and Anc2 shown in FIG.

プロセッサ４１に内蔵されたメモリは、ジャイロセンサ４４及び加速度センサ４５のそれぞれで得られた計測結果に基づいて、人物Ｑ１’の頭部の位置及び頭部の正面の向きＤｉｒ（図１３）を検出する位置検出処理のためのプログラムを格納している。また、プロセッサ４１は、内蔵のメモリに格納された位置検出処理のためのプログラムを実行することで、人物Ｑ１’の頭部の位置及び向きを検出し、検出した位置及び向きを近距離無線通信インタフェース４２により端末装置ＵＥに送信する。なお、位置検出装置ＨＭＤの動作については、図１７を用いて後述する。 The memory built in the processor 41 detects the position of the head of the person Q1 ′ and the front direction Dir (FIG. 13) of the person Q1 ′ based on the measurement results obtained by the gyro sensor 44 and the acceleration sensor 45, respectively. A program for position detection processing is stored. Further, the processor 41 detects the position and orientation of the head of the person Q1 ′ by executing a program for position detection processing stored in the built-in memory, and the detected position and orientation are short-range wireless communication. It transmits to the terminal device UE through the interface 42. The operation of the position detection device HMD will be described later with reference to FIG.

図１６は、図１４に示した計測装置ＥＱの動作を示す。図１６に示したステップＳ３２１〜ステップＳ３２６の各処理は、図１４に示したメモリ２２に格納された計測処理のためのアプリケーションプログラムに含まれる処理の一例である。また、これらのステップＳ３２１〜ステップＳ３２６の各処理は、図１４に示したコンピュータ装置２０のプロセッサ２１によって実行される。 FIG. 16 shows the operation of the measurement apparatus EQ shown in FIG. Each process of step S321 to step S326 illustrated in FIG. 16 is an example of a process included in the application program for the measurement process stored in the memory 22 illustrated in FIG. In addition, each processing of step S321 to step S326 is executed by the processor 21 of the computer apparatus 20 shown in FIG.

図１６の例は、図２において計測範囲Ｒを示す扇形の内角をｎ(ｎは２以上の整数)個に分割し、分割された計測範囲のそれぞれの内側に設定された複数の第１方向について、順次に個別頭部インパルス応答を計測する手法の例を示す。この場合に、図１４に示した複数のスピーカＳＰＫのそれぞれは、例えば、人物Ｑ１が着席した回転可能なイスの回転中心を中心とする中心角２φ／ｎの扇形の弧をｍ(ｍは２以上の整数)等分する位置に設置される。なお、図１４の例では、人物Ｑ１が着席しているイスおよびプロセッサ２１によりイスを回転させるための機構の図示は省略されている。 The example of FIG. 16 divides the sector-shaped inner angle indicating the measurement range R in FIG. 2 into n (n is an integer of 2 or more), and a plurality of first directions set inside each of the divided measurement ranges. An example of a technique for sequentially measuring individual head impulse responses will be described. In this case, each of the plurality of speakers SPK shown in FIG. 14 has, for example, a fan-shaped arc with a central angle 2φ / n centered on the rotation center of the rotatable chair on which the person Q1 is seated (m is 2). It is installed at the position where it is equally divided. In the example of FIG. 14, illustration of a chair on which the person Q1 is seated and a mechanism for rotating the chair by the processor 21 are omitted.

そして、プロセッサ２１は、人物Ｑ１が着席したイスを回転させることで、スピーカＳＰＫが配置された弧と分割された計測範囲の一つである選択範囲に対応する扇形の弧とを一致させた状態で、以降に述べる処理を開始する。この場合に、ｍ個のスピーカＳＰＫのそれぞれとイスの回転中心とを結ぶ線分の方向は、選択範囲の内側に設定されたｍ個の第１方向のそれぞれを示す。 Then, the processor 21 rotates the chair on which the person Q1 is seated to match the arc in which the speaker SPK is arranged with the fan-shaped arc corresponding to the selection range that is one of the divided measurement ranges. Then, the processing described below is started. In this case, the direction of the line segment that connects each of the m speakers SPK and the rotation center of the chair indicates each of the m first directions set inside the selection range.

図１６に示したステップＳ３２１において、プロセッサ２１は、音響信号生成部２６に対してＴＳＰ信号の生成を指示することで、複数のスピーカＳＰＫのそれぞれに順次にＴＳＰ信号に対応する音響を出力させる。音響信号生成部２６によって生成されたＴＳＰ信号は、例えば、オーディオインタフェース２５を介して複数のスピーカＳＰＫのそれぞれに順次に渡される。そして、オーディオインタフェース２５からＴＳＰ信号を受けたスピーカＳＰＫは、ＴＳＰ信号に対応する音響を出力する。 In step S321 illustrated in FIG. 16, the processor 21 instructs the acoustic signal generation unit 26 to generate a TSP signal, thereby causing each of the plurality of speakers SPK to sequentially output the sound corresponding to the TSP signal. The TSP signal generated by the acoustic signal generation unit 26 is sequentially delivered to each of the plurality of speakers SPK via the audio interface 25, for example. The speaker SPK that has received the TSP signal from the audio interface 25 outputs sound corresponding to the TSP signal.

ステップＳ３２２において、プロセッサ２１は、ステップＳ３２１の処理で出力された音響が人物Ｑ１の頭部に到達した際にマイクロホンＭＣＬ，ＭＣＲで生成された音響信号を受け、受けた音響信号をメモリ２２またはハードディスク装置２３に保持する。マイクロホンＭＣＬ，ＭＣＲのそれぞれで得られた音響信号は、オーディオインタフェース２５を介してプロセッサ２１に渡される。プロセッサ２１は、ステップＳ３２１において、ＴＳＰ信号の生成を指示した時刻から所定の時間が経過するまでの期間にオーディオインタフェース２５を介してマイクロホンＭＣＬ，ＭＣＲのそれぞれから受けた音響信号をメモリ２２などに保持させる。ここで、プロセッサ２１は、マイクロホンＭＣＬ，ＭＣＲから受けた音響信号を、選択範囲の内側に設定された複数の第１方向のうち、音響を出力したスピーカＳＰＫに対応する第１方向からの音響としてメモリ２２などに記憶させる。 In step S322, the processor 21 receives an acoustic signal generated by the microphones MCL and MCR when the sound output in the process of step S321 reaches the head of the person Q1, and receives the received acoustic signal in the memory 22 or the hard disk. It is held in the device 23. Acoustic signals obtained by the microphones MCL and MCR are passed to the processor 21 via the audio interface 25. In step S321, the processor 21 holds, in the memory 22 or the like, acoustic signals received from the microphones MCL and MCR via the audio interface 25 during a period until a predetermined time elapses from the time when the generation of the TSP signal is instructed. Let Here, the processor 21 receives the acoustic signal received from the microphones MCL and MCR as the sound from the first direction corresponding to the speaker SPK that outputs the sound among the plurality of first directions set inside the selection range. It is stored in the memory 22 or the like.

ステップＳ３２３において、プロセッサ２１は、選択範囲の内側に設定された全ての第１方向から受けた音響を示す音響信号を保持したか否かに基づいて、選択範囲の計測が完了したか否かを判定する。 In step S323, the processor 21 determines whether or not the measurement of the selection range is completed based on whether or not the acoustic signals indicating the sounds received from all the first directions set inside the selection range are held. judge.

選択範囲の内側に設定された複数の第１方向の中に、まだ、音響信号を保持していない第１方向がある場合に、プロセッサ２１は、ステップＳ３２３の否定判定ルート(ＮＯ)に従ってステップＳ３２１の処理に戻る。この場合に、プロセッサ２１は、ステップＳ３２１において、新たな第１方向に対応するスピーカＳＰＫにＴＳＰ信号に対応する音響を出力させる。 If there is a first direction that does not yet hold an acoustic signal among the plurality of first directions set inside the selection range, the processor 21 performs step S321 according to the negative determination route (NO) of step S323. Return to the process. In this case, the processor 21 causes the speaker SPK corresponding to the new first direction to output the sound corresponding to the TSP signal in step S321.

ステップＳ３２１〜ステップＳ３２３の処理を繰り返すことにより、選択範囲の内側に設定された全ての第１方向についての音響の出力が完了した場合に（ステップＳ３２３の肯定判定（ＹＥＳ））、プロセッサ２１は、ステップＳ３２４の処理に進む。 When the output of the sound for all the first directions set inside the selection range is completed by repeating the processes in steps S321 to S323 (Yes determination in step S323 (YES)), the processor 21 The process proceeds to step S324.

ステップＳ３２４において、プロセッサ２１は、分割された計測範囲の全てについての計測が終了したか否か、即ち、図２に示した計測範囲Ｒについての計測が完了したか否かを判定する。 In step S324, the processor 21 determines whether or not the measurement for all of the divided measurement ranges has been completed, that is, whether or not the measurement for the measurement range R illustrated in FIG. 2 has been completed.

分割された計測範囲のいずれかについての計測がまだ終了していない場合に（ステップＳ３２４の否定判定（ＮＯ））、プロセッサ２１は、ステップＳ３２５の処理に進む。 If the measurement for any of the divided measurement ranges has not yet been completed (No at Step S324 (NO)), the processor 21 proceeds to the process at Step S325.

ステップＳ３２５において、プロセッサ２１は、ｎ個の分割された計測範囲のうち、まだ計測が完了していない分割された計測範囲を選択範囲とし、選択範囲についての計測を行うための位置決め処理を行う。プロセッサ２１は、例えば、人物Ｑ１が着席しているイスを回転させることで、人物Ｑ１の頭部と複数のスピーカＳＰＫとの相対位置を変更し、計測が完了した選択範囲に隣接する範囲を新たな選択範囲とする。ここで、プロセッサ２１は、分割された計測範囲の一つについての計測が完了する毎にプロセッサ２１がイスを回転させる角度は、例えば、図２に示した角度２φをｎ等分した角度で示される。 In step S325, the processor 21 performs a positioning process for measuring the selected range using the divided measurement range for which measurement has not been completed among the n divided measurement ranges as the selection range. The processor 21 changes the relative position between the head of the person Q1 and the plurality of speakers SPK, for example, by rotating the chair on which the person Q1 is seated, and newly creates a range adjacent to the selected range where the measurement has been completed. Select a suitable range. Here, the processor 21 indicates that the angle at which the processor 21 rotates the chair each time measurement for one of the divided measurement ranges is completed is, for example, an angle obtained by dividing the angle 2φ illustrated in FIG. 2 into n equal parts. It is.

一方、分割された計測範囲の全てについての計測が終了したと判定された場合に（ステップＳ３２４の肯定判定（ＹＥＳ））、プロセッサ２１は、ステップＳ３２６の処理に進む。 On the other hand, when it is determined that the measurement for all of the divided measurement ranges has been completed (Yes in step S324 (YES)), the processor 21 proceeds to the process of step S326.

ステップＳ３２６において、プロセッサ２１は、第１方向のそれぞれに対応してメモリ２２などに記憶させた音響信号に基づいて、人物Ｑ１の両耳のそれぞれについての個別頭部インパルス応答を求める。また、プロセッサ２１は、第１方向のそれぞれに求めた個別頭部インパルス応答を示す情報を、メモリ２２あるいはハードディスク装置２３に格納する。メモリ２２あるいはハードディスク装置２３に格納された個別頭部インパルス応答を示す情報は、図６に示した音響処理のためのアプリケーションプログラムを実行する際に用いられる。 In step S326, the processor 21 obtains an individual head impulse response for each of both ears of the person Q1 based on the acoustic signal stored in the memory 22 or the like corresponding to each of the first directions. Further, the processor 21 stores information indicating the individual head impulse response obtained in each of the first directions in the memory 22 or the hard disk device 23. Information indicating the individual head impulse response stored in the memory 22 or the hard disk device 23 is used when the application program for the acoustic processing shown in FIG. 6 is executed.

なお、第１方向のそれぞれの個別頭部インパルス応答を求める手法は、図１６に示したステップＳ３２６において一括して求める手法に限られない。例えば、プロセッサ２１は、ステップＳ３２２の処理で第１方向のそれぞれに対応する音響信号を取得する毎に、取得した音響信号から当該第１方向についての個別頭部インパルス応答を求めてもよい。 Note that the method of obtaining the individual head impulse responses in the first direction is not limited to the method of obtaining all at once in step S326 illustrated in FIG. For example, every time the processor 21 acquires the acoustic signal corresponding to each of the first directions in the process of step S322, the processor 21 may obtain the individual head impulse response for the first direction from the acquired acoustic signals.

以上に説明した計測処理が完了した後に、プロセッサ２１は、図６に示した音響処理のためのアプリケーションプログラムを実行する。音響処理のためのアプリケーションプログラムを実行する過程で、プロセッサ２１は、例えば、図７〜図９を用いて説明したようにして、各第１方向についての計測処理で得られた個別頭部インパルス応答から、各第２方向についての遅延時間を予測する。そして、プロセッサ２１は、予測された遅延時間を用いて、メモリ２２またはハードディスク装置２３に各第２方向に対応して保持された共通頭部インパルス応答の遅延時間を補正する。その後、プロセッサ２１は、計測処理により各第１方向について得られた個別頭部インパルス応答を示す情報及び各第２方向について得られた補正後の共通頭部インパルス応答を示す情報を、ネットワークＮＷを介して端末装置ＵＥに渡し、メモリ３２に格納させる。以上に説明したようにして、音響処理のためのアプリケーションプログラムを実行する過程で、端末装置ＵＥのメモリ３２に格納された情報は、端末装置ＵＥのプロセッサ３１により、音響ＡＲ処理のためのアプリケーションプログラムが実行される際に用いられる。 After the measurement process described above is completed, the processor 21 executes the application program for the acoustic process shown in FIG. In the process of executing the application program for acoustic processing, the processor 21 performs, for example, the individual head impulse response obtained by the measurement processing for each first direction as described with reference to FIGS. Thus, the delay time for each second direction is predicted. Then, the processor 21 corrects the delay time of the common head impulse response held in the memory 22 or the hard disk device 23 corresponding to each second direction, using the predicted delay time. Thereafter, the processor 21 sends information indicating the individual head impulse response obtained for each first direction by the measurement process and information indicating the corrected common head impulse response obtained for each second direction to the network NW. To the terminal device UE and stored in the memory 32. As described above, in the process of executing the application program for acoustic processing, the information stored in the memory 32 of the terminal device UE is stored in the application program for acoustic AR processing by the processor 31 of the terminal device UE. Used when is executed.

図１７は、図１４に示した音響ＡＲ装置ＡＲＣの動作を示す。図１７に示したステップＳ３３１〜ステップＳ３３７の各処理は、音響ＡＲ処理のためのアプリケーションプログラムに含まれる処理の一例である。また、これらのステップＳ３３１〜ステップＳ３３５の各処理は、端末装置ＵＥのプロセッサ３１により、例えば、人物Ｑ１’が図１３に示した展示会場ＨＬに入った後、数ミリ秒から数１０ミリ秒程度に設定される所定の時間が経過する毎に実行される。 FIG. 17 shows the operation of the acoustic AR device ARC shown in FIG. Each process of step S331 to step S337 illustrated in FIG. 17 is an example of a process included in the application program for the acoustic AR process. In addition, each processing of step S331 to step S335 is performed by the processor 31 of the terminal device UE, for example, about several milliseconds to several tens of milliseconds after the person Q1 ′ enters the exhibition hall HL shown in FIG. It is executed every time a predetermined time set in the elapses.

ステップＳ３３１において、プロセッサ３１は、位置検出装置ＨＭＤによって検出された人物Ｑ１’の頭部の位置及び向きを示す情報と、図１３に示した展示物Ｅｘｈ１，Ｅｘｈ２のうち人物Ｑ１’に最寄りの一つを示す情報とを収集する。例えば、プロセッサ３１は、近距離無線通信インタフェース３５を介して、位置検出装置ＨＭＤに対して、ジャイロセンサ４４と加速度センサ４５と赤外線センサ４３とのそれぞれで得られた計測結果の送信を要求する。プロセッサ３１からの要求は、図１５に示した近距離無線通信インタフェース４２を介して、位置検出装置ＨＭＤのプロセッサ４１に渡される。プロセッサ４１は、プロセッサ３１から渡された要求に基づいて、ジャイロセンサ４４から角速度の計測結果を受けるとともに、加速度センサ４５から加速度の計測結果を受ける。また、プロセッサ４１は、図１３に示した標識Ａｎｃ１，Ａｎｃ２のいずれかから赤外線センサ４３に到達した赤外線で示される識別情報を受ける。そして、プロセッサ４１は、角速度及び加速度の計測結果を示す情報とともに赤外線センサ４３で得られた識別情報を、近距離無線通信インタフェース４２を介して端末装置ＵＥに送信する。位置検出装置ＨＭＤのプロセッサ４１によって送信された情報は、端末装置ＵＥの近距離無線通信インタフェース３５を介してプロセッサ３１に渡される。そして、プロセッサ３１は、受けた情報に含まれる加速度及び角速度に基づいて、人物Ｑ１’の頭部の位置及び向きを算出する。また、プロセッサ３１は、受けた情報に含まれる識別情報を、展示物Ｅｘｈ１，Ｅｘｈ２のうち人物Ｑ１’に最寄りの一つを示す情報として用いる。なお、人物Ｑ１’の頭部の位置及び向きを算出する処理は、位置検出装置ＨＭＤのプロセッサ４１によって実行されてもよい。 In step S331, the processor 31 detects the position and orientation of the head of the person Q1 ′ detected by the position detection device HMD and the one closest to the person Q1 ′ among the exhibits Exh1 and Exh2 shown in FIG. Information indicating For example, the processor 31 requests the position detection device HMD to transmit measurement results obtained by the gyro sensor 44, the acceleration sensor 45, and the infrared sensor 43 via the short-range wireless communication interface 35. The request from the processor 31 is passed to the processor 41 of the position detection device HMD via the short-range wireless communication interface 42 shown in FIG. The processor 41 receives the angular velocity measurement result from the gyro sensor 44 and the acceleration measurement result from the acceleration sensor 45 based on the request passed from the processor 31. Further, the processor 41 receives identification information indicated by infrared rays that reach the infrared sensor 43 from one of the signs Anc1 and Anc2 shown in FIG. Then, the processor 41 transmits the identification information obtained by the infrared sensor 43 together with the information indicating the measurement results of the angular velocity and acceleration to the terminal device UE via the short-range wireless communication interface 42. Information transmitted by the processor 41 of the position detection device HMD is passed to the processor 31 via the short-range wireless communication interface 35 of the terminal device UE. Then, the processor 31 calculates the position and orientation of the head of the person Q1 'based on the acceleration and angular velocity included in the received information. Further, the processor 31 uses the identification information included in the received information as information indicating one of the exhibits Exh1 and Exh2 that is closest to the person Q1 '. Note that the process of calculating the position and orientation of the head of the person Q1 'may be executed by the processor 41 of the position detection device HMD.

ステップＳ３３２において、プロセッサ３１は、ステップＳ３３１の処理で受けた識別情報が、以前に図１７に示した処理を実行した際に受けた識別情報から変化しているか否かを判定する。 In step S332, the processor 31 determines whether or not the identification information received in the process of step S331 has changed from the identification information received when the process shown in FIG. 17 was executed previously.

図１７に示した処理を初めて実行した場合またはステップＳ３３１の処理で以前とは異なる識別情報を受けた場合に、プロセッサ２１は、ステップＳ３３２の肯定判定（ＹＥＳ）として、ステップＳ３３３の処理に進む。一方、ステップＳ３３１の処理で受けた識別情報と以前に受けた識別情報とが同一である場合に（ステップＳ３３２の否定判定（ＮＯ））、プロセッサ３１は、ステップＳ３３３の処理を行わずに、ステップＳ３３４の処理に進む。 When the process shown in FIG. 17 is executed for the first time or when identification information different from the previous one is received in the process of step S331, the processor 21 proceeds to the process of step S333 as an affirmative determination (YES) of step S332. On the other hand, when the identification information received in the process of step S331 and the previously received identification information are the same (No determination in step S332 (NO)), the processor 31 does not perform the process of step S333, but performs the step The process proceeds to S334.

ステップＳ３３３において、プロセッサ３１は、ステップＳ３３１の処理で受けた新たな識別情報に基づき、図１４に示したサーバ装置ＳＶに対する問い合わせを行うことで、識別情報で示される展示物の位置を音声による案内を提供する対象の位置として取得する。 In step S333, the processor 31 makes an inquiry to the server apparatus SV shown in FIG. 14 based on the new identification information received in the process of step S331, so that the position of the exhibit indicated by the identification information is guided by voice. Is acquired as the position of the target to be provided.

ステップＳ３３４において、プロセッサ３１は、案内の対象となる展示物の位置と、人物Ｑ１’の頭部の位置及び向きとに基づいて、人物Ｑ１’の頭部の向きを基準として、案内の対象となる展示物の方向を算出する。例えば、展示物Ｅｘｈ１の位置を示す情報及び人物Ｑ１の頭部の位置および向きを示す情報に基づいて、プロセッサ３１は、図１３に示した人物Ｑ１の頭部の向きＤｉｒと人物Ｑ１の頭部と展示物Ｅｘｈ１とを結ぶ線分とが交差する角度θを算出する。そして、プロセッサ３１は、算出した角度θを示す情報を、定位させる音像の方向を人物Ｑ１’の頭部の正面の向きＤｉｒを基準として示す情報として、図１４に示した音声処理部ＳＰに渡す。 In step S334, the processor 31 determines the guidance target based on the orientation of the head of the person Q1 ′ based on the position of the exhibit to be guided and the position and orientation of the head of the person Q1 ′. Calculate the direction of the exhibit. For example, based on the information indicating the position of the exhibit Exh1 and the information indicating the position and orientation of the head of the person Q1, the processor 31 performs the head direction Dir of the person Q1 and the head of the person Q1 shown in FIG. And the angle θ at which the line connecting the exhibition Exh1 intersects is calculated. Then, the processor 31 passes information indicating the calculated angle θ to the sound processing unit SP illustrated in FIG. 14 as information indicating the direction of the sound image to be localized with reference to the front direction Dir of the head of the person Q1 ′. .

ステップＳ３３５において、プロセッサ３１は、サーバ装置ＳＶから、案内の対象となる展示物に対応して音声データベースＤＢ１に蓄積された音声情報の一部を受ける。例えば、プロセッサ３１は、図１７に示した処理を実行する毎に、時間間隔と同等の時間で再生される量毎に分割された音声情報を順次に受け、受けた音声情報を音声処理部ＳＰに渡す。 In step S335, the processor 31 receives from the server device SV a part of the audio information stored in the audio database DB1 corresponding to the exhibit to be guided. For example, every time the processing shown in FIG. 17 is executed, the processor 31 sequentially receives audio information divided for each amount to be reproduced at a time equivalent to the time interval, and receives the received audio information as an audio processing unit SP. To pass.

ステップＳ３３６において、プロセッサ３１は、ステップＳ３３４の処理で受けた情報で示される方向に対応する耳毎の頭部インパルス応答とステップＳ３３５の処理で受けた音声情報から耳毎に生成した音響信号との畳み込み処理を音声処理部ＳＰに実行させる。例えば、ステップＳ３３４の処理で算出された角度θが第１方向のいずれかを示す場合に、音声処理部ＳＰは、角度θで示される第１方向に対応してメモリ３２に保持された各耳の個別頭部インパルス応答と音響信号との畳み込み処理を実行する。一方、ステップＳ３３４の処理で算出された角度θが第２方向のいずれかを示す場合に、音声処理部ＳＰは、角度θで示される第２方向に対応してメモリ３２に保持された補正後の共通頭部インパルス応答と音響信号との畳み込み処理を実行する。 In step S336, the processor 31 compares the head impulse response for each ear corresponding to the direction indicated by the information received in step S334 and the acoustic signal generated for each ear from the voice information received in step S335. The speech processing unit SP is caused to execute the convolution process. For example, when the angle θ calculated in the process of step S334 indicates one of the first directions, the sound processing unit SP stores each ear held in the memory 32 corresponding to the first direction indicated by the angle θ. The convolution processing of the individual head impulse response and the acoustic signal is executed. On the other hand, when the angle θ calculated in the process of step S334 indicates one of the second directions, the sound processing unit SP performs the correction after being stored in the memory 32 corresponding to the second direction indicated by the angle θ. The convolution processing of the common head impulse response and the acoustic signal is executed.

ステップＳ３３７において、プロセッサ３１は、ステップＳ３３６の処理で人物Ｑ１の両耳のそれぞれについて生成された音響信号を、音響処理部ＳＰからイアホンＥＰＬ、ＥＰＲを介して出力させ、人物Ｑ１に聴取させる。 In step S337, the processor 31 outputs the acoustic signal generated for each of both ears of the person Q1 in the process of step S336 from the acoustic processing unit SP via the earphones EPL and EPR, and causes the person Q1 to listen.

以上に説明したように、図１４に示した端末装置ＵＥのプロセッサ３１により、所定の時間毎にステップＳ３３１〜ステップＳ３３７の処理を実行することで、図１２に示した音響ＡＲ装置ＡＲＣを実現することができる。すなわち、図１４に示した音響ＡＲ装置ＡＲＣは、音響処理装置１０によって端末装置ＵＥのメモリ３２に各方向に対応して格納された個別頭部インパルス応答あるいは補正後の共通頭部インパルス応答を用いて音像定位処理を行うことができる。 As described above, the processor 31 of the terminal device UE illustrated in FIG. 14 executes the processing of steps S331 to S337 at predetermined time intervals, thereby realizing the acoustic AR device ARC illustrated in FIG. be able to. That is, the acoustic AR device ARC shown in FIG. 14 uses the individual head impulse response or the corrected common head impulse response stored in the memory 32 of the terminal device UE corresponding to each direction by the acoustic processing device 10. Sound image localization processing.

これにより、音響ＡＲ装置ＡＲＣは、例えば、図１３に示した展示会場ＨＬ内を移動する人物Ｑ１の頭部の向きＤｉｒを基準とした展示物Ｅｘｈ１の方向からの音響として、展示物Ｅｘｈ１に対応する音声情報から生成した音響を人物Ｑ１に聴取させることができる。すなわち、図１４に示した音響ＡＲ装置ＡＲＣは、展示会場ＨＬ内を移動する人物Ｑ１に対する音像定位技術を用いたサービスとして、展示物Ｅｘｈ１，Ｅｘｈ２などを説明する音声による情報を提供する案内サービスを実現することができる。 As a result, the acoustic AR device ARC corresponds to the exhibit Exh1, for example, as sound from the direction of the exhibit Exh1 based on the head direction Dir of the person Q1 moving in the exhibition hall HL shown in FIG. The person Q1 can listen to the sound generated from the sound information. That is, the acoustic AR device ARC shown in FIG. 14 provides a guidance service for providing information by voice explaining the exhibits Exh1, Exh2, etc. as a service using the sound image localization technology for the person Q1 moving in the exhibition hall HL. Can be realized.

以上の詳細な説明により、実施形態の特徴点及び利点は明らかになるであろう。これは、特許請求の範囲が、その精神および権利範囲を逸脱しない範囲で、前述のような実施形態の特徴点および利点にまで及ぶことを意図するものである。また、当該技術分野において通常の知識を有する者であれば、あらゆる改良および変更を容易に想到できるはずである。したがって、発明性を有する実施形態の範囲を前述したものに限定する意図はなく、実施形態に開示された範囲に含まれる適当な改良物および均等物に拠ることも可能である。 From the above detailed description, features and advantages of the embodiment will become apparent. It is intended that the scope of the claims extend to the features and advantages of the embodiments as described above without departing from the spirit and scope of the right. Any person having ordinary knowledge in the technical field should be able to easily come up with any improvements and changes. Therefore, there is no intention to limit the scope of the inventive embodiments to those described above, and appropriate modifications and equivalents included in the scope disclosed in the embodiments can be used.

以上の説明に関して、更に、以下の各項を開示する。
(付記１) 頭部の前方方向の所定の範囲内の複数の第１方向のそれぞれから前記頭部に音響が到達する際に計測されたインパルス応答に基づいて、前記所定の範囲の外側の第２方向から前記頭部に音響が到達する際のインパルス応答の遅延時間を予測する予測部と、
前記第２方向からの音響に対して予めモデル化された基準のインパルス応答の遅延時間を、前記予測部で予測された遅延時間に合わせて補正する補正部と、
を備えたことを特徴とする音響処理装置。
(付記２) 付記１に記載の音響処理装置において、
前記補正部で遅延時間が補正された基準のインパルス応答を用いて、前記第２方向に前記音像を定位させる音響を生成する生成部と、
を備えたことを特徴とする音響処理装置。
(付記３) 付記１または付記２に記載の音響処理装置において、
前記予測部は、
前記頭部とインパルス応答の計測の際に前記第１方向に設置された音源との位置関係として、前記音源から到達する音響の遅延時間が、計測されたインパルス応答の遅延時間となる位置関係を特定する特定部と、
前記特定部によって特定された位置関係に基づいて、前記第２方向から前記頭部に音響が到達する場合に予測される遅延時間を算出する算出部とを有する
ことを特徴とする音響処理装置。
(付記４) 付記３に記載の音響処理装置において、
前記特定部は、
前記位置関係の特定に、前記複数の第１方向のうち前記所定の範囲の境界に近い第１方向から前記頭部に音響が到達する際に計測されたインパルス応答の遅延時間に対する重みを、前記境界から離れた第１方向から前記頭部に音響が到達する際に計測されたインパルス応答の遅延時間に対する重みよりも大きくした重み付けを用いる
ことを特徴とする音響処理装置。
（付記５）付記３に記載の音響処理装置において、
前記特定部は、
前記境界に近い第１方向を含む複数の第１方向についての計測で得られた前記インパルス応答の遅延時間のそれぞれに、前記境界に近いほど大きい重みを設定した回帰分析を行うことで、前記位置関係を特定する
ことを特徴とする音響処理装置。
（付記６）頭部の前方方向の所定の範囲内の複数の第１方向のそれぞれから前記頭部に音響が到達する際に計測されたインパルス応答に基づいて、前記所定の範囲の外側の第２方向から前記頭部に音響が到達する際のインパルス応答の遅延時間を予測し、
前記第２方向からの音響に対して予めモデル化された基準のインパルス応答の遅延時間を、前記予測部で予測された遅延時間に合わせて補正する、
ことを特徴とする音響処理方法。
（付記７）頭部の前方方向の所定の範囲内の複数の第１方向のそれぞれから前記頭部に音響が到達する際に計測されたインパルス応答に基づいて、前記所定の範囲の外側の第２方向から前記頭部に音響が到達する際のインパルス応答の遅延時間を予測し、
前記第２方向からの音響に対して予めモデル化された基準のインパルス応答の遅延時間を、前記予測部で予測された遅延時間に合わせて補正する、
処理をコンピュータに実行させることを特徴とする音響処理プログラム。 Regarding the above description, the following items are further disclosed.
(Supplementary note 1) Based on the impulse response measured when sound reaches the head from each of a plurality of first directions within a predetermined range in the forward direction of the head, A prediction unit that predicts a delay time of an impulse response when sound reaches the head from two directions;
A correction unit that corrects a delay time of a reference impulse response modeled in advance with respect to the sound from the second direction in accordance with the delay time predicted by the prediction unit;
An acoustic processing apparatus comprising:
(Supplementary note 2) In the sound processing apparatus according to supplementary note 1,
A generation unit that generates sound for localizing the sound image in the second direction using a reference impulse response whose delay time is corrected by the correction unit;
An acoustic processing apparatus comprising:
(Supplementary Note 3) In the sound processing apparatus according to Supplementary Note 1 or Supplementary Note 2,
The prediction unit
As the positional relationship between the head and the sound source installed in the first direction when measuring the impulse response, the positional relationship in which the delay time of the sound reaching from the sound source becomes the delay time of the measured impulse response A specific part to identify;
A sound processing apparatus comprising: a calculation unit configured to calculate a delay time predicted when sound reaches the head from the second direction based on the positional relationship specified by the specifying unit.
(Supplementary Note 4) In the sound processing apparatus according to Supplementary Note 3,
The specific part is:
For identifying the positional relationship, a weight for a delay time of an impulse response measured when sound reaches the head from a first direction close to a boundary of the predetermined range among the plurality of first directions, A sound processing apparatus using a weight larger than a weight for a delay time of an impulse response measured when sound reaches the head from a first direction away from a boundary.
(Supplementary note 5) In the sound processing device according to supplementary note 3,
The specific part is:
By performing a regression analysis in which each of the delay times of the impulse response obtained by measurement in a plurality of first directions including the first direction close to the boundary is set to have a greater weight closer to the boundary, the position A sound processing apparatus characterized by specifying a relationship.
(Appendix 6) Based on the impulse response measured when sound reaches the head from each of a plurality of first directions within a predetermined range in the forward direction of the head, Predict the delay time of the impulse response when sound reaches the head from two directions,
Correcting the delay time of the reference impulse response modeled in advance for the sound from the second direction in accordance with the delay time predicted by the prediction unit;
The acoustic processing method characterized by the above-mentioned.
(Appendix 7) Based on the impulse response measured when sound reaches the head from each of a plurality of first directions within a predetermined range in the forward direction of the head, Predict the delay time of the impulse response when sound reaches the head from two directions,
Correcting the delay time of the reference impulse response modeled in advance for the sound from the second direction in accordance with the delay time predicted by the prediction unit;
An acoustic processing program for causing a computer to execute processing.

１０…音響処理装置；１１…予測部１１…補正部；１３…生成部；１１１，１１１ａ…特定部；１１２…算出部；１３１…設定部；１３２…選択部；２０…コンピュータ装置；２１，３１，４１…プロセッサ；２２，３２…メモリ；２３…ハードディスク装置；２４，３３…ネットワークインタフェース；２５…オーディオインタフェース；２６…音響信号生成部；３５，４２…近距離無線通信インタフェース；４３…赤外線センサ；４４…ジャイロセンサ；４５…加速度センサ；ＥＱ…計測装置；ＳＤ…記憶装置；ＡＲＣ…音響ＡＲ(Augmented Reality：拡張現実)装置；ＭＥＭ…記憶部；ＣＮＴ…制御部；ＤＢ１…音声データベース；ＤＢ２…展示データベース；ＵＥ…端末装置；ＳＰ…音響処理部；ＳＶ…サーバ装置；ＤＲＤ…方位特定部；ＭＣＬ，ＭＣＲ…マイクロホン；ＥＰＬ，ＥＰＲ…イアホン；ＨＭＤ…位置検出装置；ＮＷ…ネットワーク；Ｑ１…人物；Ｅｘｈ１，Ｅｘｈ２…展示物；Ａｎｃ１，Ａｎｃ２…標識；ＳＰＫ…スピーカ

DESCRIPTION OF SYMBOLS 10 ... Sound processing apparatus; 11 ... Prediction part 11 ... Correction part; 13 ... Generation part; 111, 111a ... Identification part; 112 ... Calculation part; 131 ... Setting part; 132 ... Selection part; , 41 ... Processor; 22, 32 ... Memory; 23 ... Hard disk device; 24, 33 ... Network interface; 25 ... Audio interface; 26 ... Acoustic signal generator; 35, 42 ... Short-range wireless communication interface; 44 ... gyro sensor; 45 ... acceleration sensor; EQ ... measuring device; SD ... storage device; ARC ... acoustic AR (Augmented Reality) device; MEM ... storage unit; CNT ... control unit; DB1 ... voice database; Exhibit database; UE ... Terminal device; SP ... Acoustic processing unit; SV ... Server device; DRD ... Direction specifying unit; CL, MCR ... microphone; EPL, EPR ... earphone; HMD ... position detection device; NW ... network; Q1 ... person; Exh1, Exh2 ... exhibit; Anc1, Anc2 ... sign; SPK ... speaker

Claims

Based on an impulse response measured when sound reaches the head from each of a plurality of first directions within a predetermined range in the forward direction of the head, from the second direction outside the predetermined range, A prediction unit for predicting a delay time of an impulse response when sound reaches the head;
The delay time of the impulse response of the pre-modeled criteria for sound from the second direction, and a correcting unit for correcting in accordance with the delay time predicted by the prediction unit,
The prediction unit
As the positional relationship between the head and the sound source installed in the first direction when measuring the impulse response, the positional relationship in which the delay time of the sound reaching from the sound source becomes the delay time of the measured impulse response A specific part to identify;
A calculating unit that calculates a delay time predicted when sound reaches the head from the second direction based on the positional relationship specified by the specifying unit.
A sound processing apparatus.

The sound processing apparatus according to claim 1,
Using the impulse response of the reference delay time is corrected by the correction unit, Ru includes a generator for generating a sound to localize a sound image in the second direction
A sound processing apparatus.

In the sound processing apparatus according to claim 1 or 2 ,
The specific part is:
For identifying the positional relationship, a weight for a delay time of an impulse response measured when sound reaches the head from a first direction close to a boundary of the predetermined range among the plurality of first directions, A sound processing apparatus using a weight larger than a weight for a delay time of an impulse response measured when sound reaches the head from a first direction away from a boundary.

Based on an impulse response measured when sound reaches the head from each of a plurality of first directions within a predetermined range in the forward direction of the head, from the second direction outside the predetermined range, A prediction step for predicting a delay time of an impulse response when sound reaches the head;
A correction step of correcting a delay time of a reference impulse response pre-modeled with respect to the sound from the second direction in accordance with the delay time predicted in the prediction step ,
The prediction step includes
As the positional relationship between the head and the sound source installed in the first direction when measuring the impulse response, the positional relationship in which the delay time of the sound reaching from the sound source becomes the delay time of the measured impulse response A specific process to identify;
A calculation step of calculating a delay time that is predicted when sound reaches the head from the second direction based on the positional relationship specified by the specifying step. .

Based on an impulse response measured when sound reaches the head from each of a plurality of first directions within a predetermined range in the forward direction of the head, from the second direction outside the predetermined range, A prediction step for predicting a delay time of an impulse response when sound reaches the head;
Causing the computer to execute a process including a correction step of correcting a delay time of a reference impulse response modeled in advance with respect to the sound from the second direction in accordance with the delay time predicted in the prediction step. ,
The prediction step includes
As the positional relationship between the head and the sound source installed in the first direction when measuring the impulse response, the positional relationship in which the delay time of the sound reaching from the sound source becomes the delay time of the measured impulse response A specific process to identify;
A calculation step of calculating a delay time predicted when sound reaches the head from the second direction based on the positional relationship specified by the specifying step.
An acoustic processing program characterized by that.