JPH01183232A

JPH01183232A - Presence-of-speech detection device

Info

Publication number: JPH01183232A
Application number: JP677188A
Authority: JP
Inventors: Takao Suzuki; 孝夫鈴木; Osamu Noguchi; 修野口
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1988-01-18
Filing date: 1988-01-18
Publication date: 1989-07-21

Abstract

PURPOSE:To precisely calculate a noise level and to attain presence-of-speech detection in which malfunction with respect to noise is less by setting a center clipping detection signal and a double talk detection signal to control signals which temporarily stop the update of a threshold. CONSTITUTION:An adaptive power threshold updating unit 264 in a presence-of- speech detection part 26 inputs the center clipping detection signal 512 and the double talk detection signal 514 from the coefficient calculator 126 in an echo cancel part 12. When either of the signals 512 or 514 is detected, an adaptive power threshold is not updated. A presence-of-speech decision unit 262 receives the short frame mean power output 700 of a power calculator 260 and the threshold output 702 of the threshold updating unit 264, compares them, and executes presence-of-speech decision. Thus, the noise level can precisely be calculated and presence-of-speech detection with less malfunction with respect to noise is attained.

Description

【発明の詳細な説明】（産業上の利用分野）本発明は有音検出装置、とくに国際衛星回線または光海
底ケーブル回線などの回線価格が高価な長距離通信回線
に有利に適用されるデジタル音声挿入（Ｏ８１）システ
ムにおける有音検出装置に関する。DETAILED DESCRIPTION OF THE INVENTION (Industrial Application Field) The present invention is a sound detection device, particularly a digital voice detection device that is advantageously applied to long-distance communication lines where line prices are high, such as international satellite lines or optical submarine cable lines. The present invention relates to a sound detection device in an insertion (O81) system.

（従来の技術）通信網の発展に伴ない国際衛星回線や光海底ケーブル回
線などの長距離通信回線の需要が激増しており、これら
高価な回線を効率的に使用する方式が検討されている。(Conventional technology) With the development of communication networks, the demand for long-distance communication lines such as international satellite lines and optical submarine cable lines has increased dramatically, and methods for efficiently using these expensive lines are being considered. .

このような方式には、たとえば無音圧縮技術に基づ＜　
　Ｏ５１方式または高能−１，符号化方式などがある。Such methods include, for example, <
There are O51 method, high efficiency-1, encoding method, etc.

前者の方式を用いたものがＤＳ＋システムであり、また
後者の方式を用いタモノがトランスコータ・システムで
あり、さらに前者の方式と後者の方式を組合せたものが
ディジタル回線多重化システムであり、これらいずれも
実用化システムの時期を迎えつつある。A system that uses the former method is a DS+ system, a transcoder system that uses the latter method, and a digital line multiplexing system that combines the former method and the latter method. All of these systems are approaching the point where they can be put into practical use.

無音圧縮技術を適用するＤＳＩシステムに関しては、た
とえば太田、大野による「ディジタル通話音声そう大シ
ステム」電子通信学会論文Ａ。For DSI systems that apply silence compression technology, see, for example, ``Digital Call Voice System'' by Ohta and Ohno, Institute of Electronics and Communication Engineers Paper A.

第５６−Ａ巻、第８号、第４４８〜４５５頁（１９７３
年８月）に記載されたものがあり、以下Ｏ６Ｉシステム
について説明する。Volume 56-A, No. 8, pp. 448-455 (1973
The O6I system is described below.

電話会話における通話時間には、通話者か相手の話を聞
いている時間および通話者が話しているいるときの単語
−文節・文章の切れ目の休止時間である無音区間が含ま
れる。ＤＳＩシステムは、このような無音区間が通常の
電話会話に５０％以上あることを利用して回線の効率化
を図るものである。すなわちこのシステムは、多数の入
力チャネルの状ＩＥを監視し、通話者が話している有音
のチャネルのみを選択して出力チャネルに割当て伝達す
ることにより、通話チャネル数の半分の伝達チャネル数
を用いて通話を可能とするものである。The call time in a telephone conversation includes the time during which the caller is listening to what the other party is saying, and the silent period, which is the pause time between words, phrases, and sentences when the caller is speaking. The DSI system aims to improve line efficiency by taking advantage of the fact that a normal telephone conversation has 50% or more of such silent periods. In other words, this system monitors the status IE of a large number of input channels, selects only the active channel where the caller is talking, and assigns it to the output channel for transmission, thereby reducing the number of transmission channels by half the number of communication channels. This allows you to make phone calls.

したがって、　ＤＳＩシステムでは近端話者の音声部分
の検出感度が良好で、背景雑音による誤動作が少ない有
音検出器が要求される。何故なら検出感度の鈍い有音検
出器は話頭切断の増加により音声品質の劣化をもたらし
、また雑音による誤動作の多い有音検出器は不要な音声
平均動作率（回線に音声の存在する割合）の増加により
効率の劣化をもたらすためである。Therefore, the DSI system requires a utterance detector that has good sensitivity in detecting the voice part of the near-end speaker and has few malfunctions due to background noise. This is because a voice presence detector with low detection sensitivity causes deterioration of voice quality due to an increase in the number of disconnections at the beginning of speech, and a voice presence detector with many malfunctions due to noise reduces unnecessary voice average operation rate (ratio of voice presence on the line). This is because the increase causes a deterioration in efficiency.

また、ＤＳＩシステムにおける有音検Ｗ器は、長距離回
線を用いた電話通信におけるエコー（反２１りにより誤
動作するという問題がある。周知のように、一般の電話
回線では両端の電話に接続される加入者線は２線式であ
り、長距離回線の伝送路は４線式である。このため、２
線４線変換が交換機の加入者回路により行なわれるが、
回線の特性インピーダンスのばらつきによりインピーダ
ンス不整合が生じ、４線受信側に入った受信信号は４線
送信側に漏洩し送話者側に戻りエコーを引き起こす。　
ＤＳＩシステムはこのようなエコーを有音と見なすため
、これにより音声平均動作率が見かけ上５０％を越すと
、音声の締出しが極端に増加し効率の劣化をもたらすこ
とになる。このようなエコーによる弊害を防ぐため、ｌ
ｌ５Ｉシステムではエコー制御が不可欠である。Additionally, the sound detector in the DSI system has the problem of malfunctioning due to echoes in telephone communication using long-distance lines. The subscriber line is a 2-wire system, and the transmission path of a long-distance line is a 4-wire system.
Line 4-wire conversion is performed by the subscriber circuit of the exchange,
Impedance mismatch occurs due to variations in the characteristic impedance of the lines, and the received signal entering the 4-wire receiving side leaks to the 4-wire transmitting side and returns to the speaker side, causing an echo.
Since the DSI system regards such echoes as sound, if the average voice activity rate apparently exceeds 50%, the amount of voice cut-out increases significantly, resulting in a deterioration of efficiency. In order to prevent the harmful effects of such echoes, l
Echo control is essential in the I5I system.

エコー制御の方式には、たとえばエコーサプレッサ（反
響阻止）方式とエコーキャンセラ（反響消去）方式とか
ある。エコーサプレッサ方式は、送信レベルより受信レ
ベルが大きい場合に送信側の伝送路に大きな損失を与え
る方式である。Echo control methods include, for example, an echo suppressor method and an echo canceller method. The echo suppressor method is a method that causes a large loss to the transmission path on the transmitting side when the receiving level is higher than the transmitting level.

このため、送受両方向の同時通話時に音声の切断を生し
るという欠点があり満足すべき通話品質が得られない。For this reason, there is a drawback that the voice is cut off when a call is made in both the sending and receiving directions simultaneously, and a satisfactory call quality cannot be obtained.

エコーキャンセラ方式は、回線のインピータンス不整合
により生じるエコーのインパルス応答を推定し、このイ
ンパルス応答と受信信号から実際のエコーとほぼ等しい
疑似信号を作りだし、この゛　信号を実際のエコーから
差し引きエコーを打消す方式である。このため、本質的
に音声の切断がなくエコーを抑圧できるという利点があ
る。エコーキャンセラ方式は、複雑な演算処理を必要と
するが最近のディジタル信号処理技術と半導体集積回路
技術の進歩により高性能で経済的なものが実用化されつ
つある。このようにエコーキャンセラ方式は、エコーサ
プレッサ方式に比べ通話品質が高いため、通話品質を重
視する国際通信回線に適している。The echo canceller method estimates the impulse response of the echo caused by line impedance mismatch, creates a pseudo signal that is almost the same as the actual echo from this impulse response and the received signal, and then subtracts this signal from the actual echo to eliminate the echo. This is a method of canceling. Therefore, there is an advantage that echoes can be suppressed without essentially cutting off the audio. The echo canceller method requires complex arithmetic processing, but with recent advances in digital signal processing technology and semiconductor integrated circuit technology, high-performance and economical methods are being put into practical use. As described above, the echo canceller method has higher call quality than the echo suppressor method, and is therefore suitable for international communication lines where call quality is important.

ＤＳＩシステムにエコーキャンセラ方式を適用する場合
には、　ＤＳＩシステムの加入者側にエコーキャンセラ
を縦続に接続する。具体的にはたとえばティジタル電子
交換機は、２線４線変換を行なうハイブリッド回路に符
号化復号回路（Ｇｏｄｅｃ）がｊｆＺ　Ｍ、されるが、
エコーキャンセラ方式を用いるとハイブリッド回路と符
号化復号回路の間にエコーキャンセラを挿入する。When applying an echo canceller method to a DSI system, the echo cancellers are connected in series on the subscriber side of the DSI system. Specifically, for example, in a digital electronic exchange, an encoding/decoding circuit (Godec) is installed in a hybrid circuit that performs two-wire and four-wire conversion.
When using the echo canceller method, an echo canceller is inserted between the hybrid circuit and the encoding/decoding circuit.

（発明が解決しようとする問題点）ＤＳＩシステムとエコーキャンセラとの併置は装置の小
型化が困難であり、保守、運用上で問題点があった。ま
た、従来のＤＳＩ方式では、ＤＳＩシステムにエコーキ
ャンセラを単に縦続接続している。このため、エコー成
分だけでなく雑音成分も減衰されるので、雑音パワーに
基づく適応パワー閾値を用いた有音検出器は不必要に閾
値が更新され、雑音による誤動作が多いという問題点が
あった。(Problems to be Solved by the Invention) When a DSI system and an echo canceller are installed together, it is difficult to miniaturize the device, and there are problems in terms of maintenance and operation. Furthermore, in the conventional DSI system, echo cancellers are simply connected in cascade to the DSI system. For this reason, not only the echo component but also the noise component is attenuated, so a sound detector that uses an adaptive power threshold based on the noise power has the problem that the threshold is updated unnecessarily and there are many malfunctions due to noise. .

本発明はこのような従来技術の欠点を解消し、装置の設
計、製造上無駄のないエコーキャンセラ機能を内蔵させ
たＤＳＩシステムを提供することを目的とする。また１
本発明はエコーキャンセラ機能を内蔵させたＤＳＩシス
テムにおいてエコーキャンセラが具備している制御機能
を用いて雑音パワーに基づく適応パワー閾値の更新を正
確に行なう有音検出装置を提供することを目的とする。SUMMARY OF THE INVENTION An object of the present invention is to eliminate such drawbacks of the prior art and provide a DSI system incorporating an echo canceller function that is efficient in device design and manufacturing. Also 1
SUMMARY OF THE INVENTION An object of the present invention is to provide a voice presence detection device that accurately updates an adaptive power threshold based on noise power using a control function included in an echo canceller in a DSI system with a built-in echo canceller function. .

（問題点を解決するための手段）本発明はと述の問題点を解決するために、入力した信号
に音声が含まれているか否かを検出する有音検出装置は
、前記信号を入力し、該信号の平均パワーを算出する算
出手段と、算出手段より前記信号の平均パワーを受け、
平均パワーにより雑音パワーの平均レベルである雑音パ
ワー平均レベルを算出更新する閾値更新手段と、前記信
号の平均パワーおよび雑音パワー平均レベルを受けて両
者を比較し、前記信号に音声か含まれているか否かを判
断する判定手段と、ダブルトーク検出信号およびセンタ
クリッピング検出信号を入力する入力端子とを有し、入
力端子にダブルトーク検出信号およびセンタクリッピン
グ検出信号のいずれか一方でも入力されると、閾値更新
手段は雑音パワー平均レベルの算出更新を入力の間中断
し、パワー平均レベルをダブルトーク検出信号およびセ
ンタクリッピング検出信号のいずれかが入力ざれる以前
の値に保つ。(Means for Solving the Problems) In order to solve the above-mentioned problems, the present invention provides a sound detection device that detects whether or not an input signal includes speech. , a calculation means for calculating the average power of the signal, and receiving the average power of the signal from the calculation means,
threshold updating means for calculating and updating a noise power average level, which is an average level of noise power, based on the average power; and receiving the average power of the signal and the noise power average level and comparing the two to determine whether the signal contains speech or not. and an input terminal for inputting a double talk detection signal and a center clipping detection signal, and when either the double talk detection signal or the center clipping detection signal is input to the input terminal, The threshold updating means suspends calculation and updating of the noise power average level during input, and maintains the power average level at the value before either the double talk detection signal or the center clipping detection signal is input.

また本発明によれば、入力信号に音声が含まれているか
否かを検出する有音検出装置は、該入力信号に含まれる
エコー成分の制御を行なうとともに、入力信号のダブル
トークおよびセンタクリッピングを検出するエコー制御
手段と、エコー制御手段よりエコー成分を制御した信号
を入力し、該信号の平均パワーを算出する算出手段と、
算出手段より前記信号の平均パワーを受け、平均パワー
により雑音パワーの平均レベルである雑音パワー平均レ
ベルを算出更新する閾値更新手段と、前記信号の平均パ
ワーおよび雑音パワー平均レベルを受けて両者を比較し
、前記信号に音声が含まれているか否かを判断する判定
手段とを有し、エコー制御手段は、前記入力信号にダブ
ルトークおよびセンタクリッピングのいずれかを検出す
ると、該検出を閾値更新手段に通知し、閾値更新手段は
、該通知を受けると雑音パワー平均レベルの算出更新を
該通知を受けている間行なわない。Further, according to the present invention, the sound detection device that detects whether or not an input signal includes speech controls echo components included in the input signal, and also controls double talk and center clipping of the input signal. an echo control means for detecting; a calculation means for inputting a signal whose echo component has been controlled from the echo control means and calculating the average power of the signal;
Threshold updating means receives the average power of the signal from the calculation means and calculates and updates a noise power average level which is the average level of noise power using the average power; and receives the average power of the signal and the noise power average level and compares the two. and determining means for determining whether or not the signal includes audio, and when the echo control means detects either double talk or center clipping in the input signal, the echo control means updates the detection with a threshold value updating means. Upon receiving the notification, the threshold updating means does not update the calculation of the average noise power level while receiving the notification.

また本発明によれば、回線に発生したエコーを制御する
エコー制御装置を有し、該回線で伝送された信号の無音
区間に他の音声信号を挿入するティジタル、音声挿入シ
ステムは、前記信号に音声信号が含まれるか否かを伝送
された信号の雑音パワーに基づいて更新される適応パワ
ー閾値と比較することによって検出する有音検出装置を
有し。Further, according to the present invention, there is provided a digital audio insertion system that includes an echo control device that controls echoes generated on a line and inserts another audio signal into a silent section of a signal transmitted on the line. It has a speech presence detection device that detects whether a speech signal is included by comparing it with an adaptive power threshold that is updated based on the noise power of the transmitted signal.

有音検出装置は、エコー制御装置より、エコー制御され
た信号を受信するとともにエコー制御信号としてセンタ
クリッピング検出信号およびダブルトーク検出信号を入
力可能に接続され、有音検出装置は、エコー制御装置よ
りセンタフリンピンク検出信号およびダブルトーク検出
信号のいす、れか一方でも入力しているときは、適応パ
ワーの閾イｉＱの更新を行なわない。The sound detection device is connected to receive an echo-controlled signal from the echo control device and to be able to input a center clipping detection signal and a double talk detection signal as echo control signals. When either the center fly pink detection signal or the double talk detection signal is input, the adaptive power threshold iQ is not updated.

（作　用）本発明によれば、有音検出装置は、エコー制御装置より
センタクリッピング検出信号およびダブルトーク検出信
号のいずれか一方でも入力しているときは、入力した信
号により平均パワーが変動しても雑音パワー平均レベル
の算出更新を行なわない。(Function) According to the present invention, when the sound presence detection device receives either the center clipping detection signal or the double talk detection signal from the echo control device, the average power varies depending on the input signal. The calculation of the average noise power level is not updated even if the noise power level is calculated.

（実施例）本発明の詳細な説明に先立って、本発明の理解を助ける
ために、第３図を参照してディジタル音声挿入（Ｄｉｇ
ｉｔａｌ　５ｐｅｅｃｈ　Ｉｎｔｅｒｐｏｌａｔｉｏｎ
、以下ＤＳＩ　と称す）システムにエコーキャンセラ部
を組込んだ従来技術を説明する。ＤＳＩシステムとは、
前述したように無音圧縮技術に基づいて通話の塀音区間
に他の通話の実際の音声である有音な挿入し、回線の効
率化を図るものである。同図に示すように従来のＤＳＩ
システムは、　　ＤＳＩ送信装置４およびＤＳＩ受信装
置３により構成され、エコーキャンセラＥＣが電話機毎
にハイブリッド回路Ｈの装置側に接続されている。(Example) Prior to detailed description of the present invention, in order to help understanding of the present invention, reference will be made to FIG.
ital 5peech Interpolation
A conventional technique in which an echo canceller section is incorporated into a system (hereinafter referred to as DSI) will be described. What is the DSI system?
As mentioned above, based on the silence compression technology, active voice, which is the actual voice of another call, is inserted into the wall tone section of a call to improve the efficiency of the line. As shown in the figure, conventional DSI
The system is composed of a DSI transmitting device 4 and a DSI receiving device 3, and an echo canceller EC is connected to the device side of the hybrid circuit H for each telephone.

電話機か゛ら送られてきた音声信号は、ハイブリッド回
路Ｈにより２線４線変換され、エコーキャンセラＥＣに
入力される。二ニーキャンセラＥＣは、回線の特性イン
ピーダンスのばらつきによりインピーダンス不整合が生
じ、ハ・イブリッド回路の４線受信側に入った受信信号
が４線送信側に漏洩することでエコー（反響）が生じる
ことによりＤＳＩシステムが誤動作するのを防ぐ回路で
ある。The audio signal sent from the telephone is converted into 2-wire and 4-wire by the hybrid circuit H, and is input to the echo canceller EC. The two-knee canceller EC is designed to prevent impedance mismatching due to variations in the characteristic impedance of the lines, and the received signal entering the 4-wire receiving side of the hybrid circuit leaks to the 4-wire transmitting side, causing an echo. This circuit prevents the DSI system from malfunctioning.

エコーキャンセラＥＣより出力された音声信号は、ＰＣ
Ｍエンコータに入力され、ここでＰＣＭ信号に変換され
マルチプレクサに送られる。マルチプレクサは入力した
ＰＣＭ信号を多重化して入力ＰＣＭ信号としてＤＳＩ送
信装置４に送る。The audio signal output from the echo canceller EC is sent to the PC.
The signal is input to the M encoder, where it is converted to a PCM signal and sent to the multiplexer. The multiplexer multiplexes the input PCM signals and sends them to the DSI transmitter 4 as input PCM signals.

マルチプレクサより送られてきたＰＣＭ信号は、ＤＳＩ
送信装置４の音声遅延部および有音検出部に入力される
。有音検出器は、背景雑音の雑音パワーに基づき適応パ
ワー閾値を更新することにより有音を検出する有音検出
器である。有音検出器は、入力ＰＣＭ信号の各チャネル
を監視して音声があれば割当プロセッサに送信割当要求
信号を送る。また、音声遅延部は入力したＰＣＭ信号を
一定時間遅延した後、音声メモリに出力する。The PCM signal sent from the multiplexer is
The signal is input to the voice delay section and voice detection section of the transmitting device 4. The utterance detector is a utterance detector that detects utterance by updating an adaptive power threshold based on the noise power of background noise. The voice detector monitors each channel of the input PCM signal and, if there is voice, sends a transmission assignment request signal to the assignment processor. Further, the audio delay unit delays the input PCM signal for a certain period of time, and then outputs the signal to the audio memory.

割当プロセッサは、送信割当要求信号を受信すると音声
メモリを制御し、割当要求されたチャネルの音声信号を
このメモリから選出し、出力インタフェースに送るよう
音声メモリを制御するとともに、出力インタフェースに
割当情報を出力する。出力インタフェースは、音声信号
および割当情報、を合成し出力ＤＳＩ信号として対向す
るＤＳＩ受信装誼３に送信する。When the allocation processor receives the transmission allocation request signal, the allocation processor controls the audio memory to select the audio signal of the allocated channel from this memory and send it to the output interface, and also sends allocation information to the output interface. Output. The output interface combines the audio signal and the allocation information and transmits the synthesized signal as an output DSI signal to the opposing DSI receiving device 3.

ＤＳＩ送信装置４より送信された出力ＤＳＩ信号が入力
ＤＳＩ信号としてＤＳＩ受信装置３の入力インタフェー
スに入力されると、入力インタフェースはこの信号を音
声信号と割当情報とに分解し、音声信号を音声メモリに
、また割当情報を割当プロセッサにそれぞれ出力する。When the output DSI signal transmitted from the DSI transmitter 4 is input as an input DSI signal to the input interface of the DSI receiver 3, the input interface decomposes this signal into an audio signal and allocation information, and stores the audio signal in the audio memory. In addition, the allocation information is output to the allocation processor.

割当プロセッサは、受信した割当情報に基づいて音声メ
モリを制御し、音声信号を各チャネルに割当てる。入力
した音声信号は有音部分のみの信号であるため、通話の
自然性を確保するため背景雑音に相当するものを無音部
分に挿入する必要がある。このめ割当プロセッサは、各
チャネルの無音部分に背景雑音が挿入されるよう雑音発
生器を制御する。The allocation processor controls the audio memory based on the received allocation information and allocates audio signals to each channel. Since the input audio signal is a signal containing only the voiced portions, it is necessary to insert something equivalent to background noise into the silent portions in order to ensure the naturalness of the conversation. To this end, the allocation processor controls the noise generator so that background noise is inserted into the silent portions of each channel.

背景雑音が挿入された音声信号は、ＤＳＩ受信装置３よ
り出力ＰＣＭ信号としてデーマルチプレクサに送られる
。デマルチプレクサは多重化された音声信号をチャネル
毎に分離する回路であり、分離された音声信号はＰＣＭ
デコーダに送られる。ＰＣＭデコーダはディジタル音声
信号をアナログ音声信号に変換する。アナログ音声信号
は、エコーキャンセラおよびハイブリッド回路Ｈを介し
て通話先の電話機に送られる。The audio signal into which the background noise has been inserted is sent from the DSI receiving device 3 to the demultiplexer as an output PCM signal. A demultiplexer is a circuit that separates multiplexed audio signals for each channel, and the separated audio signals are PCM
sent to the decoder. A PCM decoder converts digital audio signals into analog audio signals. The analog voice signal is sent via an echo canceller and hybrid circuit H to the called telephone.

このように従来技術によるＤＳＩシステムでは。In this way, in the DSI system according to the prior art.

エコーキャンセラが端末毎に必要なため、システムの小
型化および経済化が困難であり、保守、運用上において
問題があった。また、有音検出器は、雑音パワーに基づ
く適応パワー閾値を用いているため単にエコーキャンセ
ラを［ｌＳ１システムに接続するだけではエコー成分以
外に雑音成分も減衰される。このため、雑音パワーに基
づく適応閾値を用いた有音検出部は不必要に閾値、が更
新され、雑音による誤動作が多いという問題点がある。Since an echo canceller is required for each terminal, it is difficult to make the system smaller and more economical, and there are problems in terms of maintenance and operation. Furthermore, since the utterance detector uses an adaptive power threshold based on noise power, simply connecting an echo canceller to the [lS1 system will attenuate noise components in addition to echo components. For this reason, there is a problem in that a voice detection section using an adaptive threshold based on noise power has a threshold updated unnecessarily and often malfunctions due to noise.

次に添付図面を参照して本発明による有音検出制御方式
の実施例を詳細に説明する。Next, embodiments of the sound detection control method according to the present invention will be described in detail with reference to the accompanying drawings.

第１図を参照すると、本発明による有音検出装置を、た
とえば国際衛星回線または光海底ケーブルなどに接続さ
れるＤＳＩシステムに適用した実施例が示されている。Referring to FIG. 1, there is shown an embodiment in which a sound detection device according to the present invention is applied to a DSI system connected to, for example, an international satellite line or an optical submarine cable.

なお、同図には本発明に直接関係のある要素、すなわち
ＤＳＩシステムの送信装置側が示されており、直接関係
の嵩い受信装晋側の要素は略して記載している。Note that this figure shows elements directly related to the present invention, that is, the transmitting device side of the DSI system, and directly related bulky elements on the receiving device side are omitted.

ＤＳＩシステムに接続されている複数のたとえば電話機
より送られてくるアナログ音声信号は、ＰＣＭエンコー
ダ（図示せず）でディジタル音声信号に変換され、入力
ＰＣＭチャネルＣＨＩ−Ｃ）Ｉｎに割り当てられる。　
ＰＧＭチャネルＣＨＩ〜ＣＨｎのディジタル音声信号は
、出力５００としてマルチプレクサ１１に送られる。マ
ルチプレクサ１１は出力５００を時分割多重する多重化
装置である。マルチプレクサ１１は、送信入力（Ｓｉｎ
）信号線５０２を介しエコーキャンセル部１２に接続さ
れ、入力ＰＣＭ信号をエコーキャンセル部１２に出力す
る。Analog voice signals sent from a plurality of telephones, for example, connected to the DSI system are converted into digital voice signals by a PCM encoder (not shown) and assigned to an input PCM channel CHI-C)In.
The digital audio signals of PGM channels CHI to CHn are sent to multiplexer 11 as output 500. The multiplexer 11 is a multiplexing device that time-division multiplexes the outputs 500. The multiplexer 11 has a transmission input (Sin
) It is connected to the echo canceling unit 12 via a signal line 502 and outputs the input PCM signal to the echo canceling unit 12.

電話機から送られてくる音声信号は、たとえば交換機の
加入者回路のハイブリッド回路（図示せず）において２
線４１ｊｉｔ変換が行なわれるが、前述のように回線の
特性インピーダンスのばらつきによりインピーダンス不
整合が生じる。このため、４線受信側に入った受信信号
は４線送信側に漏洩し送話側に戻りエコーを引き起こす
。ｌｌ５Ｉシステムは、無音圧縮技術に基づいているた
め、このようなエコーがあると、音声平均動作率が見か
け上たとえば５０％を越えて音声の締出しが極端に増加
し効率の劣化をもたらすことになる。このため、エコー
キャンセル部１２は、エコーキャンセラ（反響消去）方
式により　ＰＣＭチャネルＣＨＩ−ＣＨｎに発生するエ
コーをチャネル毎に消去する。エコーキャンセル部１２
は、送信出力（Ｓｏｕｔ）信号線５０４７ｉ介し音声遅
延部２０に接続されるとともにｎチャネル有音検出部２
６に接続される。エコーキャンセル部１２はまた。セン
タクリッピング検出信号線５１２およびダブルトーク検
出信号線５１４を介し有音検出部２Ｂに接続されている
。The voice signal sent from the telephone is, for example, divided into two parts in a hybrid circuit (not shown) in the subscriber circuit of the exchange.
Line 41jit conversion is performed, but impedance mismatch occurs due to variations in the characteristic impedance of the line, as described above. Therefore, the received signal entering the 4-wire receiving side leaks to the 4-wire transmitting side and returns to the transmitting side, causing an echo. Since the ll5I system is based on silence compression technology, if there is such an echo, the average voice operation rate will apparently exceed, for example, 50%, which will dramatically increase the number of voices blocked out and cause a deterioration of efficiency. Become. For this reason, the echo canceling unit 12 uses an echo canceller (echo cancellation) method to cancel echoes occurring in the PCM channels CHI-CHn for each channel. Echo canceling section 12
is connected to the audio delay unit 20 via a transmission output (Sout) signal line 5047i, and is also connected to the n-channel sound detection unit 2
Connected to 6. The echo canceling unit 12 is also It is connected to the sound detection section 2B via a center clipping detection signal line 512 and a double talk detection signal line 514.

有音検出部２６は、有音検出入力である送信出力５０４
によりチャネルＣＨＩ〜ＣＨｎの音声部分を検出する。The sound detection unit 26 outputs a transmission output 504 which is a sound detection input.
The audio portions of channels CHI to CHn are detected.

このため有音検出部２６は、近端話者の音声部分の検出
感度が良好で背景雑音による誤動作が少ないことが要求
される。これは、検出感度の鈍い有音検出部２８は話頭
切断の増加により音声品質が劣化し、雑音による誤動作
の多い有音検出部２Ｂは不要な音声平均動作率の増加に
より効率の劣化をもたらすためである。有音検出部２６
は有音検出出力５１５を介し割当プロセッサ２８に接続
される。Therefore, the utterance detection unit 26 is required to have good detection sensitivity for the near-end speaker's voice part and to have few malfunctions due to background noise. This is because the voice presence detection unit 28, which has low detection sensitivity, deteriorates the voice quality due to an increase in speech cutting, and the voice presence detection unit 2B, which often malfunctions due to noise, causes a decrease in efficiency due to an increase in unnecessary voice average operation rate. It is. Sound detection unit 26
is connected to the assignment processor 28 via the voice detection output 515.

有音検出部２Ｂは、送信出力５０４に音声勢力が存在す
ればチャネル割当要求信号として有音検出出力５１５を
割当プロセッサ２８に出力する。The sound presence detection unit 2B outputs a sound presence detection output 515 to the allocation processor 28 as a channel allocation request signal if a voice signal is present in the transmission output 504.

音声遅延部２０は、有音検出部２６の処理遅延を補償す
るため入力した送信出力信号５０４を一定時間遅延させ
る。音声遅延部２０は出力５０６を介し音声メモリ２２
に接続されており、遅延された音声信号は出力５０８と
してメモリ２２に送られる。The audio delay unit 20 delays the input transmission output signal 504 for a certain period of time in order to compensate for the processing delay of the voice detection unit 26. The audio delay unit 20 is connected to the audio memory 22 via an output 506.
The delayed audio signal is sent to memory 22 as output 508.

割当プロセッサ２８は、たとえばＤＳＩ送信装置２のＤ
ＳＩ利得が２の場合には、実際に入力したチャネル数の
半分のチャネル数で音声信号を送信装置２から受信装置
３に送信するために、音声メモリ２２および出力インタ
フェース２４を制御する制御回路である。割当プロセッ
サ２８は、メモリ制御出力５１８を介し音声メモリ２２
に接続されるとともに割当情報出力５１Ｂを介し出力イ
ンタフェース２４に接続されている。プロセッサ２８は
、入力ＰＣＭチャネルとＤＳＩ処理を行なった出力ＯＳ
Ｉチャネルとの対応関係を表わす割当情報出力５１８を
インタフェース２４に送る。The allocation processor 28 is configured to, for example,
When the SI gain is 2, the control circuit that controls the audio memory 22 and the output interface 24 transmits the audio signal from the transmitter 2 to the receiver 3 using half the number of channels actually input. be. Allocation processor 28 connects audio memory 22 via memory control output 518.
It is connected to the output interface 24 via the allocation information output 51B. The processor 28 has an input PCM channel and an output OS that has undergone DSI processing.
Assignment information output 518 representing the correspondence with the I channel is sent to the interface 24.

音声メモリ２２は、制御出力５１６に従い対応するチャ
ネルの音声信号を有音としてメモリに格納する。音声メ
モリ２２は、出力５０８を介し出力インタフェース２４
に接続されており、メモリに格納した有音のみインタフ
ェース２４に送る。インタフェース２４は、音声メモリ
２２の出力５０Ｂとプロセッサ２８の割当情報出力５１
８とを入力し、これら出力を合成して出力ＤＳＩ信号５
１０として対向するＤＳＩ受信装行３に送る。The audio memory 22 stores the audio signal of the corresponding channel as sound in accordance with the control output 516. Audio memory 22 is connected to output interface 24 via output 508.
is connected to the interface 24, and only the sound signals stored in the memory are sent to the interface 24. The interface 24 connects the output 50B of the audio memory 22 and the allocation information output 51 of the processor 28.
8 and synthesizes these outputs to output DSI signal 5.
10 to the opposing DSI receiver row 3.

第２図には本実施例によるエコーキャンセル部１２と有
音検出部２Ｂの機能ブロック図が示されている。なお、
同図は理解を容易にするためエコーキャセル部１２およ
び有音検出部２Ｂの１チャネル分か示されている。FIG. 2 shows a functional block diagram of the echo canceling section 12 and the sound detecting section 2B according to this embodiment. In addition,
In the figure, one channel of the echo canceller 12 and the sound detection section 2B is shown for ease of understanding.

エコーキャンセル部１２は、加算器１２０、センタクリ
ツバ１２２　、Ｎ次係数ディジタルフィルタｉ２４およ
び係数算出器１２６により構成されている。エコーキャ
ンセル部１２は、受信出力５２２から送信入力５０２間
の端末側経由の伝送路、すなわちエコーパスのインパル
ス応答を推定し、このインパルス応答と受信人力５２２
から入力した実際のエコーとほぼ等しいエコー疑似信号
Ｅｎｉを作りだし、この信号Ｅｎｉを実際のエコー信号
Ｅｎから差し引いてエコーを打ち消す。The echo canceling section 12 includes an adder 120, a center critter 122, an N-order coefficient digital filter i24, and a coefficient calculator 126. The echo canceling unit 12 estimates the impulse response of the transmission path via the terminal side between the receiving output 522 and the transmitting input 502, that is, the echo path, and combines this impulse response with the receiving human power 522.
An echo pseudo signal Eni that is almost equal to the actual echo input from the input signal is generated, and this signal Eni is subtracted from the actual echo signal En to cancel the echo.

加算器１２０は送信入力５０２として音声信号Ｅｎ＋Ｚ
ｎを、またディジタルフィルタ１２４よりエコー疑似信
号−Ｅｎｉをそれぞれ入力し１合成信号Ｅｎ＋　Ｚｎ−
Ｅｎｉを作成する加算器である。加算器１２０の出力６
００はセンタクリッパ１２２および係数算出器１２６に
それぞれ接続され、合成信号Ｅｎ＋　Ｚｎ−Ｅｎｉを出
力６００として出力する。The adder 120 receives the audio signal En+Z as a transmitting input 502.
n and the echo pseudo signal -Eni from the digital filter 124 respectively to obtain one composite signal En+Zn-
This is an adder that creates Eni. Output 6 of adder 120
00 is connected to the center clipper 122 and the coefficient calculator 126, respectively, and outputs the composite signal En+Zn-Eni as an output 600.

算出器１２６は受信入力線５２０にも接続されており、
送信人力５０２、出力８００および受信人力５２０を入
力し、これらから疑似エコーの最適値を推定して係数算
出する。算出器１２６は、出力５１６を介しディジタル
フィルタ１２４に接続され、計数算出した値を出力５１
６によりこれに出力する。算出器１２６はまた、前述の
信号を入力することによりセンタクリンピングおよびダ
ブルトークを検出する機能を有する。算出器１２６は、
出力５２１を介しセンタクリッパ１２２に、またダブル
トーク検出信号線５１４およびセンタクリッピング検出
信号線５１２を介し後述する有音検出部２６の適応パワ
ー閾値更新器２６４に接続され、ダブルトークまたはセ
ンタクリンピングを検出するとそれぞれに検出報告を行
なう。Calculator 126 is also connected to receive input line 520,
The transmitter's power 502, the output 800, and the receiver's power 520 are input, and from these, the optimum value of the pseudo echo is estimated and the coefficients are calculated. The calculator 126 is connected to the digital filter 124 via an output 516, and outputs the counted value to the output 51.
6 to output to this. Calculator 126 also has the ability to detect center crimping and double talk by inputting the aforementioned signals. The calculator 126 is
It is connected to the center clipper 122 via the output 521, and to the adaptive power threshold updater 264 of the voice detection unit 26, which will be described later, via the double talk detection signal line 514 and the center clipping detection signal line 512, and detects double talk or center crimping. When detected, a detection report is made for each.

ダブルトークとはエコーに送信信号が重畳することであ
り、第５図には本実施例によるダブルトーク検出領域が
示されている。すなわち、係数算出器１２６は、送信人
力５０２のレベル閾値を一３１ｄＢｍＯ、受信入力５２
０のレベル閾値を一３１ｄＢｍＯとし、縦軸を送信人力
５０２のレベルＬｓ、横軸を受信人力５２０のレベルＬ
ｒとした場合に、直線Ｌｓ＝Ｌｒ−ｋ（ｋは定数）より
送信入力レベルＬｓが大きい場合にダブルトーク検出と
見なす。Double talk means that a transmission signal is superimposed on an echo, and FIG. 5 shows a double talk detection area according to this embodiment. That is, the coefficient calculator 126 sets the level threshold of the transmitting human power 502 to -31 dBmO, and the receiving input 52
The level threshold of 0 is -31 dBmO, the vertical axis is the level Ls of the sending human power 502, and the horizontal axis is the level L of the receiving human power 520.
When r is the transmission input level Ls greater than the straight line Ls=Lr−k (k is a constant), it is regarded as double talk detection.

ディジタルフィルタ１２４は、疑似エコーの最適値を推
定して係数算出された出力５１６および受信入力５２０
を入力し、これらから疑似エコー−Ｅｎｉを作成する。The digital filter 124 has an output 516 whose coefficients have been calculated by estimating the optimum value of the pseudo echo, and a reception input 520.
are input, and a pseudo echo-Eni is created from these.

フィルタ１２４は出力５１８を介し加算器１２０に接続
され、疑似エコー−Ｅｎｉを出力５１８として加算器１
２０に出力する。The filter 124 is connected to the adder 120 via an output 518, and the pseudo echo Eni is connected to the adder 120 as an output 518.
Output to 20.

センタクリッパ１２２は、センタクリッピング検出出力
５２０を入力すると出力６００の波形操作を行なうこと
により微小な残留エコーを除去する回路である。第６図
にはセンタクリッパ１２２の入出力特性図が示されてお
り、同図に示すようにクリップ電圧Ｖｃは受信入力レベ
ルに応じて変化する。The center clipper 122 is a circuit that receives the center clipping detection output 520 and removes minute residual echoes by manipulating the waveform of the output 600. FIG. 6 shows an input/output characteristic diagram of the center clipper 122, and as shown in the figure, the clipping voltage Vc changes depending on the received input level.

有音検出部２６は、第２図に示すようにパワー算出器２
６０、有音判定器２６２および適応パワー閾値更新器２
８４により構成されている。パワー算出器２６０は、セ
ンタクリッパ１２２から出力５０４を入力し、この出力
５０４をたとえば１０ｍ５の短フレーム毎に平均パワー
を算出する。算出器１２６は出カフ００を介し閾値更新
器２６４および有音判定器２８２に接続され、これらに
平均パワーの算出結果７００を出力する。The sound detection unit 26 includes a power calculator 2 as shown in FIG.
60, utterance determiner 262 and adaptive power threshold updater 2
84. The power calculator 260 inputs the output 504 from the center clipper 122 and calculates the average power of the output 504 for each short frame of, for example, 10 m5. The calculator 126 is connected to the threshold updater 264 and the utterance determiner 282 via the output cuff 00, and outputs the average power calculation result 700 to these.

閾値更新器２６４は平均パワーの算出結果７００を入力
することにより、無音区間中の条件、たとえば１００ｍ
５の長フレームで連続した無音区間の平均パワーすなわ
ち雑音パワーの平均レベルＰｎを計算し、適応パワー閾
値レベルＰａｔｈを計算する。Ｐａｔｈの算出は次式に
より行なわれる。By inputting the average power calculation result 700, the threshold value updater 264 changes the condition during the silent section, for example, 100m.
The average power of continuous silent sections in five long frames, that is, the average level of noise power Pn is calculated, and the adaptive power threshold level Path is calculated. Path is calculated using the following equation.

Ｐａｔｈ＝　Ｋ　ｍ　ＰｎここでＫは重み付は係数であり、雑音の分散値によって
決定される。適応パワー閾値Ｐａｔｈの更新は、たとえ
ば１００ｍ５以上の長フレームが連続した無音区間で実
施される。閾値更新器２６４は、出カフ０２を介し有音
判定器２６２に接続され、適応パワー閾値Ｐａｔｈの更
新は出カフ０２として有音判定器２６２に出力される。Path=K m Pn Here, K is a weighting coefficient and is determined by the variance value of the noise. The adaptive power threshold Path is updated in a silent section with continuous long frames of, for example, 100 m5 or more. The threshold updater 264 is connected to the utterance determiner 262 via the output cuff 02, and the update of the adaptive power threshold Path is outputted to the utterance determiner 262 as the output cuff 02.

閾値更新器２６４は；前述のように係数算出器１２６よ
りセンタクリッピング検出信号５１２およびダブルトー
ク検出信号５１４を入力する。そして閾値更新器２６４
にセンタクリッピング検出信号５１２またはダブルトー
ク検出信号５１４のいずれかが検出されているときは、
前述の適応パワー閾値の更新を行なわない。これはセン
タクリッピング検出状態では雑音レベルが極端に低減す
るのでこれに追従するのを防止し、またグブルトーク検
出状態では雑音レベルの時間変動が大きいのでこれに追
従するのを防止するためである。Threshold updater 264 receives center clipping detection signal 512 and double talk detection signal 514 from coefficient calculator 126 as described above. and threshold updater 264
When either the center clipping detection signal 512 or the double talk detection signal 514 is detected,
The above-mentioned adaptive power threshold is not updated. This is to prevent the noise level from following the center clipping detection state, since it is extremely reduced, and to prevent the noise level from following it, since the noise level fluctuates greatly in the gobble talk detection state.

有音判定器２６２は、パワー算出器２８０の類フレーム
平均パワー出力ＺＯＯおよび閾値更新２６４の閾値出カ
フ０２を受信してこれらを比較し有音判定を行なう。判
定器２６２は、有音検出出力５１５を介し割当プロセッ
サ２８に接続されており、平均パワー出カフ００が閾値
出カフ０２より大きい場合は有音検出部カフ０６を割当
プロセー２す２８に出力する。The utterance determiner 262 receives the similar frame average power output ZOO of the power calculator 280 and the threshold output cuff 02 of the threshold value update 264, compares these, and determines the utterance. The determiner 262 is connected to the allocation processor 28 via the sound detection output 515, and outputs the sound detection unit cuff 06 to the allocation processor 28 if the average power output cuff 00 is greater than the threshold output cuff 02. .

有音判定器２６２はまた平均パワー出カフ００が閾値出
カフ０２より小さいときは無音検出量カフ０４を閾値更
新器２６４に出力する。When the average power output cuff 00 is smaller than the threshold output cuff 02, the utterance determiner 262 outputs the silence detection amount cuff 04 to the threshold updater 264.

チャネルＣＨＩ〜ＣＨｎの電話音声のＰＣＭ信号５００
は、マルチプレクサ１１に入力され、これに時分割多重
化されて送信入力５０２としてエコーキャンセル部１２
に送られる。送信入力５０２は、前述したように近端話
者からの音声信号Ｚｎとエコー信号Ｅｎとを加算した信
号である。このため、ディジタルフィルタ１２４と係数
算出器１２Ｇとによりエコー疑似信号−Ｅ旧を算出し、
加算器１２０によりエコー疑似信号を加算する。そして
加算器１２０により加算された信号６００は、センタク
リッパ１２２に送られて微小な残留エコーを除去する。Telephone voice PCM signal 500 for channels CHI to CHn
is input to the multiplexer 11, time-division multiplexed thereto, and transmitted to the echo canceling unit 12 as a transmission input 502.
sent to. The transmission input 502 is a signal obtained by adding the voice signal Zn from the near-end speaker and the echo signal En, as described above. Therefore, the echo pseudo signal -E old is calculated by the digital filter 124 and the coefficient calculator 12G,
An adder 120 adds the echo pseudo signals. The signal 600 added by the adder 120 is sent to the center clipper 122 to remove minute residual echoes.

センタクリッパ１２２から送信出力５０４として出力さ
れた音声信号は、遅延部２０および有音検出部２６に送
られる。The audio signal output from the center clipper 122 as a transmission output 504 is sent to the delay section 20 and the sound detection section 26.

送信出力５０４は、有音検出部２６の算出器２６０に入
力されると、これにより送信出力５０４の平均パワー７
００が算出される。この平均パワー７００は判定器２６
２に送られるとともに閾値更新器２６４に出力される。When the transmission output 504 is input to the calculator 260 of the sound detection section 26, the average power 7 of the transmission output 504 is calculated.
00 is calculated. This average power 700 is determined by the judge 26
2 and output to the threshold updater 264.

背景雑音は、入力した音声信号毎にそのレベルが異なる
ため、信号毎にその雑音値を更新する必要がある。閾値
更新器２８４は雑音パワーの平均値Ｐｎを計算し、雑音
パワーの基準値７０２として判定器２６２に出力する。Since the level of background noise differs depending on the input audio signal, it is necessary to update the noise value for each signal. The threshold updater 284 calculates the average value Pn of the noise power and outputs it to the determiner 262 as the reference value 702 of the noise power.

判定器２６２は平均パワー７００と雑音パワー７０２と
を比較することにより、有音か無音かを判定し、有音で
あれば有音検出出力５１５をプロセッサ２８に出力し、
また無音であれば無音検出出カフ０４を閾値更新器２８
４に出力する。The determiner 262 determines whether there is a sound or no sound by comparing the average power 700 and the noise power 702, and if there is a sound, outputs a sound detection output 515 to the processor 28,
If there is no sound, the soundless detection output cuff 04 is updated by the threshold updater 28.
Output to 4.

なお、閾値更新器２６４は、ダブルトーク検出信号５１
４およびセンタクリッピング検出信号５１２のいずれか
一方でも検出されている場合には閾値更新を行なわない
。このため閾値更新器２６４は、適応パワー閾値の更新
を極端に低減する雑音レベルまたは時間変動が大きい雑
音レベルに追従して行なうことはない。したがって有音
検出器２６は有音検出を高精度に行なうことができる。Note that the threshold value updater 264 uses the double talk detection signal 51
4 and center clipping detection signal 512, the threshold value is not updated. For this reason, the threshold value updater 264 does not update the adaptive power threshold value following a noise level that is extremely reduced or a noise level that has large temporal fluctuations. Therefore, the sound detector 26 can detect the presence of sound with high accuracy.

このように本実施例によるＯ５１システムでは、エコー
キャンセラをマルチプレクサ１１より装置側に配設した
ため、従来のように端末毎に配設する必要がなくなった
。これによりシステムが小型番経済化し、保守、運用を
スムーズに行なうことができる。また、有音検出部２６
の適応パワー閾値更新において、エコーキャンセル部１
２からのセンタクリッピング検出信号５１２とダブルト
ーク検出信号５１４を適応パワー閾値更新器２６４に出
力することにより閾値更新を停止させる制御信号とする
。In this way, in the O51 system according to this embodiment, the echo canceller is disposed closer to the device than the multiplexer 11, so it is no longer necessary to dispose it for each terminal as in the past. As a result, the system can be made smaller and more economical, and maintenance and operation can be carried out smoothly. In addition, the presence detection unit 26
In the adaptive power threshold update of echo canceling unit 1
The center clipping detection signal 512 and the double talk detection signal 514 from 2 are output to the adaptive power threshold updater 264 as control signals for stopping the threshold update.

このため、雑音レベルの算出を高精度で行なうことがで
き、雑音に対する誤動作の少ない有音検出装置を実現で
きる。Therefore, it is possible to calculate the noise level with high accuracy, and it is possible to realize a sound detection device that is less likely to malfunction due to noise.

（発明の効果）このように本発明によれば、有音検出装置の適応パワー
閾値更新においてエコー制御装置からのセンタクリッピ
ング検出信号とダブルトーク検出信号を！２３値更新を
一時的に停止させる制御信号とする。このため、雑音レ
ベルの算出を高精度で行なうことができ、雑音に対する
誤動作の少ない有音検出を実現できる。すなわち、　Ｄ
ＳＩシステムに本発明を適用することにより高精度な有
音検出が可能となり音声平均動作率を低減でき、その結
果効率の良いシステムが実現できる。(Effects of the Invention) As described above, according to the present invention, the center clipping detection signal and the double talk detection signal from the echo control device are used in updating the adaptive power threshold of the voice presence detection device! This is a control signal that temporarily stops updating the V.23 value. Therefore, the noise level can be calculated with high precision, and the presence of sound can be detected with fewer malfunctions due to noise. That is, D
By applying the present invention to an SI system, highly accurate sound detection becomes possible, the average voice activity rate can be reduced, and as a result, an efficient system can be realized.

【図面の簡単な説明】第°１図は本発明による有音検出装置の実施例をＤＳＩ
システムに適用したシステム構成図、第２図は、第１図
のＤＳＩシステムに適用した有音検出装置の機能ブロッ
ク図。第３図は従来のＤＳＩシステムの構成を示すシステム構
成図、　　　　゛第４図は、第３図に示した従来のｌｌ５Ｉシステムのエ
コーキャンセラの機能ブロック図、第５図は本実施例に
おけるダブルトーク検出領域を示す検出領域図、第６図はセンタクリッパの入出力特性を示す入出力特性
図である。一要部分の符号の説明２、、、ＤＳＩ送信装置１１、、、マルチプレクサ１２、、、ｎチャネル二二−キャンセル部２０、、、音
声遅延部２２、、、音声メモリ２４、、、出力インタフェース２Ｅ１．　、　、’ｎチャネル有音検出部２Ｂ、、、割
当プロセッサ１２０　、　、加算器１２２　　、　、センタクリッパ１２４　　、　、　Ｎ次係数ディジタルフィルタ１２Ｂ
　、　、係数算出器２８０　　、　、パワー算出器２８２　、　、有音判定器２８４　、　、適応パワー閾値更新器特許出願人　沖電気工業株式会社代　理　人　香取　孝雄丸山　隆夫Ｄ／Ａ　ディジタル・７ナロク変換従来のエコーキャンセラ第４図受信入力レベル　（ｄＢｍｏ）ダブルトーク検出の領域第５図センタクリツノ譬の入出力特性第６図[Brief Description of the Drawings] Figure 1 shows an embodiment of the sound detection device according to the present invention.
A system configuration diagram applied to the system, FIG. 2 is a functional block diagram of a sound detection device applied to the DSI system of FIG. 1. Figure 3 is a system configuration diagram showing the configuration of a conventional DSI system, Figure 4 is a functional block diagram of the echo canceller of the conventional ll5I system shown in Figure 3, and Figure 5 is a system configuration diagram showing the configuration of a conventional DSI system. FIG. 6 is a detection area diagram showing the detection area, and FIG. 6 is an input/output characteristic diagram showing the input/output characteristics of the center clipper. Explanation of the symbols of important parts 2: DSI transmitter 11, multiplexer 12, n-channel 22-canceller 20, audio delay unit 22, audio memory 24, output interface 2E1 ．． , ,'n-channel sound detection unit 2B, ,assignment processor 120, ,adder 122, ,center clipper 124, ,N-order coefficient digital filter 12B
, ,Coefficient Calculator 280 , ,Power Calculator 282 , ,Speech Determiner 284 , ,Adaptive Power Threshold Updater Patent Applicant Oki Electric Industry Co., Ltd. Agent Takao Katori Takao Maruyama D/A Digital 7 Narok Conversion Conventional Echo canceller Fig. 4 Receiving input level (dBmo) Area of double talk detection Fig. 5 Input/output characteristics of center critter Fig. 6

Claims

[Claims] 1. A sound detection device for detecting whether or not an input signal includes speech, the device comprising: calculation means for inputting the signal and calculating the average power of the signal; , threshold updating means that receives the average power of the signal from the calculation means and calculates and updates a noise power average level that is an average level of noise power using the average power, and receives the average power of the signal and the noise power average level. the input terminal for inputting a double talk detection signal and a center clipping detection signal; When either the signal or the center clipping detection signal is input, the threshold value updating means suspends calculation and updating of the noise power average level during the input, and updates the noise power average level to the double talk detection signal and the center clipping detection signal. A sound detection device characterized in that a clipping detection signal is maintained at a value before input. 2. In a sound detection device that detects whether or not an input signal contains speech, the device controls echo components contained in the input signal, and also controls double talk and center clipping of the input signal. echo control means for detecting; calculation means for inputting a signal whose echo component has been controlled from the echo control means and calculating the average power of the signal; and receiving the average power of the signal from the calculation means and calculating the average power based on the average power. threshold updating means for calculating and updating a noise power average level that is an average level of noise power; receiving the average power of the signal and the noise power average level and comparing the two to determine whether or not the signal includes speech; and determining means for determining whether the input signal is double talk or center clipping, and when the echo control means detects either double talk or center clipping in the input signal, it notifies the threshold updating means of the detection; A sound detection device characterized in that upon receiving a notification, the calculation update of the average noise power level is not performed while the notification is being received. 3. In a digital audio insertion system that has an echo control device that controls echoes generated on a line and inserts another audio signal into the silent section of the signal transmitted on the line, the system includes: a voice presence detection device that detects whether or not the echo control device includes a voice presence detection device by comparing it with an adaptive power threshold that is updated based on the noise power of the transmitted signal; is connected so that it can receive an echo-controlled signal and input a center clipping detection signal and a double talk detection signal as echo control signals, and the sound detection device receives the center clipping detection signal and double talk detection signal from the echo control device. A digital voice insertion system characterized in that the adaptive power threshold is not updated when any one of the detection signals is input.