JPH03254223A

JPH03254223A - Analog data transmission system

Info

Publication number: JPH03254223A
Application number: JP5133490A
Authority: JP
Inventors: Seiji Omori; 大森　晴史; Yoshiaki Tanaka; 良明田中; Masao Nakagawa; 正雄中川; Masafumi Hagiwara; 将文萩原
Original assignee: Eastman Kodak Japan Ltd
Current assignee: Eastman Kodak Japan Ltd
Priority date: 1990-03-02
Filing date: 1990-03-02
Publication date: 1991-11-13

Abstract

PURPOSE:To simplify the constitution of a transmission line and a transmission equipment and to allow a reception side to receive a data with high fidelity by compressing an analog data with a band limit at a transmission side or a transmission line in the case of transmitting the analog data. CONSTITUTION:A transmission analog data 100 is made to a narrow band by a band limiter 10 composed of a low pass filter at the transmission side. A waveform 110 with a high band component eliminated therefrom is sent from a transmitter 12 to a transmission line 14 as a signal whose information quantity is considerably missing while it fundamental component only remains and received by a receiver 16. Then the reception signal is fed to a synthesizer 20 as it is and decoded by a neural network 20 studied for each data processed in advance to decode the lost high frequency component. Thus, the synthesizer 20 synthesizes the received data 200 with a predicted value obtained from the neural network 26 and outputs the result as a synthesis data 210. Thus, while the load of the transmission line or the transmission equipment is relieved, the data transmission with fidelity is attained.

Description

【発明の詳細な説明】［産業上の利用分野コ本発明はアナログデータ伝送方式、特に伝送信号の帯域
制限を行うことによってデータ圧縮し、伝送路の負担を
軽減可能な改良されたアナログデータ伝送方式に関する
。[Detailed Description of the Invention] [Industrial Application Fields] The present invention relates to an analog data transmission system, particularly an improved analog data transmission system that compresses data by band-limiting the transmission signal and reduces the burden on the transmission path. Regarding the method.

［従来の技術］近年、音声あるいは画像デー′夕を伝送する必要性が著
しく増大し、例えば電話回線網、テレビ通信システムあ
るいは各種データベースを利用した通信システムとして
極めて多岐に亘る伝送手段が実用化されている。[Prior Art] In recent years, the need to transmit audio or image data has increased significantly, and a wide variety of transmission means have been put into practical use, such as communication systems using telephone lines, television communication systems, or various databases. ing.

このような伝送情報としてアナログデータを扱う場合、
その帯域が問題となる。When handling analog data as such transmission information,
The problem is the bandwidth.

伝送路の要求からすれば、アナログデータはできるだけ
狭帯域であることが望ましく、実際上アナログデータは
その保有する帯域が広いにも拘らず伝送路による帯域制
限で一部の特に高帯域成分が捨てられてしまうことが知
られている。Considering the requirements of the transmission path, it is desirable for analog data to have as narrow a band as possible, and in reality, even though analog data has a wide band, some particularly high band components are discarded due to the band limitation of the transmission path. It has been known that it can get lost.

また、アナログデータを保存するための各種のメディア
として例えば磁気テープなどが用いられるか、このよう
な記憶あるいは伝送メディアに対してもその小型軽量化
の面からアナログデータは常に帯域制限を受けることと
なる。In addition, various types of media such as magnetic tape are used to store analog data, and analog data is always subject to bandwidth limitations due to the need to reduce the size and weight of such storage or transmission media. Become.

一方、このような帯域制限はアナログデータ自体に対し
ては極めて深刻な特性劣化となり、通常の場合前記帯域
制限は高周波成分に対してデータが欠落する結果を招く
。そして、このような高域遮断されたアナログデータで
は、信号波形の鈍化が生じ、ひいては音がこもるあるい
は立上りのスピード感がなくなるなどの著しい音質劣化
を招くことが知られている。On the other hand, such band limitation causes extremely serious characteristic deterioration for analog data itself, and in normal cases, band limitation results in data loss for high frequency components. It is known that analog data with such high frequencies cut off causes a slowing of the signal waveform, which in turn leads to significant deterioration in sound quality, such as muffled sound or loss of sense of speed at rise.

例えば、一般的な音声を例にとると、これらのアナログ
音声データは通常３．４ｋＨｚまでの高帯域を有するが
、通常の伝送路によれば例えば帯域０．８ｋＨｚに制限
されてしまい、この結果音声の本来前する細い高域成分
が失われて大幅になまった波形に歪む。このような波形
劣化により、音源のピッチを表す基本的な周波数情報は
残るものの、音質的には声色が濁って不愉快にこもった
音になってしまう。また、前述した如く、音の立上りが
鈍くなるので、スピード感も損われる欠点がある。For example, taking general audio as an example, these analog audio data usually have a high band of up to 3.4 kHz, but using a normal transmission path, the band is limited to, for example, 0.8 kHz, and as a result, The thin high-frequency components that normally precede the sound are lost, resulting in a distorted waveform that becomes significantly dull. Due to such waveform deterioration, although basic frequency information representing the pitch of the sound source remains, the tone becomes muddy and unpleasantly muffled. Furthermore, as mentioned above, the rise of the sound becomes slow, which impairs the sense of speed.

更に、子音部の認識が部分的に困難になるだけでなく、
時には母音部においても第２ホルマントの欠落が起り、
誤認識の原因にもなっていた。Furthermore, not only does recognition of consonants become partially difficult;
Sometimes the second formant is missing even in the vowel part,
It also caused misunderstandings.

従って、アナログデータ処理において、伝送路あるいは
伝送機器の要求とデータ品質の要求とが常に矛盾し、こ
れらの調整が極めて困難であった。Therefore, in analog data processing, the requirements of the transmission line or transmission equipment and the requirements of data quality always conflict, and it has been extremely difficult to adjust them.

従来において、前記帯域制限されたアナログデータの復
元には等花器が用いられている。Conventionally, an analog device has been used to restore the band-limited analog data.

すなわち、等花器は帯域制限された例えば音声アナログ
データの明瞭度を上げるために、制限された周波数帯域
の増幅度を上げて、アナログ波形を復元し、これによっ
て等花器の強調増幅作用によりデータ受信側において原
データに近い復元データを得ることができる。In other words, in order to improve the clarity of band-limited audio analog data, for example, Tokagi increases the degree of amplification in the limited frequency band and restores the analog waveform, thereby enhancing data reception through the emphasized amplification effect of Tokagi. On the other hand, restored data close to the original data can be obtained.

従来における前記等化器による復元技術は例えばそれ以
外にも線形的な手法を用いた信号再生方式としていくつ
か実用化され、いずれも帯域制限を受けている高域成分
の僅かな残留データを利用するものである。The conventional restoration technique using an equalizer has been put into practical use as a signal regeneration method using a linear method, for example, and all of them utilize a small amount of residual data of high-frequency components that are subject to band limitations. It is something to do.

［発明が解決しようとする課題］しかしながら、前記従来における復元技術では必ずしも
十分な品質の受信データを得ることができないという問
題があった。[Problems to be Solved by the Invention] However, the conventional restoration techniques described above have a problem in that received data of sufficient quality cannot necessarily be obtained.

すなわち、等花器を用いた場合、伝送路特性に応じて定
められた等花器の逆特性によって波形が復元できるはず
であるが、この場合には、所望の高帯域において僅かで
もデータが伝送されていることが必要である。すなわち
、完全に高帯域成分のデータが失われている場合には、
従来における等花器はほとんどその機能を果すことがで
きずデータ復元は不可能となる。In other words, when using an isoflower, the waveform should be able to be restored using the inverse characteristics of the isoflower determined according to the transmission path characteristics, but in this case, even a small amount of data is transmitted in the desired high band. It is necessary to be present. In other words, if high band component data is completely lost,
Conventional flower vases are almost unable to perform their functions, making data recovery impossible.

また、従来における線形的な手法を用いた信号再生方式
においても、例えば音声処理の分野では完全な極零型線
形モデルが未だ発見されておらず、もともとデータ量の
少ない音声しか受信側に得られない状況では実際上デー
タ復元はほとんど不可能であった。Furthermore, even in conventional signal reproduction methods that use linear methods, for example, in the field of audio processing, a perfect pole-zero linear model has not yet been discovered, and originally only audio with a small amount of data can be obtained on the receiving side. In situations where there is no such thing, data recovery is virtually impossible.

従来において、音声などに限定して、受信側でのデータ
が完全に欠落した場合においても、スペクトル推定技術
によって失われた高域成分データを修復する技術が提案
されているが、このようなスペクトル推定でも、音声の
子音情報まで推定することは困難であり、更にアナログ
データの保有している位相情報が全く無視されてしまう
ので、音質の変化が生じてしまうという欠点があった。In the past, technology has been proposed that uses spectrum estimation technology to restore lost high-frequency component data even when data on the receiving side is completely lost, limited to audio data. Even with estimation, it is difficult to estimate the consonant information of the voice, and furthermore, the phase information possessed by the analog data is completely ignored, resulting in a change in sound quality.

本発明は上記従来の課題に鑑みなされたものであり、そ
の目的は、高帯域データが完全に欠落した場合において
も、受信側において高い忠実度でデータ復元を可能とし
た改良されたアナログデータ伝送方式を提供することに
ある。The present invention was made in view of the above-mentioned conventional problems, and its purpose is to provide an improved analog data transmission that enables data recovery with high fidelity on the receiving side even when high-band data is completely lost. The goal is to provide a method.

［課題を解決するための手段］上記目的を達成するために、本発明は、劣悪なアナログ
データが基本的な情報を比較的低帯域に保有しており、
高帯域成分は低帯域成分に比して固有差が少なく、この
結果、低帯域成分から比較的容易に学習できることに着
目したことを特徴とする。[Means for Solving the Problem] In order to achieve the above object, the present invention solves the problem that poor analog data retains basic information in a relatively low band,
It is characterized by focusing on the fact that high-band components have fewer inherent differences than low-band components, and as a result, learning can be done relatively easily from low-band components.

このために、本発明では、アナログデータを伝送する際
に伝送されるデータを積極的に帯域制限してしまい、高
帯域成分を自発的に欠落させてしまう。For this reason, in the present invention, when transmitting analog data, the transmitted data is actively band-limited, and high-band components are spontaneously dropped.

この結果、データ伝送路あるいは伝送機器は狭帯域で良
いために小型軽量となりあるいは伝送密度を著しく改善
できるという利点を生ずる。As a result, the data transmission path or the transmission equipment needs only a narrow band, so it has the advantage of being smaller and lighter, and the transmission density can be significantly improved.

そして、本発明においては、受信側においてニューラル
ネットワークを用いた復元処理を行い、これによって前
記欠落した高帯域成分を受信時に付加することを特徴と
する。The present invention is characterized in that a restoration process using a neural network is performed on the reception side, thereby adding the missing high-band components at the time of reception.

前記ニューラルネットワークは勿論予め処理するアナロ
グデータ毎に学習されており、これによって、受信した
低帯域領域のアナログデータからこれに対応した高帯域
成分をニューラルネットワークで作成して忠実度の高い
復元を行うことができる。The neural network is, of course, trained in advance for each analog data to be processed, so that the neural network creates high-band components corresponding to the received low-band analog data and performs high-fidelity restoration. be able to.

すなわち、本発明によれば、前述した如く、アナログデ
ータ中の高帯域成分が比較的固有差か少なく、低帯域成
分から比較的容易に推定可能であることを利用し、高帯
域成分が完全に欠落している状態でもニューラルネット
ワークの学習によって忠実度の高い復元作用を可能とし
た。That is, according to the present invention, as described above, the high band components in analog data have relatively small inherent differences and can be estimated relatively easily from the low band components, and the high band components are completely Even in the missing state, neural network learning enables high-fidelity restoration.

勿論、本発明の特徴である前述した高帯域成分の除去は
帯域制限器を用いても良く、あるいは伝送路自体の狭帯
域特性で実現しても良い。Of course, the above-mentioned removal of high band components, which is a feature of the present invention, may be achieved by using a band limiter or by using the narrow band characteristics of the transmission path itself.

［作用］従って、本発明によれば、音声あるいは画像などのアナ
ログデータは予め送信側あるいは伝送路において積極的
に帯域制限されて送られる。[Operation] Therefore, according to the present invention, analog data such as audio or images is actively band-limited beforehand on the transmitting side or on the transmission path before being sent.

そして、受信側に設けられているニューラルネットワー
クは予め処理対象となるアナログデータに対してパック
プロパゲーションなどを用いて学習され、ニューラルネ
ットワークの入出力特性が定められる。The neural network provided on the receiving side is trained in advance using pack propagation or the like on the analog data to be processed, and the input/output characteristics of the neural network are determined.

このよにして学習されたニューラルネットワークに帯域
制限されたアナログデータが供給されると、ニューラル
ネットワークからは低帯域情報から高帯域データを作り
出し、これによって原データに近い受信データを復元す
ることが可能となる。When a neural network trained in this way is supplied with band-limited analog data, the neural network generates high-band data from low-band information, which makes it possible to restore received data that is close to the original data. becomes.

前記ニューラルネットワークによるデータ復元は受信デ
ータそのものを復元データに変換しても良く、あるいは
失われた高域成分のみを時間差分から復元してこれに受
信データを合成することも可能である。Data restoration using the neural network may be performed by converting the received data itself into restored data, or by restoring only the lost high frequency component from the time difference and combining the received data with this.

［実施例］以下図面に基づいて本発明の好適な実施例を説明する。[Example] Preferred embodiments of the present invention will be described below based on the drawings.

第１図には本発明が適用された伝送システムの全体構成
が示されている。FIG. 1 shows the overall configuration of a transmission system to which the present invention is applied.

送信されるアナログデータ１００は例えば図において帯
域３．４　ｋＨｚを有する音声「女性の「あ」」として
示され、この送信アナログデータ１００は送信側におい
てローパスフィルタからなる帯域制限器１０により帯域
０．８ｋｆ（ｚに狭帯域化される。勿論、この帯域制限
周波数は伝送路あるいは伝送機器によって任意に選択可
能であり、後に詳述する如く、実験によれば帯域周波数
が変化しても、本発明における復元作用に大きな影響は
見られない。For example, the analog data 100 to be transmitted is shown in the figure as a voice "female's 'a'" having a band of 3.4 kHz, and this analog data 100 to be transmitted is reduced to a band of 0.5 kHz by a band limiter 10 consisting of a low-pass filter on the transmitting side. The band is narrowed to 8 kf (z). Of course, this band-limiting frequency can be arbitrarily selected depending on the transmission line or transmission equipment, and as will be detailed later, experiments have shown that even if the band frequency changes, the present invention No significant effect was seen on the restoring effect.

高帯域成分が除去された波形１１０は図示の如く基本ピ
ッチのみを残した大幅に情報量の欠落した信号として示
されている。As shown in the figure, the waveform 110 from which the high band components have been removed is shown as a signal with a large amount of information missing, leaving only the basic pitch.

このような狭帯域信号１１０は実施例において送信機１
２により伝送路１４に送り込まれ、受信側では受信機１
６にて受信される。Such a narrowband signal 110 is transmitted to the transmitter 1 in the embodiment.
2 to the transmission line 14, and on the receiving side, the receiver 1
Received at 6.

本発明において、このような伝送系に関しては、アナロ
グあるいはデジタル処理でもあるいは有線あるいは無線
を問わず任意に利用可能であり、本発明の特徴は送信側
あるいは伝送器においてアナログ信号の段階で狭帯域制
限を施すことによってアナログデータそのもののデータ
圧縮を可能とすることにある。In the present invention, such a transmission system can be arbitrarily used regardless of whether it is analog or digital processing, wired or wireless, and the feature of the present invention is that narrow band limitation is applied at the analog signal stage on the transmitting side or transmitter. The objective is to enable data compression of the analog data itself by applying .

前記受信機１６の出力は第１図において狭帯域化された
信号１１０とほぼ同一であると考える。Consider that the output of the receiver 16 is approximately the same as the narrowband signal 110 in FIG.

そして、この受信信号は実施例においてそのまま合成器
２０へ供給されると共に、失われた高域成分を復元する
ためにニューラルネットワークにて復元が行われる。In this embodiment, this received signal is supplied as it is to the synthesizer 20, and is restored by a neural network in order to restore the lost high-frequency components.

受信データ２００はまずサンプル回路２２へ供給され、
サンプリングクロックによってデータのサンプリングが
行われる。実施例において、サンプリングクロックは１
０ｋＨｚの周波数を有し、４０個で１フレームを形成す
る。すなわち、この４０個は、後述するニューラルネッ
トワークが４層パーセプトロン構成を有し、各層が４０
個のユニットを有しており、前記４０個のサンプリング
データがそれぞれ各入力層のニューラルユニットに割り
付けられる。The received data 200 is first supplied to the sample circuit 22,
Data is sampled by the sampling clock. In the example, the sampling clock is 1
It has a frequency of 0 kHz, and 40 pieces form one frame. That is, these 40 neural networks have a 4-layer perceptron configuration, and each layer has 40 perceptrons.
The 40 sampling data are allocated to the neural units of each input layer, respectively.

以上のようにしてサンプリングされたデータは次に差分
器２４へ供給され、実施例においては時間差分値が出力
される。The data sampled as described above is then supplied to the differentiator 24, and in the embodiment, a time difference value is output.

勿論、本発明においてサンプリングされた波形データそ
のものをニューラルネットワークに供給することも可能
であるが、実施例においては、高帯域を強調するために
前述した如く時間差分値がニューラルネットワークの人
力データとして用いられる。Of course, it is also possible to supply the sampled waveform data itself to the neural network in the present invention, but in the embodiment, the time difference value is used as manual data for the neural network as described above in order to emphasize high bands. It will be done.

そして、前記差分データはニューラルネットワーク２６
によって所定の変換処理を受ける。Then, the difference data is stored in the neural network 26.
undergoes a predetermined conversion process.

すなわち、二二−ラルネットワーク２６は自ら学習によ
って生成した変換規則に従って、受信データと送信デー
タとの誤差を予測し、その値を出力する。That is, the 22-ral network 26 predicts the error between the received data and the transmitted data according to the conversion rule generated by learning itself, and outputs the value.

実施例においては、この予測値は前述した合成器２０の
他方の入力に供給され、この結果合成器２０は前記受信
データ２００をニューラルネットワーク２６から得られ
る予測値と合成して図の符号２１０で示される合成デー
タとして出力する。In the illustrative embodiment, this predicted value is provided to the other input of the synthesizer 20 described above, so that the synthesizer 20 combines the received data 200 with the predicted value obtained from the neural network 26 and generates the predicted value at 210 in the figure. Output as the composite data shown.

以上のようにして、本発明によれば送信されるデータは
予めアナログ状態でデータ圧縮を受け、これをニューラ
ルネットワークを用いて受信側にて復元することを特徴
とし、伝送路あるいは伝送機器の負担を軽減しながらか
つ高忠実度のデータ伝送を可能とする利点がある。As described above, according to the present invention, data to be transmitted is compressed in advance in an analog state, and this is restored on the receiving side using a neural network, thereby burdening the transmission line or transmission equipment. This has the advantage of enabling high-fidelity data transmission while reducing data transmission.

前述したニューラルネットワーク２６は周知の如く学習
と実行の２段階のプロセスを有し、本発明によれば、処
理対象となるアナログデータによって復元に必要な変換
規則を学習し、次に前述した如く受信データを入力して
前記学習した変換規則に従った予測値の出力を実行段階
として行う。As is well known, the neural network 26 described above has a two-step process of learning and execution, and according to the present invention, it learns the conversion rules necessary for restoration based on the analog data to be processed, and then performs the reception as described above. As an execution step, data is input and a predicted value according to the learned conversion rule is output.

すなわち、学習段階においては、帯域制限を受けて劣化
した受信データから、帯域制限を受ける以前の本来の送
信データと受信データとの誤差を推定する変換規則を経
験的にニューラルネットワークに学習させる。That is, in the learning stage, the neural network is made to empirically learn a conversion rule for estimating the error between the original transmission data before the band limitation and the received data from the received data degraded due to the band limitation.

そして、実行段階では、未知の受信データ入力に対して
、本来の送信データとの推定誤差を出力し、実施例によ
れば前述した如く受信データにこの誤差予測値を加算す
ることによって合成波形を得ることとなる。Then, in the execution stage, the estimated error with respect to the original transmitted data is output for the unknown received data input, and according to the embodiment, the synthesized waveform is created by adding this predicted error value to the received data as described above. You will get it.

以下には前記ニューラルネットワーク２６の内容を詳細
に説明する。The contents of the neural network 26 will be explained in detail below.

第２図には一般的なニューロンのモデルが示され、通常
ユニットと呼ばれるニューロンの動作を情報処理の機能
素子としてモデル化した工学モデルを示す。FIG. 2 shows a general neuron model, and shows an engineering model in which the operation of a neuron, usually called a unit, is modeled as an information processing functional element.

ユニットは入力側からの信号を重み付けによる多数決論
理で処理し、１つの出力信号を発生する。The unit processes the signals from the input side using weighted majority logic and produces one output signal.

そして、この出力信号を次層の複数のユニットの入力信
号として用い、多層階層による学習及び実行処理を行う
。このモデルは、入力信号と重みとの積和演算と、その
積和演算結果に対する閾値処理の２つの処理をニューロ
ンの基本性質として捕えたものである。シグモイド関数
を用いると、ユニットの人出力特性が次のように定式化
される。Then, this output signal is used as an input signal for a plurality of units in the next layer to perform learning and execution processing using a multilayer hierarchy. This model captures two processes as the basic properties of a neuron: a product-sum operation of an input signal and a weight, and a threshold value process for the result of the product-sum operation. Using the sigmoid function, the human output characteristics of the unit can be formulated as follows.

ｏ　　、−ｆ−（ｎｅｔ　　−）　　　　　　　　　　
　（２）１）Ｊ　　　　　Ｊ　　　　　　　　　ｐＪｆ
、　　（ｘ）＝１／　　（１＋ｅｘｐ　　（−ｘ＋θ）
）（３）前述した多数のユニットにニューロン）を相互に結合す
ることによってニューラルネットワークが構成され、そ
の結合形態によってさまざまなバリエーションが生まれ
る。o, -f-(net-)
(2)1) J J pJf
, (x)=1/ (1+exp (-x+θ)
) (3) A neural network is constructed by interconnecting the aforementioned large number of units (neurons), and various variations are created depending on the form of the connection.

第３図には本発明に係るニューラルネットワークの原理
的な構造が示されており、入力層、２層の中間層及び出
力層の４層からなるネットワーク構造を有する。FIG. 3 shows the basic structure of the neural network according to the present invention, which has a network structure consisting of four layers: an input layer, two intermediate layers, and an output layer.

階層ネットワークにおける各ユニットは、人力層から中
間層、中間層から出力層という向きに結合しており、各
層内での結合及び出力層から人力層へと向う結合は存在
しない。Each unit in the hierarchical network is connected in the direction from the human-powered layer to the intermediate layer and from the intermediate layer to the output layer, and there are no connections within each layer or from the output layer to the human-powered layer.

人力層に人カバターンが与えられると、入力層の各ユニ
ットはその人力信号を中間層へ渡し、中間層の各ユニッ
トは前述した（１）式及び（２）式に基づいて動作し、
出力層の各ユニットに出力信号を供給する。When a human cover turn is given to the human power layer, each unit of the input layer passes the human power signal to the middle layer, and each unit of the middle layer operates based on the above-mentioned equations (1) and (2),
An output signal is supplied to each unit of the output layer.

そして、最終的には、ネットワークの出カバターンが出
力層の各ユニットから出力される。Finally, the output pattern of the network is output from each unit of the output layer.

従って、このネットワークは、入力を常に前向きに処理
することによって、ユニット間の結合の重みＷから決定
される一定の変換規則（データ処理法）を入カバターン
に適応し、出力層にそのパターンを出力する。Therefore, by always processing the input forward, this network applies a certain transformation rule (data processing method) determined from the weight W of connections between units to the input cover pattern, and outputs that pattern to the output layer. do.

前述した一般的なニューラルネットワークを動作させる
場合、前述した変換規則を与えなければならず、このた
めにネットワークの学習が必要である。このネットワー
クの学習とは、与えられる個々の入カバターンに対して
、ネットワークが実際に出力する出カバターンと望まし
い出カバターン（教師信号）とが一致するまたは十分に
似ているようなユニット間の結合の重みＷの集合を見つ
け出すことである。When operating the general neural network described above, the transformation rules described above must be provided, and the network needs to be trained for this purpose. This network learning is the process of determining connections between units such that, for each given input pattern, the output pattern actually output by the network and the desired output pattern (teacher signal) match or are sufficiently similar. The purpose is to find a set of weights W.

このような階層ネットワークの学習アルゴリズムとして
はルーメルハートらが提案したバックプロパゲーション
法が知られている。バックプロパゲーション法は、階層
ネットワークに実際の出カバターンと望ましい出カバタ
ーン（教師信号）との誤差をフィードバックし、これに
基づいて結合の重みＷとバイアス値θとで以下の（４）
式に基づいて適応的に自動調整する。As a learning algorithm for such a hierarchical network, the backpropagation method proposed by Rumelhart et al. is known. In the backpropagation method, the error between the actual output pattern and the desired output pattern (teacher signal) is fed back to the hierarchical network, and based on this, the connection weight W and bias value θ are determined as shown in (4) below.
Adaptively auto-adjust based on the formula.

Δ　　ｗｏ、寥ηδ　、ｉ　、　　　　　　　　　　　
　　　（４）１）Ｊｌ　　　　　　ｐＪ　　　ｐ１出力層のユニットに関するδ　、はｌ δ　、　−（ｔ　　、−ｏ　　、）　　ｆ　　　、　　
（ｎｅｐｊ　　　　　　ｐＪ　　　　Ｉ）ｊ　　　　　
　Ｊ中間層のユニットに関するδｐｔは δ−＝ｆ−，（ｎｅｔ　　、）ΣδｐｋＷｋｊｐＪ　　
　　　　Ｊ　　　　　　　　　ｐｊのように計算される
。Δ wo, ηδ , i ,
(4)1) Jl pJ p1 δ for the output layer unit is l δ , −(t , −o ,) f ,
(nepj pJ I)j
δpt for the unit of J middle layer is δ−=f−, (net,)ΣδpkWkjpJ
It is calculated as J pj.

以上の結果、ネットワークは入カバターンと出カバター
ンとの間の連合を学習、し、適応的なデータ処理を実現
できるようになる。As a result of the above, the network learns the association between input and output patterns, and can realize adaptive data processing.

本発明において、前述した第１図から明らかなように、
ニューラルネットワークには帯域制限を受けた受信デー
タの時間差分値が与えられ、その差分値を一定の長さの
フレーム（実施例において４０サンプル分）毎にニュー
ラルネットワーク２６の人力層の各ユニットに与え、出
力層より出された信号と、之の信号と帯域制限された受
信データの差との数値的隔りを再消化する方向でネット
ワークの結合の重みＷの学習が進められる。In the present invention, as is clear from FIG. 1 mentioned above,
The neural network is given the time difference value of the received data subject to the band limit, and the difference value is given to each unit of the human layer of the neural network 26 for every frame of a certain length (40 samples in the embodiment). , learning of the network connection weights W proceeds in a direction that re-digests the numerical gap between the signal output from the output layer and the difference between this signal and the band-limited received data.

従って、以上の学習を経た後、ニューラルネットワーク
２６は帯域制限され高帯域が除去されたデータを復元す
るために最適な結合の重みＷを有することとなる。Therefore, after the above learning, the neural network 26 will have the optimum combination weight W for restoring the band-limited data from which high bands have been removed.

ｔ　　、）　　（５）ｐＪ（６）第４図には本実施例におけるニューラルネットワーク２
６のデータ復元作用が示されており、帯域制限された受
信データは以下の条件下で学習及び復元動作が行われる
。t, ) (5) pJ (6) Figure 4 shows the neural network 2 in this embodiment.
6, the learning and restoring operations are performed on band-limited received data under the following conditions.

サンプリング周波数　　：　１０ｋＨｚフレーム長　　
　　　　：４ｍ５（４０個）フレームシフト　　　　：
２ｍ５（２０個）サンプルポイント数　　二６０００個
（母音）：２０００個（子音十母音）蓄積フレーム数　　　　：５０個（母音）＝２５個（子
音十母音）学習効率定数（η）　　　＋０．００２（母音）：０．
００４（子音十母音）安定化係数（α）：ｏ、ｃ＋構成　　　　、４層バーセプトロン。Sampling frequency: 10kHz frame length
:4m5 (40 pieces) Frame shift:
2m5 (20 pieces) Number of sample points 26000 pieces (vowels): 2000 pieces (consonants and 10 vowels) Number of accumulated frames: 50 pieces (vowels) = 25 pieces (consonants and 10 vowels) Learning efficiency constant (η) +0.002 (vowels) ):0.
004 (consonant ten vowels) Stabilization coefficient (α): o, c+ configuration, 4-layer berseptron.

（４０，４０，４０，４０）入力信号　　：　０１３ｋｌ（ｚ帯域信号の差分希望出
力信号：　０．１ｉｋＨｚ帯域信号と３．４ｋＨｚ帯域
信帯域差ここでサンプルポイント数とは、ニューラルネットワー
クの学習に用いたポイントの数を表す。(40, 40, 40, 40) Input signal: 013kl (difference between z-band signal Desired output signal: 0.1ikHz band signal and 3.4kHz band difference) Here, the number of sample points is the difference between the z-band signal and the 3.4kHz band signal. Represents the number of points earned.

ここでは母音のみの場合は初めの０．６秒間を、子音を
含む音節の場合には初めの０，２秒間を学習に用いてい
る。また、蓄積フレーム数とは、学習の際にユニットの
誤差をある一定のフレーム数分たけ蓄積した後、誤差伝
播し、ネットワークの重みの更新を行うので、そのフレ
ームの数を表したものである。ユニットの誤差を蓄積す
ることにより、ネットワークは個々のフレームのデータ
に固執せずに、全データについて広く一般化を行い、最
適解を求めることができる。この値が小さすぎると、ネ
ットワークは個々のデータに依存し易くなり、全データ
に対する最適解の探索に多くの時間を要するようになる
。逆にこの値が大きいと、波形の細かい特徴がとらえに
くくなる。Here, the first 0.6 seconds are used for learning in the case of only vowels, and the first 0.2 seconds in the case of syllables containing consonants. In addition, the number of accumulated frames refers to the number of frames since the unit error is accumulated for a certain number of frames during learning, and then the error is propagated and the network weights are updated. . By accumulating unit errors, the network can broadly generalize across all data to find the optimal solution, rather than fixating on individual frame data. If this value is too small, the network becomes dependent on individual data, and it takes a lot of time to search for the optimal solution for all data. Conversely, if this value is large, it becomes difficult to capture the fine features of the waveform.

表３・表４に主観評価結果（ＭＯＳ　：ＭＥＡＮＯＰＩ
ＮＩＯＮ　　５ＣＯＲＥ）を示す。Tables 3 and 4 show the subjective evaluation results (MOS: MEANOPI
NION 5CORE).

表３文章発声における主観評価結果（学習話者）表４　文章発声における主観評価結果（未学習話者）この評価では、３．４ｋＨｚ帯域信号を原音（４点に相
当）と考え、０点（非常に悪い音質）から４点（原音と
同じ）の５段階で１０人に音質の評価をしてもらい、そ
の平均の点数をつけである。各帯域の信号と合成音との
比較を行うと、帯域０．８ｋＨｚの信号を使って合成し
た信号は、学習話者、未学習話者ともに受信信号である
０、８ｋＨｚ帯域の信号を上回るのは勿論、２　、０ｋ
Ｈｚ帯域の信号よりも良いスコアが得られた。これによ
り、損失情報を予測して信号帯域を拡張した効果が示さ
れた。Table 3 Subjective evaluation results for sentence utterances (learning speakers) Table 4 Subjective evaluation results for sentence utterances (untrained speakers) In this evaluation, the 3.4kHz band signal is considered to be the original sound (equivalent to 4 points), and 0 points ( Ten people were asked to rate the sound quality on a five-point scale from 4 points (very poor sound quality) to 4 points (same as the original sound), and the average score was given. Comparing the signals in each band and the synthesized sound, the signal synthesized using the 0.8 kHz band signal exceeds the received signal in the 0 and 8 kHz bands for both trained and untrained speakers. Of course, 2,0k
A better score was obtained than the signal in the Hz band. This demonstrated the effectiveness of predicting loss information and expanding the signal band.

本実施例においては、０　、１１ｋＨｚ帯域信号を入力
として仮定しているが、伝送路の周波数特性の変動に対
する検討として、学習話者において帯域０．７ｋＨｚに
制限された信号をもとに合成した信号についても評価を
行った。その結果を表３の最下掴に併記する。前述した
０、８ｋＨｚ帯域信号をもとに合成した音に比べ、多少
の劣化はみられるものの、２．０ｋＨｚ帯域信号に迫る
スコアが得られた。従って、伝送路の周波数特性の多少
の変動にも、本発明で提案した方式が適応できることが
判明した。In this example, it is assumed that a signal in the 0 and 11 kHz band is input, but in order to consider fluctuations in the frequency characteristics of the transmission path, a signal synthesized based on a signal whose band is limited to 0.7 kHz in the learning speaker is used. Signals were also evaluated. The results are also listed at the bottom of Table 3. Although there was some deterioration compared to the sound synthesized based on the 0.8 kHz band signal mentioned above, a score close to that of the 2.0 kHz band signal was obtained. Therefore, it has been found that the method proposed by the present invention can be applied even to slight variations in the frequency characteristics of the transmission path.

［発明の効果］以上説明したように、本発明によれば、アナログデータ
を伝送する際に、送信側あるいは伝送路においてアナロ
グデータの圧縮を帯域制限にて行い、これによって伝送
路及び伝送機器の構成を極めて簡単にすることが可能と
むる。[Effects of the Invention] As explained above, according to the present invention, when transmitting analog data, the analog data is compressed by band limiting on the transmitting side or on the transmission path, thereby reducing the speed of the transmission path and transmission equipment. It is possible to make the configuration extremely simple.

また、受信側では、ニューラルネットワークによる予め
さだめられた変換規則に則ったデータ復元が行われ、こ
れによって極めて忠実度の高いデータ受信が可能となる
。Furthermore, on the receiving side, data is restored in accordance with preset conversion rules using a neural network, thereby making it possible to receive data with extremely high fidelity.

[Brief explanation of drawings]

１ｆｉ１図は本発明に係るアナログデータ伝送方式の好
適な実施例を示す概略構成図、第２図は本発明に用いられるニューロンの原理を示す説
明図、第３図は本発明に用いられるニューラルネットワークの
階層構造を示す説明図、第４図は本発明に係るニューラルネットワークの動作を
説明する図である。１０　・・・　帯域制限器１４伝送路６ニューラルネットワークFigure 1fi1 is a schematic configuration diagram showing a preferred embodiment of the analog data transmission system according to the present invention, Figure 2 is an explanatory diagram showing the principle of a neuron used in the present invention, and Figure 3 is a neural network used in the present invention. FIG. 4 is a diagram illustrating the operation of the neural network according to the present invention. 10...Band limiter 14 Transmission line 6 Neural network

Claims

[Claims]

An analog data transmission method characterized by transmitting analog data after removing high-band components, and restoring the high-band components on the receiving side using a neural network trained in advance according to data characteristics.