JPH03210685A

JPH03210685A - Neural network device

Info

Publication number: JPH03210685A
Application number: JP2005189A
Authority: JP
Inventors: Yoshio Izui; 良夫泉井
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1990-01-12
Filing date: 1990-01-12
Publication date: 1991-09-13

Abstract

PURPOSE:To speed up learning speed, and to improve probability to converge to an optimum solution in a wide view by constituting a device so as to minimize locally energy defined by output data and the square of calculated output. CONSTITUTION:The device is constituted so that input data and the output data are set beforehand in the neural element 11a of an input layer and the neural element 11c of an output layer respectively as learning data, and the operating state of the neural element 11a of the input layer is allotted actually equally in the same group, and the neural element 11b of an intermediate layer obtains the calculated output at the neural element 11c of the output layer while it total-calculates a signal through a coupled element 12 coupling with the neural element 11b of a preceding layer respectively, and minimizes locally the energy defined by the output data and the calculated output. Thus, the learning speed can be speeded up, and the probability to converge to the optimum solution in a wide view can be improved.

Description

【発明の詳細な説明】［産業上の利用分野］この発明は、生体の神経細胞とその間の結合を模擬して
記憶、推論１判断、予測、計画、制御。[Detailed Description of the Invention] [Industrial Application Field] This invention simulates biological nerve cells and the connections between them to improve memory, reasoning, judgment, prediction, planning, and control.

パターン認識、Ｗｉ通化などを行なう神経回路網装置に
関するものである。This invention relates to a neural network device that performs pattern recognition, Wi-Fi communication, etc.

［従来の技術］第４図は、例えば、「ニューラルコンピュータ」合原−
幸著、東京電機大字出版局、　１９８８年、第１１０頁
〜第１１４頁に掲載された従来のフィードフォワード型
の神経回路網装置の構成を示す説明図である０図におい
て、（ｌりは生体の神経細胞を模擬する素子（以下、神
経素子と呼ぶ）で、（ｌｌａ）は入力層を構成する神経
素子で例えば３つの入力層神経素子、　（ｉｉｂ）は中
間層を構成する神経素子で例えば２つの中間層神経素子
、　（Ｉｌｃ）は出力層を構成する神経素子で例えば１
つの出力層神経素子である。　（１２）は神経素子（１
１１間のシナプスを模擬する素子（以下、結合素子と呼
ぶ）で、その結合の強さを結合重みと呼ぶ。[Prior art] Figure 4 shows, for example, the "neural computer" Aihara-
In Figure 0, which is an explanatory diagram showing the configuration of a conventional feedforward type neural network device published in Sachi, Tokyo Denki Oaza Publishing Bureau, 1988, pages 110 to 114, An element (hereinafter referred to as a neural element) that simulates the neurons of Two intermediate layer neural elements (Ilc) are neural elements that constitute the output layer, for example 1
There are two output layer neural elements. (12) is the neural element (1
11 (hereinafter referred to as a coupling element), and the strength of the coupling is called a coupling weight.

この回路網において、神経素子（！鳳）は層状に結合さ
れており、グイナミクスとしては、入力層（ｌｌａ）か
ら入った入力信号は中間層（ｔｕｂ）を介して出力Ｊｆ
ｌ（ＩＩＧ＋に伝搬されていく、定量的には以下のよう
になる。学習データとして入力データと出力データの対
をあらかじめ与えるｅｄ＋’を出力層における第２番目
の学習データの第ｉ番目の出力データ、ｕｈ、を第り層
の５番目の神経素子（１１）の内部状態、■１．を第り
層のｊｌ）目の神経素子＋１１１の出力値、ＷｈＪ、を
第り層の第ｉ番目の神経素子１ｌｌ）と第ｈ＋１層にお
ける第一番目の神経素子（ｌｌ）との間の結合重みとす
る。学習データの入力データは、入力層（！　Ｉａｌで
の出力（ｉ　Ｖ　’　ｊと同じである。In this circuit network, the neural elements (!Otori) are connected in layers, and in terms of Guinamitics, the input signal that enters from the input layer (lla) is output via the intermediate layer (tub).
l(Propagated to IIG+. Quantitatively, it is as follows. ed+', which gives a pair of input data and output data in advance as learning data, is the i-th output of the second learning data in the output layer. The data, uh, is the internal state of the fifth neural element (11) in the second layer, ■1. is the output value of the jl)th neural element + 111 in the second layer, and WhJ is the i-th neural element in the second layer. Let it be the connection weight between the neural element 1ll) and the first neural element (ll) in the h+1th layer. The input data of the learning data is the same as the output (i V' j in the input layer (!Ial).

第４図に示す例において、各添字は第１表に示すような
構成としている。また、各変数の関係は式（１）３式（
２）のようになる、ここで、関数ｇは微分可能で非減少
な関数であれば良く、−例として式（３）を用いる。In the example shown in FIG. 4, each subscript has a structure as shown in Table 1. In addition, the relationship between each variable is expressed by Equation (1) and Equation 3 (
2), where the function g only needs to be a differentiable and non-decreasing function, using equation (3) as an example.

ｕＩＩＪ＝Σｗ”−’Ｊｔｖ”−’、　　　　−−−（
＋）Ｖ”Ｊ＝ｇ　（ｕ”ｉ）　　　　　　　　・・・（
２）ｇ　（ｘ）　＝　１／　（１＋ｅｘｐ　（−ｘｌ）
　・・（３１第１表さらに、結合重み（Ｗ）は式（４）に示す学習則で逐次
的に決定される。即ち、出力層（ｌ　ｌｃ）における学
習データと、神経回路網によって冥際に得られた演算出
力で定義される二乗誤差に関する最急降下法を用いて決
定される。神経回路網の層の数をＨとすると、二乗誤差
（以下でエネルギーということもある）は式（４）に様
になる。uIIJ=Σw"−'Jtv"−', ---(
+)V”J=g (u”i) ・・・(
2) g (x) = 1/ (1+exp (-xl)
...(31 Table 1) Furthermore, the connection weights (W) are sequentially determined by the learning rule shown in equation (4). That is, the learning data in the output layer (l lc) and the neural network are used to It is determined using the steepest descent method regarding the squared error defined by the calculation output obtained in ).

Ｅ＝　（１／２）Ｘ：Ｉ　（Ｖ’、−ｄｉ’）　”　　
−−＋４１又、結合重みの逐次変更はα、βを適当なパ
ラメータ、δ　ｌｈｌを第り層、ｉ番目の神経素子にお
ける誤差（式（５）９式（６））とし、モーメント法を
使用した場合には式（７）の学習方程式で実行できる０
式（５）１式（６）において、ｈ　＝　Ｈの時は出力層
における値を示している。なお、この例では１■＝３で
ある。Ｎ′１は第ｈ＠における神経素子の数である。E= (1/2)X:I (V', -di')"
--+41 Also, to sequentially change the connection weights, α and β are appropriate parameters, δ lhl is the error in the th layer and the i-th neural element (Equation (5), 9 Equation (6)), and the method of moments is used. In this case, 0 can be implemented using the learning equation of equation (7).
In equations (5) and (6), when h=H, the value in the output layer is shown. Note that in this example, 1■=3. N'1 is the number of neural elements in h@th.

δ、”’＝　（ｃｔ　、’−Ｖ’、）　Ｖ’、　（１−
Ｖ”、）ここで、ｉ　＝　ｌ、　Ｎ　ｔｌ′ｌｌ　　　
　　・・・（５）δ１”””Ｖ”１（１−Ｖ”＋）Ｘ５
４′ｈ”’Ｗ”１ここで、ｉ＝ｌ〜ＮＬｈ１．ｈ＝１■
〜ｌ・・（６）ｄ　”　ｗｈＪｔ　／ｄｔ、”　＋　（
１−α）　ａ　Ｗ　”Ｊ　ｌ　／　ａｔ＝−βａＥ／ａ
Ｗｌ″１　　　　　・・（７）例えば、式（７）の学習
方程式を現在のノイマン型計算機で実行しようとすると
、計算機はディジタルなので、方程式を差分化して逐次
実行する必要があるが、この処理を行なうフローチャー
トの一例を第５図に示す、ステップ（３０）でＬ＝１に
おける結合重みＷ″１と結合重みの更新量ΔＷｈＪ、の
初期値を一様乱数又は正規分布する乱数によって設定す
る。この時、ｈ＝１〜２でｈ＝ｔの時ｉ＝１〜３．ｊ＝
１〜２、ｈ＝２の時ｉ＝１〜２．Ｊ＝ｌの全てについて
設定するのであるが、第５図には簡単のため、ｈ、ｊ、
ｉと記述する。ステップ（３■）でし＝１として初期設
定する。ステップ（３２）では誤差信号の初期値として
出力層（１１ｃ）における誤差δ　ＩＩ＋を式（５）に
よって求める０次に逆伝播法を用いて、−層上の誤差信
号から下の層の誤差信号を式（６）に基いて求める（ス
テップ（３３））、この逆伝播を第６図に示す、ステッ
プ（３４）では、モーメント法により結合重みを更新す
る。このため、まず第り番の重みのし＋１における更新
ＩΔＷ”ＪＩ（七十１）を式（８）で求める。δ,”'= (ct,'-V',) V', (1-
V”,) where i = l, N tl′ll
...(5) δ1"""V"1 (1-V"+)X5
4'h"'W"1 where i=l~NLh1. h=1■
~l...(6)d ” whJt /dt,” + (
1-α) a W ”J l /at=-βaE/a
Wl″1...(7) For example, if you try to execute the learning equation in equation (7) on a current Neumann computer, since the computer is digital, it is necessary to differentiate the equation and execute it sequentially. An example of the flowchart is shown in FIG. 5. In step (30), initial values of the connection weight W''1 and the connection weight update amount ΔWhJ at L=1 are set by uniform random numbers or normally distributed random numbers. At this time, when h=1 to 2 and h=t, i=1 to 3. j=
1-2, when h=2, i=1-2. Settings are made for all of J=l, but for the sake of simplicity in Figure 5, h, j,
It is written as i. In step (3), initialize as shi = 1. In step (32), the error δ II+ in the output layer (11c) is determined by equation (5) as the initial value of the error signal using the zero-order backpropagation method, and the error signal on the − layer is converted to the error signal on the layer below. is obtained based on equation (6) (step (33)), and this back propagation is shown in FIG. 6. In step (34), the connection weights are updated by the method of moments. For this reason, first, the update IΔW''JI (71) at the weight number +1 is calculated using equation (8).

ΔＷ”ＪＩ　　（ｔ　＋　１　）　＝　（Ｊ　６　Ｊ”
”’Ｖ”＋０６ｗ”Ｊｔ　　（ｔ）　　・・（８）この
式におけるα、βはあらかじめ定めたパラメータである
０次に、ΔＷ　”ａ＋　　（シ＋　１　）を用いて新た
な重みＷ”ＪＩ　　（ｔ、　＋　１　）を式（９）によ
り求める。ΔW”JI (t + 1) = (J 6 J”
"'V"+06w"Jt (t) (8) α and β in this equation are predetermined parameters. Next, a new weight W"JI ( t, + 1 ) is determined by equation (9).

Ｗ”ＪＩ　　（ｔ−ｚ）＝Ｗ”ＪＩ　　（ｔ）＋Δｗ”
、ｔ　　（ｔ＋１）　　・・（９）ステップ（３５）で
は、前向きの伝播により式（１）式（２）を用いて各層
の内部状態ｕ　ｈ　Ｊ、出力値ＶｈＪを演算し、出力層
Ｈにおける神経回路網の集合体としての出力ＶＨ，を求
める。この出力ｖＨＪと７習データｄ＋’を用いて式（
４）を演算し、二乗誤差Ｅを求める。このＥが充分小さ
いとき、例えばあらかじめ許容誤差Ａを与えておき、Ｅ
とＡとの大小関係を判定しくステップ＋３６））　、　
ＥがＡ以下の時は学習は終了とし、ＥがＡよりも大きい
時はＬ＝ｔ＋１　（ステップ（３７））としてステップ
（３２）からの処理を繰り返す。W”JI (t-z)=W”JI (t)+Δw”
, t (t+1) (9) In step (35), the internal state u h J and output value VhJ of each layer are calculated using equations (1) and (2) by forward propagation, and the output value VhJ in the output layer H is calculated. The output VH as a collection of neural networks is determined. Using this output vHJ and the 7 learning data d+', the formula (
4) to find the squared error E. When this E is small enough, for example, by giving a tolerance A in advance, E
Step +36)) to determine the magnitude relationship between and A.
When E is less than or equal to A, learning is terminated, and when E is greater than A, L=t+1 (step (37)) and the process from step (32) is repeated.

［発明が解決しようとする課題］従来の神経回路網装置は以上のように構成されており、
集合体としての誤差Ｅが誤差の許容値Ａ以下になるのが
遅く、学習の速度が遅かった。さらに、誤差Ｅが許容値
Ａ以下にならず、局所最適解に収束してしまうという問
題点があった。[Problem to be solved by the invention] The conventional neural network device is configured as described above.
It was slow for the error E as a collection to fall below the error tolerance value A, and the speed of learning was slow. Furthermore, there is a problem in that the error E does not become less than the allowable value A and converges to a locally optimal solution.

この発明は上記のような問題点を解決するためになされ
たもので、学習速度を速くでき、さらに局所最適解に収
束するかわりに、大局的最適解に収束する確率を向上す
ることのできる神経回路網装置を得ることを目的として
いる。This invention was made to solve the above problems, and it is possible to increase the learning speed and improve the probability of converging to the global optimal solution instead of converging to the local optimal solution. The purpose is to obtain a network device.

［課題を解決するための手段］この発明は、入力層、中間層、及び出力層を構成する複
数の神経素子を所定数毎に群分けした神経素子群と、神
経素子の出力の各々に結合係数を乗じて次の層の神経素
子へ出力する結合素子とを備え、入力層の神経素子に入
力データ、及び出力層の神経素子に出力データを学習デ
ータとしてあらかじめ設定し、入力層の神経素子の動作
状態は同一群内では実質的に同一に割り当て、中間層の
神経素子は各々而の層の神経素子と結合する結合素子を
介しての信号を総和演算しつつ出力層の神経素子におい
て演算出力を得、上記出力データと演算出力の二乗で定
義されるエネルギーを局所的に最小にするように構成し
たものである。[Means for Solving the Problems] The present invention provides a neural element group in which a plurality of neural elements constituting an input layer, an intermediate layer, and an output layer are divided into groups of a predetermined number, and a neural element group that is connected to each of the outputs of the neural elements. It is equipped with a coupling element that multiplies the result by a coefficient and outputs it to the neural element of the next layer.Input data is set in advance to the neural element of the input layer, and output data is set to the neural element of the output layer as learning data, and the neural element of the input layer is The operating states of the neural elements in the output layer are assigned substantially the same within the same group, and the neural elements in the intermediate layer perform summation calculations on the signals via the coupling elements that connect with the neural elements in the respective layers, while performing calculations in the neural elements in the output layer. The configuration is such that the energy defined by the output data and the square of the calculation output is locally minimized.

［作用］この発明における神経回路網装置は、少なくとも入力層
の神経素子を冗長にし、これに共なって結合重みを冗長
にして結合することによって、等価的にニューロンの伝
達関数であるシグモイド関数の勾配を急にして学習を高
速にし、かつ、冗長にした結合重みが、大数の法則によ
って等価的に正規分布となり、大局的最適解に収束する
確率を向上する。[Operation] The neural network device according to the present invention makes at least the neural elements of the input layer redundant, and makes the connection weights redundant and connects them, thereby equivalently achieving a sigmoid function which is a neuron transfer function. The gradient is made steeper to speed up learning, and the redundant connection weights become equivalently normally distributed according to the law of large numbers, improving the probability of convergence to a globally optimal solution.

［実施例］第１図は、この発明の一実施例による神経回路網装置の
構成を示す説明図である１図において、＋ＩＩ）は生体
の神経細胞を模擬する素子（以下、神経素子と呼ぶ）で
、（ｌｌａ）は入力層を構成する神経素子で例えば６つ
の入力層神経素子、（ｌｌｂ）は中間層を構成する神経
素子で例えば４つの中間層神経素子、　（ｌｌｃ）は出
力層を構成する神経素子で例えば１つの出力層神経素子
である。　（１２）は神経素子＋Ｉ　Ｉ）間のシナプス
を模擬する素子（以下、結合素子と呼ぶ）で、その結合
の強さを結合重みと呼ぶ。[Embodiment] FIG. 1 is an explanatory diagram showing the configuration of a neural network device according to an embodiment of the present invention. In FIG. ), (lla) is the neural element that constitutes the input layer, for example, six input layer neural elements, (llb) is the neural element that constitutes the intermediate layer, for example, four intermediate layer neural elements, and (llc) is the output layer. The constituent neural elements are, for example, one output layer neural element. (12) is an element (hereinafter referred to as a coupling element) that simulates a synapse between neural elements +II), and the strength of the coupling is called a coupling weight.

この回路網において、入力層の神経素子（ｌ　ｌａ）に
は２重の冗長性を持たしており、神経素子Ｖ′（■　と
神経素子Ｖｌ、ｉｍｌは同一の神経素子群を構成する。In this circuit network, the input layer neural element (l la) has double redundancy, and the neural element V' (■) and the neural elements Vl and iml constitute the same neural element group.

神経素子Ｖｌ＋Ｉ＋　と神経素子ｙ１１！１神経素子ｖ
′（１）と神経素子Ｖｌｌｌ′も同様に同一の神経素子
群を構成する。この冗長化したニューロンの出力は全て
同一とし、即ち、入力層の神経素子（ｌｌａ）の動作状
態は同一群内では実質的に同一に割り当てている。この
実施例では中間層を構成する第２９　（ｌｌｂ）のニュ
ーロンにも第１層と同様に二重の冗長性を持たしている
。但し、第２層目（１ｌｂ）では、入力層（ｌｌａ）の
ように冗長化したニューロンの出力は同一とは限らない
、出力層（Ｉ　Ｉｃ）では冗長化は不要である。又、冗
長化した入力層（ｌ　ｌａ）に共゛なって、結合重みも
冗長化している０図において、入力層（１１ａ）から出
力層（ｌ　ｌｃ）への信号の流れは、神経素子（Ｉ　ｌ
）と結合重み（１２）が冗長になったほかは従来と同様
である。即ちｍを冗長化の数とすると、ｕｆｉ、１ｍｌ
　を第り層の５番目の神経素子の内部状態、Ｖｌｌ、１
１１１を第り層の５番目の神経素子の出力値、Ｗｌ＋、
、１ｍｌを第り層の第１番目の神経素子と第ｈ＋１層に
おける第ｊ番目の神経素子との間の結合重みとする。学
習デー夕の入力データは、入力層（ｌｌａ）での出力ｔ
ｔｉｖ’１１　と同じである。Neural element Vl+I+ and neural element y11!1 neural element v
'(1) and neural element Vllll' similarly constitute the same neural element group. The outputs of these redundant neurons are all the same, that is, the operating states of the neural elements (lla) in the input layer are assigned substantially the same within the same group. In this embodiment, the 29th (llb) neuron constituting the intermediate layer also has double redundancy like the first layer. However, in the second layer (1lb), the outputs of neurons made redundant like the input layer (lla) are not necessarily the same, and redundancy is not necessary in the output layer (I Ic). In addition, in Figure 0, where the connection weights are also redundant along with the redundant input layer (l la), the signal flow from the input layer (11a) to the output layer (l lc) is controlled by the neural element ( I l
) and connection weight (12) are now redundant, but are the same as before. That is, if m is the number of redundancies, ufi, 1ml
is the internal state of the fifth neural element in the second layer, Vll, 1
111 is the output value of the fifth neural element of the th layer, Wl+,
, 1ml is the connection weight between the first neural element in the th layer and the jth neural element in the h+1th layer. The input data of the learning data is the output t of the input layer (lla)
Same as tiv'11.

第１図に示す例では、第２表に示すような構成としてい
る。また、各変数の関係は式（１１）、式（１２）のよ
うになる、ここで、関数ｇは微分り能で非減少な関数で
あれば良く、−例として式（Ｉｓ）を用いる。In the example shown in FIG. 1, the configuration is as shown in Table 2. Moreover, the relationship between each variable is as shown in Equation (11) and Equation (12), where the function g may be a non-decreasing function with differentiability, and the equation (Is) is used as an example.

第２表ｕｈ、ｌ＠ｌ＝＝ΣＷ１１−１．、　ｌａｌ　Ｖ　ｈ−
ｔ、　１６１　　・・（ｌｌ）Ｖｌｌ　　ｌａｌ　　＝
：ｇ　　（ｕｈＪｌ”ｌ　　）　　　　　−・、　　・
＋＋２）ｇ　　（ｘ）　　＝１／　（１＋ｑｘｐ　　（
−ｘ）ｌ　　　−−（＋３１さらに、結合重み（Ｗ）は
式（１４）に示す学習則で逐次的に決定される。即ち、
出力層（ｌ　Ｉｃ）における学習データと、神経回路網
によって実際に得られた演算出力で定義される二乗誤差
に関する最急降下法を用いて決定される。神経回路網の
層の数をＨとすると、二乗誤差（以下でエネルギーとい
うこともある）は式（１４）に様になる。Table 2 uh, l@l==ΣW11-1. , lal V h-
t, 161...(ll) Vll lal =
:g (uhJl”l) −・、・
++2)g (x) =1/ (1+qxp (
−x)l −−(+31 Furthermore, the connection weight (W) is sequentially determined by the learning rule shown in equation (14). That is,
It is determined using the steepest descent method regarding the squared error defined by the learning data in the output layer (l Ic) and the calculation output actually obtained by the neural network. When the number of layers of the neural network is H, the squared error (hereinafter also referred to as energy) is expressed as in equation (14).

Ｅ＝（１／２１ΣΣΣ（ＶＨ，１６１−ｄ　、Ｉｌｌ＠
ｌ）Ｉ　・（１４）又、結合重みの逐次変更はａ、βを
適当なパラメータ、δ　ｌｈｌ　ｌａｌ１　を第り層、
ｉ番目の神経素子における誤差（式（１５）　、式（１
６１）とし、モーメント法を使用した場合には式（１７
）の学習方程式で実行できる８式（Ｉｓ）、式（１６）
において、ｈ＝Ｈの時は出力層における値を示している
。なお、この例ではＩ（＝　３である。E=(1/21ΣΣΣ(VH, 161-d, Ill@
l) I (14) Also, to sequentially change the connection weights, a and β are appropriate parameters, δ lhl lal1 is the second layer,
Error in the i-th neural element (Equation (15), Equation (1)
61), and when using the method of moments, the equation (17
) can be executed using the learning equation (Is), equation (16)
In, when h=H, the value in the output layer is shown. Note that in this example, I (= 3).

５　、１１１１　ｌａｌ、（ｄ　、１１１＋＋＋１−ｖ
ｌｌ、１ｍ１）ｙＨ，（１−ｙＨ１′ｍｌ）ここで、′
ｉ　＝　ｌ、　Ｎ　１１１１　ｌａｌ　　　　・・・（
１５）δ　ｌｌ＋ｌ　ｌａｌ　　＝　Ｖ　ｌ＋、　ｌｓ
ｌ　（１−Ｖ　ｈ、　１ｍ１）本Ｉ　Ｘ：　δ、　ｌｈ
ｌｌｌ　ｌａｌ　ｗｈＪ、　ｌ＋ｍｌここで、ｉ＝Ｉ〜
Ｎ　ｌｈｌ　ｌａｌ　、　ｈ＝　）（〜ｌ・（Ｉｃ）ｄ
　　”　　Ｗ”、１１”’／ｄｔ”　　÷（ｌ　−α）
　　ｄ　　Ｗ　”Ｊｒ　”’　／　ｄｔ＝−βｃｌＥ／
ａＷＩ″Ｊｔ”’　　　　　−−（１７）例えば、式（
Ｉ７）の学習方程式の演算処理を行なうフローチャート
の一例を第２図に示す、ステップ（４０）でｔ＝ｉにお
ける結合重みＷ”Ｊ−−’と結合重みの更新量ΔＷｈ　
Ｊ、　ｌ　ｍ　ｌの初期値を一様乱数又は正規分布する
乱数によって設定する。この時、ｈ＝１〜２でｂ＝１の
時ｉ＝１〜３．ｊ＝１〜２、ｈ＝２の時ｉ＝１〜２．ｊ
＝１の全てについて設定するのであるが、第２図には簡
単のため、ｈ。5, 1111 lal, (d, 111+++1-v
ll, 1m1)yH, (1-yH1'ml) where,'
i = l, N 1111 lal...(
15) δ ll+l lal = V l+, ls
l (1-V h, 1m1) Book I X: δ, lh
lll lal whJ, l+ml where i=I~
N lhl lal, h= )(~l・(Ic)d
"W", 11"'/dt" ÷ (l - α)
d W ``Jr ''' / dt=-βclE/
aWI″Jt”′ --(17) For example, the formula (
An example of a flowchart for calculating the learning equation in step (40) is shown in FIG.
The initial values of J, l ml are set using uniform random numbers or normally distributed random numbers. At this time, when h=1 to 2 and b=1, i=1 to 3. When j=1-2, h=2, i=1-2. j
= 1, but for simplicity, h is shown in Figure 2.

ｊ、ｉと記Ｍし、ｍについては入力層（ｌｌａ）　、中
間層（ｌｌｂ）共にｍ　＝　２、出力層（ｌｌｃｌでは
ｍ＝１とする。ステップ（４１）でｔ＝１として初期設
定する。ステップ（４２）では誤差信号の初期値として
出力Ｈ（Ｉｌｃ）　ニオケルＫＭ　５　ｔ　”’　””
ｋ　式（１５）　Ｇｌ：　Ｊ：って求める１次に逆伝播
法により、−層上の誤差信号から下の層の誤差信号を式
（１６）に基いて求める（ステップ＋４３））　、第３
図に逆伝播を示す。ステップ（４４）では、モーメント
法により結合重みを更新する。このため、まず第り番の
重みの１＋１における多値化したｍ番目の更新量△Ｗ”
、、１”″）（ｔ＋Ｂを式（１８）で求める。j, i are written as M, and for m, m = 2 for both the input layer (lla) and the intermediate layer (llb), and m = 1 for the output layer (llcl). Initialize as t = 1 in step (41). In step (42), the output H (Ilc) is set as the initial value of the error signal.
k Equation (15) Gl: J: Using the linear backpropagation method, the error signal of the lower layer is obtained from the error signal on the - layer based on Equation (16) (step +43)), 3rd
The figure shows backpropagation. In step (44), the connection weights are updated by the method of moments. For this reason, first, the m-th update amount △W'' is multivalued at 1+1 of the weight of the
,,1'''')(t+B is determined by equation (18).

ΔＷ　ｈＪｌ　”’　（シ＋’）　＝βδ　＋ｈ＊＋＋
　ｌａｌ　　Ｖ　ｈ、　１ｌｊｌ＋αΔＷ”ｊ、１”　
（ｔ）　　・・（１８）この式におけるα、βはあらか
じめ定めたパラメータである１次に、ΔＷ”、（”ｌ　
（ｔ　＋　ｔ　）を用いて新たな重みｗ”、、’″’（
し＋ｉ）を式（１９）により求める。ΔW hJl ”'(shi+') =βδ +h*++
lal V h, 1ljl+αΔW"j, 1"
(t) ... (18) α and β in this equation are predetermined parameters of the first order, ΔW”, (”l
(t + t) to create new weights w",,'"'(
+i) is calculated using equation (19).

Ｗ”Ｊ、ｌ”’　（ｔ、＋ｔ）＝ｗ”ｊＩ”’　（ｔ）
＋ΔＷ”、”’　（ｈ＋ｔ）　　・・（１９）ステップ
（４５）では、前向きの伝播により式（Ｉ　Ｉ）　。W"J, l"' (t, +t) = w"jI"' (t)
+ΔW","' (h+t)...(19) In step (45), equation (II) is obtained by forward propagation.

式（１２）を用いて各層の内部状態ｕ　ｈ　ｌ　＠　１
．出力値Ｖｌｌ、１１６１　を演算し５、出力層Ｈにお
ける神経回路網の集合体としての出力Ｖ１４Ｊ′１を求
める。この出力ｖ８，１ｍｌと学習データｄ＋””　を
用いて式（１４）を演算し、二乗誤差Ｅを求める。この
Ｅが充分小さいとき１例えばあらかじめ許容誤差Ａを与
えておき、ＥとＡとの大小関係を判定しくステップ（４
６］）　、　ｒＥがＡ以下の時学習は終了とし、ＥがＡ
よりも大きい時はＥ＝ｔ＋１　（ステップ＋４７）　）
としてステップ（４２）からの処理を繰り返す。Using equation (12), the internal state of each layer u h l @ 1
．． The output value Vll, 1161 is calculated 5, and the output V14J'1 as a collection of neural networks in the output layer H is determined. Equation (14) is calculated using this output v8, 1 ml and the learning data d+"" to obtain the squared error E. When this E is sufficiently small, step 1, for example, give a tolerance A in advance and judge the magnitude relationship between E and A.
6]) When rE is less than or equal to A, learning ends, and E becomes A.
If it is larger than E=t+1 (step+47))
The process from step (42) is repeated.

このように、上記実施例では少なくとも入力層の神経素
子（ｌｌａ）を冗長にし、これに共なって結合重みを冗
長にして結合することによって、等価的にニエーロンの
伝達関数であるシグモイド関数の勾配を急にして学習を
高速にし、かつ、冗長にした結合重みが、大数の法則に
よって等価的に正規分布となり、大局的最適解に収束す
る確率を向上する。In this way, in the above embodiment, at least the neural elements (lla) of the input layer are made redundant, and the connection weights are made redundant and connected, thereby effectively reducing the gradient of the sigmoid function, which is equivalently Nieron's transfer function. The connection weights, which are made steeper to speed up learning and made redundant, have an equivalent normal distribution according to the law of large numbers, improving the probability of converging to a global optimal solution.

天際にＸｏＲＯ問題として知られているものを実行した
。これは、学習データの入力データとしテＶ　’ｌ＝　
Ｏ、Ｖ　’ｘ＝　Ｏｔ７）時、出力データトシテＶ３、
；０、’ｖ’、＝ｏ、ｖ’、＝ｏの時、出力データトシ
テＶ　’　ｔ　＝　Ｏｌｖ　＋、　＝　０　、　Ｖ　’
−＝　１　（７）時、出力データとｔ、てｖ’、＝ｉ、
Ｖ’ｌ＝１．Ｖ’、二〇（７１１時、出力データとして
Ｖ’ｌ＝１．Ｖ’、＝１．Ｖ’□＝１の時、出力データ
としてｖ’、＝ｏを与えて学習させると、従来装置では
ＩＯ分程度かかっていたものが、この実施例による装置
では２分程度で学習できた。冗長性の度合いをＤとすれ
ば、データの性質にもよるが、従来装置の速さのｌ、／
Ｄ＋、’ｘ程度で学習できるようになった。さらに、中
間層の神経素子に冗長性を持たせた際に同一群内では同
一の動作状態にしているが、同じでなく、完全に乱数で
定めた場合はこれより速くなる。We ran what is known as the XoRO problem. This is the input data of the learning data and TeV'l=
O, V 'x = Ot7), the output data is V3,
;0, 'v', = o, v', = o, output data value V' t = Olv +, = 0, V'
-= 1 (7) When the output data and t, tev', = i,
V'l=1. V', 20 (at 711, when output data V'l = 1.V', = 1.V'□ = 1, when learning is given v', = o as output data, the conventional device What used to take about IO minutes could be learned in about 2 minutes with the device according to this example.If the degree of redundancy is D, then the speed of the conventional device is l, //, depending on the nature of the data.
I was able to learn at around D+ and 'x. Furthermore, when providing redundancy to the neural elements in the middle layer, the operating states within the same group are the same, but if they are not the same and are determined completely using random numbers, the processing speed will be faster.

し発明の効果］この発明は、入力層、中間層、及び出力層を構成する複
数の神経素子を所定数毎に群分けした神経素子群と、神
経素子の出力の各々に結合係数を乗じて次の層の神経素
子へ出力する結合素子とを備え、入力層の神経素子に入
力データ、及び出力層の神経素子に出力データを学習デ
ータとしてあらかじめ設定し、入力層の神経素子の動作
状態は同一群内では実質的に同一に割り当て、中間層の
神経素子は各々前の層の神経素子と結合する結合素子を
介しての信号を総和演算しつつ出力層の神経素子におい
て演算出力を得、」１記出力データと演算出力の二乗で
定義されるエネルギーを局所的に最小にするように構成
することにより、学習速度を速くでき、かつ、大局的最
適解に収束する確率が向上できる神経回路網装置を得る
ことができる効果がある。[Effects of the Invention] This invention provides neural element groups in which a plurality of neural elements constituting an input layer, an intermediate layer, and an output layer are divided into groups of a predetermined number, and each of the outputs of the neural elements is multiplied by a coupling coefficient. It is equipped with a coupling element that outputs to the neural element of the next layer, and input data to the neural element of the input layer and output data to the neural element of the output layer are set in advance as learning data, and the operating state of the neural element of the input layer is Substantially the same allocation is made within the same group, and each neural element in the intermediate layer performs a summation operation on signals via a coupling element that connects with the neural element in the previous layer, while obtaining a calculation output in the neural element in the output layer; ” A neural circuit that can increase the learning speed and improve the probability of converging to the global optimal solution by configuring it to locally minimize the energy defined by the output data and the square of the calculation output. There is an effect that a network device can be obtained.

[Brief explanation of drawings]

Ｉｔ図はこの発明の一実施例による神経回路網装置の構
成を示す説明図、第２図はこの一実施例に係る学習方程
式を演算するフローチャート、第３図はこの一実施例に
係る誤差の逆伝播を示す説明図、第４図は従来の神経回
路網装置の構成を示す説明図、第５図は従来の装置に係
る学習方程式を演算するフローチャート、第６図は従来
の装置に係る誤差の逆伝播を示す説明図である。（Ｉｌｌ・・・神経素子、（１２）・・・結合素子。なお、図中、同一符号は同一、又は相当部分を示す。The It diagram is an explanatory diagram showing the configuration of a neural network device according to an embodiment of the present invention, FIG. 2 is a flowchart for calculating a learning equation according to this embodiment, and FIG. An explanatory diagram showing backpropagation, Fig. 4 is an explanatory diagram showing the configuration of a conventional neural network device, Fig. 5 is a flowchart for calculating the learning equation related to the conventional device, and Fig. 6 is an error related to the conventional device. It is an explanatory diagram showing back propagation of. (Ill...neural element, (12)...coupling element. In the figures, the same reference numerals indicate the same or equivalent parts.

Claims

[Claims]

A neural element group in which a plurality of neural elements constituting the input layer, intermediate layer, and output layer are divided into groups of a predetermined number, and the output of each of the neural elements is multiplied by a coupling coefficient and output to the neural element of the next layer. The input data to the neural elements in the input layer and the output data to the neural elements in the output layer are set in advance as learning data, and the operating states of the neural elements in the input layer are substantially the same within the same group. The neural elements in the intermediate layer each perform summation calculations on the signals via the coupling elements that connect with the neural elements in the previous layer, and obtain the calculation output in the neural elements in the output layer, and a neural network device configured to locally minimize energy defined by the square of the calculation output.