JPH03186963A - Connection method for computer - Google Patents
Connection method for computerInfo
- Publication number
- JPH03186963A JPH03186963A JP1325559A JP32555989A JPH03186963A JP H03186963 A JPH03186963 A JP H03186963A JP 1325559 A JP1325559 A JP 1325559A JP 32555989 A JP32555989 A JP 32555989A JP H03186963 A JPH03186963 A JP H03186963A
- Authority
- JP
- Japan
- Prior art keywords
- links
- pes
- hypercube
- distance
- aggregate
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Multi Processors (AREA)
Abstract
Description
【発明の詳細な説明】
〔産業上の利用分野〕
本発明は、複数の処理要素(以下、PEと呼ぶ)をリン
クにより相互に結合して成るいわゆるメツセージパッシ
ング型の並列計算機結合網に係り、特にPE間の結合数
(リンクの数)を少なく保つま
た上で高性能化するのに好適な計算機の結合方法に関す
る。[Detailed Description of the Invention] [Industrial Application Field] The present invention relates to a so-called message passing type parallel computer connection network in which a plurality of processing elements (hereinafter referred to as PEs) are interconnected by links. In particular, the present invention relates to a computer connection method suitable for keeping the number of connections (number of links) between PEs small and increasing performance.
従来、PE間を一対一緒合であるリンクにより相互に結
合する形態の並列計算機結合網には、PEを2進本状に
結合したトリー結合、2次元格子状に結合した2次元ア
レイ結合、PEの番号を2進表示したときそのピントパ
ターンが1ビツトだけ異なるPE同志を全て結合するい
わゆるハイパキューブ結合等がある。Conventionally, parallel computer connection networks in which PEs are interconnected by pairs of links include tree connections in which PEs are connected in a binary line, two-dimensional array connections in which PEs are connected in a two-dimensional grid, and PEs in a two-dimensional array. There is a so-called hypercube combination in which all PEs whose focus patterns differ by one bit when expressed in binary are combined.
このような結合網においては、直接リンクにより結合さ
れていないPEに対する通信はいくつかのPEを中継し
て行われるため、この中継数が少ない事が要求される。In such a connected network, communication with PEs that are not connected by direct links is performed by relaying through several PEs, so it is required that the number of relays be small.
又、更に結合が対称形であり、通信が特定のリンクに集
中しないことが要求される。こういった要求から、最近
はハイパキューブ結合が注目され、いくつかのマシンが
実用化されてきている。第4図にPE数が8のときのハ
イパキューブ結合を示す。It is also required that the coupling be symmetrical and that communications not be concentrated on any particular link. Due to these demands, hypercube coupling has recently attracted attention, and several machines have been put into practical use. FIG. 4 shows a hypercube connection when the number of PEs is 8.
ハイパキューブ結合では、PE数をNとしたとき、各P
Eはlog2N個のPEと直接リンクで結合され、又、
Pu間通信における中継数の最大値である最大I) E
開路HD (直径とも呼ばれる)も]、og2Nとなる
。このように、ハイパキューブ結合は、比較的少ないリ
ンク数(PE当りlog2N本)で最大PE間距離りを
小さくできる特徴を有する。In hypercube join, when the number of PEs is N, each P
E is connected to log2N PEs by direct links, and
Maximum I) E which is the maximum number of relays in communication between Pus
The open circuit HD (also called diameter) is og2N. In this way, hypercube coupling has the feature that the maximum distance between PEs can be reduced with a relatively small number of links (log2N links per PE).
しかし、PE数Nが例えば103〜104あるいはそれ
以上といった高並列計算機では、PE当りのリンク数し
、最大PE間距離り共に小さいとは言えなくなる。すな
わち、リンク数の増加により実装上のPE間の接続処理
が難しくなり、また最大PE間距離の増大により通信遅
延がシステム全体の性能を制限することになるという問
題がある。However, in a highly parallel computer where the number N of PEs is, for example, 103 to 104 or more, both the number of links per PE and the maximum distance between PEs cannot be said to be small. That is, there are problems in that the increase in the number of links makes it difficult to process connections between PEs on implementation, and the increase in the maximum distance between PEs causes communication delays that limit the performance of the entire system.
尚、これに対し、 K、Hwang等はHyperne
tと呼ばれる結合構成を提案しく K 、Hwang、
et al、 : “Hypernet : A
Communjcation−Effjcient A
rchj、tecture For Construc
tingMassively Parallel Co
mputers”、 I E E E Trans、
Comput。In contrast, K, Hwang, etc.
Let us propose a combination configuration called tK, Hwang,
et al.: “Hypernet: A
Communication-Effjcient A
rchj, texture for construct
tingMassively Parallel Co
mputers”, I E E E Trans,
Compute.
Vol、C−36,N[112,PP、1450−14
663
(上987))、また、F、 P 、 Prepara
ta等はCCCと呼ばれる結合構成を提案しているが(
F。Vol, C-36, N [112, PP, 1450-14
663 (upper 987)), also F, P, Prepara
ta et al. have proposed a coupling configuration called CCC (
F.
P 、 Praparata+ at al−、: “
’I’he Cube−Connected Cycl
es : A Versatile Network
for Parallel Computations
、 Commum、 ACM、 pp。P, Praparata+ at al-,: “
'I'he Cube-Connected Cycle
es: A Versatile Network
for Parallel Computations
, Commun, ACM, pp.
300−309 (1981))、これらは共にリンク
数を削減しているものの最大PE間距離については削減
できていない。300-309 (1981)), both of which reduce the number of links but cannot reduce the maximum distance between PEs.
このように従来技術では、一般にリンク数、最大PE間
距離を同時に小さくすることは困難であり、これらが比
較的小さいハイパキューブ結合でもPE数が大きい場合
にはこれらが充分小さいとは言えず、システム実現上あ
るいはシステム性能」二問題があった。In this way, with conventional technology, it is generally difficult to simultaneously reduce the number of links and the maximum distance between PEs, and even if these are relatively small hypercube connections, if the number of PEs is large, it cannot be said that these are sufficiently small. There were two problems: system implementation and system performance.
本発明の目的は、より少ないリンク数でより小さな最大
PE間距離を達成できる並列計算機のPE間結合方法を
提供することにある。An object of the present invention is to provide a method for connecting PEs in a parallel computer that can achieve a smaller maximum distance between PEs with a smaller number of links.
上記l」的を達成するために、本発明では、複数のPE
をリンクによりハイパキューブ状に結合す−
ると共に、その最大PE間距離を有するPE間同志にリ
ンクを付加したPE群を−っのPE集合体として、該P
E集合体を複数配置し、各PE集合体間を相互に完全結
合して並列計算機を構成する。In order to achieve the above objective, the present invention uses a plurality of PEs.
A group of PEs in which the PEs are connected in a hypercube shape by links and links are added to the PEs having the maximum inter-PE distance is defined as a PE aggregate.
A parallel computer is constructed by arranging a plurality of E aggregates and completely coupling the PE aggregates to each other.
このような並列計算機結合網では、最大PE間距111
1Dは、ベースとなるハイパキューブ結合でのPE数を
Mとすると、全PE数Nには依存せず、D = 1 +
log、M (log2Mが奇数のときは、D=2+
log2M )となる。又、PE当りに必要なリンク数
りは、L = 2 +log2Mとなる。すなわち、従
来の単なるハイパキューブ結合の最大PE間距離および
リンク数に比べ大幅に小さな値にすることができる。In such a parallel computer connection network, the maximum distance between PEs is 111
1D does not depend on the total number of PEs N, and D = 1 +
log, M (when log2M is an odd number, D=2+
log2M). Also, the number of links required per PE is L = 2 + log2M. In other words, the value can be made much smaller than the maximum inter-PE distance and the number of links of a conventional simple hypercube connection.
以下、本発明の一実施例について図面により説明する。 An embodiment of the present invention will be described below with reference to the drawings.
第1図は本発明による並列計算機結合網の一実施例の概
略構成を示したものである。第1図中、1はPE集合体
、2はPE集合体間結合用リンク−
である。実施例では、一つのPE集合体は8個のPEで
構成されるとする。第1図は、このようなPE集合体1
を9個配置し、相互にPE集合体間結合用リンク2によ
り完全結合して並列計算機結合網を構成したものである
。従って、該並列計算機結合網は合計72個のPEから
なる。FIG. 1 shows a schematic configuration of an embodiment of a parallel computer coupling network according to the present invention. In FIG. 1, 1 is a PE aggregate, and 2 is a link for coupling between PE aggregates. In the embodiment, it is assumed that one PE aggregate is composed of eight PEs. FIG. 1 shows such a PE aggregate 1
A parallel computer connection network is constructed by arranging nine PE aggregates and completely connecting them with each other through PE aggregate connection links 2. Therefore, the parallel computer network consists of a total of 72 PEs.
第2図は第1図中の一つのPE集合体1の詳細構成図で
、8個のPEIIをリンクエ2によりハイパキューブ状
に結合し、更に、最大PE間isを有するPE間同志に
、バイパス用すンク王3を付加した構成となっている。FIG. 2 is a detailed configuration diagram of one PE aggregate 1 in FIG. It has a configuration with the addition of the use of the King King 3.
第2図において、(OOO)〜(111)は8個のPE
の番号を2進表示で示したもので、バイパス用リンク1
3は(000)と(111)、(001)と(110)
、(010)と(101)、(olz)と(100)の
PE間に設けられる。すなわち、あるPEの番号を2進
表示したとき(ao+ alr a2+・・・an−1
)とすると、このPEは(a 01 a 11 a 2
1 ”’ a n−0)の番号のPEにバイパス用リン
クで結合される。In Figure 2, (OOO) to (111) are 8 PEs.
Bypass link number 1 is shown in binary format.
3 is (000) and (111), (001) and (110)
, (010) and (101), and (olz) and (100). In other words, when the number of a certain PE is expressed in binary (ao+alr a2+...an-1
), this PE is (a 01 a 11 a 2
It is coupled to the PE numbered 1"'a n-0) by a bypass link.
第3図は一つのPEの概略構成であり、CPU6−
1 ]、 1、メモリ112、及び複数の入出力(Il
o)ポートエ1−3よりなる。本例の場合、入出力ボー
ト113は5つ必要となる。FIG. 3 shows a schematic configuration of one PE, including a CPU 6-1], a memory 112, and a plurality of input/outputs (Il
o) Consists of ports 1-3. In this example, five input/output ports 113 are required.
第2図に示すように、ベースとなるPE集合体のハイパ
キューブにバイパス用リンクを付加すると、各PEのリ
ンクの数は1本増加するが、そのハイパキューブの最大
PE間距離は1/2に削減され、一般にはハイパキュー
ブ内のPE数をMとすると(]、og2M ) / 2
となる(]、og2Mが奇数のときは、 (1+log
2M) / 2となる)。なぜなら、通信元PEと通信
先PEとのPE間距離(それらを2進表示したとき異な
るビットの数に等しい)が(LogzM ) / 2以
下であれば、ハイパキューブのリンクを使って順に中継
していけば良く、それ以」二のときにはバイパス用リン
クを用いて一担最遠のPEを中継し、それから順次ハイ
パキューブ用のリンクを用いて中継すればよいからであ
る。As shown in Figure 2, when a bypass link is added to the hypercube of the base PE aggregate, the number of links for each PE increases by one, but the maximum distance between PEs of that hypercube is halved. Generally, if the number of PEs in a hypercube is M, then (], og2M ) / 2
becomes (], when og2M is an odd number, (1+log
2M) / 2). This is because if the distance between the communication source PE and communication destination PE (equal to the number of different bits when they are expressed in binary) is less than or equal to (LogzM)/2, the communication is relayed sequentially using hypercube links. In other cases, the bypass link can be used to relay to the farthest PE, and then the hypercube links can be used for relaying in sequence.
例えば、16個のPEからなるハイパキューブで(00
00)のP Eから(01−1,1)のPEに通信する
場合には、(OOOO)のPEつバイパスリング→(1
工11)のPE−>ハイパキューブのリンク−>(01
41)のPE、というルートで通信でき、最大PE間距
離は(1,ogz工6)/2=2となる。第2図の場合
には、M、=8でLogzMが奇数なので、PE集合体
内の最大PE間距離は2である。すなわち、本発明では
、PE集合体内では各PEはいずれのPEとも距Ml
(1,ogzM) / 2(または(1+log2M)
/ 2)で通信でき、ベースとしたハイパキューブの
最大■〕E間距離を1/2に削減している。For example, in a hypercube consisting of 16 PEs (00
When communicating from the PE of (00) to the PE of (01-1,1), one PE of (OOOO) bypass ring → (1
Engineering 11) PE->Hypercube link->(01
41) PE, and the maximum distance between PEs is (1, ogz6)/2=2. In the case of FIG. 2, since M=8 and LogzM is an odd number, the maximum distance between PEs in the PE aggregate is 2. That is, in the present invention, each PE has a distance Ml from any PE within the PE aggregate.
(1,ogzM) / 2 (or (1+log2M)
/ 2) can communicate, reducing the maximum distance between E to 1/2 of the hypercube based on it.
以上のように構成したPE集合体を複数個配置し、第1
図に示すように、各々のPE集合体を一つのPEとみな
したとき、それらが完全結合を構成するように結合する
。この完全結合を実現するため、第2図に示すように、
PE集合体内の各PEt1にはP E集合体間結合用リ
ンク2を設けておく。A plurality of PE aggregates configured as described above are arranged, and the first
As shown in the figure, when each PE aggregate is regarded as one PE, they are combined to form a complete combination. In order to achieve this complete connection, as shown in Figure 2,
Each PEt1 in the PE aggregate is provided with a link 2 for coupling between PE aggregates.
このように構成することにより、あるPE集合体から他
の全てのPE集合体へ距離上で到達できる。従って、I
? E間の通信は最悪でも、通信元PEが属するPE集
合体内での中継、PE集合体間の中継、通信先PEが属
するPE集合体内での中継の組合せで実現される。すな
わち、本発明による結合網の最大PE間開路lIDは、
PE集合体内の各PE間の距離が最大(log、 M
) / 2であるから、D= (LogzM)/2+1
+ (LogzM)/2=1(logzM (Logz
Mが奇数のときは2 十]、og、 M )となる。例
えば、第工図および第2図の実施例ではD=5となる。With this configuration, it is possible to reach all other PE aggregates over a distance from a certain PE aggregate. Therefore, I
? At worst, communication between E is realized by a combination of relaying within the PE aggregate to which the communication source PE belongs, relaying between PE aggregates, and relaying within the PE aggregate to which the communication destination PE belongs. That is, the maximum PE-to-PE open circuit lID of the coupling network according to the present invention is:
The distance between each PE in a PE aggregate is maximum (log, M
) / 2, so D= (LogzM)/2+1
+ (LogzM)/2=1(logzM (Logz
When M is an odd number, it becomes 2 0], og, M). For example, in the embodiment shown in FIG. 2 and FIG. 2, D=5.
又、PE当りのリンク数りはベースとなるハイパキュー
ブにおけるPEのリンク数、バイパス用のリンク数、P
E集合体間結合用のリンク数の合計であり、L = l
og7M十王+↓=2+log2Mとなる。例えば、実
施例の場合にはL=5となる。すなわち、第1図および
第2図の実施例では、72個をPEt−PE当り5本の
リンクにより最大PE間距離5で結合している。これは
64個のPEからなるハイパキューブの場合のリンク数
6、最大PE間開路6と比べ小さな値である(実施例で
PE集合体の数を8個にすれば64個のシステムを9
構成でき、この時のDとLはやはり5である)。Also, the number of links per PE is the number of PE links in the base hypercube, the number of bypass links, and P
E is the total number of links for connection between aggregates, L = l
og7M Juoh+↓=2+log2M. For example, in the case of the embodiment, L=5. That is, in the embodiment shown in FIGS. 1 and 2, 72 PEt-PEs are connected by five links per PE with a maximum inter-PE distance of 5. This is a small value compared to the number of links of 6 and the maximum number of open circuits between PEs of 6 in the case of a hypercube consisting of 64 PEs. (D and L in this case are still 5).
尚、以上の実施例では、P E集合体の数を、ベースと
なるハイパキューブのPE集合体間結合用リンクが全て
使われるように設定したが(PE集合体の数=ベースと
なるハイパキューブのIJ E数+1)、一般にPE集
合体の数はこれ以下でも以上(尚、以上とするときには
各PE当りのPE集合体結合用リンクは一本でなく複数
にする必要がある)でも良いことは言うまでもない、但
し、その場合には全PE間の結合が完全な対称形とはな
らない事になる。In the above embodiment, the number of PE aggregates was set so that all links for connecting PE aggregates of the base hypercube were used (number of PE aggregates = base hypercube). IJE number + 1), and generally the number of PE aggregates can be less than or greater than this (in addition, if it is more than this, it is necessary to have multiple links for connecting PE aggregates for each PE instead of one). Needless to say, however, in that case, the connections between all PEs will not be completely symmetrical.
以上説明したように、本発明によれば、複数のPEから
なるハイパキューブ結合にバイパス用リンクを付加して
1つのPE集合体を構成し、このPE集合体を複数個設
け、その間を完全結合して並列PE結合網を構成するた
め、PE当りのリンク数を少なく保ったうえで最大PE
間距離を小さくできる。この効果は、PE数が多いシス
テム程顕著である。例えば、4096個のPEからなる
0
並列計算機は、ハイパキューブ結合の場合にはPE当り
のリンク数、最大PE間距離はともに12であるが、本
発明では1つのPE数を64、PE集合体の数を64と
すると、PE当りのリンク数は8、最大PE間距離は7
となる。As explained above, according to the present invention, a bypass link is added to a hypercube connection consisting of a plurality of PEs to form one PE aggregate, a plurality of PE aggregates are provided, and a complete connection is established between them. In order to configure a parallel PE connection network, the number of links per PE is kept small and the maximum
The distance between them can be reduced. This effect is more pronounced in systems with a larger number of PEs. For example, in a zero-parallel computer consisting of 4096 PEs, in the case of hypercube connection, the number of links per PE and the maximum distance between PEs are both 12, but in the present invention, the number of PEs is 64, and the maximum distance between PEs is 12. If the number of links is 64, the number of links per PE is 8, and the maximum distance between PEs is 7.
becomes.
すなわち1本発明によれば、ハイパキューブ結合より少
ないリンク数で最大PE間距離をそれ以下にできる利点
があり、ひいては高並列計算機の実現性を高め、かつそ
の通信遅延を小さくできることになる。That is, according to the present invention, there is an advantage that the maximum distance between PEs can be reduced to less than that with a smaller number of links than in a hypercube connection, and as a result, it is possible to improve the feasibility of a highly parallel computer and reduce the communication delay thereof.
第1図は本発明による並列計算機結合網の一実施例の概
略構成図、第2図は第1図における1つのPE集合体の
詳細構成図、第3図は1つのPEの概要を示す図、第4
図はハイパキューブ結合の構成例を示す図である。
l・・・PE集合体、
2・・・PE集合体間結合用リンク、
11・・・PE(処理要素)。
12・・・ハイパキューブ用リンク、
11−
13・・・バイパス用リンク。
=12=FIG. 1 is a schematic configuration diagram of an embodiment of a parallel computer coupling network according to the present invention, FIG. 2 is a detailed configuration diagram of one PE aggregate in FIG. 1, and FIG. 3 is a diagram showing an outline of one PE. , 4th
The figure is a diagram showing an example of the configuration of hypercube connections. 1... PE aggregate, 2... Link for coupling between PE aggregates, 11... PE (processing element). 12...Link for hypercube, 11-13... Link for bypass. =12=
Claims (1)
よりハイパキューブ状に結合すると共に、その最大PE
間距離を有するPE間同志にリンクを付加したPE群を
一つのPE集合体として、該PE集合体を複数配置し、
各PE集合体間を、各々のPE集合体を一つのPEとみ
なしたとき相互に完全結合を構成するように結合し、該
完全結合を実現するリンクは各PE集合体内のPEに均
一に結合することを特徴とする計算機の結合方法。(1) Combine multiple processing elements (hereinafter referred to as PEs) into a hypercube shape using links, and
A PE group in which links are added to PEs having a distance between them is regarded as one PE aggregate, and a plurality of PE aggregates are arranged,
Each PE aggregate is connected to form a complete connection when each PE aggregate is regarded as one PE, and links that realize the complete connection are uniformly connected to PEs in each PE aggregate. A computer combination method characterized by:
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP1325559A JPH03186963A (en) | 1989-12-15 | 1989-12-15 | Connection method for computer |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP1325559A JPH03186963A (en) | 1989-12-15 | 1989-12-15 | Connection method for computer |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| JPH03186963A true JPH03186963A (en) | 1991-08-14 |
Family
ID=18178250
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP1325559A Pending JPH03186963A (en) | 1989-12-15 | 1989-12-15 | Connection method for computer |
Country Status (1)
| Country | Link |
|---|---|
| JP (1) | JPH03186963A (en) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2010079912A (en) * | 1997-10-10 | 2010-04-08 | Altera Corp | Processor array and its formation method |
| WO2012056547A1 (en) * | 2010-10-28 | 2012-05-03 | 富士通株式会社 | Information processing system, routing method and program |
-
1989
- 1989-12-15 JP JP1325559A patent/JPH03186963A/en active Pending
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2010079912A (en) * | 1997-10-10 | 2010-04-08 | Altera Corp | Processor array and its formation method |
| JP2012133793A (en) * | 1997-10-10 | 2012-07-12 | Altera Corp | Processor array and methods of forming the same |
| WO2012056547A1 (en) * | 2010-10-28 | 2012-05-03 | 富士通株式会社 | Information processing system, routing method and program |
| JPWO2012056547A1 (en) * | 2010-10-28 | 2014-03-20 | 富士通株式会社 | Information processing system, routing method and program |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP4676463B2 (en) | Parallel computer system | |
| US9880972B2 (en) | Computer subsystem and computer system with composite nodes in an interconnection structure | |
| US5170482A (en) | Improved hypercube topology for multiprocessor computer systems | |
| US5630162A (en) | Array processor dotted communication network based on H-DOTs | |
| GB2140589A (en) | An array of a plurality of processing elements | |
| EP4293984A1 (en) | Data transmission system and method, and related device | |
| JPH0877128A (en) | Interconnection method of scalable parallel processing network and node | |
| Finkel | Processor interconnection strategies | |
| EP3973406A1 (en) | Embedding rings on a toroid computer network | |
| US5765015A (en) | Slide network for an array processor | |
| JPH03186963A (en) | Connection method for computer | |
| CN110210102A (en) | Bionic Self-healing Hardware Distributed Global Dynamic Routing System | |
| CN108768864B (en) | A Data Center Network Topology System with Easy Expansion and High Fault Tolerance | |
| CN116383126A (en) | Network-on-chip-based transmission method of deep neural network accelerator | |
| JP2976675B2 (en) | Array processor routing method | |
| JP3532102B2 (en) | Indirect rotator graph network and transmission path setting method in indirect rotator graph network | |
| Li | Multicast router with fault tolerance mechanism for large-scale neuromorphic computing system | |
| CN120017616B (en) | Interconnection system | |
| KR0170496B1 (en) | Cluster connecting structure using cross bar switch in a parallel processing computer system | |
| CN119945970B (en) | A parallel data transmission method based on 3-ary n-cube interconnection network | |
| JPH07114515A (en) | Decentralized memory computer with network for synchronous communication | |
| CN110445574A (en) | A kind of optical network transmission method and system based on hypergraph structure | |
| Chalasani et al. | Fault-tolerance with multimodule routers | |
| CN114185832B (en) | Acceleration component, acceleration device and electronic equipment | |
| JPS60175175A (en) | Parallel data processor |