JPH052607A

JPH052607A - High-speed search method using tree-structured data structure

Info

Publication number: JPH052607A
Application number: JP3154377A
Authority: JP
Inventors: Morio Kinoshita; 盛夫木下
Original assignee: Hitachi Software Engineering Co Ltd; Hitachi Ltd
Current assignee: Hitachi Software Engineering Co Ltd; Hitachi Ltd
Priority date: 1991-06-26
Filing date: 1991-06-26
Publication date: 1993-01-08

Abstract

(57)【要約】【目的】情報探索時の重複する無駄な比較を行わなくて
良く、且つ情報探索時の比較結果を有効に活用して高速
に情報探索を行えるデータ構造を構築することにより、
情報探索時の性能を向上させる。【構成】探索を目的としたデータ構造において、データ
を識別するための識別子（キー）を複数登録する場合
に、それぞれのキーを比較しやすい大きさのブロックに
分割する。そして、キーの比較を先頭の方のブロックか
らブロック毎に行い、キーの先頭の方のブロックが等し
く、キーの後ろの方のブロックが異なる場合、等しい部
分のブロックを一つだけ保持し、異なる部分のブロック
を木構造のデータ構造になるようにポインタでつないで
保持する。 (57) [Abstract] [Purpose] By constructing a data structure that does not require redundant and redundant comparisons at the time of information search, and that can effectively utilize the comparison results at the time of information search to perform high-speed information search. ,
Improves performance when searching for information. [Structure] When a plurality of identifiers (keys) for identifying data are registered in a data structure for searching, each key is divided into blocks of a size that facilitates comparison. Then, the key comparison is performed for each block from the first block, and when the first block of the key is the same and the second block of the key is different, only one block of the same part is held and different. The partial blocks are connected by a pointer and held so as to form a tree-structured data structure.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、データ構造に関し、特
に高速な情報の登録および探索が可能なデータ構造に関
するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a data structure, and more particularly to a data structure capable of registering and searching information at high speed.

【０００２】[0002]

【従来の技術】従来より、情報の登録、探索を効率よく
実現するデータ構造として、ハッシングや木構造が用い
られている。ハッシングを用いたデータ構造では、発生
するデータ量の推定値が事前に分かっている場合、良好
な性能を得るデータ構造を作成することができる。しか
し、データ量が予測しにくく、またかなり変動する場
合、木構造の方が好ましいことが知られている。従来の
木構造のデータ構造には、２分木、ＡＶＬ木、最適木、
Ｂ木、２−３木、ＳＢＢ木等さまざまなものがあるが、
いずれも、木の枝の分かれ目や木の枝の先端（以下、節
点と記す）にキーを完全な形で持っていなければならな
い。非常に多くの場合、情報探索のキーは整数のように
一回の比較で大小の判定ができるものではなく、文字列
のように複数回に分割して比較しなければ大小の判定が
できないものである。しかし、従来の木構造ではポイン
タをたどり新しい節点に来る度にキーを先頭から比較し
直さなければならず、一回の比較で比較しきれないよう
なキーの比較回数の削減については全く考えられていな
い。また従来の木構造では、節点でのキーの比較で得ら
れる情報のうち、大きいか小さいかの二つの情報によっ
て、二つのポインタの中からどのポインタをたどるかを
決定しており、節点でのキーの比較で得られる情報の、
どこまでが等しくどこからが違うかという、より詳しい
情報を利用して、二つ以上のポインタの中から一つのポ
インタを選んでたどる方法については全く考えられてい
ない。2. Description of the Related Art Conventionally, hashing or a tree structure has been used as a data structure for efficiently implementing information registration and search. In the data structure using hashing, when the estimated value of the amount of generated data is known in advance, it is possible to create a data structure that obtains good performance. However, it is known that the tree structure is preferable when the amount of data is difficult to predict and varies considerably. The conventional tree data structure includes a binary tree, an AVL tree, an optimal tree,
There are various trees such as B tree, 2-3 tree, SBB tree,
In each case, the key must be completely held at the branch of the tree branch or the tip of the tree branch (hereinafter referred to as a node). In many cases, the information search key is not something that can be compared to determine the size by a single comparison like an integer, but a character that cannot be determined if it is divided into multiple times and compared like a character string. Is. However, in the conventional tree structure, each time the pointer is traced and a new node is reached, the keys must be compared again, and it is completely conceivable to reduce the number of key comparisons that cannot be compared in one comparison. Not not. Also, in the conventional tree structure, which of the two pointers is to be traced is determined by two pieces of information, which are larger or smaller, among the information obtained by comparing the keys at the nodes. Of the information obtained by comparing the keys,
There is no idea of how to select and trace one pointer from two or more pointers by using more detailed information on how much is equal and where is different.

【０００３】なおこの種の技術として関連するものに
は、たとえばＡＮＳＩ（アメリカン・ナショナル・スタ
ンダード・フォ・インフォメーション・システムズ：Ａ
merican Ｎational Ｓtandard for information s
ystems）データベース・ランゲージ：database langua
ge−ＳＱＬ（規格番号Ｘ３．１３５−１９８６）などが
ある。[0003] Related to this kind of technology, for example, is ANSI (American National Standard for Information Systems: A).
merican National Standard for information s
ystems) database language: database langua
ge-SQL (standard number X3.135-1986).

【０００４】[0004]

【発明が解決しようとする課題】木構造で各節点にそれ
ぞれのキーを完全な形で持つデータ構造では、木構造の
根（ｒｏｏｔ）から目的のキー又はキーの挿入場所を探
すときに、ポインタをたどり目的のキーにたどり着くま
での間の各節点で、その節点にあるキーと探しているキ
ーの比較を行う。キーは通常長いものもあれば短いもの
もある。そのため、節点にあるキーと探しているキーの
比較は一回で行えないものが多く、キーを分割し複数回
に分けて比較し、比較結果（キーの大小）によってどの
ポインタをたどるかを決定する場合がほとんどである。
そのため、キーの先頭の方は同じで後ろの方が違うキー
が複数登録されている場合（多くのキーを登録する場
合、必ずこのような状態が発生する）、これらの節点で
の比較で、毎回キーの先頭の同じ部分の比較をやりなお
さなければならないという問題が生じる。In a data structure having a key in each node in a tree structure in a complete form, a pointer is used when searching for a target key or a key insertion position from the root of the tree structure. At each node until the desired key is reached, the key at that node is compared with the key you are looking for. Some keys are usually long and some are short. Therefore, it is not possible to compare the key at the node with the key you are looking for in a single operation. The key is divided and compared multiple times, and which pointer is traced is determined by the comparison result (size of key). In most cases.
Therefore, when multiple keys are registered with the same beginning and different endings (when registering a lot of keys, this situation will always occur), the comparison at these nodes The problem arises that the same part at the beginning of the key has to be recompared each time.

【０００５】また、節点でのキーの比較でキーを複数の
ブロックに分割し、先頭の方のブロックから比較した場
合、得られる情報としては、どこまでのブロックは等し
いが、どこ以降のブロックが大きい又は小さい、という
情報が得られる。しかし、従来の木構造では、節点での
キーの比較結果によりポインタをたどるときにキーが大
きいか小さいかの情報しか利用しておらず、節点での比
較で得られる情報を十分に利用していないという問題が
ある。When the key is divided into a plurality of blocks by comparing the keys at the nodes and the comparison is made from the first block, the obtained information is where the blocks are equal but where the subsequent blocks are large. Or information that it is small is obtained. However, in the conventional tree structure, only the information about whether the key is large or small is used when the pointer is traced based on the comparison result of the keys at the nodes, and the information obtained by the comparison at the nodes is fully used. There is a problem that there is no.

【０００６】本発明の目的は、このような問題を解決
し、キーの先頭の方が同じで後ろの方が違うキーが有る
場合でも無駄な比較をできるだけ少なくし、また節点で
のキーの比較で得られる情報を十分に利用して、情報探
索の性能を向上させることが可能なデータ構造を提供す
ることにある。An object of the present invention is to solve such a problem, to minimize unnecessary comparison even when there is a key having the same key at the beginning and a different key at the rear, and comparing keys at nodes. It is to provide a data structure capable of improving the performance of information search by fully utilizing the information obtained in.

【０００７】[0007]

【課題を解決するための手段】上記目的を達成するため
に、本発明のデータ構造（木構造）では、キーを複数登
録する場合に、それぞれのキーを比較しやすい大きさの
ブロックに分割し、キーの比較を先頭の方のブロックか
らブロック毎に行い、先頭の方のブロックが等しく後ろ
の方のブロックが異なる場合に、等しい部分のブロック
は一つだけ保持し、異なる部分以降のブロックを木構造
のデータ構造になるようにポインターでつないで保持す
ることに特徴がある。In order to achieve the above object, in the data structure (tree structure) of the present invention, when a plurality of keys are registered, each key is divided into blocks of a size easy to compare. , The key comparison is performed for each block from the first block, and when the first block is the same and the second block is different, only one block of the same part is retained and blocks after the different part are retained. It is characterized by holding it by connecting it with a pointer so that it becomes a tree-structured data structure.

【０００８】[0008]

【作用】本発明のデータ構造では、節点からポイントさ
れた先の節点には、ポイント元の節点にセットされてい
るキーと先頭の方から比較し、異なっている所より後ろ
の部分のブロックのみがセットされている。このため、
目的のキーと節点にセットされているキーの先頭の部分
が同じで後ろの部分が違う場合、その節点からポイント
された次の節点でのキーの比較では、前の節点でのキー
の比較で異なった部分が現れた所より後ろの比較のみを
行えば良い。このため、先頭の部分が同じで後ろの部分
が異なるキーが複数登録されている場合のキーの探索時
に各節点で無駄な比較を行わなくても良い。In the data structure of the present invention, the node pointed to from the node is compared with the key set in the node that is the point of origin from the beginning, and only the blocks after the point where they differ are compared. Is set. For this reason,
If the beginning part of the target key and the key set in the node are the same, but the back part is different, in the key comparison at the next node pointed from that node, the key comparison at the previous node is Only the comparison after the point where the different parts appear should be performed. Therefore, when a plurality of keys having the same head portion but different tail portions are registered, it is not necessary to perform unnecessary comparison at each node when searching for a key.

【０００９】また、本発明のデータ構造では、節点に複
数のブロックを持ち、その各ブロック毎にポインタを持
っている。そして、キーの探索を行う場合、節点でキー
の比較を行い、どのブロックまでが等しくどのブロック
から異なるかにより、二つ以上のポインタの中からどの
ポインタをたどるかを決定できる。この結果、情報探索
の性能が向上する。Further, in the data structure of the present invention, each node has a plurality of blocks, and each block has a pointer. Then, when searching for a key, it is possible to determine which pointer is to be traced from two or more pointers by comparing the keys at the nodes and determining which block is the same and which block is different. As a result, the information search performance is improved.

【００１０】[0010]

【実施例】以下、本発明の実施例を図面により詳細に説
明する。図１は、本発明の一実施例であるデータ構造に
７つのキーを登録したときのデータ構造の図である。図
２は、図１のデータ構造の節点の構成の詳細を示した図
である。図２において１は１つ以上の連続したブロック
をセットするエリアであり、２は各ブロック毎に持つポ
インタのエリアで、そのブロックより前のブロックは等
しいが、そのブロック以降のブロックが異なり、且つそ
のブロックの値よりも小さい値のブロックを持つキーを
示す節点へのポインタ（以下、左のポインタと記す）の
エリアである。３は各ブロック毎に持つポインタのエリ
アで、そのブロックより前のブロックは等しいが、その
ブロック以降のブロックが異なり、且つそのブロックの
値よりも大きい値のブロックを持つキーを示す節点への
ポインタ（以下、右のポインタと記す）のエリアであ
る。Embodiments of the present invention will now be described in detail with reference to the drawings. FIG. 1 is a diagram of a data structure when seven keys are registered in the data structure according to an embodiment of the present invention. FIG. 2 is a diagram showing details of the configuration of nodes in the data structure of FIG. In FIG. 2, 1 is an area in which one or more continuous blocks are set, 2 is an area of a pointer held for each block, blocks before the block are equal, but blocks after the block are different, and It is an area of a pointer (hereinafter, referred to as a left pointer) to a node indicating a key having a block having a value smaller than the value of the block. Reference numeral 3 denotes an area of a pointer held for each block. A pointer to a node indicating a key having a block which is equal to the block before the block but different from the block after the block and has a value larger than the value of the block. This area (hereinafter referred to as the right pointer) is the area.

【００１１】図１は、Ａ，Ｂ，Ｃ，Ｄ，Ｅ，Ｆ、及びＧ
の７つのキーを登録した場合のデータ構造を示してい
る。４のｒｏｏｔは木構造の根の節点を示すポインタで
ある。Ａ［ｎ］は、キーＡのｎ番目のブロックを示し、
同様にＢ［ｎ］，Ｃ［ｎ］，…Ｇ［ｎ］はそれぞれのキ
ーのｎ番目のブロックを示す。ＮＩＬはキーの終端を示
すブロックである。ＮＩＬは可変長のキー登録を可能と
するために設けたものであり、固定長のキー登録のみを
行う場合ＮＩＬは必要ない。図１のデータ構造に登録さ
れているキーの長さ（ブロック数）を表１に示す。表２
にそれぞれのキーの大小関係を示す。FIG. 1 shows A, B, C, D, E, F, and G.
7 shows the data structure when the seven keys are registered. Root 4 is a pointer indicating the node of the root of the tree structure. A [n] indicates the nth block of the key A,
Similarly, B [n], C [n], ... G [n] indicate the nth block of each key. NIL is a block indicating the end of the key. The NIL is provided to enable variable-length key registration, and NIL is not necessary when only fixed-length key registration is performed. Table 1 shows the lengths (number of blocks) of the keys registered in the data structure of FIG. Table 2
Shows the magnitude relationship of each key.

【００１２】[0012]

【表１】 [Table 1]

【００１３】[0013]

【表２】 [Table 2]

【００１４】まず、本データ構造へのキーの登録手順を
示しながら、本データ構造を説明する。キーがなにも登
録されていない状態では、木の根を示すｒｏｏｔ４には
０がセットされている。この状態でキーＡを登録する場
合、ｒｏｏｔ４に節点５のアドレスをセットし、節点５
にキーＡの全てのブロックをセットする。Ａ［５］のブ
ロックの後には、キーＡの終端を示すＮＩＬをセットし
ておく。First, this data structure will be described by showing the procedure for registering a key in this data structure. In the state where no key is registered, 0 is set in the root4 indicating the root of the tree. When registering key A in this state, set the address of node 5 to root4 and
Set all blocks of key A to. After the block of A [5], NIL indicating the end of the key A is set.

【００１５】次に、この状態にキーＢを登録する場合、
まずｒｏｏｔ４が指す節点５にセットされているキーＡ
と、これから登録するキーＢを先頭から比較する。キー
ＡとキーＢは、先頭のブロックから異なっており、Ａ
［１］よりＢ［１］の方が小さい。そこでＡ［１］の左
のポインタをチェックする。ポインタに何もセットされ
ていないので新しい節点６を作成し、節点６のアドレス
をＡ［１］の左のポインタにセットし、節点６にキーＢ
をセットする。Next, when registering the key B in this state,
First, the key A set at the node 5 pointed to by root4
Then, the key B to be registered is compared from the beginning. Key A and key B are different from the first block,
B [1] is smaller than [1]. Therefore, the pointer on the left of A [1] is checked. Since nothing is set in the pointer, a new node 6 is created, the address of node 6 is set in the pointer on the left of A [1], and key B is set in node 6.
Set.

【００１６】次に、この状態にキーＣを登録する場合、
まずｒｏｏｔ４が指す節点５にセットされているキーＡ
と、これから登録するキーＣを先頭から比較する。キー
ＡとキーＣは、先頭のブロックから異なっており、Ａ
［１］よりＣ［１］の方が大きい。そこでＡ［１］の右
のポインタをチェックする。ポインタに何もセットされ
ていないので新しい節点７を作成し、節点７のアドレス
をＡ［１］の右のポインタにセットし、節点７にキーＣ
をセットする。Next, when registering the key C in this state,
First, the key A set at the node 5 pointed to by root4
And the key C to be registered is compared from the beginning. Key A and key C are different from the first block,
C [1] is larger than [1]. Therefore, the pointer to the right of A [1] is checked. Since nothing has been set in the pointer, a new node 7 is created, the address of node 7 is set in the pointer to the right of A [1], and key C is set in node 7.
Set.

【００１７】つまり、キーＡ、キーＢ、及びキーＣは１
番目のブロックが異なるもの同志で２分木のデータ構造
を構成している。That is, the keys A, B, and C are 1
The second block is different, but the data structure of the binary tree is composed of comrades.

【００１８】次に、この状態にキーＤを登録する場合、
まずｒｏｏｔ４が指す節点５にセットされているキーＡ
と、これから登録するキーＤを先頭から比較する。キー
ＡとキーＤは、１番目、２番目、及び３番目のブロック
は等しいが、４番目のブロックが異なりＡ［４］よりＤ
［４］の方が大きい。そこでＡ［４］の右のポインタを
チェックする。ポインタに何もセットされていないので
新しい節点８を作成し、節点８のアドレスをＡ［４］の
右のポインタにセットする。節点８にキーＤをセットす
るが、そのときにキーＡのブロックと等しいＤ［１］，
Ｄ［２］、及びＤ［３］はセットせずに、キーＡと異な
るＤ［４］以降のブロックのみをセットする。Next, when registering the key D in this state,
First, the key A set at the node 5 pointed to by root4
And the key D to be registered is compared from the beginning. The key A and the key D are the same in the first, second, and third blocks, but are different in the fourth block from A [4] to D.
[4] is larger. Therefore, the pointer to the right of A [4] is checked. Since nothing is set in the pointer, a new node 8 is created and the address of the node 8 is set in the pointer to the right of A [4]. The key D is set at the node 8, and at that time D [1], which is equal to the block of the key A,
D [2] and D [3] are not set, and only blocks after D [4] different from the key A are set.

【００１９】次に、この状態にキーＥを登録する場合、
まずｒｏｏｔ４が指す節点５にセットされているキーＡ
と、これから登録するキーＥを先頭から比較する。キー
ＡとキーＥは、１番目、２番目、及び３番目のブロック
は等しいが、４番目のブロックが異なりＡ［４］よりＥ
［４］の方が大きい。そこでＡ［４］の右のポインタを
チェックする。Ａ［４］の右のポインタはアドレスがセ
ットされているので、そのポインタが指す節点８にセッ
トされているキーＤとこれから登録するキーＥを比較す
る。ここで、Ａ［４］の右のポインタには、１番目から
３番目のブロックがキーＡと等しいキーのみしかセット
されていないので、節点８でのキーの比較では、１番目
から３番目のブロックの比較は不要である。Ｅ［４］と
Ｄ［４］を比較するとＤ［４］よりＥ［４］のほうが小
さい。そこでＤ［４］の左のポインタをチェックする。
ポインタに何もセットされていないので新しい節点９を
作成し、節点９のアドレスをＤ［４］の左のポインタに
セットする。節点９にキーＥをセットするが、そのとき
にキーＡ、及びキーＤのブロックと等しいＥ［１］，Ｅ
［２］、及びＥ［３］はセットせずに、キーＡ、及びキ
ーＤと異なるＥ［４］以降のブロックのみをセットす
る。Next, when registering the key E in this state,
First, the key A set at the node 5 pointed to by root4
Then, the key E to be registered is compared from the beginning. The key A and the key E are the same in the first, second, and third blocks, but are different in the fourth block from E [4] to E.
[4] is larger. Therefore, the pointer to the right of A [4] is checked. Since the address is set in the pointer on the right of A [4], the key D set in the node 8 pointed to by the pointer is compared with the key E to be registered. Here, since the pointers on the right side of A [4] are set only to the keys in which the first to third blocks are equal to the key A, the comparison of the keys at the node 8 indicates that the first to third blocks are the same. No block comparison is necessary. Comparing E [4] and D [4], E [4] is smaller than D [4]. Therefore, the pointer on the left of D [4] is checked.
Since nothing is set in the pointer, a new node 9 is created, and the address of the node 9 is set in the pointer to the left of D [4]. The key E is set at the node 9, and at that time, E [1], E equal to the blocks of the key A and the key D are set.
[2] and E [3] are not set, and only blocks after E [4] different from keys A and D are set.

【００２０】つまり、本データ構造ではｎ番目までのブ
ロックが等しくｎ＋１番目以降のブロックが異なるキー
同志で２分木を構成する。また、このときにｎ番目まで
のブロックは１つだけ保持し、共有している。というこ
とに特長がある。That is, in this data structure, a binary tree is constructed by keys having the same up to the n-th block and different n + 1-th and subsequent blocks. At this time, only one block up to the nth block is held and shared. There is a feature in that.

【００２１】次に、可変長のキー登録でのみ発生する特
殊なケースを説明する。Ａ，Ｂ，Ｃ，Ｄ、及びＥのキー
が登録されている状態にキーＦを登録する場合、まずｒ
ｏｏｔ４が指す節点５にセットされているキーＡと、こ
れから登録するキーＦを先頭から比較する。キーＡとキ
ーＦは、先頭のブロックから異なっており、Ａ［１］よ
りＦ［１］のほうが大きい。そこで、Ａ［１］の右のポ
インタをチェックする。Ａ［１］の右のポインタはアド
レスがセットされているので、そのポインタが指す節点
７にセットされているキーＣとこれから登録するキーＦ
を比較する。キーＣとキーＦは１番目、及び２番目のブ
ロックは等しいが、キーＣは２つのブロックしかなく、
キーＦは４つのブロックからなる。そこで、キーＣの終
端を示すブロックＮＩＬとＦ［３］を比較し、（ＮＩＬ
はいずれのブロックの値よりも小さい値をもつものとし
た場合）ＮＩＬよりもＦ［３］の方が大きいので、キー
Ｃの終端を示すＮＩＬの右のポインタをチェックする。
このＮＩＬの右のポインタに何もセットされていないの
で新しい節点１０を作成し、節点１０のアドレスをキー
Ｃの終端を示すＮＩＬの右のポインタにセットする。節
点１０にキーＦをセットするが、そのときにキーＣのブ
ロックと等しいＦ［１］、及びＦ［２］はセットせず
に、キーＣと異なるＦ［３］以降のブロックのみをセッ
トする。Next, a special case that occurs only when a variable-length key is registered will be described. When registering the key F while the keys A, B, C, D, and E are registered, first, r
The key A set at the node 5 pointed by the boot 4 and the key F to be registered now are compared from the beginning. The key A and the key F are different from the first block, and F [1] is larger than A [1]. Therefore, the pointer to the right of A [1] is checked. Since the address is set in the pointer on the right of A [1], the key C set at the node 7 pointed to by the pointer and the key F to be registered from now on.
To compare. The keys C and F are the same in the first and second blocks, but the key C has only two blocks.
Key F consists of four blocks. Therefore, the block NIL indicating the end of the key C is compared with F [3], and (NIL
Is F [3] is larger than NIL (assuming that each block has a value smaller than the value of any block), the pointer to the right of NIL indicating the end of key C is checked.
Nothing is set in the pointer to the right of this NIL, so a new node 10 is created, and the address of node 10 is set to the pointer to the right of NIL indicating the end of key C. The key F is set at the node 10, but F [1] and F [2], which are equal to the block of the key C, are not set at that time, and only the blocks of F [3] and subsequent blocks different from the key C are set. ..

【００２２】次に、上記の状態にキーＧをセットする場
合、まずｒｏｏｔ４が指す節点５にセットされているキ
ーＡと、これから登録するキーＧを先頭から比較する。
キーＡとキーＧは、先頭のブロックから異なっており、
Ａ［１］よりＧ［１］のほうが小さい。そこで、Ａ
［１］の左のポインタをチェックする。Ａ［１］の左の
ポインタはアドレスがセットされているので、そのポイ
ンタが指す節点６にセットされているキーＢと、これか
ら登録するキーＧを比較する。キーＢとキーＧは１番目
のブロックは等しいが、キーＢは４つのブロックがあ
り、キーＧは１つのブロックしかない。そこで、キーＧ
の終端を示すブロックＮＩＬとＢ［２］を比較し、Ｂ
［２］よりもＮＩＬの方が小さいので、Ｂ［２］の左の
ポインタをチェックする。このＢ［２］の左のポインタ
になにもセットされていないので新しい節点１１を作成
し、節点１１のアドレスをＢ［２］の左のポインタにセ
ットする。節点１１にキーＧをセットするが、そのとき
にキーＢのブロックと等しいＧ［１］はセットせずに、
キーＧの終端を示すＮＩＬのみをセットする。Next, when the key G is set in the above state, first, the key A set at the node 5 pointed by the root 4 and the key G to be registered are compared from the beginning.
Key A and Key G are different from the first block,
G [1] is smaller than A [1]. So A
Check the pointer to the left of [1]. Since the address is set to the pointer on the left of A [1], the key B set at the node 6 pointed to by the pointer is compared with the key G to be registered. Key B and key G have the same first block, but key B has four blocks and key G has only one block. So key G
The block NIL indicating the end of B is compared with B [2], and B
Since NIL is smaller than [2], the pointer to the left of B [2] is checked. Since nothing has been set in the left pointer of B [2], a new node 11 is created, and the address of the node 11 is set in the left pointer of B [2]. The key G is set at the node 11, but G [1] equal to the block of the key B is not set at that time,
Only NIL indicating the end of the key G is set.

【００２３】以上のように本データ構造では、キーの終
端を示すブロックＮＩＬを使用することにより可変長の
キーも登録することができる。また、ｎ番目までのブロ
ックが等しく、ｎ＋１番目以降のブロックが異なるキー
同志で２分木を構成しているので、目的のキーを探すと
きに重複している部分の無駄な比較が一切必要ない。更
に、２分木を構成するこれらのキーで、ｎ番目までのブ
ロックを共有し１つしか保持しないので、重複するデー
タを複数セットする必要がなく、データ構造を作成する
ときの時間が短縮される。As described above, in this data structure, a variable length key can also be registered by using the block NIL indicating the end of the key. In addition, since the n-th block is the same and the n + 1-th block and thereafter are different keys, a binary tree is formed, so that no unnecessary comparison of overlapping parts is required when searching for a target key. .. Furthermore, these keys that make up the binary tree share up to the nth block and hold only one, so there is no need to set multiple duplicate data and the time to create the data structure is shortened. It

【００２４】本データ構造において、既に登録されてい
るキーの中から目的のキーを探索するときの手順は、前
に示したキーの登録のときに、新規の節点を追加する所
を探すときと同様の手順でポインタをたどることによ
り、目的のキーの探索が行える。この場合も、重複する
部分の無駄な比較がなく、キーの探索時間が短縮され
る。In this data structure, the procedure for searching for a target key from the already registered keys is as follows when searching for a new node to be added when registering the key shown above. The target key can be searched by following the pointer in the same procedure. Also in this case, there is no wasteful comparison of overlapping portions, and the key search time is shortened.

【００２５】次に、本データ構造から、目的のキーを削
除するときの手順を説明する。キーを削除する場合、ま
ず前に示したキー探索の手順で削除するキーがセットさ
れている節点を探す。そして、その節点にセットされて
いるポインタの状態によってキーの削除手順が異なる。
まず、削除するキーがセットされている節点にポインタ
が１つもセットされていない場合の削除手順を示す。こ
の場合、削除するキーがセットされている節点を指して
いるポインタをクリアすれば良い。例えば、図１のデー
タ構造からキーＥを削除する場合、キーの探索を行い、
キーＥは節点９にセットされていることが分かる。節点
９は、節点８のＤ［４］の左のポインタから指されてい
るので、節点８のＤ［４］の左のポインタをクリアすれ
ば節点９がデータ構造から削除され、キーＥが削除でき
る。Next, a procedure for deleting a target key from this data structure will be described. When deleting a key, first find the node in which the key to be deleted is set by the key search procedure shown above. The key deletion procedure differs depending on the state of the pointer set at the node.
First, the deletion procedure when no pointer is set at the node where the key to be deleted is set will be described. In this case, the pointer pointing to the node where the key to be deleted is set may be cleared. For example, when deleting the key E from the data structure of FIG. 1, a key search is performed,
It can be seen that the key E is set at the node 9. Since the node 9 is pointed to by the left pointer of D [4] of the node 8, if the left pointer of D [4] of the node 8 is cleared, the node 9 is deleted from the data structure and the key E is deleted. it can.

【００２６】次に、削除するキーがセットされている節
点にポインタがセットされており、且つポインタがセッ
トされているブロックの内、最も後のブロック（以下、
このブロックを置き換え先頭ブロックと記す）のポイン
タが、左のポインタか右のポインタのいずれか一方しか
セットされていない場合の削除手順を示す。この場合、
削除するキーがセットされている節点の置き換え先頭ブ
ロック以降のブロック及びポインタに、置き換え先頭ブ
ロックの左のポインタ又は右のポインタが指す節点のブ
ロック及びポインタの内容を全て移せば良い。例えば、
図１のデータ構造からキーＡを削除する場合、キーの探
索を行い、キーＡは節点５にセットされていることが分
かる。節点５の中でポインタがセットされているブロッ
クは、Ａ［１］とＡ［４］だが、Ａ［４］の方が後のブ
ロックなのでＡ［４］が置き換え先頭ブロックとなる。
Ａ［４］のブロックには右のポインタしかなく、節点８
を指しているので、節点５のＡ［４］以降のブロック及
びポインタに節点８のブロック及びポインタの内容をコ
ピーする。そうすると図３の状態となる。Ａ［１］，Ａ
［２］、及びＡ［３］は、Ｄ［１］，Ｄ［２］、及びＤ
［３］と等しいから、節点５にセットされているキーは
ＡではなくＤとなり、このことからキーＡの削除が行わ
れていることが分かる。また、その他の登録状態は変わ
っていない。Next, the pointer is set at the node where the key to be deleted is set, and the last block (hereinafter, referred to as
This block is referred to as a replacement top block), and the deletion procedure is shown when only one of the left pointer and the right pointer is set. in this case,
All the contents of the node block and the pointer pointed by the left pointer or the right pointer of the replacement head block may be transferred to the blocks and pointers after the replacement head block of the node in which the key to be deleted is set. For example,
When the key A is deleted from the data structure of FIG. 1, it is found that the key is searched and the key A is set at the node 5. The blocks in which the pointer is set in the node 5 are A [1] and A [4], but since A [4] is a later block, A [4] becomes the replacement first block.
The block of A [4] has only the right pointer, and the node 8
The contents of the block and the pointer at the node 8 are copied to the block and the pointer after A [4] of the node 5. Then, the state shown in FIG. 3 is obtained. A [1], A
[2] and A [3] are D [1], D [2], and D
Since it is equal to [3], the key set at the node 5 is D instead of A, and it can be seen that the key A is deleted. The other registration statuses have not changed.

【００２７】次に、削除するキーがセットされている節
点にポインタがセットされており、且つポインタがセッ
トされているブロックの内、最も後のブロック（置き換
え先頭ブロック）の左のポインタ及び右のポインタの両
方がセットされている場合の削除手順を示す。この場
合、まず削除するキーがセットされている節点の置き換
え先頭ブロックの右のポインタ（左のポインタをたどる
方法も考えられる）が指す節点を根（ｒｏｏｔ）とした
部分木構造を考える。この部分木構造で、各節点の先頭
のブロックの左のポインタだけをたどり、節点の先頭の
ブロックの左のポインタがセットされていない節点（以
下、置き換え元節点と記す）を探す。置き換え元節点が
見つかったら置き換え元節点を指しているポインタを、
置き換え元節点の先頭のブロックの右のポインタの内容
で置き換える。次に、削除するキーがセットされている
節点の置き換え先頭ブロック以降のブロックの内容を置
き換え元節点にセットされているブロックの内容で置き
換える。また削除するキーがセットされている節点の置
き換え先頭ブロックより１つ後のブロック以降のポイン
タを置き換え元節点の２番目以降のブロックのポインタ
で置き換えれば良い。例えば、図４のデータ構造からキ
ーＨを削除する場合、キーの探索を行い、キーＨは節点
１２にセットされていることが分かる。節点１２の中で
ポインタがセットされているブロックは、Ｈ［４］だけ
なのでＨ［４］が置き換え先頭ブロックとなる。Ｈ
［４］のブロックには右のポインタ及び左のポインタの
両方がセットされている。そこで、Ｈ［４］の右のポイ
ンタが指す節点１４を根（ｒｏｏｔ）とする部分木構造
で、各節点の先頭のブロックの左のポインタだけをたど
り、節点の先頭のブロックの左のポインタがセットされ
ていない節点（置き換え元節点）を探す。まず節点１４
の先頭のブロックの左のポインタにはアドレスがセット
されているのでポインタをたどり、次に節点１５をチェ
ックする。節点１５の先頭のブロックの左のポインタは
セットされていないので、節点１５が置き換え元節点と
なる。置き換え元節点が見つかったので置き換え元節点
を指している節点１４のＪ［４］の左のポインタを置き
換え元節点１５の先頭のブロックＫ［４］の右のポイン
タの内容で置き換える。次に、削除するキーがセットさ
れている節点１２の置き換え先頭ブロックＨ［４］以降
のブロックを、置き換え元節点１５にセットされている
ブロックの内容で置き換え、また削除するキーがセット
されている節点１２の置き換え先頭ブロックＨ［４］よ
り１つ後のブロック以降のポインタを置き換え元節点１
５の２番目以降のブロックのポインタで置き換える。す
ると図５の状態となる。Ｈ［１］，Ｈ［２］、及びＨ
［３］は、Ｋ［１］，Ｋ［２］、及びＫ［３］と等しい
はずであるから、節点１２にセットされているキーはＨ
ではなくキーＫとなり、このことからキーＨの削除が行
われていることが分かる。また、その他のキーの登録状
態は変わっていない。Next, the pointer is set at the node where the key to be deleted is set, and the leftmost pointer and the rightmost pointer of the last block (replacement head block) of the blocks where the pointer is set. The deletion procedure when both pointers are set is shown. In this case, first, consider a subtree structure in which the node pointed by the right pointer (following the method of tracing the left pointer) of the replacement head block of the node in which the key to be deleted is set is the root. In this subtree structure, only the pointer on the left of the block at the beginning of each node is traced to find a node for which the pointer on the left of the block at the beginning of the node is not set (hereinafter referred to as replacement source node). When the replacement source node is found, set the pointer pointing to the replacement source node to
Replace with the contents of the pointer on the right of the block at the beginning of the replacement source node. Next, the contents of the blocks after the replacement first block of the node in which the key to be deleted is set are replaced with the contents of the block set in the replacement source node. Further, it is sufficient to replace the pointers of the blocks after the first block after the replacement of the node in which the key to be deleted is set with the pointers of the second and subsequent blocks of the replacement original node. For example, when the key H is deleted from the data structure of FIG. 4, it is found that the key H is searched and the key H is set at the node 12. Since the block in which the pointer is set in the node 12 is only H [4], H [4] becomes the replacement top block. H
Both the right pointer and the left pointer are set in the block [4]. Therefore, in the subtree structure in which the node 14 pointed by the right pointer of H [4] is a root, only the left pointer of the head block of each node is traced, and the left pointer of the head block of the node is Search for unset nodes (replacement original nodes). First, node 14
Since the address has been set in the pointer on the left of the first block of, the pointer is traced, and then the node 15 is checked. Since the pointer to the left of the head block of the node 15 is not set, the node 15 becomes the replacement source node. Since the replacement source node is found, the left pointer of J [4] of the node 14 pointing to the replacement source node is replaced with the content of the right pointer of the leading block K [4] of the replacement source node 15. Next, the block after the replacement head block H [4] of the node 12 in which the key to be deleted is set is replaced with the contents of the block set in the replacement source node 15, and the key to be deleted is set. Replacement of the node 12 The pointer after the block one block after the leading block H [4] is replaced with the node 1
Replace with the pointers of the second and subsequent blocks of block 5. Then, the state shown in FIG. 5 is obtained. H [1], H [2], and H
[3] should be equal to K [1], K [2], and K [3], so the key set at node 12 is H
Instead of the key K, it can be seen that the key H has been deleted. Also, the registration status of other keys has not changed.

【００２８】以上のように、本データ構造からのキーの
削除の手順は、２分木のデータ構造からキーを削除する
ときの手順と類似している。ただし、削除するキーがセ
ットされている節点を探索するときの手順は前に示した
ものが使用できるので、この場合も、重複する部分の無
駄な比較がなく、この分の時間が短縮できる。As described above, the procedure for deleting a key from this data structure is similar to the procedure for deleting a key from the binary tree data structure. However, since the procedure shown above can be used when searching for the node in which the key to be deleted is set, in this case as well, there is no wasteful comparison of overlapping parts and the time can be shortened by this amount.

【００２９】また、本データ構造は２分木の考えを使用
しており、各節点はより大きいものへのポインタ、又は
より小さいものへのポインタでつながれているので、本
データ構造に登録されたキーはおのずとソートされてい
ることは明らかである。Since this data structure uses the idea of a binary tree and each node is connected by a pointer to a larger one or a pointer to a smaller one, it is registered in this data structure. Clearly the keys are naturally sorted.

【００３０】このように、本実施例に示したデータ構造
においては、可変長のキーの登録、探索、削除、及びソ
ートを行うことができる。本実施例では、ｎ番目までの
ブロックが等しくｎ＋１番目以降のブロックが異なるキ
ー同志で通常の２分木を構成するものであるが、この他
にｎ番目までのブロックが等しくｎ＋１番目以降のブロ
ックが異なるキー同志でＡＶＬ木を構成する方法、最適
木を構成する方法、２−３木を構成する方法等さまざま
な方法が考えられるので、キー探索の最悪の場合の時間
を短縮する必要がある場合は、これらの方法を使用する
こともできる。As described above, in the data structure shown in this embodiment, it is possible to register, search, delete, and sort a variable-length key. In the present embodiment, a normal binary tree is formed by keys having the same n-th block and different n + 1-th and subsequent blocks, but in addition to this, the n-th block is equal and the n + 1-th and subsequent blocks are the same. Since various methods such as a method of forming an AVL tree with different keys, a method of forming an optimal tree, and a method of forming a 2-3 tree can be considered, it is necessary to shorten the time in the worst case of key search. If desired, these methods can also be used.

【００３１】[0031]

【発明の効果】以上説明したように、本発明によれば、
可変長のキーの登録、探索、削除、及びソートを行うこ
とができるデータ構造を作ることができる。このとき、
本データ構造ではキーの比較を先頭の方のブロックから
ブロック毎に行い、先頭の方のブロックが等しく後ろの
方のブロックが異なる場合に、等しい部分のブロックは
一つだけ保持し、異なる部分以降のブロックを木構造の
データ構造になるようにポインターでつないで保持して
いる。このため、目的のキーが登録されている節点、ま
たは目的のキーを追加する所を探索するときに、キーの
内容が重複する部分の無駄な比較がない。また、キーの
追加の場合は、キーの重複する部分のデータのセットは
１回だけでよい。更に、通常の２分木の場合、１つの節
点での比較により、・等しい・大きい・小さいの三つの情報しか得られない。このため、一つの節点で
の比較結果により、２つのポインタの中から１つのポイ
ンタを選んでたどることしかできない。しかし、本発明
の方式では、一つの節点での比較により、・等しい・どこのブロックまで等しく、どこ以降のブロックが大
きい・どこのブロックまで等しく、どこ以降のブロックが小
さいのように、より詳しい情報を得ることができる。このた
め、一つの節点での比較結果により、２つ以上のポイン
タの中から１つのポインタを選んでたどることができ
る。従って、目的のキーを探索し目的のキーがセットさ
れている節点にたどり着くまでに辿らなければならない
ポインタの数が削減される。そのため、従来の木構造の
データ構造でキーの登録、探索、削除、及びソートを実
施する場合より短い時間でこれらの機能を実現できる。
また、多くのキーを登録し、キーの重複する部分が多い
場合は、バイナリサーチと比較しても、より短い時間で
探索を実行することができる。As described above, according to the present invention,
Data structures can be created that allow for variable length key registration, search, deletion, and sorting. At this time,
In this data structure, key comparison is performed for each block from the first block, and when the first block is the same and the second block is different, only one block of the same part is retained and after the different part The blocks are stored by connecting them with pointers so that they become a tree-structured data structure. Therefore, when searching for a node in which the target key is registered or a place where the target key is added, there is no wasteful comparison of the portions where the key contents overlap. In addition, in the case of adding a key, the data set in the overlapping portion of the key needs to be set only once. Furthermore, in the case of a normal binary tree, the comparison at one node yields only three pieces of information: equal, large, and small. Therefore, only one pointer can be selected from the two pointers and traced according to the comparison result at one node. However, in the method of the present invention, by comparison at one node, it is more detailed such as: equal, to which block is equal, where to be large block, to which block is equal, and after which block is small. You can get information. Therefore, one pointer can be selected and traced from two or more pointers according to the comparison result at one node. Therefore, the number of pointers required to search for the target key and reach the node where the target key is set is reduced. Therefore, these functions can be realized in a shorter time than when performing key registration, search, deletion, and sorting with a conventional tree-structured data structure.
In addition, when many keys are registered and there are many overlapping keys, the search can be executed in a shorter time than the binary search.

[Brief description of drawings]

【図１】本発明の一実施例であるデータ構造にＡ，Ｂ，
Ｃ，Ｄ，Ｅ，Ｆ、及びＧの７つのキーを登録した場合の
データ構造を示す図である。FIG. 1 is a block diagram showing a data structure of A, B,
It is a figure which shows the data structure at the time of registering seven keys of C, D, E, F, and G.

【図２】節点の構成を示す図である。FIG. 2 is a diagram showing a configuration of nodes.

【図３】図１のデータ構造からキーＡを削除した後のデ
ータ構造を示す図である。FIG. 3 is a diagram showing a data structure after key A is deleted from the data structure shown in FIG.

【図４】本発明の一実施例であるデータ構造にＨ，Ｉ，
Ｊ，Ｋ，Ｌ、及びＭの６つのキーを登録した場合のデー
タ構造を示す図である。FIG. 4 shows H, I, and
It is a figure which shows the data structure at the time of registering six keys of J, K, L, and M.

【図５】図４のデータ構造からキーＨを削除した後のデ
ータ構造を示す図である。5 is a diagram showing a data structure after deleting a key H from the data structure shown in FIG. 4;

[Explanation of symbols]

１…ブロックをセットするエリア、２…左のポインのタエリア、３…右のポインタのエリア、４…ｒｏｏｔポインタ、５〜１７…節点。 1 ... area for setting block, 2 ... left pointer area, 3 ... right pointer area, 4 ... root pointer, 5-17 ... node.

Claims

What is claimed is: 1. When a plurality of identifiers (hereinafter referred to as keys) for identifying data are registered in a data structure for the purpose of searching, each key is divided into one or more. (Divided keys are referred to as blocks below), the keys are compared from the first block to the last block, and if the first block of the key is the same and the second block of the key is different, the same. A high-speed search method using a tree-structured data structure, in which only one block is retained, and blocks following different portions are connected by a pointer so that the block has a tree-structured data structure.