JPH0440554A

JPH0440554A - Character data processor

Info

Publication number: JPH0440554A
Application number: JP2147926A
Authority: JP
Inventors: Toshimitsu Onoki; 小野木　敏光
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1990-06-06
Filing date: 1990-06-06
Publication date: 1992-02-10

Abstract

PURPOSE:To efficiently execute the processing of character data in which character codes of plural code systems are mixed by constituting character-string data by dividing it into two parts of a character code train and a code flag train, and inserting a control code into the character-string data in the case of necessity. CONSTITUTION:The processor is provided with an input means 1 and an output means 2, the input means 1 generates character-string data 4 from prescribed input data 3, the character-string data 4 is constituted of a character code train 5 and a code flag train 6, and the character code train 5 is a list of character codes of plural different prescribed code systems. Also, the code flag train 6 is a list of code flags, and each code flag corresponds to every prescribed number of the character codes of the character code train and shows a code system of the corresponding character code. The output means 2 generates output data 7 by inserting a necessary control code at every continuous list of the character codes of the same code system of the character code train 5 based on the correspondence of the code flag of the code flag train 6 and the character code of the character code train 5. In such a way, the character data in which the character codes of plural code systems are mixed can be processed efficiently.

Description

【発明の詳細な説明】〔手肌　　要〕計算機における、文字データの処理に関し、複数のコー
ド系の文字コードが混在する文字データの処理を容易に
する文字データ処理装置を目的とし、入力手段と出力手段を有し、該入力手段は、所与の入力
データから文字列データを生成し、該文字列データは、
文字コード列とコードフラグ列によって構成され、該文
字コード列は、複数の異なる所定のコード系の文字コー
ドの並びであり、該コードフラグ列は、コードフラグの
並びであって、各該コードフラグが、該文字コード列の
該文字コードの所定数ごとに対応して、該対応する文字
コートの該コード系を示し、該出力手段は、該コードフ
ラグ列の該コードフラグと該文字コード列の該文字コー
ドとの対応とに基づいて、該文字コード列の同一の該コ
ード系の文字コードの連続する並びごとに、所要の制御
コードを挿入して出力データを生成するように構成する
。[Detailed Description of the Invention] [Required] The object of the present invention is to provide a character data processing device that facilitates the processing of character data in which character codes of a plurality of code systems coexist in a computer. It has an output means, the input means generates character string data from given input data, and the character string data is
It is composed of a character code string and a code flag string, the character code string is a sequence of character codes of a plurality of different predetermined code systems, and the code flag string is a sequence of code flags, and each code flag indicates the code system of the corresponding character code for each predetermined number of character codes of the character code string, and the output means outputs the code flag of the code flag string and the character code string of the character code string. Based on the correspondence with the character code, a required control code is inserted into each consecutive sequence of character codes of the same code system in the character code string to generate output data.

[Industrial application field]

本発明は、計算機における、文字データの処理、特に複
数のコード系の文字コードが混在する文字データの文字
データ処理装置に関する。The present invention relates to character data processing in a computer, and particularly to a character data processing apparatus for character data in which character codes of a plurality of code systems coexist.

〔従来の技術と発明が解決しようとする課題〕第２図は
日本語文字を含む文字データの例であり、第２図（ａ）
に示すように、半角の英数字と全角の日本語文字が混在
する場合の文字データは、例えば第２図（ｂ）のように
、１バイトで表される半角文字のコードと、２バイトで
表される全角文字のコードと、それらのコード種別の切
換（シフト）を示す制御コード（図の「２８」、ｒ２９
」）から構成される。[Prior art and problems to be solved by the invention] Figure 2 is an example of character data including Japanese characters, and Figure 2 (a)
As shown in Figure 2(b), the character data when half-width alphanumeric characters and full-width Japanese characters are mixed is the code of the half-width character expressed in 1 byte and the code expressed in 2 bytes, as shown in Figure 2 (b). The code of the double-byte character represented and the control code that indicates the switching (shift) of the code type (“28” in the figure, r29
”).

文字データはすべてこのような形式のデータとして構成
されるので、このようなデータを生成し、処理する各業
務処理プログラムごとに、データの生成において制御コ
ードの挿入を行い、又制御コードがデータの中に挿入さ
れていることを意識して処理を行わなければならない。All character data is structured as data in this format, so a control code is inserted during data generation for each business processing program that generates and processes such data. Processing must be carried out while being aware that it is inserted inside.

この処理は、以上のように制御コードの挿入によって、
文字の桁位置とデータ上の文字コードの位置とに桁ずれ
があるために、両者の対応が複雑となり、文字データの
再設定処理を複雑化する。This process is done by inserting the control code as described above.
Since there is a digit shift between the character digit position and the character code position on the data, the correspondence between the two becomes complicated, which complicates the character data resetting process.

又、制御コード群には複数種類の変形があって、必要な
場合にはそれらの種類ごとに対応するプログラムが必要
になる。Furthermore, there are multiple types of variations in the control code group, and if necessary, a program corresponding to each type is required.

本発明は、複数のコード系の文字コードが混在する文字
データを容易且つ効率よく処理する文字データ処理装置
を目的とする。An object of the present invention is to provide a character data processing device that easily and efficiently processes character data in which character codes of a plurality of code systems coexist.

[Means to solve the problem]

第１図は、本発明の構成を示すブロック図である。 FIG. 1 is a block diagram showing the configuration of the present invention.

図は文字データ処理装置の構成であって、入力手段１と
出力手段２を有し、入力手段１は、所与の入力データ３
から文字列データ４を生成し、該文字列データ４は、文
字コード列５とコードフラグ列６によって構成され、文
字コード列５は、複数の異なる所定のコード系の文字コ
ードの並びであり、コードフラグ列６は、コードフラグ
の並びであって、各該コードフラグが、該文字コード列
の該文字コードの所定数ごとに対応して、該対応する文
字コードの該コ′−ド系を示し、出力手段２は、コード
フラグ列６の該コードフラグと文字コード列５の該文字
コードとの対応とに基づいて、文字コード列５の同一の
該コード系の文字コードの連続する並びごとに、所要の
制御コードを挿入して出力データ７を生成する。The figure shows the configuration of a character data processing device, which has an input means 1 and an output means 2. The input means 1 receives given input data 3.
generates character string data 4 from The code flag string 6 is a sequence of code flags, and each code flag indicates the code system of the corresponding character code for each predetermined number of character codes of the character code string. Based on the correspondence between the code flag in the code flag string 6 and the character code in the character code string 5, the output means 2 outputs each consecutive sequence of character codes of the same code system in the character code string 5. A required control code is inserted into the output data 7 to generate output data 7.

[For production]

この処理装置により、例えば入力情報又は既成の出力デ
ータを入力データ３として、入力手段１により文字列デ
ータ４が作成され、文字列データ４は文字コード列５と
コードフラグ列６の２部分に分けた構成であって、文字
を表すコードの並びに制御コードが挿入されることが無
いので、前記のような処理上の複雑さが無くなる。且つ
各種業務処理はこの文字列データ４を処理対象とし、帳
票出力等の必要な場合に、出力手段２によって文字列デ
ータ４を、制御コードが挿入された従来の形式の出力デ
ータ７を作成することにより、制御コード群の種類に個
別に対応を要する部分を出力手段のみに集約することが
できる。With this processing device, character string data 4 is created by input means 1 using input information or existing output data as input data 3, and character string data 4 is divided into two parts: a character code string 5 and a code flag string 6. This configuration eliminates the need for inserting code sequences representing characters and control codes, thereby eliminating the processing complexity described above. In addition, various business processes use this character string data 4 as a processing target, and when it is necessary to output a form, etc., the output means 2 creates the character string data 4 and output data 7 in a conventional format in which a control code is inserted. By doing so, parts that require individual handling for each type of control code group can be integrated into only the output means.

〔Example〕

第２図（Ｃ）及び（ｄ）は、第１回の文字列データ４の
構成例を説明する図であり、入力手段１は入力データの
各文字について、文字コードのみを（Ｃ）のように並べ
て文字コード列５とし、各文字コードのコード系を示す
コードフラグ（例は全角をＪ、半角をＥとす）を例えば
１バイトの情報として、文字コードと同じ順序で（ｄ）
のように並べてコードフラグ列６とする。FIGS. 2(C) and 2(d) are diagrams illustrating an example of the structure of the first character string data 4. The input means 1 inputs only the character code for each character of the input data as shown in (C). The code flags indicating the code system of each character code (for example, full-width is J and half-width is E) are set as 1-byte information, and in the same order as the character codes (d).
They are arranged as code flag string 6.

ここで、第２図の（Ｃ）と（ｄ）を破線で結んで示すよ
うに、各コードフラグＪは２バイトの文字コードに対応
し、コードフラグＥは連続するｌハイドの文字コード２
個の対に対応するようにする。Here, as shown by connecting (C) and (d) in FIG. 2 with a broken line, each code flag J corresponds to a 2-byte character code, and the code flag E corresponds to the character code 2 of consecutive
correspond to pairs of individuals.

この例は、前記の説明と同様に第２図（ａ）の文字列に
対応する文字列データの例であり、図から明らかなよう
に、（Ｃ）の文字コード列の各コードと、（ａ）の文字
列の各文字とは桁位置が一致する。This example is an example of character string data corresponding to the character string in FIG. The digit position matches each character of the character string in a).

第１図の出力手段２は、例えば第２図（Ｃ）と（ｄ）か
らなる文字列データについて、（ｄ）のコードフラグ列
を走査しながら、（Ｃ）の文字コード列の対応する文字
コードを出力し、コードフラグ列のコードフラグが変化
した場合には、変化の前後に対応する文字コードの間に
所定の制御コード（図の例では、Ｊ−＋Ｅの場合に制御
コード「２９」、Ｅ→Ｊの場合に制御卸コードｒ２８Ｊ
）を出力する。このように処理することにより、第２図
（Ｃ）と（ｄ）の文字列データから、（ｂ）のようなデ
ータが出力される。For example, the output means 2 in FIG. 1 scans the code flag string in (d) for the character string data consisting of (C) and (d) in FIG. When a code is output and the code flag in the code flag string changes, a predetermined control code (in the example shown in the figure, control code "29" for J-+E) is inserted between the corresponding character codes before and after the change. , in the case of E→J, control wholesale code r28J
) is output. By processing in this manner, data as shown in FIG. 2(b) is output from the character string data of FIG. 2(C) and (d).

第３図は出力手段の処理の流れの一例を示す図であり、
先ず処理ステップ１０で文字列データのコードフラグ列
の先頭を記憶する等の初期制御情報の設定を行って処理
を開始する。FIG. 3 is a diagram showing an example of the processing flow of the output means,
First, in processing step 10, initial control information such as storing the beginning of a code flag string of character string data is set, and processing is started.

処理ステップ１１でコードフラグ列を先頭がら順次１個
取り出して、記憶しであるコードフラグ（以下において
前コードフラグという）と比較し、一致すれば処理ステ
ップ１２で文字コード列の対応する文字コード２バイト
を出力し、処理ステップ１３で識別して後続のコードフ
ラグがあれば処理ステップ１１に戻る。In processing step 11, one code flag string is taken out one by one from the beginning and compared with the stored code flag (hereinafter referred to as the previous code flag).If they match, in processing step 12 the corresponding character code 2 of the character code string is extracted. The byte is output, identified in process step 13, and returns to process step 11 if there is a subsequent code flag.

処理ステップ１１で、前コードフラグと一致しない場合
には、処理ステップ１４でコードフラグの種類を識別し
、ｒＪ、なら処理ステップ１５で制御コード「２８」を
出力し、ｒＢ、なら処理ステップ１６で制御卸コード「
２９」を出力する。If it does not match the previous code flag in processing step 11, the type of code flag is identified in processing step 14, and if rJ, then control code "28" is output in processing step 15, and if rB, then in processing step 16, the control code "28" is output. Control wholesale code
29" is output.

次に処理ステップ１７で前コードフラグを今調べたコー
ドフラグに更新した後、前記と同様に処理ステップ１２
でコードフラグに対応する文字コード２ハ゛イトを出力
する。Next, in processing step 17, after updating the previous code flag to the code flag just checked, processing step 12
Outputs the 2-byte character code corresponding to the code flag.

なお、出力手段２は、制御コードの種類指定を受けて、
各種の制ｉ１Ｈコードに対応できるように構成してもよ
い。Note that the output means 2 receives the control code type designation, and
It may be configured to be compatible with various restricted i1H codes.

〔Effect of the invention〕

以上の説明から明らかなように本発明によれば、計算機
の文字データの処理において、複数のコード系の文字コ
ードが混在する文字データの処理を容易且つ効率よく行
うことができるという著しい工業的効果がある。As is clear from the above description, according to the present invention, in processing character data of a computer, character data in which character codes of multiple code systems coexist can be easily and efficiently processed, which is a remarkable industrial effect. There is.

[Brief explanation of the drawing]

第１図は本発明の構成を示すブロック図、第２図は文字
データの説明図第３図は本発明の処理の流れ図である。図において、１は入力手段、３は入力データ、５は文字コード列、７は出力データ２は出力手段、４は文字列データ、６はコードフラグ列、１０〜１７は処理ステップFIG. 1 is a block diagram showing the configuration of the present invention, FIG. 2 is an explanatory diagram of character data, and FIG. 3 is a flowchart of the processing of the present invention. In the figure, 1 is input means, 3 is input data, 5 is character code string, 7 is output data 2 is output means, 4 is character string data, 6 is code flag string, 10 to 17 are processing steps

Claims

[Claims] It has an input means (1) and an output means (2), and the input means (1) converts character string data (
4), the character string data (4) is composed of a character code string (5) and a code flag string (6), and the character string data (4) is composed of a character code string (5) and a code flag string (6).
) is a sequence of character codes of a plurality of different predetermined code systems, and the code flag string (6) is a sequence of code flags, and each code flag corresponds to the character code of the character code string (5). Corresponding to each predetermined number of character codes, the code system of the corresponding character code is indicated, and the output means (2) outputs the code flag of the code flag string (6) and the character code string (5). is configured to generate output data (7) by inserting a required control code into each consecutive sequence of character codes of the same code system in the character code string based on the correspondence with the character code. A character data processing device characterized by: