JPS6254365A

JPS6254365A - Data processing of chinese using voice chinese language and word processing method and apparatus

Info

Publication number: JPS6254365A
Application number: JP61139106A
Authority: JP
Inventors: ビクター・シー・ヤー
Original assignee: Individual
Current assignee: Individual
Priority date: 1985-06-14
Filing date: 1986-06-13
Publication date: 1987-03-10

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】産業上の利用分野本発明は、中国語のデータ・プロセシングおよびワード
・プロセシングの方法とその装置に向けられており、特
に、同音異義語（ｈｏｍｏｎｙｍｓ）および同声調語（
ｈｏｍｏｔｏｎｅｓ）ゆえに生じる曖昧さを避けるため
、定義づけされた音声中国語言語を使用するものである
。DETAILED DESCRIPTION OF THE INVENTION FIELD OF INDUSTRIAL APPLICATION The present invention is directed to a method and apparatus for data processing and word processing of the Chinese language, and in particular for the processing of homonyms and tonal words (
In order to avoid the ambiguity caused by Chinese language homotones, a defined phonetic Chinese language is used.

現代中国語単語は、主として多音節である。伝統的には
、中国語書き言葉の各単語は、１つまたは複数の表意文
字、つまり、ある概念または事物の絵画的表現から成る
。各表意文字は、単音節で発音される。しかし、単音節
の単語使用だけでは、話し言葉として不十分である。と
いうのは、中国語には多数の同音異義語すなわち、異っ
た形で書かれ、または異った意味をもちながら、同じ音
で発音される単語（この場合表意文字）があるからであ
る。換言すれば、中国語で発音された■つの単音節は、
多数の異った表意文字を表わし、従って多数の異った意
味をもつ。そのため、口頭での伝達に単音節の単語だけ
を使用するのは、実用的ではない。Modern Chinese words are primarily polysyllabic. Traditionally, each word in written Chinese consists of one or more ideograms, ie, pictorial representations of some concept or thing. Each ideogram is pronounced monosyllable. However, the use of monosyllabic words alone is insufficient for spoken language. This is because Chinese has a large number of homonyms, or words that are written differently or have different meanings but are pronounced with the same sound (in this case ideograms). . In other words, the ■ monosyllables pronounced in Chinese are
It represents many different ideograms and therefore has many different meanings. Therefore, it is impractical to use only monosyllabic words for oral communication.

この問題を克服するために、話し言葉は主として多音節
語として発達してきたのである。多数の表意文字か連な
って１つの多音節語を生むにいたり、それかそのような
単語のもつ意味を可成り狭めているのである。この結果
、中国語話し言葉の約８０％は、多音節語（うち７５％
は２音節語）になっている。現代中国語書き言葉にも、
話し言葉にならって、多数の表意文字の組み合わせ、す
なわち多音節語が使われている。To overcome this problem, spoken languages have developed primarily as polysyllabic words. Either a number of ideograms are combined to produce a single polysyllabic word, or the meaning of such a word is narrowed considerably. As a result, approximately 80% of spoken Chinese words are polysyllabic words (75% of which are
is a two-syllable word). In modern written Chinese,
Following spoken language, many ideographic combinations, or polysyllabic words, are used.

現代中国語には約８，０００　の表意文字が使用されて
いる。表意文字の総数は５万を超えるが、その殆どはご
く稀にしか使われず、もはや日常用語ではない。１９８
１年、中国は、６，７６３　の表意文字を当用文字とし
、これらが中国の電気通信システム用に使われることに
なっている。そんなわけで、約８．０００　の基礎表意
文字があれば、最も実用的な中国語はあやつれるのであ
る。Approximately 8,000 ideograms are used in modern Chinese. The total number of ideograms exceeds 50,000, but most of them are used only very rarely and are no longer part of everyday language. 198
In 2017, China adopted 6,763 ideograms as official scripts, which are to be used in China's telecommunications system. Therefore, with about 8,000 basic ideograms, the most practical Chinese language can be written.

中国では、表意文字使用により、強い文化的傾向を享受
し、これが民意の統一にも役立っている。China enjoys a strong cultural tendency through the use of ideograms, which also serves to unify the will of the people.

それゆえ、ワード・プロセシングまたはデータ・プロセ
シングのシステムか、その出力として、中国語表意文字
を生成できるようにすることが必要なのである。しかし
、表意文字を直接の入力媒体として使うためには、キー
ボードを必要とする表意文字の数が多すぎ（約８，００
０）、実用的ではない。また、表意文字はアルファベッ
ト順になっていないため、それを処理して順位づけるの
は非常にむずかしく、やっかいである。表意文字を出力
することは、データ・プロセシングやワード・プロセシ
ングにとって重要ではあるが、このような出力はワード
・プロセシングには十分であっても、データ・プロセシ
ングにはまだ不十分である。Therefore, there is a need for a word processing or data processing system that can produce Chinese ideograms as its output. However, the number of ideograms that require a keyboard is too large (approximately 8,000) to use them as a direct input medium.
0), not practical. Also, since ideograms are not alphabetically ordered, it is very difficult and cumbersome to process and rank them. Although outputting ideograms is important for data and word processing, such output is sufficient for word processing, but still insufficient for data processing.

表意文字はアルファベット化できないので、データ・プ
ロセシング・システムの表意文字出力をアルファベット
順に並べることは不可能である。それゆえ、気の効いた
辞典や電話帳、人名簿、その他のアルファベット化され
た目録作りは容易にできない。そこでこの中国語を分類
し、アルファベット類、または池の方式で配列し、非表
色的な方法で表示する必要かある。Because ideograms cannot be alphabetized, it is not possible to alphabetize the ideogram output of a data processing system. Therefore, it is not easy to create elaborate dictionaries, telephone directories, directories, or other alphabetized catalogues. Therefore, it is necessary to classify this Chinese language, arrange it in alphabetical order or the Ike system, and display it in a non-colorful way.

ｎｆｊ記問題の解決のために、中国政府は中国語表意文
字をアルファベットで表示する方法を開発した。これは
、漢語併音（Ｈａｎｙｕ　Ｐｉｎｙｉｎ）として知られ
、Ｍａｎｄａｒｉｎ（北京語）の発音で表示されている
。北京語には、約４００の異った単音節の音がある。枡
音は、英語アルファベット２５文字（Ｖは使わない）に
より、これら４００の全音を発音的に表示することがで
きる。刈音は、発音のみに基づいて、巧みにこれを成し
とげることができる。In order to solve the NFJ problem, the Chinese government developed a method to display Chinese ideograms in the alphabet. This is known as Hanyu Pinyin and is displayed in Mandarin pronunciation. Mandarin has approximately 400 different monosyllabic sounds. Masuon can represent these 400 whole tones phonetically using 25 letters of the English alphabet (V is not used). Karion is able to skillfully accomplish this based solely on pronunciation.

中国語には、２１の子音と１５の母音があり、（さらに
“ｉ”、“ｕ“、ｄ“の音を他の母音と組み合わせて、
１８の結合母音を作ることができる）この各音は、枡音
を１つまたは複数組み合わせることにより、特異に書き
表わせる。こうして、楢音を入力、出力の双方に用いる
システムにより、ワード・プロセシングの能率および便
宜面が改善されたのである。Chinese has 21 consonants and 15 vowels (in addition, the sounds “i”, “u”, and d can be combined with other vowels,
(18 combined vowels can be created) Each sound can be written uniquely by combining one or more square sounds. Thus, the efficiency and convenience of word processing was improved by a system that used syllables as both input and output.

しかし、表意文字生成出力のための主な障害は、中国語
の重大な数にのぼる表意文字を区分しなければならない
点にある。８，０００の基礎表意文字から成る辞典を想
定しても、中国語の各音節（１表意文字相当）には平均
２０の同音異義語がある（中国語には約４００の異なっ
た音節がある）ので、１つの枡音音節は、平均２０の異
なった表意文字を識別してしまう。場合によっては、１
つの音が１５０らの同音異義語に通じることさえある。However, the main obstacle to ideographic output is the need to segment the significant number of Chinese ideograms. Even assuming a dictionary of 8,000 basic ideograms, each syllable (equivalent to one ideogram) in Chinese has an average of 20 homonyms (there are approximately 400 different syllables in Chinese). ), so one Masu syllable identifies an average of 20 different ideograms. In some cases, 1
One sound can even lead to 150 homophones.

中国語の８０パーセントは多音節であり、多音筒編を作
る表意文字の組み合わせの数は限られているので、コン
ピュータの記憶装置に多音節の梢音辞典をストアしたコ
ンピュータの利用により、この問題の一部は解決できる
。多音節の併音単語が入力されると、対応する限られた
数の組み合わせによる表意文字が識別され、しばしば、
たった１つの表意文字の組み合わせが、特異にも多音節
単語により識別できるのである。しかし、多音筒編辞典
を使うと、単音ｍ１語（表意文字）辞典を使う場合より
も可成り大きな記憶容量を要し、また、併音大力を表意
文字出力に変換するのにも可成りの時間を要する。たと
え多音筒編大辞典をストアしたとしても、中国語にはお
びただしい数の同音異義語がある（２音節語の約４０パ
ーセントに同音異ａ語がある）ため、併音と表意文字と
の間に、独特かつ明確なマツピングをする妨げとなる。80% of the Chinese language is polysyllabic, and the number of combinations of ideograms that create polysyllabic characters is limited, so by using a computer with a polysyllabic dictionary stored in the computer's storage device, this Some of the problems can be solved. When a multisyllabic diphonetic word is input, a limited number of corresponding combinations of ideograms are identified, often
Only one ideographic combination can uniquely be identified by a multisyllabic word. However, using a polyphonic cylinder-edited dictionary requires considerably more storage capacity than using a monophonic M1 word (ideograph) dictionary, and it also takes considerably more time to convert polyphonic words into ideographic output. It takes time. Even if you store a polyphonic encyclopedia, there are a large number of homophones in Chinese (approximately 40% of bisyllabic words have homophones), so it is difficult to distinguish between polyphones and ideograms. In the meantime, it becomes a hindrance to unique and clear mapping.

多くの表意単語は、同じ発音を伴うものがあり、−一そ
れゆえに所定の発音をもつ構音単語の部にマツプされる
のでｍ−活字化された併音にも多数の同音異義語がある
。入力語としてノ井音を利用するシステムは、普通特殊
な綴り方を要したり、）隻数の同音異義語を区別するた
めに、２音節語の語尾に何か１文字を追加したりしなけ
ればならない。Since many ideographic words have the same pronunciation and are therefore mapped to the part of the articulatory word with a given pronunciation, there are also many homonyms in the m-printed articulations. Systems that use the Noi sound as an input word usually require special spelling, or add a letter to the end of a disyllabic word to distinguish between a number of homophones. Must be.

他の発音的変換システムは、操作員が表示された表意文
字の同音異義語や複合語の中から、該当するものを手で
選び出さなければならない。Other phonetic conversion systems require the operator to manually select the appropriate ideographic homonym or compound word.

併音には、さらに中国語の最も根本的な特徴でもある声
Ｆｌ（ｔｏｎｅ）を無視するという重大な障害がある。Furthermore, there is a serious problem with conjugation, which is that it ignores the most fundamental feature of Chinese language, the voice Fl (tone).

併音は、異なった母音または子音の音、すなわち音素（
ｐｈｏｎｅｍｅｓ）を指定する。中国語の各音節には声
調、すなわちアクセントがある。声調には、第１図に示
された通り４つの高低アクセントの型がある。４つの声
調（四声）には、（１）高く始まり高く終わる第一声、
（２）中位から出て高く終わる第二声、（３）中の下位
から出て、一度低めてからさらに上昇する第三声、（４
）高く出して低く消える第四声とがある。Conjunctions are the sounds of different vowels or consonants, i.e. phonemes (
phonemes). Each syllable in Chinese has a tone, or accent. There are four pitch accent types in tone, as shown in Figure 1. The four tones include (1) the first tone that starts and ends high;
(2) The second voice that starts from the middle and ends high; (3) The third voice that starts from the lower middle and then goes low and then rises further; (4)
) There is a fourth tone that is raised high and then disappears low.

音の音節と、それと抱き合わせて使われる声調のことを
以後「声調・音節」と呼ぶことにする。中国語の各表意
文字、従って中国語の各音節は、声調と音節の組み合わ
せによって発音される。The syllable of a sound and the tone used in conjunction with it will be referred to as ``tone/syllable'' from now on. Each Chinese ideogram, and therefore each Chinese syllable, is pronounced by a combination of tone and syllable.

従って、声調に基づいたシステムには、多大な１１１点
がある。音情報を提供するだけでは、表意文字を正しく
発音するに必要な完全情報とはならず、まだ不十分であ
る。さらに、上記で説明した通り、音に基づいたシステ
ムは、所定の中国語音節の同音異義語とも係わりをらた
なければならず、これは決して満足のゆくものではない
が可能ではある。Therefore, there are a whopping 111 points in a tone-based system. Providing phonetic information alone is still insufficient, as it does not provide the complete information needed to correctly pronounce the ideogram. Furthermore, as explained above, a sound-based system must also deal with homonyms of a given Chinese syllable, which is possible, although by no means satisfactory.

しかし、声調に基づいたシステムは、同声調語（ｈｏｍ
ｏｔｏｎｅ−同じ声調と同じ音をもつ音節）のみと係わ
りをもてばよい。同音異義語のレベルよりは、むしろ同
声調語のレベルで解決することにより、所定の声調・音
節で表示される複数の表意文字のために生ずる平均曖昧
度は可成り減少する。減少の度合いは、約３倍である（
中国語では、可能な全声調・音節巾約４分の３のみが使
われている）。However, systems based on tone do not use homotonal words (hom
otone (syllables with the same tone and sound). By resolving at the level of tones rather than the level of homonyms, the average ambiguity resulting from multiple ideograms expressed in a given tone/syllable is significantly reduced. The degree of decrease is approximately three times (
In Chinese, only about three-quarters of all possible tones and syllable widths are used).

同音異義語の問題を認識し、いくつかの先行技術に関す
る出版物は、求める特定の表意文字を識別するために、
ある意味をもつ記号を各膚音音節に追加することを示唆
している。併音アルファベットには２５文字あるので、
２５のうちいずれか１文字を追加することにより（また
、場合によっては１文字ら追加せずに）２６の表意文字
を識別することができる。が、このシステムは、追加さ
れた文字が、表示されるべき特定の表意文字と合理的に
関連づけられてはおらず、どの文字をどの表意文字に対
応させるかを記憶するのが不可能ではないにしても難し
く、十分に使用されてはいない。Recognizing the problem of homonyms, several prior art publications have attempted to identify the specific ideogram sought.
It suggests adding a symbol with a certain meaning to each dermatosyllabic syllable. There are 25 letters in the diaphragm alphabet, so
The 26 ideograms can be identified by adding any one of the 25 characters (or, in some cases, without adding any character at all). However, this system does not reasonably associate the added characters with the specific ideograms that should be displayed, and it is difficult if not impossible to remember which characters correspond to which ideograms. However, it is difficult and underused.

音に基づく言葉の欠陥は、１９２８年、Ｙ、Ｒ。Sound-based language deficiencies, 1928, Y, R.

Ｃｈａｏ氏も認めており、氏はローマ字使用の発音的シ
ステムを提案している。このシステムでは、音節の声調
を表わずため、各音ごとの音節に声調表示記号を挿入し
て使用した。このシステムの主な問題点は、おびただし
い声調表示記号のため、生成語の有意義なアルファベッ
ト類のリスト作りを妨げていることである。また、読み
取りも難しく、発音された単語と各表意文字との間の特
異な識別もできない。This is acknowledged by Chao, who proposes a phonetic system using the Roman alphabet. This system does not represent the tone of a syllable, so a tone indicating symbol is inserted into each syllable. The main problem with this system is that the large number of tone markings prevents the creation of a meaningful alphabetical list of generated words. It is also difficult to read, and it is difficult to differentiate between the pronounced word and each ideogram.

上記を要約すると、併音は２つの点で不備である。すな
わち、（１）声調を考慮に入れていないこと、（２）同
音異義語の区別ができないことである。To summarize the above, Junon is flawed in two ways. That is, (1) it does not take tone into account, and (2) it cannot distinguish between homophones.

声調および表意記号を含む才井音または他の先行技術シ
ステムを修正することにより、これらの問題はある程度
軽減することはできるが、一方では、言葉のもつアルフ
ァベット性を破壊し、適切な辞典その他の目録作りが非
常に困難になるという独自の問題点をもつ。また、先行
技術によって提案された中国語の修正に関するもう１つ
の問題点は、ある特定の表意文字を識別するのに必要な
記号の数が可成り増え、そのため、言葉の読み取りお上
び学習を非常に困難にしている。Modifying Saii-on or other prior art systems that include tones and ideograms can alleviate these problems to some extent, but it also destroys the alphabetic nature of words and makes it difficult to find a suitable dictionary or other system. It has its own problems that make cataloging very difficult. Another problem with the Chinese modifications proposed by the prior art is that the number of symbols needed to identify a particular ideogram increases considerably, thus making it difficult to read and learn the language. making it very difficult.

実用的なアルファベット方式では、中国語の各単語（１
つまたは複数の表意文字から成る）は、文字を列状に連
鎖してタイプしなければならない。The practical alphabet system uses each Chinese word (1
(consisting of one or more ideographs) must be typed as a chain of characters.

単語と単語との間にはスペースをおく。先行技術のシス
テムにおいては、多音節単語を個々の構成文字に分割す
る方法がないので、多音節辞典をストアさせなければな
らず、そのためデータ・プロセシングまたはワード・プ
ロセシングの記憶容量や処理時間が増すことになった。Leave a space between words. In prior art systems, there is no way to split multisyllabic words into their individual constituent characters, so multisyllabic dictionaries must be stored, which increases data or word processing storage capacity and processing time. is what happened.

たとえ多音筒編を各構成音節に分割する手段が与えられ
たとしても、先行技術のアルファベット方式では表意文
字の発音記号と個々の中国語表意文字との間に１対１の
対応ができなかった。従って、アルファベットによる表
現では、操作員がさらに手で区分けしなければならない
ような複数の表意文字をしばしば同一視してしまう。Even if a means were provided to divide the polyphonic tube into its constituent syllables, the prior art alphabetic systems did not allow for a one-to-one correspondence between the phonetic symbols of the ideograms and the individual Chinese ideograms. Ta. Therefore, alphabetic representations often equate multiple ideograms, which the operator must further distinguish manually.

本発明の詳細な説明本発明は、音声中国語アルファベット（ＰＣＡ）を使っ
た音声中国語言語（ＰＣＬ）を利用するものである。こ
こで中国語アルファベットは、音声中国語単語（ＰＣＷ
）を形成するもので、各音声中国語単語はそれぞれ１つ
の表意文字に対応する。さらにこの音声中国語単語は、
連鎖状になっていて、多音節の音声中国語単語（Ｐ　Ｐ
　ＣＷ）を形成する。DETAILED DESCRIPTION OF THE INVENTION The present invention utilizes the Phonetic Chinese Language (PCL) using the Phonetic Chinese Alphabet (PCA). Here, the Chinese alphabet is phonetic Chinese word (PCW)
), where each phonetic Chinese word corresponds to one ideogram. Furthermore, this phonetic Chinese word is
Chained, multisyllabic phonetic Chinese words (P P
CW).

各ｐｐｃｗは、複数の表意文字から成る中国語の多音節
の複合語に対応する。本発明の音声中国語言語は、以下
のような特徴をもつ。Each ppcw corresponds to a Chinese polysyllabic compound word consisting of multiple ideograms. The audio Chinese language of the present invention has the following characteristics.

■１本言語は、声調に忠実に基づいたアルファベットを
利用するもので、複数の（アルファベットの）文字の独
立した１組が中国語言語（Ｍａｎｄａｒｉｎ）の全音節
を発音するための全ての音声、声調情報を提供する。■One language utilizes an alphabet based faithfully on tones, in which an independent set of letters (of the alphabet) produces all the sounds for pronouncing all the syllables of the Mandarin language, Provides tonal information.

２、本言語は、優勢語根原則（ｄｏｍｉｎａｎｔ　−ｒ
ｏｏｔｐｒｉｎｃｉｐｌｅ）または意義的類別原則（ｓ
ｅｍａｎｔｉｃｃｌａｓｓｉｒｉｅｒ　ｐｒｉｎｃｉｐ
ｌｅ）を利用するもので、これによってＰＰＣＷが、独
特かつ明確に１つの表意文字を識別するように、ＰＣＷ
と中国語表意文字とをまぎれもなくｌ対ｌで対応させる
ため、あるＰＣＷに追加すべき言語を選択する。2. This language uses the dominant root principle (dominant -r).
ootprinciple) or the semantic classification principle (s
emantic classier principal
le), which allows the PPCW to uniquely and unambiguously identify one ideogram.
Select a language to be added to a certain PCW in order to have an unmistakable 1-to-1 correspondence between

３、本言語は、連結して多音節の複合語となる不可分な
連鎖状のＰ　ＣＬ文字から成る多音節の音声中国語単語
（ＰＣＷ）を、個々の（表意文字に対応する）ｐｃｗに
自動的に分割する分離論理の使用を可能にする。3. The language automatically converts polysyllabic phonetic Chinese words (PCWs), which consist of an indivisible chain of PCL characters that can be concatenated into polysyllabic compound words, into individual (ideograph-corresponding) PCWs. Enables the use of separation logic that divides the

Ｐ　ＣＬにおいては、各音節を４つの異なった声調で表
わすため、１つの音節は４通りに書き出される。本アル
ファベットの声調上の性格ゆえに、本言語は読み取りが
非常に楽で、音のみに晧づいた方式に比して、解答度は
自動的に３倍も増す。In PCL, each syllable is expressed in four different tones, so one syllable is written out four ways. Due to the tonal nature of the alphabet, this language is very easy to read, and the accuracy is automatically increased by three times compared to a system based only on sounds.

ＰＣＬ入力を受は取るデータ・プロセッサまたはワード
・プロセッサは、先行技術で２０以上の同音異義語（表
意文字が８，０００と仮定して）を扱うのに対して、当
方式では平均６つの同声調語をＩ及うだ１すである。A data processor or word processor that accepts PCL input handles an average of 6 homophones (assuming 8,000 ideograms), compared to 20 or more in the prior art (assuming 8,000 ideograms). The tone words are I and 1.

声調に基づいたアルファベットは、音に基づいたアルフ
ァベットよりら解答度が３倍も高く、後述するＰＣＡの
特徴により、多数の同音異義語がある場合でさえ、ＰＣ
Ｗと中国語表意文字とを１対ｌで対応させることができ
る。さらに追って詳述するが、本発明のＰＣＬは、２５
５の同声調語（２２５ｘ４＝ｌ、０２０の同音異義語に
相当する）を、　合音の“ｉ″、“ｕ″、“ｄ′の母音
のみを含む声調・音節に、１７０の同声調語（６８０の
同音異義語に相当）を語尾が１井合音の“ｌｏ、“ｕ”
、または“ら”で終わる声調・音節に、また、８５の同
声調語（３４０の同音異義語に相当）をその他全ての声
調・音節に区分することができる。ＰＣＷ（所定の声調
・音節の発音に必要な全ての音情報および声調情報を併
せもつ）と表意文字とのＩ対Ｉの対応は、先行技術では
不可能であった。Alphabets based on tones have three times higher response rates than alphabets based on sounds, and due to the characteristics of PCA described below, even when there are many homonyms, PC
It is possible to have a one-to-l correspondence between W and Chinese ideograms. As will be described in detail later, the PCL of the present invention is 25
5 homophones (225x4=l, corresponding to 020 homophones) are converted into tones and syllables containing only the vowels "i", "u", and "d' of the conjunction, and 170 homophones. (equivalent to 680 homonyms), the ending of the word is “lo”, “u” with a single sound
, or into tones and syllables ending in "ra", and 85 homophone words (equivalent to 340 homonyms) can be divided into all other tones and syllables. An I-to-I correspondence between PCW (combining all the phonetic and tone information necessary for the pronunciation of a given tone/syllable) and ideograms was not possible in the prior art.

本発明の重要な利点は、多音節語辞典を予めストアする
ことなしに、コンピュータのプログラムが連鎖したＰＰ
ＣＷを個々のＰＣＷに分割するような方法で、音声中国
語アルファベットから不可分に連鎖した多音節の音声中
国語単語を書き出す能力である。本発明のこの部分は非
常に大切である。この特徴ゆえに、ＰＣＷと表意文字を
Ｉ対１で対応させて組み合わせるので、コンピュータの
記憶装置に多音節語辞典をストアする必要はない。An important advantage of the present invention is that a computer program can be used to create a chained PP without pre-storing a polysyllabic dictionary.
It is the ability to write out inseparably chained multisyllabic spoken Chinese words from the spoken Chinese alphabet in such a way that the CW is divided into individual PCWs. This part of the invention is very important. Because of this feature, it is not necessary to store a polysyllabic word dictionary in the computer's storage device, since PCWs and ideograms are combined in an I-to-one correspondence.

むしろ、全てのＰＰＣＷは、連鎖したＰＣＬ文字として
入力し、その後でＰＰＣＷを個々のＰＣｗに分割する分
離法にゆだねる。コンピュータは、次に各ＰＣＷをその
対応する表意文字に変換する単音頭語辞典に照会する。Rather, all PPCWs are entered as chained PCL characters and then submitted to a separation method that splits the PPCW into individual PCws. The computer then queries a phonetic initial dictionary which converts each PCW to its corresponding ideogram.

これにより、本発明を利用したデータ・プロセシングま
たはワード・プロセシングの記憶容量および処理時間は
可成り削減される。This significantly reduces storage capacity and processing time for data or word processing using the present invention.

分離論理の使用と、ＰＣＷと表意文字の独特な１対１の
対応によるもう１つの重要な効果は、先行技術のシステ
ムでは不可能な方法で、ストアされたＰＣＷからアルフ
ァグラミックなリスト（ＰＧＬ）が自動的に得られるこ
とである。アルファグラミック・リストとは、ＰＣＷは
普通アルファベット順に並べられるが、このアルファベ
ット類では分離されてしまう同声調語と、同じ表意文字
の頭文字で始まる多音節の複合語とを一括することを保
証するものである。純粋にアルファベット類のＰＣＬリ
ストは、ある単語には在り、他の単語には存在しない意
義的類別記号により、同じ表意文字の頭文字で始まる単
語または熟語をばらばらに切り離してしまう恐れがある
。アルファグラミック・リストは、この可能性を回避し
、同じ表意文字の頭文字をもつ単語はひとまとめにでき
るしのである。ＰＧＬに関する詳細は後述する。Another important effect of the use of separation logic and the unique one-to-one correspondence between PCWs and ideograms is that alphagrammatic lists (PGLs) can be extracted from stored PCWs in a way not possible with prior art systems. ) is automatically obtained. The alphagrammic list guarantees that the PCW is usually arranged in alphabetical order, but homotonous words that are separated in this alphabet group and multisyllabic compound words that start with the initial letter of the same ideogram are grouped together. It is something to do. A purely alphabetical PCL list may separate words or phrases that begin with the same ideographic initial letter, due to the semantic classifiers present in some words and absent in others. Alphagrammatic lists avoid this possibility and allow words with the same ideographic initial letter to be grouped together. Details regarding PGL will be described later.

ＰＣＬの声調に基づく性格、さらに後述する優勢語根お
よび意義的類別記号ゆえに、ＰＣＬは５万もの表意文字
を特異に識別することができる。Because of the tonal nature of PCL, as well as the dominant roots and semantic classifiers discussed below, PCL can uniquely identify as many as 50,000 ideograms.

主要の表意文字セラ）８，０００のうち、定義づけされ
た優勢語根原則に従った各ＰＣＷ″詣根”の綴り中のた
った３種類の変化を使うことにより、約３，９００字が
識別できるのである。これは、中国語で使用されている
言葉の約９７パーセントに当たる。主要セットの残りの
表意文字のうち、表意文字の基本とらなるべき中国語の
語根と類似または同一の意義的類別記号を使って、８０
パーセントは識別できる。従って、ＰＣＬは、簡潔かつ
非常に読み取りやすい。中国語の他の全ての表意文字も
また、１つの意義的類別記号を使って識別することがで
きる。それで、ＰＣＬは、全ての中国語表意文字を特異
に識別することができる。Of the 8,000 major ideograms (sera), approximately 3,900 characters can be identified by using only three variations in the spelling of each PCW "Mai root" according to defined dominant root principles. It is. This accounts for approximately 97% of the words used in Chinese. Of the remaining ideograms in the main set, 80 are used with similar or identical semantic classification symbols to the Chinese root on which the ideograms are based.
Percentages are discernible. Therefore, PCL is concise and very easy to read. All other ideograms of Chinese can also be identified using one semantic classification symbol. So PCL can uniquely identify all Chinese ideograms.

そんなわけで、ＰＣＬは、併音を用いた場合に必要な最
高７文字（８文字ともいえる）、使用頻度平均４文字に
比して、最高４文字、使用頻度平均２．４文字の表意文
字を使うだけである。中国語の表意文字または表意文字
の一部に類似した音声中国語アルファベットを選択する
ことにより、ＰＣＡ文字（意義的類似記号として使われ
た時にでさえ）は、中国語表意文字を知っている人には
誰でも容易に理解され得る。この技術は、意義的類別記
号が伝統的な中国語の中でも、基本的な表意文字の形を
とる表意文字の語根を直接識別するのに非常に有利であ
る。For this reason, PCL is an ideogram with a maximum of 4 characters and an average frequency of 2.4 characters, compared to the maximum of 7 characters (sometimes 8 characters) and the average frequency of use of 4 characters required when using diphones. Just use . By selecting phonetic Chinese alphabets that are similar to Chinese ideograms or parts of ideograms, PCA characters (even when used as semantically similar symbols) can be used by people who know Chinese ideograms. can be easily understood by anyone. This technique is very advantageous for directly identifying the roots of ideograms, which take the form of basic ideograms even in Chinese, where semantic classifiers are traditional.

また、ＰＣＬがビデオのディスプレイやプリントアウト
に対応する表意文字とテキスト中隣り合わせになってい
たり、隔行に並置されている場合、各表意文字は対応す
るＰＣＷに関連づけることにより容易に読み取れる。こ
れにより、表意文字とその発音方法が簡潔に提示され、
ＰＣＬが表意的な中国の言語を教えるために理想的な道
具となる。Additionally, when a PCL is juxtaposed in text with a corresponding ideogram on a video display or printout, or juxtaposed on alternate lines, each ideogram can be easily read by associating it with its corresponding PCW. This provides a concise presentation of ideograms and how to pronounce them,
PCL becomes an ideal tool for teaching ideographic Chinese language.

また、ＰＣＬは、中国語のコンピュータ処理に必要なハ
ードウェアおよびソフトウェアを簡素化する。「情報交
換用中国語象形文字セットの符号−主要セット」として
指定された上記標準中国語は、各中国語表意文字に対し
２−バイトのディジクル・コードを使う。これに似ては
いるが、「万国共通中国語表意文字標準符号」と弥する
１３．０５３の表意文字用の大型セットが、１９８６年
３月台湾によって公表された。これも各表意文字に２−
バイトのコードを使う。PCL also simplifies the hardware and software required for Chinese language computer processing. The standard Chinese language designated as "Chinese Hieroglyphic Character Set Codes for Information Interchange - Main Set" uses a 2-byte dicicle code for each Chinese ideogram. Although similar to this, a large set of 13.053 ideograms called the ``Universal Chinese Ideographic Standard Code'' was published by Taiwan in March 1986. This also has 2-
Use byte code.

ここに説明されている音声中国語言語において、全８５
文字の音声中国語アルファベットをフード化するには、
たった７−バイトのコードが要るだけである。この７−
バイトのコードは、ここでは「情報交換用米国標準符号
Ｊ（ＣＳＣ１１）と呼ばれ、第１３図に例示されている
。これは、ＡＳＣＩＩ（情報交換用米国標準符号）に似
ており、両方とも重要な７つのビットを使う。しかし、
第１３図にも示されている通り、ＡＳＣＩＩは０−１２
７（Ｏ０）Ｉ−７１ｌ−１）のレンジを使うのに対し、
本Ｃ９Ｃ１１は、句読点を含めても１２９−２２２（８
１Ｈ−ＤＨＥ）レンジしか使わない。いずれにしてもＣ
５Ｃ１１は、先行の“ビビットを加えると、ＡＳＣＩＩ
に似ているので、コンピュータに採択され易いアルファ
ベット表示装置とディジタル・コード体系をともに使用
する英語／中国語２か国語の情報交換に使用するのには
大変便利である。In the phonetic Chinese language described here, a total of 85
To hood the phonetic Chinese alphabet of characters,
Only a 7-byte code is required. This 7-
The code of the byte is referred to herein as the American Standard Code for Information Interchange J (CSC11) and is illustrated in Figure 13. It is similar to ASCII (American Standard Code for Information Interchange), both of which are Use 7 important bits. However,
As shown in Figure 13, ASCII is 0-12
7(O0)I-71l-1),
Book C9C11 is 129-222 (8
1H-DHE) Use only the microwave. In any case C
5C11 is ASCII by adding the leading “bibit”
, it is very convenient to use for bilingual English/Chinese information exchange that uses both an alphabetic display device and a digital code system, which are easily adopted by computers.

このようにして、ＰＣＬは、表意文字の絵画的コーディ
ングの必要性を回避し、むしろ各表意文字が、独特な７
−ビットのＰＣＡ文字符号の組み合わせとしてコードさ
れるＰＣＷとして表意文字を声調で綴ることによって表
現される。従って、各表意文字は、４文字以下−一しば
しば使用頻度平均２．４文字−一の、標準７−ビットの
ＰＣＡ文字符号の組み合わせとしてコードされ、これに
より、中国語テキストをコンピュータ処理するために必
要なハードウェアおよびソフトウェアを可成り簡素化す
ることとなる。In this way, PCL avoids the need for pictorial coding of ideograms, but rather each ideogram has a unique seven
- Represented by tonal spelling of the ideogram as PCW coded as a combination of PCA character codes of bits. Therefore, each ideogram is coded as a combination of no more than four characters - often with an average usage frequency of 2.4 characters - of the standard 7-bit PCA character code, which allows for computer processing of Chinese text. This significantly simplifies the required hardware and software.

面記の特徴ゆえに、本発明は、英語のプロセシングと同
じ技術をＰＣＬに当てはめて、データを全く自由に単語
処理および情報処理すると同時に、中国語表意文字を明
確に出力し、アルファグラミック・リストの作成をも可
能にする。Because of the characteristics of face writing, the present invention applies the same technology as English processing to PCL to completely freely process words and information on data, while clearly outputting Chinese ideograms and creating an alphagramic list. It also makes it possible to create

実施例Ａ、音声中国語言語本発明は、第２図に例示された声調を基調としたアルフ
ァベットに基づくものである。このアルファベットは、
発明者が現時点で好ましいと思うものの実施様態の表示
であるが、同じまたは根本的には同じ声調情報および音
情報をもつ他の表示記号も使い得る。いかなる特殊な表
示記号が使われようとも、同じ音ながら異なった声調の
母音を表示するのに使われる記号は、明確かつ互いに関
連のあるものを使うのが好ましい。Embodiment A, Spoken Chinese Language The present invention is based on the tone-based alphabet illustrated in FIG. This alphabet is
Although the designation is of an embodiment that the inventors currently believe is preferred, other designations having the same or fundamentally the same tonal and tonal information may also be used. No matter what special symbols are used, it is preferable that the symbols used to represent vowels of the same sound but different tones be clear and related to each other.

また、第１３図に示されている通り、たった７つの重要
なビットをもつディジタル・コードの組み合わせとして
コード化することができる。これにより、ハードウェア
およびソフトウェアの必要条件は、先行技術より実質的
に簡素化することができる。It can also be encoded as a combination of digital codes with only seven significant bits, as shown in FIG. This allows the hardware and software requirements to be substantially simplified over the prior art.

第２図に示した通り、出願考の音声中国語アルファベ−
／　）には２５の子音と６０の母音声調（母音声調とは
、母音の音とそれが発音される時の特別な声調とを兼ね
表わす文字である）、合計８５文字がある。各文字には
、データ・プロセンノブ用に即使用できるような通し番
号が指定されている。第２図では、ＰＣＡ文字Ｉこ相当
する折合音（らしあれば）をＰＣＡ文字の下に付記した
。母音は、ウムラウトを含む“ｕ”と”６”音で発音さ
れる表意文字の区別をしないため、その表意文字をどう
発音すべきかについて、混乱を招き得る。しかし、ＰＣ
ｌ３においては、その読解性を増すために、明確な区別
がされている。母音文字には声調情報が含まれていない
ので、ＰＣＡの母音声調に相当する音文字を、第一声（
第１図参照）で発音される母音声調の下だけに付記した
。同音ながら声調の違うものは、第２図の各欄の関連し
た各母音声調に利用される。従って、２３−２６欄の各
母音声調は“ａ”の音をもつ。以後、音声中国語アルフ
ァベットは、母音文字や指定番号と互換関係で呼ばれ、
実際のＰＣＡ文字とさえ互換関係で呼ばれる。As shown in Figure 2, the phonetic Chinese alphabet of the application proposal is
/ ) has 85 characters, 25 consonants and 60 vowel tones (a vowel tone is a letter that represents both the sound of a vowel and the special tone in which it is pronounced). Each letter is designated with a serial number for ready-to-use data processing knobs. In Fig. 2, the consonant sound corresponding to the PCA letter I has been added below the PCA letter. Vowels do not differentiate between ideograms pronounced with the "u" and "6" sounds, including the umlaut, which can lead to confusion as to how the ideogram should be pronounced. However, P.C.
In 13, clear distinctions are made to improve readability. Since vowel letters do not contain tone information, the phonetic letters corresponding to the vowel tone of PCA are changed to the first tone (
(See Figure 1) is added only below the vowel tone pronounced. The same sounds but different tones are used for each related vowel tone in each column of Figure 2. Therefore, each vowel tone in columns 23-26 has an "a" sound. From then on, the phonetic Chinese alphabet was referred to interchangeably with vowel letters and designated numbers.
Even the actual PCA characters are called interchangeably.

中国語には、２１の子音音と■５の母音音がある。２１
の子音音は、短子音と長子音の音に区分され、それぞれ
対応する列にリストされている。In Chinese, there are 21 consonant sounds and 5 vowel sounds. 21
The consonant sounds are divided into short consonant sounds and long consonant sounds, each listed in its corresponding column.

各長子合音は、その中に組み込まれた種々の基本的な母
音音を１つもっている。ある表意文字は長子音の音に対
応するので、それに対応するＰＣＷに含まれている声調
表示記号をもたなければならない。これは、２７〜３０
欄または７９〜８２１Ｉｌ［ｌの母音声調の１つを加え
ることによってなされ、この場合、■声調のみ追加する
だけで、これ自体が母音音になることはない。一方、短
子音には母音音は含まれておらず、母音音と起用される
声調を兼ね表わすＰＣＷの母音声調を伴わなければなら
ない。Each major consonant has one of the various basic vowel sounds incorporated within it. Since an ideogram corresponds to a long consonant sound, it must have a corresponding tone marking symbol included in the PCW. This is 27-30
This is done by adding one of the vowel tones in column or 79 to 821Il [l. In this case, only the ■ tone is added, and this itself does not become a vowel sound. On the other hand, a short consonant does not include a vowel sound, and must be accompanied by a PCW vowel tone, which represents both the vowel sound and the tone used.

１〜２ＮＩＩｆｌの伝統的な２Ｉの子音に加えて、ＰＣ
Ａにはさらにゼロ子音２２と半子音８３〜８５が含まれ
ている。ゼロ子音２２（０の記号で表わされている）は
無声で、下記に説明する特別の場合の多音頭語を個々の
音節に分割するための音節の区切り記号として使われる
。In addition to the traditional 2I consonant of 1-2NIIfl, PC
A further includes a zero consonant 22 and semiconsonants 83 to 85. The zero consonant 22 (represented by the symbol 0) is unvoiced and is used as a syllable delimiter to divide polyphonic initials into individual syllables in the special case described below.

半子音Ｓ３．８４．８５は、母音音と共に発音されるが
、声調をもっていないので子音として作用する。むしろ
、ＰＣＷの中に声調を追加しなければならない。半子音
８３．８４．８５は、母音声調２７−３０．３９−４２
．４７−５０とそれぞれ同一であり、後者の各母音声調
は、それに関連した半子音につけて声調を出すことがで
きよう。The semi-consonant S3.84.85 is pronounced with a vowel sound, but since it has no tone, it acts as a consonant. Rather, tones must be added within the PCW. The semi-consonant 83.84.85 is the vowel tone 27-30.39-42
．． 47-50, respectively, and each vowel tone of the latter could be toned along with its associated semiconsonant.

これにより、Ｐ　ＣＬには可成りの柔軟性がつき、多数
の同声調語の解答が可能となる。６っと重要なこととし
て、８３．８４または８５と、１つの母音を組み合わせ
ると、１８の掛合結合母音ができることである。“１”
、“ｕ”、“Ｖ“のそれぞれ２組を含めると（８３−８
５対２７−３０．３９−４２．４７−５０）、分離論理
を終局的に可能とする重要な基礎が得られる。This gives PCL considerable flexibility and allows it to answer a large number of tonal words. 6. Most importantly, when you combine 83.84 or 85 with one vowel, you get 18 conjunctive vowels. “1”
, “u” and “V”, (83-8
5 vs. 27-30.39-42.47-50), an important basis is obtained that will ultimately enable separation logic.

中国語には１５の母音があり、そのそれぞれが第１図に
例示されているような四辺の１つをもっているので、結
果的には６０の母音声調かあることになる。ＰＣＬでは
、それぞれの母音は４つの関連した母音声調に分割され
、その個々が同じ音でありながら別の声調をもつ。Since there are 15 vowels in Chinese, each with one of the four sides illustrated in Figure 1, there are 60 vowel tones. In PCL, each vowel is divided into four related vowel tones, each of which has the same sound but a different tone.

例として、母音声：Ａ２３−２６の全てが同じ“ａ″の
音をらちながら、表示のような（第１図第−〜四声に対
応する）第１声から第四声までの声調をもつ。母音声調
族の各文字は、同じ基本文字でありながら、第二、第三
、第四声を識別するために、基本文字内のどこかに１本
の線を入れることにより区別されている。例えば、２３
〜２６１ＩＩＩＩの母音声調族を特に参照してみると、
第二声を識別するために、基本文字の下に１本の線を入
れる、第三声を識別するために基本文字の上に１本の線
を入れる、そして第四声を識別するために、基本文字の
上から４分の１くらいのところに１本の線を引く。表示
された個々の母音声調族に類似した区別がなされる。As an example, vowel sounds: A23-26 all sound the same "a" sound, and the tones from the first to the fourth tones (corresponding to the four tones in Figure 1) are expressed. Motsu. Although each letter of the vowel tone family is the same basic letter, it is differentiated by inserting a line somewhere within the basic letter to identify the second, third, and fourth tones. For example, 23
With particular reference to the vowel tone family of ~261III,
To identify the second tone, place a line below the base letter; to identify the third voice, place a line above the base letter; and to identify the fourth voice. , draw a line about a quarter from the top of the basic character. A similar distinction is made for the individual vowel tone families displayed.

母音声Ｑ２７−３０は２つの目的をかなえている。短子
音につけば“ｉ”と発音し、音情報と声調情報の両方を
包含している。長子音または半子音８３につけば、無声
母音として作用し、声調だけになる。これは、第２図で
はダッシュ（−）で表示されている。後者の場合、無声
母音の音は、長子音または半子音の中に特育に包含され
ている。Vowel Q27-30 serves two purposes. The short consonant is pronounced as "i" and contains both sound information and tone information. If attached to a long consonant or a semiconsonant 83, it acts as a voiceless vowel and becomes only a tone. This is indicated by a dash (-) in FIG. In the latter case, the voiceless vowel sound is subsumed specifically within the long or semiconsonant.

母音声調３５−３８らまた、２重の目的をかなえている
。短子音１−４、または半子音８３−８５につくと、°
０”と発音される。その他残りの文字につくと、“ｅ”
と発音される。この重複使用は、中国語の“ｂｏ、“ｐ
′、“ｍ”、　”ｆ”、　”ｙ”、　”ｗ”、　”ｙｕ
’のあとに“ｅ”音がつく場合、中国語には声調・音節
がなく、また、“０”音が残りの子合音につく場合、中
国語には声調・音節がないという事実によってＩ′ＩＩ
能とされている。この母音用ｇ３５−３８の有効使用は
、ＰＣＡで必要とする全文字数のうち４文字減らずこと
になる。Vowel tones 35-38 also serve a dual purpose. When it comes to short consonants 1-4 or half-consonants 83-85, °
0”.The rest of the letters are pronounced “e”.
is pronounced. This redundant use is similar to the Chinese “bo,” p
′, “m”, “f”, “y”, “w”, “yu
If an "e" sound is added after ', there are no tones/syllables in Chinese, and if "0" sounds are added to the remaining consonants, there are no tones/syllables in Chinese. I'II
It is considered to be Noh. This effective use of vowel g35-38 does not reduce 4 characters out of the total number of characters required for PCA.

母音用８７９−８２は３重の目的をかなえている。これ
らの母音声調が独自で、あるいはゼロ子音２２について
いる時はいつでも、“ｅｒ”と発音される。短子音につ
く時はいつでも、“ｉ”と発音される。長子音または半
子音につく時はいつでも、無声音で、声調情報のみを提
供する（母音音は、長子台土たは半子音自体によって提
供されろ）。879-82 for vowels serves a triple purpose. Whenever these vowel tones are unique or accompany a zero consonant 22, they are pronounced "er." Whenever a short consonant is used, it is pronounced as "i." Whenever a long consonant or a semi-consonant is followed, it is voiceless and provides only tonal information (the vowel sound is provided by the long consonant or semi-consonant itself).

中国語の６表α文字は、次の形のうちいずれか１つをと
り得る１つの声調・音節によって定義づけられる。ずな
わち、ｃｖ、ｃｓｖ、ｓｖおよびＶにおいて、Ｃは子音
、Ｓは半子音（母音音をらちながら声調のない文字）、
■は母音声調（母音音と声調を併せもつ文字）である。A Chinese Table 6 Alpha character is defined by a tone/syllable that can take one of the following forms: That is, in cv, csv, sv, and V, C is a consonant, S is a semiconsonant (a letter with a vowel sound but no tone),
■ is a vowel tone (a character that has both a vowel sound and a tone).

第２図に例示された文字を使い、音声中国語アルファベ
ットは中国語の全ての声調・音節（従って全ての表意文
字）を発音するのに必要な音情報と声調情報を提供する
。この必要な情報を得るための文字の組み合わせ方は、
第４Δ−４１図に詳しく解説されているが、これは中国
語に出てくる全ての声調・音節の１）　ＣＬ表現を示す
声調図である。この図では、ＰＣＡの子音は縦に、また
母音声調は横に配列されている。ＰＣＡ文字に対応する
耕合音は、ＰＣＡ文字に指定された番号とともに、ＰＣ
Ａ文字の隣りに示されている。Using the characters illustrated in Figure 2, the Phonetic Chinese Alphabet provides the phonetic and tonal information necessary to pronounce all tones and syllables (and therefore all ideograms) of Chinese. The combination of letters to obtain this necessary information is
As explained in detail in Figure 4Δ-41, this is a tone diagram showing the 1) CL expressions of all tones and syllables that appear in Chinese. In this figure, the consonants of the PCA are arranged vertically, and the vowel tones are arranged horizontally. The Kogo sound corresponding to the PCA character is the PC
It is shown next to the letter A.

第４　Ａ−４Ｄ図は、ｃｖ、ｓｖおよび■の形をとる声
調・音節の全てを例証している。母音声調２７−３０と
７９−８２は、短子音、長子音のいずれにつくかによっ
て発音が異なってくるので、短子音と長子音とを区別す
るため、子音１１と１２との間には太い横線が引いであ
る（上記参照）。Figures 4A-4D illustrate all of the tones/syllables that take the form cv, sv and ■. Vowel tones 27-30 and 79-82 have different pronunciations depending on whether they are short or long consonants, so in order to distinguish between short and long consonants, there is a thick sound between consonants 11 and 12. The horizontal line is drawn (see above).

同様に、第４図にも、子音４と５の間と、ゼロ子音と半
子音８３の間と、母音声調３５−３８欄の下には、どの
子音につくかによって異なった音が母音用ｆＡ３５−３
８に指定されることを示すため、太線が引いである。Similarly, in Figure 4, between consonants 4 and 5, between the zero consonant and semiconsonant 83, and under the vowel tone 35-38 column, there are different sounds for vowels depending on which consonant they are attached to. fA35-3
The thick line is drawn to indicate that it is designated as 8.

ＰＣＡは、約３，０００の声調・音節を表示することが
できる。多数の声調・音節は、ＰＣＡを使って複数の方
法で書き得る。これは、第４’Ａ−４重図に示しである
が、さらに後述する。中国語には、これらの声調・音節
のうちたった１、２９２のみが含まれている。中国語に
使われていない声調・音節は、空白またはダッシュによ
って第４Ａ−４，１図に示されている。PCA can display approximately 3,000 tones/syllables. Many tones/syllables can be written in multiple ways using PCA. This is shown in quadruple diagram 4'A-4 and will be discussed further below. Chinese contains only 1,292 of these tones/syllables. Tones and syllables not used in Chinese are indicated in Figure 4A-4.1 by spaces or dashes.

ＰＣＡは、中国語の１，２９２の全声調・音節を表示は
できるが、標準的な持合は、中国語の４ＩＯ音の音節し
か表示できない。才杵音の全音表は、第３図に提示され
ている。併音と比較して、音声中国語言語による解答度
の増大ぶりは、第３図と第４Ａ−４重図の声調表と音表
を比較すればすぐに分かる。ＰＣＬにより増大された解
答度は、才叶音方式よりも少数の音節使用によって達成
され、それにより４斗音使用の時よりも多くの情報を提
供する一方、音声中国語言語の読解度をも増大する。PCA can display all 1,292 tones and syllables of Chinese, but standard Mochiai can only display syllables of the 4IO tones of Chinese. The complete tone table for Saiki-on is presented in Figure 3. The degree to which the response rate for spoken Chinese language has increased compared to that for parallel sounds can be easily seen by comparing the tone charts and tone charts in Figure 3 and Figure 4A-Quadruple. The increased answerability with PCL is achieved by using fewer syllables than with the Saiguan method, thereby providing more information than when using the 4-Douin method, while improving the reading comprehension of spoken Chinese languages. increase

音声中国語アルファベットを用い、ｃｖ、ｃｓＶ、ＳＶ
および■の可能な形をとる声調・音節を発音するのに必
要な全ての情報を、発音面および声調面から提供するこ
とができる。しかし、表意文字を発音するために必要な
音情報および声調情報は、それ自体で同声調語同志を区
別するのに十分な情報を提供はしない。そのため、必要
なら、同声調語を区別するために、ＰＣＬは声調・音節
に余分の分類記号を追加する。声調・音節に追加される
特別記号は、優勢語根システムまたは意義的類別ンステ
ムによって決定される。Using the phonetic Chinese alphabet, cv, csv, sv
It is possible to provide all the information necessary to pronounce tones and syllables that take the possible forms of and ■ from the aspects of pronunciation and tone. However, the phonetic and tonal information required to pronounce ideograms does not by itself provide sufficient information to distinguish between tonal words. Therefore, if necessary, PCL adds extra classification symbols to tones/syllables to distinguish between tonal words. Special symbols added to tones/syllables are determined by the dominant root system or semantic classification system.

優勢語根システムは、最も一般に存在する３つの同声調
語（実際上の使用頻度に基づいた）を各声調・音節に区
別するために使われる。このシステムに従って、音声中
国語単語（特異な表意文字を識別する）は、声調・音節
（ＴＳ）のみ（同声調語を区別する必要がなければ）か
ら成る第１型、母音をくり返す声調・音節（ＴＳ＋Ｖ）
から成る第２型、ゼロ子音のつく声調・音節（Ｔ　Ｓ　
＋　Ｚ）から成る第３型に書き分けられる。例えば、声
調・音節”ｓｈａ”を書き分ける第１、第２、第３型は
、−μ、（第１型）、浸林＝（第２型）、−ソＪ−（第
３型）である。この簡単なシステムを利用し、各声調・
音節は解答度を３度も増し、各音の音節は、１２（３Ｘ
４）の解答度を得た。第１、第２、第３型で書かれた声
調・音節の組み合わせは、使用頻度からして中国語の約
９７バーセントを表示するに十分である。従って、ＰＣ
Ａは、簡単な優勢語根の原則のみに従って、使用頻度に
基づいた中国語の表意文字の９７バーセントを識別する
。The dominant root system is used to distinguish between the three most commonly occurring tones (based on frequency of use in practice) for each tone/syllable. According to this system, phonetic Chinese words (identifying unique ideograms) are divided into Type 1, consisting only of tone-syllables (TS) (unless there is a need to distinguish between tone words), tone-syllables (TS) that repeat vowels, Syllable (TS+V)
The second type consists of tones and syllables with a zero consonant (T S
It can be divided into the third type consisting of + Z). For example, the first, second, and third types of the tone/syllable "sha" are -μ, (type 1), immersion = (type 2), and -so J- (type 3). . Using this simple system, you can
Syllables increase the answer level by 3 times, each sound has 12 syllables (3X
4) The answer level was obtained. The combinations of tones and syllables written in Types 1, 2, and 3 are sufficient to represent approximately 97 percent of Chinese in terms of frequency of use. Therefore, P.C.
A identifies 97 percent of Chinese ideograms based on frequency of use, following only the simple dominant root principle.

個人が、各声調・音節を見分けるため、３つの最ら頻繁
に起こる同声調語を記憶するのは比較的に易しいので、
これは非常に実用的な入力システムとなる。音声中国語
単語を、（キーホードまたはその他の入力機器に）入力
する人か、どの同声調語か第１、第２、第３型のうち最
も頻繁に起こるかを覚えていなくても、ＰＣＷの適切な
形を憶測し、ディスプレイ・スクリーンに映し出された
対応表意文字を見、らし示された表意文字が求める表意
文字に対応しなければ、入力を変更することは簡単な作
業である。It is relatively easy for individuals to memorize the three most frequently occurring tonal words in order to distinguish each tone/syllable.
This makes it a very practical input system. PCW, whether the person is typing phonetic Chinese words (on a keyboard or other input device) or not remembering which tones occur most often among types 1, 2, and 3. It is a simple task to guess at the appropriate shape, look at the corresponding ideograms shown on the display screen, and change the input if the ideograms shown do not correspond to the desired ideogram.

当用言語の残り３パーセントに当たる中国語の同声調語
は、意義的類別システムを使うことにより区別できる。The remaining 3% of the language in use, Chinese tonal words, can be distinguished using a semantic classification system.

ＰＣＡの各文字は、特別の意味のカテゴリー（例えば、
虫、山、木）を表わす意義的類別記号として、求める同
声調語の論理的標識となる。（これは、音情報および声
調情報の支持標としての使用からも明確である。）１つ
の例外は、母音用Ｎ７９で、これは、さらに後述はする
が、「反転音表意文字」と呼ばれ、中国語の特別な表意
文字を識別する時のみに使われる。意義的類別記号とし
て使われる時は、ＰＣＡ文字が声調・音節の後について
その依味を読者に伝えはするが、音と声調までは伝えな
い。Each letter in PCA has a special meaning category (e.g.
As a semantic classification symbol representing insects, mountains, trees), it serves as a logical marker for the desired tonal word. (This is also evident from its use as a support marker for phonetic and tonal information.) One exception is N79 for vowels, which, as discussed further below, is called the "inverted ideograph." , used only to identify special Chinese ideograms. When used as a semantic classification symbol, the PCA letter follows a tone or syllable to convey its meaning to the reader, but not the sound and tone.

例として、７２．８４．６８欄の３文字は、伝統的な表
意語根である虫（７２）、山（８４）、土（６８）、木
（３）にそれぞれ同一またはほぼ同一である。第９Ａ図
の最上列では、４つの異なったＰＣＷを形成するために
、声調・音節二収−にこれらの文字が追加されている。As an example, the three characters in column 72.84.68 are identical or nearly identical to the traditional ideographic roots insect (72), mountain (84), earth (68), and tree (3), respectively. In the top row of FIG. 9A, these letters have been added to the tone/syllables to form four different PCWs.

関連表意文字（これはほぼ同じ語根を統合する）は、ｐ
ｃｗの下に表示されている。The related ideogram (which unites roughly the same root) is p
It is displayed under cw.

第９Ｂ図は、徴義的類別記号が同声調語の区別に当たっ
てどう使われ得るかを示す別例である。Figure 9B is another example of how symbolic classifiers can be used to distinguish between tonal words.

この図は、ＰＣＷをそれに対応する表意文字とともに左
から右へ、アルファグラミック順に並べた辞典用リスト
である。各表意文字は、“木”の語根に統合され、各Ｐ
ＣＷには似通った文字がある。This figure is a dictionary list in which PCWs and their corresponding ideograms are arranged from left to right in alphagrammatic order. Each ideogram is integrated into the root word “tree” and each P
CW has similar characters.

さらに点線で囲み９ｂと印されたブロックへの４つの入
力は併音では区別できない同声調語であることに注目さ
れたい。Also note that the four inputs to the block surrounded by dotted lines and marked 9b are tonal words that cannot be distinguished by juxtaposition.

優勢語根システムおよび意義的類別システムを組み合わ
せて使い、各声調・音節を８５の同声調語（３５０の同
音異義語に相当する）に区分することができる。これは
、殆どの声調・音節には十分すぎるが、ある声調・音節
には８５もの同声調語がある。これらの声調・音節は、
（１）　’ｉ’　、　’ｕ’　、または°ム°の母音の
みの含まれている声調・音節と、（２）“ｉ′母音で終
わる声調・音節との２種順に分かれる。音声中国語アル
ファベットの独特な特徴を利用することにより、音声中
国語言語は、１７０の同声調語（６８０の同音異義語に
相当する）を’ｉ’ｆＢ音で終わる全ての声調・音節に
、また２５５の同声調語（１，０２０の同音異義語に相
当する）を°ｉ’　、　’ｕ’またはα゛の母合音のみ
を含む声調・音節に解くことができる。これは、以下の
方法でなされる。Using a combination of the dominant root system and the semantic classification system, each tone/syllable can be divided into 85 tones (equivalent to 350 homonyms). This is more than enough for most tones and syllables, but some tones and syllables have as many as 85 tonal words. These tones and syllables are
There are two types of tones and syllables: (1) tones and syllables that contain only 'i', 'u', or °mu° vowels, and (2) tones and syllables that end with the vowel 'i'. Spoken Chinese By taking advantage of the unique features of the alphabet, the spoken Chinese language has 170 homophones (equivalent to 680 homonyms) for all tones and syllables ending in the 'i'fB sound, and 255 Homophones (equivalent to 1,020 homophones) can be resolved into tones/syllables containing only vowels of °i', 'u' or α゛.This is done in the following way. Ru.

第２図に示された通り、ｌ゛は、半子音８３、または母
音声調２７−３０を使って書くことができる。同様にし
て、“ｕ°音は、半子音８４、または母音用８３９−４
２を使って書くことができる。As shown in FIG. 2, l' can be written using the semiconsonant 83 or the vowel tone 27-30. Similarly, the "u° sound is a semiconsonant 84, or a vowel 839-4
It can be written using 2.

最後に°ａ′音は、半子音８５、または母音声調４７＝
５０を使って書くことができる。半子音８３−８５には
声調はないが、上記の母音声調は、同じ音の情報をもつ
半子音につく時の声調を示すために使うことができる。Finally, the °a' sound is a semi-consonant 85, or a vowel tone 47 =
It can be written using 50. Although semiconsonants 83-85 have no tone, the vowel tone described above can be used to indicate the tone when attached to a semiconsonant that has the same sound information.

また、母音声調７９−８２は１、上述された通り、半子
音８３−８５についた時の声調を示すのに使うことがで
きる。これにより、第５Ａ−５Ｃ図に示されている様に
、’ｉ’　、　’ｕ’　、および°ム′の声調・音節の
それぞれは、１２種類の方法で書くことができるように
なる。これらの各図の第１列に、音情報を提供するため
の半子音が使われ、また、同じ音をもつ母音声調が声調
情報を提供するために使われる。第５図の２列目には、
音情報を提供するために半子音が使われ、声調情報を提
供するために無声母音７９−８２が使われる。第５図の
３列目には、音情報および声調情報を提供するために母
音声調のみが使われる。このＰＣＡ特有の能力は、先行
技術システムに比して、ＰＣＬの柔軟性と処理力を可成
り増大させる。Also, the vowel tone 79-82 can be used to indicate the tone when the semiconsonant 83-85 is reached, as described above. This allows each of the 'i', 'u', and 'mu' tones/syllables to be written in 12 different ways, as shown in Figures 5A-5C. In the first column of each of these diagrams, semi-consonants are used to provide tonal information, and vowel tones with the same sound are used to provide tonal information. In the second column of Figure 5,
Semi-consonants are used to provide sound information and voiceless vowels 79-82 are used to provide tonal information. In the third column of FIG. 5, only vowel tones are used to provide phonetic and tonal information. This unique capability of the PCA significantly increases the flexibility and processing power of the PCL compared to prior art systems.

ＰＣＬの°ｉ′音で終わる声調・音節解答についてら、
先行技術システムのものよりは遥かに太きい。これは、
１１音声調２７−３０および７９〜８２が、その後につ
く特別の子音により、全て”１と発音されるという事実
に起因する。母音声１７９−８２が短子音のあとにつく
と、°ｊ゛と発音される。事実、子音Ｔ’　、　’ｇ’
　、　’に’　、“ｈ′、または°ｒ′の後に“ｉ°音
のつくような声調・音節は中国語にはない。従って、母
音声調７９−８２は、子音４．９．１０，１１または１
８の後について使われることはなく、これらの組み合わ
せは同声調語の区別のみに使われる。母音声調７９−８
２が長子音＋　２−２１、または半子音８３−８５の後
につく時はいつでも（それぞれ無声音ながら母音を含ん
でいる）、音は伴なわないが声調・音節の声調を示す無
声母音として作用する。母音声調２７−３０は、短子音
の後につく時はいつでも、°ｉ°と発音される。長子音
の後につく時はいつでも、母合音はらたないが声調・音
節の声調を示し、無声母音として作用する。Regarding the tone and syllable answers that end with the °i′ sound in PCL,
Much thicker than those of prior art systems. this is,
This is due to the fact that the 11 tones 27-30 and 79-82 are all pronounced "1" by the special consonant that follows them. When the vowel 179-82 follows the short consonant, °j゛In fact, the consonants T', 'g'
There are no tones or syllables in Chinese that have an "i°" sound after , 'ni', 'h', or °r'. Therefore, the vowel tone 79-82 is the consonant 4.9.10,11 or 1
It is never used after 8, and these combinations are used only to distinguish between tonal words. vowel tone 79-8
Whenever 2 follows a long consonant + 2-21, or a semi-consonant 83-85 (each of which is voiceless but contains a vowel), it acts as a voiceless vowel that does not make a sound but indicates the tone/tonality of the syllable. . Vowel tones 27-30 are pronounced °i° whenever they follow a short consonant. Whenever it follows a long consonant, it does not have a vowel, but indicates the tone of the syllable and acts as a voiceless vowel.

母音声調２７−３０および７９−８２の上記特徴ゆえに
、音声中国語言語は、“ｉ°音で終わる同声調語を１７
０も書き出すことができ、基調となる声調・音節が、母
音声９２７−３０の中のどれがで終わるものでは８５字
、また基調となる声調・音節が母音声、１１７９−８２
で終わる場合も８５字で、結局ＰＣＬは、この音で終わ
る６８０の同音異義語の区分ができる。Because of the above characteristics of vowel tones 27-30 and 79-82, the phonetic Chinese language has 17 tonal words ending in the i° sound.
0 can also be written, and if the base tone/syllable ends in a vowel 927-30, it will be 85 characters, and the base tone/syllable will be a vowel, 1179-82.
Even if it ends with this sound, it is 85 characters long, and in the end, PCL can classify 680 homonyms that end with this sound.

第１Ｌ図は、多数の同声調語および同音異義語、この場
合では、２４の同音異義語をらっ“ｓｈａ’音と、８６
の同音異義語をもつｓｈｉ’音との背景にある表意文字
の解答法を２例挙げている。Figure 1L shows a large number of tonal words and homophones, in this case 24 homophones, with the ``sha'' sound and 86 homophones.
Two examples are given of how to answer the ideograms behind the shi' sound, which has a homophone.

各列は、所定の声調・音節の全ての同声調語を示す。例
えば、第１列は（右側に１４と印された箇所）、第一声
で発音される声調・音節’ｓｈａ’の１４の同声調語を
示す。各ＰＣＷの下には対応する表意文字がある。最初
の３つのＰＣＷは、優勢語根システムによる第１１第２
、第３型ＰＣＷである。Each column shows all tonal words of a given tone/syllable. For example, the first column (marked 14 on the right) shows 14 tones of the tone/syllable 'sha' pronounced in the first voice. Below each PCW is a corresponding ideogram. The first three PCWs are the 11th, 2nd, and
, the third type PCW.

残り＋１のＰＣＷについては、３番目のＰＣＬが意義類
別記号である。For the remaining +1 PCWs, the third PCL is the significance classification symbol.

第１１図最下部（右側に４０と印されている箇所）を見
ても、第四声で発音される声調・音節″ｓｈｉ’の同声
調語は４０もあることが分かる。最初の３３の同声調語
において、’ｒ’母音声調３０で表わされる。最後の７
つの同声調語においては、“ｉ′母音は母音声調８２で
表わされる。If you look at the bottom of Figure 11 (the part marked 40 on the right), you can see that there are as many as 40 tonal words for the tone/syllable "shi" pronounced in the fourth tone.The first 33 In homotonal words, 'r' is represented by the vowel tone 30. The last 7
In the two homotonal words, the "i' vowel is represented by the vowel tone 82.

Ｂ３分離論理中国語の理想的な表示法には、以下の３通りある。B3 separation logic There are three ideal ways to express Chinese:

１、中国語の声調・音節を、音と声調の双方で発音する
のに必要な音情報および声調情報の全てを提供する。1. Provides all the sound information and tone information necessary to pronounce Chinese tones and syllables in both sounds and tones.

２、同声調語を区別する簡潔かつ有効な方法を提供する
。2. Provide a concise and effective method for distinguishing tonal words.

３、多音頭語の文字列を、多音頭語辞典にたよることな
く、１つの表意文字に対応する個々の構成要素に分離す
る基礎を提供する。3. To provide a basis for separating a polyphonic initial character string into individual components corresponding to one ideogram without relying on a polyphonic initial dictionary.

上記に詳細に説明した通り、本発明の音声中国語言語は
、まさに上記１項、２項の特質を備えている。また、以
下に説明する通り、３項の特質をら兼ね備えている。As explained in detail above, the audio Chinese language of the present invention has exactly the characteristics described in items 1 and 2 above. Furthermore, as explained below, it also has the characteristics of the three items.

音声中国語アルファベットを利用して形成された全ての
音声中国語単語は、以下２つの形のうちの１っをとる。All phonetic Chinese words formed using the phonetic Chinese alphabet take one of two forms:

ＰＣＷ＝ＴＳ＋Ｇ　　　　第１方程式ＰＣＷ＝ＴＳ　　　　　　第２方程式ここでＴＳとは、声調・音節（ｃｖ、ｃｓｖ、ｓｖ。PCW=TS+G First equation PCW=TS Second equation Here, TS refers to tone/syllable (cv, csv, sv).

またはＶという４つの形のうらの１つをとる）のことで
あり、Ｇとは、ＰＣＡの１文字のことであり、これを声
調・音節に追加して同声調語を区分けする。この追加文
字は、上述された通り、優勢語根座間または意義的類別
原則のいずれかを使って選択できる。この文字は、その
選択に優勢語根原［川または意義的類別原則のいずれを
使おうとも、一般化された意義的類別記号Ｇと呼ばれる
。G is a letter in PCA, which is added to tones and syllables to separate homotonal words. This additional character can be selected using either the dominant root zama or the semantic classification principle, as described above. This character is called the generalized semantic classifier G, regardless of whether the dominant root word or the semantic classifier principle is used for its selection.

従って、第１および第２方程式の関係は、より一般的に
は、ＰＣＷ＝ＴＳ＋Ｑ　　　　第３方程式で表わされる。ここでＱとは、一般化された声調・音節
の修正記号であり、一般化された意義的類別記号Ｇ及び
無効セットφの両方を含むしのと定義される（すなわち
、どれか１文字を抜かすことである）。従って、一般化
された声調・音節修正記号Ｑは、１文字抜いて、または
、ＰＣＡのどれか１文字を加えて表現（後に詳述するが
、母音声、ｄ、！＋　７９以外は行表的類別記号Ｇとし
て使われることはない）することができる。Therefore, the relationship between the first and second equations is more generally expressed by the third equation: PCW=TS+Q. Here, Q is a generalized tone/syllable modification symbol, and is defined as a symbol that includes both the generalized semantic classification symbol G and the invalid set φ (i.e., if any one character is ). Therefore, the generalized tone/syllable correction symbol Q is expressed by removing one letter or adding one letter from PCA (as will be explained in detail later, except for the vowel, d, !+ 79, the line table is It is not used as a classification symbol G).

上述した通り、声調・音節は、ｃｖ、ｃｓｖ、ｓｖ、お
上び■の４つのうち、どの杉をもとりうる。As mentioned above, the tone/syllable can be any cedar among the four types: cv, csv, sv, and ojobi ■.

一般化された声調・音節修正記号Ｑは、φ、Ｃ，Ｚ。Generalized tone/syllable modification symbols Q are φ, C, and Z.

■、またはＳの５つのうち、いずれの形をらとり得る（
Ｚはゼロ子音２２を表わす）。従って、ＰＣＷは、第６
図に示されている２０の異なった形のうち、どの形をも
とり得る。It can take any of the five shapes of ■, or S (
Z represents the zero consonant 22). Therefore, PCW is the sixth
It can take any of the 20 different shapes shown in the figure.

列状に連鎖された場合、最初の２欄の杉（ＣＶとＣ５Ｖ
）は、全て区別し得る。が、第３、第４欄（※印は差し
当り無視して）は、もし第３、第４欄のｐｃｗがＰＰＣ
Ｗの一部を形成し、しかも１つ而に置かれているＰＣＷ
が子音で終われば、第１、第２ａＩと混同され得る。殊
に、もし、第３欄のＰＣＷがＣＶＣまたはｃｓｖｃの形
をとるＰＣＷの後につけば、第３１１ＩＩのｐｃｗは、
第２ＩＩｉ１１のＰＣＷと混同され得る。同様に、もし
第４欄のＰＣＷがＣＶＣまたはＣ９ＶＣの形をとるＰＣ
Ｗの後につけば、第１欄のＰＣＷと混同され得る。When chained in rows, the first two rows of cedar (CV and C5V
) can all be distinguished. However, in the 3rd and 4th columns (ignore the * marks for the moment), if the pcw in the 3rd and 4th columns is PPC
PCW that forms part of W and is located in one place
If it ends in a consonant, it can be confused with the first and second aI. In particular, if the PCW in column 3 follows a PCW in the form of CVC or csvc, the PCW in column 311II is
It can be confused with the 2nd IIi11 PCW. Similarly, if the PCW in column 4 takes the form of CVC or C9VC, then
If placed after W, it can be confused with PCW in the first column.

この可能性を避けるために、これらＰＣＷｈ＜ＰＰＣＷ
の１部を形成し、その商館にくるＰＣＷが、ＣＶＣまた
はｃｓｖｃの形をとる時はいっでら、第３、第４欄のＰ
ＣＷの頭に、ＰＣＬテキスト作成者によってゼロ子音２
２が追加される。これは、各ＰＣＷの第３、第４欄の前
に※印で表示される。To avoid this possibility, these PCWh<PPCW
When the PCW that forms part of the
Zero consonant 2 at the beginning of CW by PCL text author
2 is added. This is displayed with a * mark in front of the third and fourth columns of each PCW.

この簡単な入力方式に従うことにより、ＰＰＣＷを明確
に個々のＰＣＷ構成要素に分割し、次いで、ばらばらに
されたＰＣＷに対応する特別な中国語表章文字を識別す
る簡単なコンピュータ・プログラムを作成することがで
きる。By following this simple input method, we create a simple computer program that unambiguously splits the PPCW into its individual PCW components and then identifies the special Chinese alphabet characters that correspond to the disjointed PCWs. be able to.

もう１つの特別な方法は、反転音表意文字の性格上必要
とされる。反転音表意文字（反転音母音とも呼ばれる）
は、その１つ前にくる表色文字の語尾が°ｅｒ’音で終
わるように修正する唯一の中国語表意文字である。これ
は、２つ続いた表意文字が組み合わさって１つの音節（
’ｅｒ’で終わる）を形成する唯一のケースである。そ
の結果、反転音表色文字は、常に多音節の文字列の最後
、すなわちＰＰＣＷの語尾についている。上述の通り、
母音声調７９は、それ自体で孤立していたり、あるいは
ゼロ子音２２の後につく時は、“ｅｒ’と発音されるも
のの１つである。反転音は、中国語では°ｅｒ”と発音
されるため、母音声調７９は、反転音表意文字を表わす
ものと定義される。この指定は、コンピュータのプログ
ラムで、ｐｐｃｗをＰＣＷの各構成要素に分割し、その
後で各ＰＣＷに対応する１つの特別な中国語表意文字を
識別させるのに大切である。以下に述べる通り、プログ
ラムは、反転音表意文字を残りの表意文字とは別扱いし
ている。このプログラムは、ＰＩ’ＣＷを、さらなくば
個々のＰＣＷに分解する前に、この表意文字を探し出す
ことにより識別する。Another special method is required due to the nature of inverted ideograms. Inverted ideograph (also called inverted vowel)
is the only Chinese ideogram that modifies the preceding color character so that it ends in an °er' sound. This is because two consecutive ideograms are combined to form one syllable (
ending in 'er'). As a result, the inverted phonetic color character always appears at the end of a multisyllabic character string, that is, at the end of a PPCW word. As mentioned above,
The vowel tone 79 is one of those pronounced "er" when isolated by itself or after the zero consonant 22. The inverted tone is pronounced "°er" in Chinese. Therefore, vowel tone 79 is defined as representing an inverted ideogram. This designation is important for a computer program to divide the ppcw into each component of the PCW and then identify one special Chinese ideogram corresponding to each PCW. As described below, the program treats inverted ideograms differently from the rest of the ideograms. This program identifies the PI'CW by looking for this ideogram before further decomposing it into individual PCWs.

ＰＣＷの分離法を示すフロー・チャートは、第７Ａ−７
Ｄ図に例解されている。この方法は、コンピュータ・プ
ログラムとして作り、その後で一般目的用のコンピュー
タによって実行しうる。例示したフロー・チャートは、
分離論理と、各ＰＣＷを各表意文字に関連づける単音筒
編辞典を利用して入力されたＰＰＣＷが、中国語の表意
文字に変換してゆく方法を提示している。このプログラ
ムは、より大型のデータ・プロセシングまたはワード・
プロセシングのいずれか求めている方と関連させて使う
ことができる。The flow chart showing the PCW separation method is shown in Section 7A-7.
This is illustrated in Figure D. The method may be written as a computer program and then executed by a general purpose computer. The example flow chart is
It presents a method in which input PPCWs are converted into Chinese ideograms using separation logic and a monophonic dictionary that associates each PCW with each ideogram. This program is useful for larger data processing or word
It can be used in conjunction with those seeking any kind of processing.

ここには１つの特定のプログラムが例示されているだけ
だが、本発明はこのプログラムに限られることなく、普
通の技能レベルのプログラマなら、同じ原則を使い、本
発明の実施態様と同じ結果が得られるような、他の多く
のプログラムもデザインできよう。さらに、このプログ
ラムは、表意文字を識別してから、それを出力装置の上
に表示する。表意文字の表示は、絶対に必要ということ
ではなく、それを表示せずただ単に表意文字を識別する
ために、ＰＣＬと分離論理を使うことらできる。広義に
言えば、本発明は、多音節の文字列を分離するための分
離論理も包含していると考えられる。Although only one particular program is illustrated here, the invention is not limited to this program; programmers of ordinary skill level can use the same principles and achieve the same results as embodiments of the invention. Many other programs could be designed that would Additionally, the program identifies the ideogram before displaying it on the output device. Display of ideograms is not absolutely necessary, and PCL and separation logic can be used to simply identify ideograms without displaying them. Broadly speaking, the present invention is also considered to encompass separation logic for separating multisyllabic strings.

ここで第７Ａ−７部図に移るとしよう。本プロダラムは
、指示ブロックｌＯから始まる。ここては、７Ｌｚ’　
　Ｓ’ｌ’ｆｔ１ＮＧ（Ｊ）、ＳＥＧ（Ｍ）、およびＰ
ＣＷ（Ｘ）がクリアされ、フラッグＲＶ、Ｚ、Ｅ。Let us now turn to Figure 7A-7. The program begins with an instruction block IO. Here, 7Lz'
S'l'ft1NG (J), SEG (M), and P
CW (X) is cleared, flags RV, Z, E.

および変数ＪＭＡＸがゼロにセットされる。アレイ　５
ＴｒｔｌＮＧ（Ｊ）は、ＰＰＣＷの連続文字をストアす
るために使われる。ＰＰＣＷの頭文字は、５ＴＲＩＮＧ
（１）のエレメントにストアされＰＰＣＷの２番目の文
字は、５ＴＲＩＮＧ（２）などのエレメントにストアさ
れる。アレイ　５ＴＲＩＮＧ（Ｊ）は、そのシステムが
処理するためにデザインされた最大数のＰＰＣＷをスト
アするに十分な収容能力をもつ。殆どの場合、２０エレ
メントのアレイなら十分である。希望により、アレイ　
５ＴＩＩ　ＮＧ（Ｊ）は、Ｐ　Ｐ　ＣＷ（中国語の複合
語）を分離するスペース・バーを押すことなしに、ＰＣ
Ａの長い文字列（複数のＰＰＣＷから成る）を入力する
ために、超大型にしてもよい。and variable JMAX is set to zero. array 5
TrtlNG(J) is used to store consecutive characters of PPCW. The initials of PPCW are 5TRING.
The second character of PPCW stored in element (1) is stored in element 5TRIG(2) and so on. Array 5TRING(J) has sufficient capacity to store the maximum number of PPCWs the system is designed to process. In most cases, a 20 element array is sufficient. Array upon request
5TII NG (J) is a PC without pressing the space bar to separate P P CW (Chinese compound word).
It may be made very large in order to input a long string of A (consisting of multiple PPCWs).

アレイ　Ｓ　Ｅ　Ｇ　（Ｍ）は、その文字列の中の何文
字で１つのＰＣＷを形成するかを確定するＰＰＣＷ文字
列の１部を一時的にストアする５エレメントのアレイで
ある。アレイ　ＰＣＷ（Ｘ）は、その対応する表意文字
が識別できるよう、ＰＣＷを一時的にストアするために
使われる。アレイ　５ＴＲＩＮＧ（Ｊ）、ＳＥＧ（Ｍ）
、およびＰＣＷ（Ｘ）の全てかクリアされると、おのお
ののエレメントはゼロにセットされる。Array S E G (M) is a five-element array that temporarily stores a portion of a PPCW string that determines how many characters in the string form one PCW. Array PCW(X) is used to temporarily store PCWs so that their corresponding ideograms can be identified. Array 5TRING (J), SEG (M)
, and PCW(X) are all cleared, each element is set to zero.

フラッグＲＶは、反転音母音信号であり、ＰＰＣＷの語
尾が反転音母音７９を表示する時はいつもビにセットさ
れる。フラッグＲＶがゼロにセットされている時はいつ
も、ｐｐｃｗの語尾が反転音母音を表示していないこと
を意味する。Flag RV is an inverted vowel signal and is set to Bi whenever the ending of the PPCW indicates an inverted vowel 79. Whenever flag RV is set to zero, it means that the ending of ppcw does not display an inverted vowel.

ゼロ子音フラッグＺは、ＰＣＷの頭文字がゼロ子音２２
かどうかを表示する。もし頭文字がゼロ子音なら、フラ
ッグＺがｌ°にセットされる。The zero consonant flag Z is the initial letter of PCW, which is the zero consonant 22.
Show whether. If the initial letter is a zero consonant, flag Z is set to l°.

フラッグＥは、エラー信号で、分離論理によりＰＣＷの
文字列が誤って並べられていると判断した時はいつら、
１°にセットされる。Flag E is an error signal, and when the separation logic determines that the PCW strings are incorrectly arranged,
It is set to 1°.

変数ＪＭＡＸは、ｐｐｃｗの長さを探知するために、Ｐ
ＰＣＷが５ＴＲＩＮＧ（Ｊ）に稼動されるや、計数器Ｊ
の数が増す。The variable JMAX is used to detect the length of ppcw.
As soon as the PCW is activated to 5 TRING (J), the counter J
The number of

アレイがクリアされ、信号がゼロにセットされるや、分
離論理で最初にする操作は、１つのＰＰＣＷを識別し、
それをアレイ　Ｓ　ＴＲＩ　ＮＧ（Ｊ、）にストアする
ことである。これは、第７Ａ図の論理・ブロック＋２−
２３でおこなう。Once the array is cleared and the signal is set to zero, the first operation in the isolation logic is to identify one PPCW and
The next step is to store it in the array STRING(J,). This is the logic block +2- in Figure 7A.
Do it at 23.

先ず、指示ブロックＩ２に進むと、このプログラムは変
数Ｊを“ｌ゛にセットする。次に、入力データ・バッフ
ァ・レジスタＲＥＧＡ（ブロック１４）に文字が入って
いるか確かめる。本発明の目的のために、所定の瞬間、
レジスタＲＥＧ　　Ａにはいつら１字しか入っていない
ようにするため、コンピュータの処理時間よりも遅い速
度で入力文字を１度に１字ずつ、バッファ・レジスタＲ
ＥＧＡに入れておくものと仮定する。希望により、複数
のＰＰＣＷ（各単語間にスペースをとるかとらないかは
任意）を含む、以前にストアされたリストら加えるよう
、プログラムを修正することらできる。この場合、プロ
グラムは先ず、リストを個々のＰＰＣＷに分離し、次に
各ＰＰＣＷを後述の方法で処理する。First, proceeding to instruction block I2, the program sets variable J to "l". Next, it checks to see if the input data buffer register REGA (block 14) contains a character. For the purposes of the present invention, At a given moment,
To ensure that register REG A contains only one character at any given time, input characters are transferred one character at a time to buffer register R at a rate slower than the computer's processing time.
Assume that it is stored in EGA. If desired, the program can be modified to add from previously stored lists containing multiple PPCWs (with or without spaces between each word). In this case, the program first separates the list into individual PPCWs and then processes each PPCW in the manner described below.

決定ブロックｉ４に戻り、このプログラムは、ｐｐｃｗ
の文字列の頭文字がレジスタに現われるまで、レジスタ
ＲＥＧ　　Ａをボールし続ける。その時点でプログラム
は、決定ブロックＩ６に進み、レジスタＲＥＧ　　Ａの
字の出る箇所が空白（アルファベット文字で出ておらず
）かどうかを確かめる。もし空白でなければ、プログラ
ムは決定ブロック１８に進み、ＲＥＧ　　ＡにおけるＰ
ＣＡ文字に対応する数値を、アレイ　５ＴＲＩ　ＮＧ（
Ｊ）（Ｊは最初゛１゛にセットされている）の第１エレ
メントにセットする。次いでレジスタＩ’？ＥＧ　　Ａ
がクリア（指示ブロック２０）され、変数Ｊはｌだけ増
えろ（指示ブロック２２）。５ＴＲＩＮＧ（Ｊ）に最終
的にはロードされるｐｐｃｗの長さを探知するために、
変数ＪＭＡＸの数も増える。次にプログラムは、決定ブ
ロック１４に戻り、レジスタＲＥＧＡに入れられる２番
目の文字を持つ。もしこの文字の部分が空白でなければ
、Ｊがブロック２２で２になっているので、５ＴＲＩ　
ＮＧ（Ｊ）の２番目のエレメントに入れられる。プログ
ラムは、レジスタＲＥＧ　　Ａの文字がすっかり空にな
るまで、エレメント１４−２３までをループし続けろ。Returning to decision block i4, the program reads ppcw
Continue to roll the register REGA until the first letter of the string appears in the register. At that point, the program proceeds to decision block I6 and determines whether the register REG A is blank (no alphabetic character appears). If not blank, the program advances to decision block 18 and the P in REG A
Array 5TRING(
J) (J is initially set to ``1''). Then register I'? EG A
is cleared (instruction block 20), and the variable J is increased by l (instruction block 22). To find out the length of ppcw that will eventually be loaded into 5TRING(J),
The number of variables JMAX also increases. The program then returns to decision block 14 with the second character placed in register REGA. If this character part is not blank, J is 2 in block 22, so 5TRI
It is placed in the second element of NG(J). The program continues to loop through elements 14-23 until register REGA is completely empty of characters.

ひと度空白になると、ＰＰＣＷ単語は、ＳＴＲＩＮＧ（
Ｊ）の連続した各エレメントに１つずつストアされ、全
、てが５ＴＲＩＮＧ（Ｊ）に納められたことになる。変
数ＪＭＡＸの数値、すなわちＰＰＣＷの長さらストアさ
れる。５ＴＲＩＮＧ（Ｊ）に各ＰＰＣＷを入ツノし了え
ると、プログラムは決定ブロック２４に進む。Once blank, the PPCW word becomes STRING (
One by one is stored in each consecutive element of J), and all of them are stored in 5 TRING(J). The numerical value of the variable JMAX, that is, the length of PPCW is stored. Once each PPCW has been entered into 5TRIG(J), the program proceeds to decision block 24.

アレイ　５ＴｒｔＩ　ＮＧ（Ｊ）に１つのＰＰＣＷ単語
を入れると、そのｐｐｃｗ単語の語尾が反転音表意文字
（または反転音母音とも呼ばれる）かどうかを確かめる
。これは、論理ブロック２４−３０でなされる。論理ブ
ロック２４に進み、プログラムは先ず、５ＴＲＩＮＧ（
Ｊ）の最後の字が母音声調７９かどうかを確かめる。も
しそうでなければ、ｐｐｃｗの語尾は反転音表意文字を
表示せず、プログラムは直ちに決定ブロック３２に進む
ことができる。When we put one PPCW word in the array 5TrtI NG (J), we check whether the end of the ppcw word is an inverted ideogram (or also called an inverted vowel). This is done in logic blocks 24-30. Proceeding to logic block 24, the program first selects 5TRIG(
Check whether the last letter of J) is in vowel tone 79. If not, the end of ppcw does not display an inversion ideograph and the program can immediately proceed to decision block 32.

５ＴＲＩＮＧ（Ｊ）の最後の字が母音声調７９なら、そ
れが反転音表意文字を表示するかどうか、再調査を要す
る。上記の規則により、母音声調７９は意義的類別記号
として使うことはできない。それゆえ、声調・音節の１
部としての母音声調の後につくことはできない。もし母
音声調７９が別の母音声調の後につくとすれば、反転音
母音を表示しなければならない。同様に、第４Ｄ図に示
されている通り、声調・音節の１部として、子音１１３
．４．７−■１、または＋８の後にもつくことはできな
い。（３−７９と８−７９の組み合わせは、中国語の声
調・音節を形成する。曖昧さを避けるために、これらは
、可能な声調・音節を形成する文字の組み合わせからは
特に除外されている。If the last letter of 5TRING (J) is vowel tone 79, we need to re-examine whether it represents an inverted ideogram. According to the above rules, vowel tone 79 cannot be used as a semantic classifier. Therefore, tone/syllable 1
It cannot follow the vowel tone as a part. If vowel tone 79 follows another vowel tone, an inverted vowel must be displayed. Similarly, as shown in Figure 4D, as part of the tone/syllable, consonant 113
．． 4.7-■ Cannot be placed after 1 or +8. (The combinations 3-79 and 8-79 form Chinese tones/syllables. To avoid ambiguity, these are specifically excluded from the list of possible tone/syllable character combinations. .

第４Ｄ図参照）。従って、もし母音声調７９が母音、ま
たは子音！、３．４．７〜１１．１８のうち１つのいず
れかの後につくと、母音声調７９が反転音母音を表示す
ることが明確に判かる。プログラムは、決定ブロック２
６の５ＴＲＩＮＧ（Ｊ）の最後から２番目の字が母音（
Ｖ）か、それとも子音Ｃ−＝１，３．４．７−１１．１
８のうちの１つかどうかを確かめる検査をする。乙しそ
うでなければ、母音声調７９は、反転音母音を表示せず
、プログラムは、決定ブロック３２に進む。５ＴＲＩＮ
Ｇ（Ｊ）の最後から２番目の字が母音または子音Ｃ゛の
１つであれば、母音声調７９は反転音母音を表示しない
。この場合、５ＴＲｒＮＧ（Ｊ）の語尾は、ゼロにセッ
トされ、反転音母音フラッグＲｖは、ｌにセットされる
（ブロック２８−３０参照）。(See Figure 4D). Therefore, if vowel tone 79 is a vowel or a consonant! , 3.4.7 to 11.18, it is clearly seen that vowel tone 79 indicates an inverted vowel. The program starts with decision block 2
The second to last letter of 6 no 5 TRING (J) is a vowel (
V) or consonant C-=1, 3.4.7-11.1
We'll do a test to see if it's one of the 8. If not, the vowel tone 79 does not display an inverted vowel and the program proceeds to decision block 32. 5 TRIN
If the penultimate letter of G(J) is one of the vowels or consonants C', the vowel tone 79 does not represent an inverted vowel. In this case, the ending of 5TRrNG(J) is set to zero and the inverted vowel flag Rv is set to l (see blocks 28-30).

５ＴＲＩＮＧ（Ｊ）の語尾が反転音母音を表示している
かどうか確認したら、ＳＴ［１（ＮＧ（Ｊ）にストアさ
れたＰＰＣＷ文字列の最初のＰＣＷ単語を識別する。こ
れは、論理ブロック３２−７６（第７Ｂ図）から成るサ
ブルーチンによってなされる。5 Once we have determined whether the ending of TRING(J) represents an inverted vowel, we identify the first PCW word of the PPCW string stored in ST[1(NG(J)). This is done in logic block 32- This is done by a subroutine consisting of 76 (FIG. 7B).

上記の通り、ＰＣＷ単語は、ＴＳ＋Ｑの一般化された形
をとる。声調・音節は、ｃｓｖ、ｃｖ、ｓＶ、またはＶ
の形をとるので、長さとしては、■、２、または３文字
のうちのいずれかになりうる。As mentioned above, PCW words take the generalized form TS+Q. Tone/syllable: csv, cv, sV, or V
The length can be one of ■, 2, or 3 characters.

−膜化された声調・音節修正記号Ｑはゼロまたは１文字
の長さとなるので、ＰＣＷの全長は、２、３、または４
文字のいずれかになる。ＳＴＲＩ　ＮＧ（Ｊ）の最初の
声調・音節の実際の長さは、論理ブロック３２−４２の
サブルーチンによって決まる。- The membraned tone/syllable modifier Q can be zero or one character long, so the total length of the PCW is 2, 3, or 4.
Become any of the characters. The actual length of the first tone/syllable of STRI NG (J) is determined by the subroutine of logic blocks 32-42.

これが決まるや、ＰＣＷの長さは、声調・音節の直後に
くる２字を調べることにより、はっきりと決まる。これ
は、ブロック４４−７６のサブルーチンに従ってなされ
る。より詳しくいえば、これらの文字は、可能な声調・
音節の形であるＣ８ｖ、ｃｖ、ｓｖ、またはＶＰの最初
の２字に対応する形であるｃｓ、ｃｖ、ｓｖ、またはＶ
Ｐ（Ｐ＝φ。Once this is determined, the length of the PCW can be clearly determined by examining the two letters that immediately follow the tone/syllable. This is done according to the subroutine of blocks 44-76. More specifically, these characters have a range of possible tones and
cs, cv, sv, or V, which is the form that corresponds to the first two letters of the syllable form C8v, cv, sv, or VP.
P(P=φ.

Ｃ，Ｚ、またはＳ）のうちのどれか１つをとるかどうか
を確かめる検査をする。もしｃｓ、ｃｖ、ｓｖ。A test is performed to see if any one of C, Z, or S) is selected. If cs, cv, sv.

またはＶＰの形をとれば、この２字が５ＴＲＩＮＧ（Ｊ
）の２番目の声調・音節の頭文字を決め、Ｑはゼロにセ
ットされ、ＰＣＷの長さは声調・音節と同じ長さになる
。もしこれらのうちの１つの形をとらなければ、Ｑは一
般化され意義的類別記号Ｇであり、ＰＣＷの長さは声調
・音節＋１の長さに等しい。Or, if you take the form of VP, these two characters become 5TRING (J
), the first letter of the second tone/syllable is determined, Q is set to zero, and the length of the PCW is the same as the tone/syllable. If it does not take one of these forms, then Q is a generalized semantic classifier G, and the length of the PCW is equal to the length of the tone/syllable+1.

第７Ｂ図に移り、Ｓ’ｒ１１１ＮＧ（Ｊ）（７）最初の
声調・音節の長さを決めるサブルーチンは、決定ブロッ
ク３２で始まる。コンピュータは、先ず５Ｔｒｔ　Ｉ　
ＮＧ（Ｊ）に入っているＰＰＣＷの頭文字が半子音かど
うかを確かめる。もしそうなら、声調・音節はＳｖの形
をとらなければならないので２字となる。それゆえ、プ
ログラムは、ブロック３４に進み、変数ｎ＝２にセット
する。変数ｎは、声調・音節の文字数を表わす。Turning to FIG. 7B, the subroutine for determining the initial tone and syllable length of S'r111NG(J)(7) begins at decision block 32. The computer first uses 5Trt I
Check whether the initial letter of PPCW in NG (J) is a semiconsonant. If so, the tone/syllable must take the form of Sv, resulting in two letters. The program therefore proceeds to block 34 and sets the variable n=2. The variable n represents the number of characters in the tone/syllable.

５ＴＲＩＮＧ（Ｊ）の最初のエレメントが半子音でなけ
れば、プログラムは決定ブロック３６に進み、５ＴＲＩ
ＮＧ（Ｊ）の最初のエレメントが母音かどうかを確かめ
る。もしそうなら、声調・音節はＶから成り、変数ｎは
ｌ（ブロック３８）にセットされる。もし、５ＴＲＩ　
ＮＧ（Ｊ）の最初のエレメントが半子音でも母音でもな
ければ、それは子音にちがいない。この場合、声調・音
節は、５ＴｎＩＮＧ（Ｊ）の２番目の字が半子音か母音
かによって、Ｃ８ｖまたはＣｖの形をとることができる
。If the first element of 5TRIG(J) is not a semiconsonant, the program proceeds to decision block 36 and the 5TRING(J)
Check whether the first element of NG (J) is a vowel. If so, the tone/syllable consists of V and the variable n is set to l (block 38). If 5TRI
If the first element of NG (J) is neither a semiconsonant nor a vowel, it must be a consonant. In this case, the tone/syllable can take the form C8v or Cv, depending on whether the second letter of 5TnING (J) is a semiconsonant or a vowel.

これを決めるために、プログラムは決定ブロック４０に
進み、５ＴＲＩＮＧ（Ｊ）の２番目の字が子音かどうか
を確かめる。もしそうなら、声調・音節はＣ８Ｖの形を
とり、変数ｎは３（ブロック４２）にセットされる。２
番目のエレメントが半子音でもなければ、声調・音節は
ＣＶの形をとり、変数ｎは２（ブロック３４）にセット
される。To determine this, the program proceeds to decision block 40 and checks to see if the second letter of 5TRING (J) is a consonant. If so, the tone/syllable takes the form C8V and the variable n is set to 3 (block 42). 2
If the th element is also not a semiconsonant, the tone/syllable takes the form of a CV and the variable n is set to 2 (block 34).

ブロック３２−４２から成るサブルーチンが声調・音節
の文字数を決め、変数ｎをその数にセットするや、文字
列ｎ＋２文字の長さは、一般化された声調・音節修正記
号ＱはゼロまたはＧにセットされたかどうかを確かめる
検査をしなければならない。これは、ブロック４４−７
６を含むサブルーチンでおこなう。Once the subroutine consisting of blocks 32-42 determines the number of characters in a tone/syllable and sets the variable n to that number, the length of the string n+2 characters is determined by the generalized tone/syllable modifier Q being zero or G. A check must be made to see if it is set. This is block 44-7
This is done in a subroutine including 6.

指示ブロック４４から始め、プログラムは変数Ｎ＝ｎ＋
２、Ｍ＝１．Ｊ＝１をセットする。変数Ｎは、アレイＳ
　Ｅ　Ｇ　（Ｍ）に入れる文字数を決め、変数Ｍは、検
査のおこなわれているアレイＳＥＧ（Ｍ）の特定のエレ
メントを決め、変数Ｊは、検査のおこなわれているアレ
イ５ＴＲＩＮＧ（Ｊ）の特定のエレメントを決める。声
調・音節の直後にくる２字を検査する前に、５Ｔｒｉｔ
ｌＮＧ（Ｊ）の最初のＮ字をアレイＳ　Ｅ　Ｇ　（Ｍ）
に移さなければならない。これは、論理ブロック４６−
５０でおこなう。Starting at instruction block 44, the program begins with variable N=n+
2, M=1. Set J=1. Variable N is array S
Determine the number of characters to be placed in E G (M), variable M determines the specific element of array SEG (M) that is being tested, and variable J determines the specific element of array 5TRIG (J) that is being tested. Decide on the elements. 5 Trits before testing the 2 letters that come immediately after the tone/syllable.
Array the first N character of lNG (J) S E G (M)
must be moved to This is logical block 46-
Do it with 50.

これが終わると、プログラムはＰＣＷにｎまたはｎ＋１
文字（すなわち、一般化された声調・音節Ｑが文字なの
か、あるいはゼロ・セットなのか）のうちいずれが入っ
ているかを確定する決定ブロック５２−７６を含むサブ
ルーチンに進む。これは、アレイＳ　Ｅ　Ｇ　（Ｍ）の
最後の２字を見、この２字がｃｓ、ｃｖ、ｓｖ、または
ＶＰの形をとるかどうかを決め、従って、それら２字の
うちいずれが５Ｔｒｔ　Ｉ　ＮＧ（Ｊ）の２番目のＰＣ
Ｗ単語の頭文字なのかを決めることによって達成される
。アレイＳＥＧ（Ｍ）の最後から２番目の文字が、ＳＴ
ＲＩ　ＮＧ（Ｊ）のＰＣＷの２番目の単語の頭文字なら
、それは意義的類別記号ではなく、ＰＣＷの長さは声調
・音節の長さに等しい。アレイＳ　Ｅ　Ｇ　（Ｍ）の最
後の文字が、５ＴＲＩＮＧ（Ｊ）のｐｃｗの２番目の単
語の頭文字なら、Ｓ　Ｅ　Ｇ　（Ｍ）の最後から２番目
の文字は意義的類別記号である。この場合、ＳＴＲＩＮ
Ｇ（Ｊ）の最初のＰＣＷは、声調０音節より１文字長く
なる。Once this is done, the program will write n or n+1 to the PCW.
Proceeding to a subroutine that includes decision blocks 52-76 that determine which of the letters (i.e., whether the generalized tone/syllable Q is a letter or a zero set) is included. This looks at the last two characters of the array S E G (M) and determines whether these two characters take the form cs, cv, sv, or VP, and therefore which of those two characters is 5Trt I NG(J)'s second PC
This is achieved by determining whether it is the initial letter of the W word. The second to last character of array SEG(M) is ST
If it is the first letter of the second word in the PCW of RI NG (J), it is not a semantic classifier, and the length of the PCW is equal to the tone/syllable length. If the last character of the array S E G (M) is the initial letter of the second word of the pcw of 5TRING (J), then the penultimate character of S E G (M) is a semantic classifier. In this case, STRIN
The first PCW of G(J) is one character longer than the tone 0 syllable.

指示ブロック５２から始め、プログラムは、ＳＥ　Ｇ　
（Ｍ）の語尾が母音声調かどうかを確かめる（ブロック
４６−５０を包含するサブルーチンにおいて、変数Ｍの
値はＮに増えたことを忘れてはならない）。もしＳＥＧ
（Ｍ）の最後の文字が母音声調なら、Ｃ；；　Ｅ　ｃ　
（Ｍ）の最後から２番目の文字も母音声調かどうかを確
認する。もしそうなら、誤り状態が存在する（ＰＣＬの
入力規則で、もし文字列の最初のＰＣＷが母音声調で終
われば、次にくるＰＣＷの頭文字は母音声調にできない
ことになっている）。もし誤り状態が存在するなら、プ
ログラムは指示ブロック５６に進み、ベルを鳴らすか、
他のエラー表示をさせることができる。次にプログラム
は、エラー・フラッグＥが１にセットされる指示ブロッ
ク５８に進み、変数ＰはＮにセットされる。後述される
通り、ここでＰＣＷを入力した人がそれを調べ、どこで
入力ミスが起ったかをつきとめるために、Ｓ　Ｅ　Ｇ　
（Ｍ）にストアされた全ての文字列をディスプレイ・ス
クリーンに映し出さける。Starting at instruction block 52, the program begins with S.E.G.
Determine whether the ending of (M) is vowel (remember that in the subroutine containing blocks 46-50, the value of variable M was increased to N). If SEG
If the last letter of (M) is vowel tone, then C;; E c
Check whether the second to last letter of (M) is also in the vowel tone. If so, an error condition exists (the PCL input rules state that if the first PCW in a string ends in a vowel tone, the initial letter of the next PCW cannot be in a vowel tone). If an error condition exists, the program proceeds to instruction block 56 and either rings a bell or
Other error messages can be displayed. The program then advances to instruction block 58 where error flag E is set to 1 and variable P is set to N. As described below, the person who entered the PCW here should check it and use S E G to find out where the input error occurred.
Project all strings stored in (M) on the display screen.

もし、Ｓ　Ｅ　Ｇ　（Ｍ）の最後から２番目の文字が母
音声調（ブロック５４）でなければ、プログラムは決定
ブロック６２に進み、それが子音かどうかを確かめる。If the penultimate letter of S E G (M) is not a vowel tone (block 54), the program proceeds to decision block 62 to see if it is a consonant.

もしそうなら、ゼロ子音フラッグＺは１にセットされ、
変数ＰはＮにセットされる（ブロック６４および６５）
。もし最後から２番目の文字がゼロ子音でなければ、プ
ログラムは直接指示ブロック６６に進み、変数ＰはＮに
セットされる。いずれの場合でも、−膜化された声調・
音節修正記号Ｑは、無効セットに等しく、従ってＰはＮ
にセットされたことが確認される。これにより、そのＰ
ＣＷは・この声調・音節のみに等しいことが識別される
。If so, the zero consonant flag Z is set to 1;
Variable P is set to N (blocks 64 and 65)
. If the penultimate character is not a zero consonant, the program proceeds directly to instruction block 66 and variable P is set to N. In either case, - membranous tone/
The syllable correction mark Q is equal to the invalid set, so P is N
It is confirmed that it is set to . This allows that P
CW is identified as being equal only to this tone/syllable.

決定ブロック５２に戻り、もしＳ　Ｅ　Ｇ　（Ｍ）の最
後の文字が母音声調でなければ、プログラムはそれが半
子音（決定ブロック６８）かどうかを確かめる。もしそ
うなら、プログラムは次にＳ　Ｅ　Ｇ　（Ｍ）の最後か
ら２番目のエレメントが子音（ブロック７０）かどうか
を確かめる。もしそうなら、２番目のＰＣＷはＳ　Ｅ　
Ｇ　（Ｍ）の最後から２番目の文字で始まり、従って最
初のＰＣＷはｎ文字だけ長くなる。そのため、ＰＣＷの
長さの変数ＰはＮにセットされる（ブロック６６）。も
しＳＥＧ（Ｍ）の最後から２番目の文字が子音でなけれ
ば、Ｓ　Ｅ　Ｇ　（Ｍ）の最後に位置していた半子音は
５ＴＲＩＮＧ（Ｊ）の２番目のＰＣＷの頭文字であるた
め、５ＴＲＩＮＧ（Ｊ）の最初のｐｃｗはｎ＋１文字だ
け長（なる。Returning to decision block 52, if the last letter of S E G (M) is not a vowel tone, the program checks whether it is a semiconsonant (decision block 68). If so, the program then checks whether the penultimate element of S E G (M) is a consonant (block 70). If so, the second PCW is S E
It starts with the penultimate character of G (M), so the first PCW is n characters longer. Therefore, the PCW length variable P is set to N (block 66). If the penultimate letter of SEG (M) is not a consonant, then the semi-consonant located at the end of S E G (M) is the initial letter of the second PCW of 5 TRING (J), so The first pcw of 5TRIG(J) is n+1 characters long.

決定ブロック６８に戻り、もしＳＥＧ（Ｍ）の最後の文
字が母音声調でも半子音でもなければ、子音かゼロ子音
にちがいない。この場合、５ＴｒｔｌＮＧ（Ｊ）の最初
のｐｃｗは、ｎ＋１文字だけ長く、ＰＣＷの長さの変数
Ｐは指示ブロック７６のｎ＋１にセットされる。指示ブ
ロック７６に進む前に、Ｓ　Ｅ　Ｇ　（Ｍ）の最後の文
字がゼロ子音かどうかを確かめるため、プログラムは決
定ブロック７２に進む。もしそうなら、ゼロ子音の変数
Ｚはｌにセットされる。以下に示された通り、ゼロ子音
は、プログラムの後半で５ＴｒｔｌＮＧ（Ｊ）から外さ
れることになる。Returning to decision block 68, if the last letter of SEG(M) is neither a vowel nor a semiconsonant, it must be a consonant or a zero consonant. In this case, the first pcw of 5TrtlNG(J) is longer by n+1 characters and the PCW length variable P is set to n+1 in instruction block 76. Before proceeding to instruction block 76, the program proceeds to decision block 72 to determine whether the last letter of S E G (M) is a zero consonant. If so, the zero consonant variable Z is set to l. As shown below, the zero consonant will be removed from 5TrtlNG(J) later in the program.

この時点では、プログラムは５ＴＲＩＮＧ（Ｊ）の最初
のＰＣＷに何文字入っているかをはっきりと確かめた。At this point, the program has clearly determined how many characters are in the first PCW of 5TRING (J).

このＰＣＷは、次にブロック７８−８４から成るサブル
ーチンに従い、ＰＣＷ（Ｘ）に入れられる。This PCW is then placed into PCW(X) according to the subroutine consisting of blocks 78-84.

決定ブロック８６に戻り、コンピュータは、エラー・フ
ラッグＥ７’ｌ＜１にセットされているかを確かめる。Returning to decision block 86, the computer determines if the error flag E7'l<1 is set.

もしそうなら、プログラムは指示ブロック８８に進み、
キーボード操作員が自らの入力ミスが何であったかを確
かめられるように、ディスプレイ・スクリーンにＳ　Ｅ
　Ｇ　（Ｍ）にストアされた情報を映し出す。If so, the program proceeds to instruction block 88;
S E on the display screen so that the keyboard operator can see what his input error was.
Displays the information stored in G (M).

もしエラー信号が１でなければ、プログラムは指示ブロ
ック９０に進む。コンピュータは、可能なＰＣＷを唯−
前二の表意文字に特異に対応させる単音頭語辞典をもっ
ている。プログラムは、アレイＰＣＷ（Ｘ）のＰＣＷに
よって識別された表意文字を見、この表意文字をディス
プレイに映し出この時点で次におこなう処置は、表意文
字を識別し、それをディスプレイに映は出すために、５
ＴＲＩ　ＮＧ（Ｊ）の次のＰＣＷを調べることである。If the error signal is not one, the program proceeds to instruction block 90. The computer only identifies possible PCWs.
It has a monophonic initial word dictionary that specifically corresponds to the first two ideograms. The program looks at the ideogram identified by the PCW in array PCW(X) and projects this ideogram on the display.The next action it takes at this point is to identify the ideogram and display it on the display. To, 5
The purpose is to check the next PCW of TRI NG (J).

上記の通り、ブロック３２−９０から成るサブルーチン
は、５ＴＲＩＮＧ（Ｊ）に示された最初のＰＣＷを分析
し、５ＴＲＩＮＧ（Ｊ）の最初のエレメントの位置にお
かれた文字が、５ＴＲＩＮＧ（Ｊ）の最初のＰＣＷの頭
文字であると仮定する。プログラムが５ＴＲＩＮＧ（Ｊ
）の２番目のＰＣＷを分析できるよう、５ＴＲＩＮＧ（
Ｊ）の２番目のＰＣＷの頭文字が５ＴｒｔＩ　ＮＧ（Ｊ
）の最初のエレメントの位置におかれていることを保証
するために、５ＴｆｌＩ　ＮＧ（Ｊ）の各文字は、十分
な数だけ左に移動しなければならない。この処理は、第
７Ｄ図のブロック９２−１０４でおこなわれる。As described above, the subroutine consisting of blocks 32-90 analyzes the first PCW indicated in 5TRING(J) and determines that the character placed in the first element position of 5TRING(J) Assume that it is the initial letter of the first PCW. The program is 5 TRING (J
) to analyze the second PCW of 5TRING (
The first letter of the second PCW of J) is 5TrtI NG(J
) Each character in 5TflING(J) must be moved to the left a sufficient number to ensure that it is placed in the position of the first element of ). This processing occurs at blocks 92-104 of Figure 7D.

上記の通り、ＰＣＷの長さの変数Ｐは、５ＴＲＩＮＧ（
Ｊ）の最初のＰＣＷの文字の長さに等しくブロック５８
−６６および７６にセットされる。As mentioned above, the PCW length variable P is 5TRING(
block 58 equal to the length of the first PCW character of J)
-66 and 76.

プログラムが５ＴＲＩ　ＮＧ（Ｊ）の２番目のＰＣＷを
評価できるように、５ＴＲＩＮＧ（Ｊ）の文字をとり除
かなければならない。もしゼロ子音か５ＴＲＩＮＧ（Ｊ
）の最初と２番目のＰＣＷ単語の間を音節分割する文字
として使われるなら、もう１文字とり除かねばならない
。キーボードの操作員が自分の誤りを確かめ、それを訂
正できるように、ｐ＋２文字が既にディスプレイ上に映
しだされているので、もし誤り状態が存在すれば、もう
２文字をとり除かなければ赴らない。これは、ブロック
９２−１０４（第７Ｄ図参照）を構成するサブルーチン
でなされる。The character 5TRING(J) must be removed so that the program can evaluate the second PCW of 5TRING(J). If it is a zero consonant or 5 TRING (J
), one more character must be removed if it is to be used as a syllabic character between the first and second PCW words. Since the character p+2 is already shown on the display so that the keyboard operator can see his error and correct it, if an error condition exists, he will have to remove two more characters. No. This is done in the subroutine comprising blocks 92-104 (see Figure 7D).

ブロック９２から始め、プログラムは、ゼロ子音フラッ
グＺがｌにセットされているかどうかを確かめる。もし
そうなら、ＰＣＷの長さの変数ｐはｐ＋１にセットされ
、プログラムは指示ブロック１００に進む。もしゼロ子
音信号力月にセットされていなければ、プログラムは決
定ブロック９６に進み、エラー信号が１にセットされて
いるかどうかを確かめる。もしそうなら、ＰＣＷの長さ
の変数ｐはｐ＋２（ブロック９８）にセットされ、プロ
グラムは指示ブロック１００に進む。もしエラー信号が
１にセットされていなければ、プログラムは直接指示ブ
ロック１００にすすむ。Starting at block 92, the program checks whether the zero consonant flag Z is set to l. If so, the PCW length variable p is set to p+1 and the program proceeds to instruction block 100. If the zero consonant signal is not set, the program proceeds to decision block 96 to determine if the error signal is set to one. If so, the PCW length variable p is set to p+2 (block 98) and the program proceeds to instruction block 100. If the error signal is not set, the program proceeds directly to instruction block 100.

指示ブロック１００に従い、変数Ｊはｌにセラｌ、され
、プログラムはブロック１０２−１０６を包含するルー
プを入力する。５ＴＲＩＮＧ（Ｊ）の２番目のｐｃｗの
頭文字が、５ＴＲＩ　ＮＧ（Ｊ）の最初のエレメントの
位置にくることを保証するよう、５ＴＲＩＮＧ（Ｊ）の
各エレメントをｐ字分だけ左に移す。決定ブロック１０
４では、この処理はＪの値がＪＭＡＸより小さい限り続
けられるが、このＪＭＡＸは、決定ブロック＋６（第７
Ａ図参照）て空白が最初に探知されるＪ値になるようブ
ロック２３にセットされている。これが終わると、プロ
グラムは、指示ブロック＋０８に進み、アレイ５ＴＲＩ
ＮＧ（Ｊ）が空かどうかを確かめる。According to instruction block 100, variable J is set to l, and the program enters a loop containing blocks 102-106. Shift each element of 5TRING(J) to the left by p characters to ensure that the first letter of the second pcw of 5TRING(J) is in the position of the first element of 5TRING(J). Decision block 10
4, this process continues as long as the value of J is less than JMAX, which is determined by decision block +6 (7th
(See Figure A) is set in block 23 so that the blank is the first J value detected. Once this is done, the program proceeds to instruction block +08 where array 5TRI
Check whether NG (J) is empty.

プログラムのこの時点において、５ＴＲＩＮＧ（Ｊ）の
最初のＰＣＷは分析され、ディスプレイされ、５ＴＲＩ
ＮＧ（Ｊ）の文字は、５ＴＲＩＮＧ（Ｊ）の最初のエレ
メント位置に５ＴＲＩ　ＮＧ（Ｊ）の２番目のＰＣＷの
頭文字を入れるため左に移されている。もしＳＴＲＩ　
ＮＧ（ＪＸブロック１０８）に余分のＰＣＷがあれば、
プログラムは決定ブロック３２（第７Ｂ図）に戻り、上
記手順に従って、現在５ＴＲＩＮＧ（Ｊ）に入っている
最初のｐｃｗを分析する。このＰＣＷが分析され、ディ
スプレイされると、５ＴＲＩＮＧ（Ｊ）の文字は、５Ｔ
ＩＬＥＮＧ（Ｊ）の次のｐｃｗの頭文字が５ＴＲＩＮＧ
（Ｊ）の最初のエレメント位置におかれることを保証す
るために、ふたたび左に移される。この処理は、５ＴＲ
Ｉ　ＮＧ（Ｊ）の中の全てのＰＣＷが評価され、ディス
プレイ（ＳＴＲＩ　ＮＧ（Ｊ）が空になるまで）され続
ける。At this point in the program, the first PCW of 5TRIG(J) is analyzed and displayed, and the 5TRIG(J)
The letter NG(J) has been moved to the left to place the initial letter of the second PCW of 5TRING(J) in the first element position of 5TRING(J). If STRI
If there is extra PCW in NG (JX block 108),
The program returns to decision block 32 (Figure 7B) and analyzes the first pcw currently in 5TRING(J) according to the procedure described above. When this PCW is analyzed and displayed, the character 5TRING (J) will be 5T
The first letter of pcw after ILENG (J) is 5TRING
It is moved to the left again to ensure that it is placed in the first element position of (J). This process is 5TR
All PCWs in ING(J) continue to be evaluated and displayed (until STRING(J) is empty).

５ＴＲＩＮＧ（Ｊ）が空になると、プログラムは決定ブ
ロック１１０に進み、反転音母音フラッグＲ■が１にセ
ットされているかを確かめる。もしそうなら、反転音表
意文字はディスプレイ（ブロックｌｌ２）に映し出され
、プログラムは次のＰＰＣＷ文字列の最初のエレメント
を待つために、指示ブロックＩＯに戻る。もし反転音母
音フラッグＲＶが１にセットされていなければ、プログ
ラムは直ちにブロック１０に進む。Once 5TRIG(J) is empty, the program proceeds to decision block 110 and checks to see if the inverted vowel flag R■ is set to one. If so, the inverted ideogram is shown on the display (block ll2) and the program returns to instruction block IO to wait for the first element of the next PPCW string. If the reversed vowel flag RV is not set to 1, the program immediately proceeds to block 10.

上記のプログラム（例のみで示されているカリの重要な
点は、ＰＣＡの文字列（必ずしもｐｐｃｗを表示しなく
てもよいが、そうした方がなお望まししりが自動的に個
々のＰＣＷに分割され、次いでｐｃｗの表意文字の単音
頭語辞典を利用しつつ、適切な表意文字に明確に変換さ
れ得る。これにより、多音頭語辞典の必要はなくなり、
ＰＣＬが中国語書き言葉の表意的性格にも沿えるように
なる。The important thing about the above program (shown only as an example) is that it automatically splits the PCA string into individual PCWs (although it does not necessarily have to display the ppcw, it is still preferable to do so). and then can be unambiguously converted to the appropriate ideogram, making use of the pcw ideograph mono-initial dictionary. This eliminates the need for a poly-initial dictionary;
PCL will be able to comply with the ideographic character of written Chinese language.

Ｃ，アルファグラミック・リストＰＣＬのもう１つの重要な特徴は、単音頭語および多音
節語のアルファグラミック・リストを簡単かつ直接作成
するために使えることである。アルファグラミック・リ
ストとは、その殆どがアルファベット順になってはいる
が、たとえ正確なアルファベットの順位ではこれら普通
の表意文字を分離してしまう場合であろうとも、同じ頭
文字で始まる表意文字の多音節語や熟語が、必ずひとま
とめにされているリストのことである。これは、本発明
のＰＣＬを使って作ったアルファグラミックの辞典式リ
ストで、第８図を参照すればよく理解できる。第８図で
は、最左側かＰＰＣＷより成り、次側は対応する表意文
字からなっている。C. Alphagrammic Lists Another important feature of PCL is that it can be used to easily and directly create alphagrammic lists of monosyllabic and polysyllabic words. An alphagrammic list is a list of ideograms that begin with the same initial letter, although most of them are in alphabetical order, even though the exact alphabetical order would separate these common ideograms. A list of polysyllabic words and idioms that are always grouped together. This is a lexicographical list of alphagramics created using the PCL of the present invention, and can be better understood by referring to FIG. In FIG. 8, the leftmost side consists of PPCW, and the next side consists of the corresponding ideogram.

中国語のいかなるアルファベット式表現においても、所
定の声調・音節を表示するために使われる文字の数は、
声調・音節（ＣＳＶ、ＣＶ、ＳＶ。In any alphabetic representation of Chinese, the number of characters used to represent a given tone/syllable is:
Tone/Syllable (CSV, CV, SV.

またはＶ）の形の機能としてはいろいろ異なる。or V) has various functions.

意義的類別記号の使用もまた、ＰＣＷの文字の数を変え
る。アルファベット化の基準は、より長い単語はより短
い単語よりも後にリストするよう規定している。この変
化ゆえに、正確なアルファベット類のリストは、もし純
粋にアルファベット類のリストが使われた場合、同じ頭
文字をもつ複合語は切り離されてしまうことになる。例
えば、第８図において、へ億、爪溝、へ月、の単語は、
γという文字には８３という番号が、また、亘という文
字には３５という番号が指定されるので、ダッシュによ
って示されたところまで下げるべきである。こうして、
同じ表意文字を頭文字にもつ第２欄の単語が、お互いに
ばらばらにされてしまう。The use of semantic classification symbols also changes the number of characters in the PCW. Alphabetization standards dictate that longer words be listed after shorter words. Because of this change, an accurate alphabetical list would result in compound words with the same initial letter being separated if a purely alphabetical list were used. For example, in Figure 8, the words ``hebi'', ``nail groove'', and ``hetsuki'' are
The letter γ is assigned the number 83, and the letter Wataru is assigned the number 35, so they should be lowered to the point indicated by the dash. thus,
Words in the second column that start with the same ideogram are thrown apart from each other.

本発明は、アルファグラミックの順位にリストされるべ
きＰＰＣＷを分類する前に、ＰＣＷとＰＰＣＷとの間に
実質上のスペースをおくために、上述の分離論理の形に
修正したものを使うことにより、このような分離をさけ
ている。実質上のスペースには“θ″番号指定されいて
いるので、■の文字の萌にくる文字として分類ルーチン
によって扱われている。The present invention uses a modification of the above-described separation logic to leave a substantial space between PCWs and PPCWs before sorting them to be listed in alphagrammic order. This avoids such separation. Since the actual space is designated with the "θ" number, it is treated by the classification routine as a character that comes next to the character ■.

実質的なスペースは、第７Ｂ−７Ｄ図の分離論理、特に
ブロック３２−８４および９２−１０８の修正形式によ
って挿入される。アルファグラミック・リストを作成す
るために、ｐｐｃｗに実質的なスペースを挿入する目的
で分離論理を使うことは、分離論理が以下のように修正
できるということである。ブロック５４−６４、および
ブロック７２−７４は、不必要なので除外してもよい。Substantial space is inserted by the modified form of the separation logic of FIGS. 7B-7D, particularly blocks 32-84 and 92-108. The use of separation logic for the purpose of inserting substantial space into ppcw to create alphagrammic lists means that the separation logic can be modified as follows. Blocks 54-64 and blocks 72-74 are unnecessary and may be omitted.

第７Ｃ図のフロー・チャートのブロック８２−９０の代
わりに、文字列ＰＣＷ（Ｘ）にストアされたＰＣＷを、
実質的スペースが追加されるべき全ての文字列（これは
複数のＰＣＷでありうる）をいれておく保留アレイにお
くことができる。ＰＣＷが保留アレイにおかれた後、実
質的スペースは、保留アレイの次のエレメントにおかれ
る。その後プログラムは、ブロック９２に戻り、文字列
の全てのＰＣＷが保留アレイにおかれるまで、分離論理
によってループし続ける。その時点で、全ての文字列が
保留アレイからとり除かれ、次回の分類用に大容量記憶
装置に納められる。分類されるべき全ての文字列が分類
用を通過すると、アルファベット順に分類され、実質的
スペースはｌの文字の前にくる文字としてあっかわれる
。これにより、第８図に例示されたアルファグラミック
・リストの型を自動的に生成する。In place of blocks 82-90 of the flow chart of FIG. 7C, the PCW stored in the string PCW(X) is
Substantial space can be placed in a reservation array that holds all the strings to be added (which can be multiple PCWs). After the PCW is placed in the reserved array, substantial space is placed in the next element of the reserved array. The program then returns to block 92 and continues to loop through the isolation logic until all of the PCWs of the string have been placed in the pending array. At that point, all strings are removed from the pending array and placed in mass storage for next classification. Once all the strings to be sorted have passed through the sorter, they are sorted in alphabetical order, and the substantial space is taken up as the letter that comes before the letter l. As a result, the type of alphagramic list illustrated in FIG. 8 is automatically generated.

純粋なアルファベット順位のもう１つの例外として、ア
ルファグラミック・リストは同じ声調をらつＰＣＷを統
括すべきだという点である。例えば、ＬＭＮＶとＬＭＮ
Ｖ−とは同声調語で同じ第一声で発音されるので、■を
無声母音２７としたとき単語ＬＭＮＶの後にＶ′を無声
母音７９としたときの単語ＬＭＮＶ’はくるべきである
。次ぎにリストされるべき２単語は、例えば、Ｖ゛′お
よびＶ゛′をそれぞれ母音２８と８０としたじきのＬＭ
ＮＶ−’とＬ　Ｍ　Ｎ　Ｖ　’−”’ｒなければならな
い。Another exception to pure alphabetical ordering is that alphagrammic lists should govern PCWs that share the same tone. For example, LMNV and LMN
Since V- is a tonal word and is pronounced with the same first voice, when ■ is a voiceless vowel 27, the word LMNV' when V' is a voiceless vowel 79 should come after the word LMNV. The next two words to be listed are, for example, the LM
NV-' and L M NV '-''r.

が後者２つの単語は、ＬＭＮＶおよびＬＭＮＶ’と同じ
音をもちながら第二声で発音される。これは、以下のよ
うにして解決する。The latter two words are pronounced in the second voice, having the same sounds as LMNV and LMNV'. This is solved as follows.

第１２Ａ、１２８図は、先ずアルファグラミック順に入
れるラインはどれかをきめるために、ＰＣＬのテキスト
のラインを単語対単語、または音節対音節のベアで比較
するのに使われる「比較」ルーチンを例示したフロー・
ダイアダラムである。Figures 12A and 128 first illustrate a "comparison" routine used to compare lines of PCL text word-for-word or syllable-for-syllable bare to determine which lines should be placed in alphagrammatic order. Example flow
It is diadarum.

「比較」は、さらに英語のテキストのラインなら普通の
アルファベット順に並べられているという特徴を６つ。``Comparison'' has six additional features: English text lines are arranged in normal alphabetical order.

「比較」は、ここでは「分類」と呼ばれる総括的分類フ
ロシージャ中で使用され、「比較」ルーチンが適切な順
位を識別した後にテキストのラインを再配置する。「比
較」を使用する前に、稼動メモリに全てのテキスト・フ
ァイルをロードする。"Compare" is used in a general classification procedure, herein called "Classification", to rearrange lines of text after the "Compare" routine has identified the appropriate ranking. Load all text files into working memory before using Compare.

順位にならべられるべき単語や熟語を包含した各分野が
、それぞれ別のラインにおかれる。「分類」プログラム
は、テキストの各ラインの頭にポインタのアレイ、すな
わち、各ラインの頭文字のアドレスをらつアレイを設置
する。各ラインの終わりにも探知記号を印しておく。Each field containing words and phrases to be ranked is placed on a separate line. The Sort program places an array of pointers at the beginning of each line of text, ie, an array containing the address of the first letter of each line. Also mark the end of each line with a detection symbol.

「比較」は、比較されるべき１対のラインのアドレスを
受信する。すなわち、「比較」には、アレイラインＩと
アレイラインｌとアレイライン２の２つの引き数があり
これら引き数のおのおのは、「分類」によって作り出さ
れたアレイからのアドレスである。「比較」は、これら
のアドレスの指示ラインを処理し、ラインがアルファグ
ラミック・リストを形成するためのただしい順位となっ
ているかどうかを示す数値を戻す。ラインが不順なら、
「分類」は、ライン自体ではなしに、２つのラインのポ
インタをできることなら切り換える。Compare receives the addresses of a pair of lines to be compared. That is, ``compare'' has two arguments: array line I, array line 1, and array line 2, and each of these arguments is an address from the array produced by ``sort.'' Compare processes the indicated lines of these addresses and returns a number indicating whether the lines are in the correct order to form an alphagramic list. If the line is irregular,
"Classification" preferably switches pointers between two lines, but not the lines themselves.

以下において、「比較」で比較される２つのラインは、
ラインｌおよびライン２と呼ばれる。ブロック２１０に
おいて、「比較」は２つの計数器１１＝１２＝０．にセ
ットする。ＩＩとＩ２は、ライン■およびライン２でそ
れぞれ検査されている単語または音節の中の現在文字の
索引である。このアルゴリズムにおいて、以下に詳しく
検討する通り、１１は普通のＩ２に等しい。In the following, the two lines compared in "Compare" are:
They are called line 1 and line 2. At block 210, a "compare" is performed between the two counters 11=12=0. Set to . II and I2 are the indices of the current letter in the word or syllable being examined on line ■ and line 2, respectively. In this algorithm, 11 is equal to ordinary I2, as discussed in more detail below.

ブロック２２０−２３０では、データがラインｌおよび
ライン２の両方で、まだ比較を続けるかどうかを決める
。もしそうでなければ、萌の処理か何らの差異も探知し
ないまま、両方のラインの終わりに到着してしまったか
、それとも何かの理由で両方のラインにデータが入って
いなかったかである。決定ブロック２２０では、ライン
の末尾の文字が、ラインＩおよび２の両方で探知された
かどうかを確かめる。もしそうなら、指示ブロック２２
２において、「比較」のアルゴリズムはθ値に復帰する
。０値は、ポインタのアドレスを切り換える必要上、２
つのラインの間には何ら差異が探知されなかったことを
示す。Blocks 220-230 determine whether data is still being compared on both line I and line 2. If this is not the case, either Moe's processing has reached the end of both lines without detecting any differences, or there is no data in both lines for some reason. Decision block 220 determines whether the last character of the line was detected on both lines I and 2. If so, instruction block 22
At 2, the "compare" algorithm returns to the θ value. The 0 value is 2 because it is necessary to switch the pointer address.
Indicates that no difference was detected between the two lines.

ライン！およびライン２の両方の終わりに到着していな
ければ、決定ブロック２２４において、ラインｌの終わ
りに到着したかどうかが確かめられる。もしそうなら、
ラインｌはライン２より短いが、その池では同じである
から切り換えはおこなわれないので、２２６でルーチン
は−ｌに復帰する。ラインｌが終っていなれければライ
ン２が終ったかどうかを２２８で確かめる。もしそうな
ら、２３０で＋１値が復帰される。復帰された＋１値は
、ライン２がラインｌより短いので、ラインｌとライン
２は切り換えられなければならないことをしめしている
。line! If the ends of both lines 1 and 2 have not been reached, decision block 224 determines whether the end of line l has been reached. If so,
Line l is shorter than line 2, but since they are the same in that pond, no switch occurs, so the routine returns to -l at 226. If line 1 has not yet ended, it is checked at 228 whether line 2 has ended. If so, the +1 value is returned at 230. The +1 value restored indicates that line 1 and line 2 must be switched since line 2 is shorter than line 1.

どちらのラインが他より短いか確認できなければ、「比
較」は、アルファグラミック順位をきめるために、ライ
ン１およびライン２の次の単語または音節を点検する。If it cannot be ascertained which line is shorter than the other, Compare examines the next word or syllable in line 1 and line 2 to determine the alphagrammatic order.

単語が両方ともＰＣＬ単語でなければ、例えば、片方が
英単語であったりして、これらの単語は、ステップ２４
０−２５０に順に入れられる。指示ブロック２４０で、
ポインタのエンドｌおよび英語２は、ラインｌおよびラ
イン２にそれぞれ入っている現在単語のあとにくるスペ
ースのアドレスにセットされる。習慣として、ＰＣＬの
各単語はスペースで区切られている。従って、スペース
は、ラインｌおよびライン２の中身の単語対単語の比較
において、餌料な区切り記号の役を果たｔ。多数の余白
、制御コード、ゼロ子音、その他無関係の文字は無視す
る。［１およびＩ２は、これらの文字や記号をバイパス
するために増加ずろ。この場合、これら２つの計数器の
数値は、いつも等しいとは限らない。If the words are not both PCL words, e.g. one of them is an English word, then these words are passed to step 24.
They are entered in order from 0 to 250. At instruction block 240,
The end l and english 2 pointers are set to the address of the space that follows the current word in line l and line 2, respectively. By convention, each word in PCL is separated by a space. Thus, the spaces serve as fodder delimiters in the word-for-word comparisons of the contents of lines 1 and 2. Ignore large amounts of white space, control codes, zero consonants, and other extraneous characters. [1 and I2 are incremented to bypass these letters and symbols. In this case, the values of these two counters are not always equal.

決定ブロック２４２で、現在単語が両方ともＰＣＬであ
るかどうかを確かめる検査をする。もしそうでなければ
、ブロック２４４で、現在２単語に「比較テキスト」機
能が応用される。「比較テキスト」は！■によって表示
された現在位置からエンドＩによって表示された終わり
位置までの、ラインｌの部分に入った各文字を点検する
。同様にして、「比較テキスト」は、Ｉ２からエンド２
までのライン２の中身を点検する。これら２単語は、厳
格にアルファベット順に、例えば、標準ＡＳＣＩＩ分類
順位に従って、比較される。「比較テキスト」は、一般
の辞典規則により、ライン２の単語に等しいか、それ以
下またはそれ以上のライン１の単語に等しいか、それ以
下またはそれ以上であるかによって、０１−１、または
＋１に等しいＣＭＰ値に復帰する。Decision block 242 tests to see if the current words are both PCL. If not, at block 244, a "comparison text" function is applied to the current two words. “Comparison text” is! Inspect each character that has entered the portion of line l from the current position indicated by ■ to the end position indicated by end I. Similarly, the "comparison text" is from I2 to end 2.
Check the contents of line 2 up to. These two words are compared strictly alphabetically, eg, according to standard ASCII classification order. The "comparison text" is 01-1, or +1, depending on whether it is equal to, less than, or more than the word on line 1, according to general dictionary rules. Returns to a CMP value equal to .

２４６で、ＣＭＰ＝０かどうかを確かめる。もしそうな
ら、現在単語は同一であり、切り換えの必要はない。２
４８で、Ｉｔ−エンドｌおよびＩ２−エンド２にセット
することにより、ルーチンは次の単語に進む。次に、ブ
ロック２０５で、次の単語の点検を始めるために、■！
およびＩ２をそれぞれｌたけ増加する。At 246, it is checked whether CMP=0. If so, the words are now the same and there is no need to switch. 2
At 48, the routine advances to the next word by setting It-Endl and I2-End2. Next, in block 205, ■! to start checking the next word!
and I2 are each increased by l.

もしＣＭＰが０に等しくなければ、ブロック２５０で「
比較」はＣＭＰに復帰する。すなわち、ＣＭＰ値は、ラ
イン１の現在単語がライン２の現在単語よりも小さいか
大きいかによって、−１または＋■となる。後者の場合
、ラインは切り換えられることになる。If CMP is not equal to 0, block 250
"Compare" returns to CMP. That is, the CMP value is -1 or +■ depending on whether the current word on line 1 is smaller or larger than the current word on line 2. In the latter case, the line will be switched.

もし、決定ブロック２４２で、両方の現在単語がＰＣＬ
単語、すなわち、ｐｃｗまたはＰＰＣＷであることが確
認されたならば、これらは音節対音節（表意文字対表意
文字）で比較されなければならない。これは、ステップ
２６０−２８４でおこなわれる。If, at decision block 242, both current words are PCL
Once a word has been identified, ie, pcw or PPCW, these must be compared syllable-to-syllable (ideograph-to-ideograph). This occurs in steps 260-284.

第１２８図を見ると、指示ブロック２６０で、最初の単
語の語尾、または現在ｐｐｃｗの第■音節が、萌述の分
離論理を使っていることが分かる。Referring to FIG. 128, it can be seen that in instruction block 260, the ending of the first word, or the current syllable of ppcw, uses Moe's separation logic.

「分離」サブルーチンは、エンドシルＩおよびエンドシ
ル２の値に復帰する。エンドシルｌは、１１およびエン
ドｌの間に起こるラインｌの第１音節の終わりの索引を
表示する。エンドシル２はライン２の次の音節の終わり
の索引である。The "separate" subroutine returns the values of Endosil I and Endosil 2. Endosil l indicates the index of the end of the first syllable of line l that occurs between 11 and end l. Endosil 2 is the index of the end of the next syllable in line 2.

両音節の終わりが見つかったら、ブロック２２２で現在
音節の声調が比較される。これは、「声調比較」と呼ば
れるサブルーチンでなされる。「声調テキスト」に似て
いるが、同声調語はアルファグラミック・リストととも
に現れなければならず、かつまた、各単語に関しては、
アルファベット類でなければならないという上記説明ず
みの規Ｉりに従って修正されている。また、同じ頭文字
で始まるＰＰＣＷを分離させる語尾の文字を無視する。Once both syllable ends are found, the tones of the current syllable are compared at block 222. This is done in a subroutine called "Tone Comparison." Similar to "tonal text", but tonal words must appear with an alphagrammatic list, and also for each word,
It has been modified according to Rule I, explained above, that it must be alphabetic. It also ignores the characters at the end of words that separate PPCWs starting with the same initial letter.

「声調比較」のもう１つの有利な特徴は、所定の声調・
音節をもつ全ての同声調語を、同じ発音をもつ１つの先
決された形に変えてから「比較テキスト」を適用するこ
とである。Another advantageous feature of "tone comparison" is that
The idea is to change all tonal words with syllables into one predetermined form with the same pronunciation and then apply the "comparison text".

「声調比較」は、ＣＭＰＴ値に復帰するが、これは、両
方の現在音節が同声調語ならＯであり、ラインｌの現在
音節がライン２の現在音節よりも小さいか大きいかによ
って、−１または＋１となる。"Tone Comparison" returns the CMPT value, which is O if both current syllables are the same tone word, and -1 depending on whether the current syllable in line l is smaller or larger than the current syllable in line 2. Or +1.

ブロック２６４で、もしＣＭＰＴがＯに等しくなければ
、「比較」は指示ブロック２６６で、ＣＭＰＴ値に復帰
する。しかし、ＣＭＰＴが０に等しければ、両方の現在
音節は同声調語だが、さらにそれらがアルファベット順
になっているかを確認しなければならない。これを成し
とげるために、「比較」は、次ぎに上述のサブルーチン
「比較テキスト」を現在音節に適用する。ＰＣＬ文字除
比較に当り、「比較テキスト」は、標準的ＡＳＣＩＩ分
類法に似た規則に従う。このシステムは、英語の文字に
まねて、「分類」がＰＣＬ文字をアルファベット順に並
べられるように、ＡＳ（、ＩＩ文字のセットに指定され
る値よりも高いデイジタル値をＰＣＬ文字に指定する。At block 264, if CMPT is not equal to O, the "compare" returns at instruction block 266 to the CMPT value. However, if CMPT is equal to 0, both current syllables are tonic, but we must also check that they are in alphabetical order. To accomplish this, Compare then applies the subroutine Compare Text described above to the current syllable. For PCL character exclusion comparisons, the "comparison text" follows rules similar to standard ASCII taxonomy. This system assigns digital values to PCL characters that are higher than the values assigned to the AS(, II character set) so that the "classification" allows the PCL characters to be ordered alphabetically, to mimic English characters.

指示ブロック２６８て、「比較テキスト」は」二記に似
た方法でＣＭＰ値に復帰する。At instruction block 268, the "comparison text" is returned to the CMP value in a manner similar to section 2.

２７０で、ＣＭＰがＯに等しいかを確認する。もしそう
でなければ、２７２で「比較」は−１または＋１のＣＭ
Ｐ値に復帰する。At 270, check whether CMP is equal to O. If not, in 272 "comparison" is -1 or +1 CM
Returns to P value.

しかし、もしＣＭＰがＯに等しければ、２つの現在音節
は同声調語であると同時にアルファベット的にも同一で
ある。２７４で、ルーチンは、ｌＩ−エンドシルＩとＩ
２−エンドシル２にセットすることにより、次の現在音
節に進む。However, if CMP is equal to O, the two current syllables are tonal as well as alphabetically identical. At 274, the routine selects lI-endocyl I and I
2 - Advance to the next current syllable by setting endocill 2.

２７６で、ルーチンは、いずれかの単語が終わりにきた
かどうかを確かめるテストをする。すなわち、ＩＩがエ
ンドＩよりら小さく、同様にＩ２がエンド２よりも小さ
いかどうかを確認する。両方の単語がまだ終わりにきて
いなければ、システムは、ライン１とライン２の次ぎに
くる２つの音節の最後の文字を確認し、「声調比較」を
適用するためにブロック２６０に戻る。At 276, the routine tests to see if any words have come to an end. That is, it is checked whether II is smaller than end I and similarly I2 is smaller than end 2. If both words are not yet finished, the system checks the last letters of the next two syllables in lines 1 and 2 and returns to block 260 to apply the "tone comparison."

しかし、もし指示ブロック２７６で、片方の単語だけが
終わりにきたならば、ブロック２７８で両方の単語が終
わりに到着したか確認する。もしそうなら、ルーチンは
、［１もＩ２も１ずっ増加され、次の２つの現在単語が
比較される指示ブロック２０５に進む。However, if instruction block 276 indicates that only one word has reached the end, block 278 determines whether both words have reached the end. If so, the routine advances to instruction block 205 where both [1 and I2 are incremented by 1 and the next two current words are compared.

決定ブロック２７８で片方のみの単語が終わりに到着し
たことが確認されたら、次にはブロック２８０でライン
！の現在単語が終わりに到着したかどうかを確認する。Once decision block 278 confirms that only one word has reached the end, block 280 determines that the line! Check if the current word of has reached the end.

もしまだなら、すなわち、らし１１がエンド１よりも小
さく、ライン１の現在単語がライン２の現在単語がライ
ン２の現在単語よりも長ければ、２つのラインを切り換
えなければならない。従って、ブロック２８２で、ルー
チンは＋１の値に復帰する。一方、もし１１＝エンド１
なら到着したライン１の現在単語の終わりであり、切り
換えは不要である。従って、ブロック２８４で−１の値
が復帰される。If not already, ie, if Rashi 11 is less than End 1 and the current word in line 1 is longer than the current word in line 2, then the two lines must be switched. Therefore, at block 282, the routine returns to the +1 value. On the other hand, if 11 = end 1
If so, it is the end of the current word of line 1 that has arrived, and no switching is necessary. Therefore, a value of -1 is returned at block 284.

Ｄ　キーボードコンピュータ・システム、ワード・プロセッサ、その他
にＰＣＡを入力するのに特に能率的なキーボードが、第
１θ図に例示されている。キーボードの物理的配列は、
標準型のＱＷＥＲＴＹキーボードと同一で、標準的なＱ
ＷＥＲＴＹシンボルは、それぞれのキーの位置の下方に
示されている。それぞれのキーの位置に対応するＰＣＡ
文字は、ＱＷＥＲＴＹキーボード文字の上に示されてい
る。D Keyboard A keyboard that is particularly efficient for entering PCA into computer systems, word processors, etc. is illustrated in FIG. The physical layout of the keyboard is
Identical to the standard QWERTY keyboard, the standard Q
A WERTY symbol is shown below each key position. PCA corresponding to each key position
The characters are shown above the QWERTY keyboard characters.

２つのＰＣＡ文字は、それぞれのキーの位置を示してい
る。左上の文字は、キーボードの大文字の位置（シフト
・キーを押した位置）に対応し、それぞれのキーの位置
の右下の文字は、そのキーの位置の小文字の位置に対応
する。このキーボードは、タイピストまたはキーボード
操作員が、データ・プロセシングまたはワード・プロセ
シング壷システムにＰＣＬ情報を入力できるように、最
高の能率を上げるべくデザインされている。Two PCA letters indicate the position of each key. The upper left letter corresponds to the uppercase letter position on the keyboard (the position where the shift key is pressed), and the lower right letter of each key position corresponds to the lowercase letter position for that key position. This keyboard is designed for maximum efficiency so that a typist or keyboard operator can enter PCL information into a data processing or word processing font system.

能率的なキーボード配列については、多数の出版物があ
る。最も有名な考は、Ａ−Ｄｖｏｒａｋ他の著、ニュー
・ヨーク州アメリカン・ブック・カンパニー、１９３６
年出版０「タイピング・ビヘイビア−（行動）」と題す
る本である。この本は、キーボードに並べる文字は、タ
イピストが、ホームキー（ＱＷＥＲＴＹキーボードの“
ａ、　ｓ、　ｄ、　ｒ、　ｊ。There are numerous publications on efficient keyboard layouts. The most famous idea is A-Dvorak et al., American Book Company, New York, 1936.
The book is titled ``Typing Behavior'', published in 2010. In this book, typists are responsible for arranging the characters on the keyboard using the home key ("" on a QWERTY keyboard).
a, s, d, r, j.

ｋ、　ｌ”のキー）からできる限り指を動かさないです
むよう、統計に基づいて決めることを示唆している。そ
のため、最も頻繁に使われるキーのグループは、ホーム
・ロウ（列）（第１０図の３列目）に並べてあり、次ぎ
に頻繁に使われるキーのグループは、ホーム・ロウのす
ぐ上（第１０図の２列目）に並べられており、３番目に
頻繁に使われるキーのグループは、ホーム・ロウのすぐ
下（第１０図４列目）に並べられており、はんのたまに
しか使われないキーのグループは、ホーム・ロウ２列上
（第１０図の最上列）に並べられている。各列中、最も
頻繁に使われるキーは、人差し指の当たるキーであり、
２番目に頻繁につかわれるキーは、中指のキーであり、
３番目に頻繁に使われるキーは、薬指のキーであり、４
番目に頻繁に使われるキーは、小指のキーである。K, L" keys) to avoid moving your fingers as much as possible. Therefore, the most frequently used group of keys is the home row (row 10). The next most frequently used group of keys is the third most frequently used key group, which is arranged directly above the home row (second column in Fig. 10). The groups of keys are arranged immediately below the home row (fourth row in Figure 10), and the groups of keys that are only occasionally used are arranged two rows above the home row (the top row in Figure 10). ).In each row, the most frequently used keys are the keys that the index finger hits;
The second most frequently used key is the middle finger key.
The third most frequently used key is the ring finger key, 4
The most frequently used key is the little finger key.

Ｄ　ｖｏｒａｋ・システムは、一般に最も能率的とされ
ているが、なるべく左右の指を交互に使うための考慮は
なされていない。本発明のキーボードは、全ての子音と
、できることなら全ての半子音をキーボードの右側に寄
せ、操作員が右手で打てるようにしである。最も頻繁に
使われる母音は、キーボードの左側に並べである。キー
ボードの左側のキーの数よりも母音声調の数の方が多い
ので、ある母音声調はキーボードの右側に並べなければ
ならない。ここで使われている通り、キーボードの左側
とは、第１０図の太線より左側のキーの事である。これ
らのキーは、左手で打つ。キーボードの右側とは、第１
０図の太線より右側のキーのことである。これらのキー
は右手で打つ。Although the D vorak system is generally considered to be the most efficient, no consideration is given to using the left and right fingers alternately as much as possible. The keyboard of the present invention has all consonants and preferably all semi-consonants on the right side of the keyboard so that the operator can type them with his right hand. The most frequently used vowels are on the left side of the keyboard. Since there are more vowel tones than keys on the left side of the keyboard, certain vowel tones must be placed on the right side of the keyboard. As used herein, the left side of the keyboard refers to the keys to the left of the bold line in Figure 10. These keys are played with the left hand. The right side of the keyboard is the first
This is the key to the right of the thick line in Figure 0. These keys are typed with the right hand.

本発明はまた、キーボードの大文字、小文字の機能とし
てのキーボードの文字を、どこにおくから決めるもので
ある。ＰＣＡには８５文字あるので、全てをキーボード
の小文字のところに並べることはできない。小文字のと
ころには、４３文字しか入らない。第１θ図に示された
特殊な文字を選ぶことにより、その文字の使用頻度に基
づいて、７４パーセントの文字が小文字の位置におかれ
る。The present invention also determines the characters on the keyboard as a function of uppercase and lowercase letters based on where they are placed. There are 85 characters in PCA, so you can't line them all up in the lowercase letters on your keyboard. Only 43 characters can fit in the lower case. By selecting the special characters shown in Figure 1θ, 74 percent of the characters are placed in lowercase positions based on the frequency of use of that character.

本発明のキーボードは、また、母音声調のもつ声調の機
能として、どの文字をどのキーに置くから決める。最も
頻繁に起こる声調は、第四声であるから、この声調をも
つ母音声調をホーム・ロウ（第１θ図の３列目）におく
。２番目に頻繁に起こる声調は、第一声であるから、こ
の声調を持つ全ての母音声調は、２列目におく。３番目
に頻繁に起こる声調は、第二声であるから、この声調を
持つ全ての母音声調は、キーボードの最下列におく。The keyboard of the present invention also determines which character is placed on which key as a function of the tone of the vowel tone. Since the most frequently occurring tone is the fourth tone, the vowel tone having this tone is placed in the home row (third column in Figure 1θ). Since the second most frequently occurring tone is the first tone, all vowel tones with this tone are placed in the second column. Since the third most frequently occurring tone is the second tone, all vowel tones with this tone should be placed on the bottom row of the keyboard.

使用頻度の最も少ない声調は、第三声であるから、この
声調・音節をもつ声調は全てキーボードの最上列におく
。The least frequently used tone is the third tone, so all tones with this tone/syllable are placed on the top row of the keyboard.

キーボードの文字の位置をもつ覚えやすくするために、
第１０図のキーボードは、ある母音声調族の母音声調は
、同じ指で入力できるように、母音声調族を一括した。To make it easier to remember the positions of letters on the keyboard,
In the keyboard shown in FIG. 10, vowel tones of a certain vowel tone family can be input with the same finger, so that the vowel tones of a certain vowel tone family can be input together.

第１０図では、母音声調族４７−５０は、全ての左の小
指で打ち、母音声調族５１−５４は、全て左の薬指で打
ち、母音声調族７１−７４は、全て左の中指で打つ、等
、等である。In Figure 10, vowel tones 47-50 are all struck with the left little finger, vowel tones 51-54 are all struck with the left ring finger, and vowel tones 71-74 are all struck with the left middle finger. , etc.

[Brief explanation of the drawing]

第１図は、中国語の西岸の抑揚を示す図、第２図は、音
声中国語アルファベラ）（ＰＣＡ）が、音アルファベッ
トの音声にどのように対応するかを示す図、第３図は、中国語の全ての音音節の併音表現を解説した
音表図、第４Ａ−４Ｊ図は、中国語の全ての声調・音節の音声中
国語言語表現を示した声調図、第５Ａ、５Ｂ、５Ｃ図は
、音音節”ｉ”、Ｕ”、“ｌのそれぞれが音声中国語ア
ルファベットをつかって１２通りに書きだされる方法を
解説した解説図、第６図は、本発明の音声中国語言語に
従って、音声中国語単語かとり得る可能な形の例示図、
第７Ａ−７Ｄ図は、本発明の分離理論を解説したフロー
・ダイアグラム、第８図は、本発明から作り得るアルファグラミックのリ
ストの見本図、第９　Ａ−９Ｂ図は、本発明の意義的類別記号により同
声調語がどのように分類されるかを示した例示図、第１０図は、本発明に沿ったキーボードの配置のスキー
マ図、第１１図は、音声中国語言語が同声調語をどのように解
答するかを提示した提示図、第１２Ａ、１２［３図は、ＰＣＬテキストのラインをア
ルファグラミック順に配置するのに使う「比較」ルーチ
ンを解説するフロー・ダイアグラム、第１３図は、ＰＣ
Ａをディジタルの形で表すための７−ビットのコードの
例示図である。Figure 1 is a diagram showing the West Bank intonation of Chinese, Figure 2 is a diagram showing how the phonetic Chinese alphabet (PCA) corresponds to the sounds of the phonetic alphabet, and Figure 3 is a diagram showing how the phonetic Chinese alphabet (PCA) corresponds to the sounds of the phonetic alphabet. Figures 4A-4J are tone charts showing the phonetic Chinese language expressions of all tones and syllables in Chinese, Figures 5A and 5B are , 5C is an explanatory diagram explaining how each of the syllables "i", "U", and "l" are written out in 12 ways using the phonetic Chinese alphabet, and Figure 6 is the phonetic Chinese alphabet of the present invention. illustrative diagrams of the possible forms that phonetic Chinese words can take, according to the language;
Figures 7A-7D are flow diagrams explaining the separation theory of the present invention, Figure 8 is a sample list of alphagramics that can be created from the present invention, and Figures 9 A-9B are the significance of the present invention. Figure 10 is a schema diagram of the keyboard layout according to the present invention. Figure 11 is an example diagram showing how homotonal words are classified using the same tone classification symbols. 12A, 12[3 is a flow diagram illustrating the "compare" routine used to arrange lines of PCL text in alphagrammatic order, 13. The figure is a PC
FIG. 2 is an exemplary diagram of a 7-bit code for representing A in digital form;

Claims

[Claims] 1. A method for digitally encoding and storing Chinese ideographic characters consisting of a) - e): a) selecting a set of Chinese ideograms to be encoded and stored; b) a) select a unique digital representation for each selected ideograph, and c) a phonetic phonetic representation capable of forming a phonetic Chinese word (PCW) that can fully identify the pronunciation of the selected ideograph. select a set of characters for the Chinese Alphabet (PCA), d) select a unique digital representation for each PCA character, and e) each selected ideogram and its corresponding P
A monosyllabic dictionary is stored that identifies the one-to-one relationship between each digital representation of the CW. 2. The method of claim 1, wherein said PCA characters display the following linguistic elements: a) a plurality of vowels, b) a plurality of tones associated with the pronunciation of said vowels, and c) a plurality of consonants. 3. The method of claim 2, wherein the vowel comprises: a) a plurality of vowel tones each representing a predetermined vowel sound pronounced in a predetermined tone; and b) a predetermined vowel sound unrelated to tone. Multiple semiconsonants each displayed. 4. The method of claim 3, wherein the plurality of tones includes four tones. 5. The method of claim 4, wherein each vowel tone comprises a base word and an index combined to indicate the tone. 6. The method of claim 4, wherein the consonants include: a) a plurality of short consonants, each representing a respective consonant sound, and b) each representing a consonant sound pronounced with each vowel sound. multiple long consonants, and c) an unvoiced zero consonant. 7. The method of claim 6, wherein each PCW takes the form TS+Q, wherein: a) TS is a tone/syllable in the form of any one of CV, CSV, SV, and V; Here, C is a consonant, S is a semiconsonant, and V is a vowel tone.
Moreover, b) Q is a generalized tone/syllable correction symbol that expresses the meaning for distinguishing homotonous words. 8. The method of claim 7, wherein Q takes the form of one of φ and G, wherein a) φ is an invalid set, and b) G distinguishes a tonal word. It is a generalized semantic classification symbol consisting of PCA characters added to the tone/syllable TS to the extent necessary to make it. 9. No. 8 where G takes the form of either V or Z
The claimed method, wherein a) V is a repetition of the final vowel tone of the tone/syllable, and b) Z is a zero consonant. 10. The method of claim 4, wherein the vowel sound "i" is represented by three different PCA letters. 11. The vowel sounds “u” and “■” are two different PCs.
11. The method of claim 10, each represented by a letter A. 12. The PCA has only one vowel sound “i”, “u”
255 homotonal words can be distinguished for PCWs containing ” or “■”; 170 homotonal words can be distinguished for PCWs ending in vowel sounds “i”, “u” or “■”; 12. The method of claim 11, wherein 85 tonal tones are distinguishable for all other PCWs. 13. Four of said vowel tones each represent a vowel sound "e" pronounced in said four tones; Furthermore, “b”, “
7. The method of claim 6, further comprising displaying the vowel sound "i" when followed by "p", "m", "f", and semiconsonants. 14. Four of said vowel tones are pronounced in said four tones. The vowel sound “e” is displayed respectively, but “b”, “p”,
14. The method of claim 13, wherein the vowel sound of "m", "f", and "o" when followed by a semiconsonant is displayed. 15. Four of the vowel tones represent four tones, and when written by themselves or accompanied by a zero consonant, they represent the vowel sound "er", and when followed by a short consonant, the vowel sound 15. The method of claim 14 for displaying an "i". 16. At least about 8, most frequently used in Chinese.
10. The method of claim 9, comprising selecting 000 major combinations of ideograms. 17. Claim 16, wherein at least about 3,900 ideograms of the principal combinations, representing at least 97 percent of the frequency of use, are uniquely identified by PCWs having one of the following forms: TS+φ, TS+V, and TS+Z. the method of. 18. The method of claim 17, wherein all remaining ideograms of Chinese are uniquely identified by PCW in the form TS+G (where G is a PCA character other than V or Z). 19. By using the semantic classifier G, which is a PCA character similar to an ideographic root with a meaning similar to that of the ideogram to be identified, at least about 19. The method of claim 18, wherein 80% are uniquely identified. 20. The method of claim 1, wherein each PCW comprises up to four PCA characters. 21. The second PCW consists of an average of 2.4 characters used frequently.
0 claim method. 22. A keyboard layout method for processing the phonetic Chinese alphabet (PCA), said PCA consisting of a plurality of vowel tones representing each vowel sound pronounced in one of the four tones occurring in Chinese. , said method comprises: a) arranging at least four rows of keys determined in sequence: a top row, a second row, a home row (third row), and a bottom row; b) said four rows; c) the vowel tone and home tone with the most frequently used tone;
Associate row keys. 23.Furthermore, a second method that relates vowel tones as follows:
The method of claim 2: a) associate the second most frequently used tone with the key in the second row, b) associate the third most frequently used tone with the key in the bottom row, and c) associate the fourth most frequently used tone with the key in the bottom row. Associate the tone used for the top row of keys. 24. The method of claim 22, comprising: a) determining the classification of keys in said keyboard operated by the same finger; and b) determining a plurality of vowel tones having the same vowel sound but different tones; Associate the key of each category. 25. The PCA is a second PCA consisting of a plurality of consonants and semiconsonants.
4. The method of claim 4, further comprising: a) determining the relative frequency of use of the consonant, semiconsonant, and vowel tones, and b) frequently used vowel tones and one-handed manipulation. and c) associate frequently used consonants and semiconsonants with keys operated with the other hand. 26. Keys operated with the index finger and frequently used PCs
26. The method of claim 25, comprising associating with the letter A. 27. Keys operated with the middle finger, ring finger, and little finger, and PCA, which is used less frequently than the letter groups described in the previous section.
27. The method of claim 26, comprising associating with characters. 28. Each of the above keys has upper case and lower case conditions,
27. The method of claim 26, wherein said method associates frequently used vowels, consonants, and semiconsonants with lower case conditions of said key. 29. A method of keyboard layout for processing the characters of the Phonetic Chinese Alphabet (PCA), said PCA being a plurality of keyboards representing each vowel sound pronounced in any one of the four tones occurring in Chinese. and further comprising a plurality of consonants and semiconsonants, said method comprising: a) determining a segment of said keyboard having keys operated with the same hand; b) said consonants, semiconsonants; c) associating frequently used vowel tones with keys operated with one hand, and d) frequently used consonants and semiconsonants; Associate the key operated with the other hand. 30. A keyboard for inputting the phonetic Chinese alphabet (PCA) into a computer or similar system, said PCA being 1 when pronounced with any one of the four tones that occur in Chinese. the keyboard further comprises: a) a plurality of keyboards each divided into left and right sections, each to be operated with the left hand; b) keys of one section employed for inputting frequently used vowel tones, and c) keys of the other section employed for inputting frequently used consonants and semiconsonants. 31. The keyboard according to claim 30, wherein a) the keys are divided into four columns: a top row, a second row, a home row (third row), and a bottom row, and b) the keys are arranged in the home row. The key is employed to input the vowel tone with the most frequently used tone. 32. The keyboard of claim 31, wherein: a) the keys are divided into groups of keys designated to be operated with the same finger, and b) at least one group of keys of this kind have the same vowel sound It is adopted to input vowel tones with different tones. 33. The keyboard of claim 32, wherein the keys relatively close to the center of the keyboard are employed for inputting frequently used characters. 34. In the keyboard of claim 33, a) each key has a lower case condition and an upper case condition, and b) the keys in the lower case condition are employed for inputting frequently used characters. 35. A method of text processing comprising the following steps: each character identifies a Chinese sound and/or tone, the string includes at least two sets of characters, each set of characters
A phonetic Chinese language that identifies phonetic Chinese words of different lengths, each phonetic Chinese word identifies a unique ideogram, and provides the phonetic and tonal information necessary to pronounce that ideogram. , and further process the consecutive strings in order to clearly identify the beginning and end of each phonetic Chinese word in the string. 36. The method of claim 35, further comprising referencing a stored monosyllabic dictionary to positively identify the ideogram corresponding to each phonetic Chinese word. 37. A method of creating an alphagrammatic list from a set of word strings, each word string containing multiple characters that combine to form one or more phonetic Chinese words, and each phonetic Chinese word displaying a unique Chinese ideograph and providing the phonetic and tonal information necessary to pronounce the ideograph, said characters being pre-alphabetically ordered, said method comprising the following steps: A set of word strings in which the letters in the word strings are listed in alphabetical order are classified in alphagrammatic order, such that (a) all word strings having the same initial letter as the corresponding Chinese ideogram are To the extent that all words listed together and (b) pronounced with the same sound and tone are considered as one unit for alphabetization purposes, said alphabetical order is disregarded and said (a) and (b) )
All word strings belonging to a group are listed alphabetically with respect to each other. 38. A method for processing a string consisting of: a) input a string of syllabic Chinese alphabets (PCA), where: 1) said PCA has multiple vowel tones (V), semiconsonants (S);
), a consonant (C) and a zero consonant (Z); 2) said string contains at least two separate phonetic Chinese words (
PCW), each PCW takes the form TS+Q, where TS is a tone/syllable taking any one of the forms CV, CSV, SV, or V,
Q is a general meaning modifier that takes one of two forms: the PCA letter and the abbreviation of the PCA letter, but when it occurs at the end of a string, Q represents an inverted ideogram. 3) each of said PCs.
W displays the unique Chinese ideogram and provides the phonetic and tonal information necessary to pronounce the ideogram; 4) the PCW displays one of the forms CVC and CSVC; Whenever you follow a PCW you take, the SV
Each PC that takes either one of the forms +Q and V+Q
The W is preceded by a string with a zero consonant, and b) clearly separates said string into each phonetic Chinese word contained therein. 39. The method of claim 38, further comprising referencing a stored monosyllabic dictionary to positively identify the ideogram corresponding to each phonetic Chinese word. 40. The method of claim 38, further comprising: a) a predetermined alphabetical order for said PCA characters, b) inputting at least two of said PCA strings, and c) characters of said string. are listed in alphabetical order, in which (a) all strings having the same initial letter as the corresponding Chinese ideogram are listed together; (b) said alphabetical order is ignored to the extent that all PCWs pronounced with the same sound and tone are considered as one unit for alphabetization purposes, and all columns belonging to said groups (a) and (b) are Listed alphabetically with respect to each other. 41. A method for encoding and storing Chinese ideograms consisting of: a) selecting a set of characters from the phonetic Chinese alphabet that uniquely identifies all tones and syllables of Chinese; b) assigning each PCA character c) select a set of Chinese ideograms to be encoded and stored; and d) select a unique 7-bit digital representation of the Chinese ideograms that uniquely identify each selected ideogram. Select a unique phonetic Chinese word, and e) store a monosyllabic dictionary that identifies a one-to-one relationship between each selected ideogram and each digital representation of the corresponding PCW. 42. The method of claim 41, wherein said 7-bit digital representation for each PCA character is within the range 80H-FFH. 43. The 7-bit digital display is 80H-DF
43. The method of claim 42 within H. 44. The 7-bit digital display is 81H-DE
42. The method of claim 41 within H.