Chinese is made up of numerous Chinese characters, and these Chinese characters are as then very bothering with inputs such as keyboards.For example also can consider only to prepare to be equivalent to the key of the quantity of Chinese character, but not have practical value because of bond number too expands.Therefore, can adopt Chinese character is divided into left side (being called " partially ") and right-hand part (being called " side "), or be divided into the first half (being called " hat ") and Lower Half (being called " pin "), reach the mode that Chinese character is imported in combination that " side " or " hat " reach " pin " by " partially ".Yet, this kind mode also because of must have numerous " partially " keys, " side " key etc. makes efficient extremely low.
Therefore, just begin promptly to utilize in the following way phonetic labelling method with alphabetic flag Chinese pronunciation, and by the mode of alphabetic character key with Roman capitals mark pattern input Chinese character.Rise and fall because of only not differentiating, so represent to rise and fall with tone with the phonetic labelling method again.Circumflex has four kinds, is called " four tones of standard Chinese pronunciation ".The tone of Chinese character is (what have is not) that regulation is arranged, and the common regulation of phonetic mark must be enclosed these four tones of standard Chinese pronunciation.The four tones of standard Chinese pronunciation are defined by the following stated mode.
One: advance with high tone level.Represent with [-].
Two: rise to high pitch by bass.Represent with [/].
Three: being transferred to bass, also gone up again by high pitch is high pitch.Represent with [∨].
The four tones of standard Chinese pronunciation: reduce to bass by high pitch.With [] expression.
Below, represent with example.For example use " ma " of phonetic mark, as add that tone can tell following Chinese character.
M ā (one): mother (mother)
M á (two): fiber crops (numbness)
M ǎ (three): horse (horse)
M à (four tones of standard Chinese pronunciation): scolding (abuse)
As the labelling method of the four tones of standard Chinese pronunciation, the method for the numeral of adopting is arranged also except that above-mentioned symbol.
According to above explanation,, the phonetic labelling method of 1. not paying the four tones of standard Chinese pronunciation is arranged as the phonetic labelling method; 2. pay the phonetic labelling method of the four tones of standard Chinese pronunciation; 3. use the phonetic labelling method of the numeral four tones of standard Chinese pronunciation etc.
Using the phonetic labelling method occasion do not pay the four tones of standard Chinese pronunciation, during retrieval output because of many homonyms corresponding (with reference to aforementioned " ma ") are arranged, so will be specific go out needed Chinese character, operate loaded down with trivial details, cost a lot of times.Therefore, the processing of the four tones of standard Chinese pronunciation is problems in phonetic mark input mode, and how rationally and efficiently to be handled is a technical problem so far always.
The problem that exists during in addition, with the direct mark Chinese of phonetic is to have that the tab character number increases and shortcoming that treatment effeciency is reduced.For example: open the phonetic mark mode of once having reported on the clear 61-20176 communique with the numeral four tones of standard Chinese pronunciation the spy.If in this manner, for example to Chinese " everlasting ", carry out the words of mark with the phonetic mark mode of the numeral four tones of standard Chinese pronunciation, then be " wan4 gu3 chang2 quing1 ", the key operation increased frequency that the Chinese character that input is made up of four words is required, the treatment effeciency significance difference.
Embodiment
Below be elaborated with regard to embodiments of the invention.At first, rule 1 is illustrated.
At this, if the special word literal 2. of order rule 1 and even symbol adopt " j ", special word literal 4. and even symbol adopt " h ", and aforesaid phonetic mark " ma " is adopted rule 1, and be then as follows.
Ma(one): mother
Maj(two): fiber crops
Maa(three): horse
The mah(four tones of standard Chinese pronunciation): scolding
Rule 2 is illustrated for example with that.The above Chinese idioms of aforesaid three words " everlasting " as with paying the input of four tones of standard Chinese pronunciation phonetic labelling method, then are " w à n g ǔ ch á ng q ū ing ".Now it is adopted the present invention.Because of becoming " wanh ", application rule 1 shows the four tones of standard Chinese pronunciation according to rule 2, the 1 words " ten thousand ".The 2nd word begins, and promptly " ancient green for a long time " then only adopts the first initial of phonetic mark, is " gcq ".The whole input just with " wanh gcq " gone.Key operation number of times (hereinafter referred to as the touching number) when utilizing the phonetic labelling method input of paying the four tones of standard Chinese pronunciation is 17 times, and is relative therewith, the touching number when utilizing the inventive method input only 7 times, and number of operations has reduced significantly as can be known.
Begin only to make the corresponding reason of first initial of each word to describe at this to the 2nd word in the rule 2.Adopt aforementioned four-tone tone symbol, though can distinguish Chinese character to a certain extent, word of a word is how not easy to identify because of homonym again.Ma(one only for example) pairing Chinese character just has a lot of words such as " smearing ", " ant ", " fiber crops ", " mill ", " mother ".For the everyday words of forming by two words, how not easy to identify when not having the tone symbol because of homophone with the phonetic mark, go up then very easily identification of circumflex as paying.
On the other hand, in the everyday words that three words above (comprising 3 words) are formed, the homophony language has just reduced widely, but when Chinese input system etc. is imported as alphabet all being used the input of phonetic labelling method, then the touching number will increase.Thereby, reduce for making the touching number, only expected the method for importing with first character of the phonetic mark of word.For example " people " are as representing then to be " r é n m with the phonetic mark
N ", only get its first initial and be expressed as " rm ".Yet, adopt this mark rule that homophone is increased, retrieval is become very bother.
For this reason, the present invention for 2 word everyday words (remittance of 2 words) all import according to the phonetic mark with interior, occasion for the everyday words more than 3 words (the above Chinese idiom of 3 words), its first initial word is imported with the phonetic mark, the 2nd word begins the phonetic mark ellipsis (Roman capitals mark ellipsis) imported with first initial, thereby makes it identification easily and the touching number reduces.
Because so the Chinese of input is through simple significantly, as directly exporting then interrogatory like this.Thereby, the present invention will according to aforesaid regular 1 and the Roman capitals marks of rule 2 inputs be for conversion into the dictionary (the corresponding one by one Chinese character and words dictionary of Roman capitals mark ellipsis) that the Chinese of complete shape uses and be arranged in the Chinese input system, make the simple mark that is transfused to be for conversion into the Chinese of completeness.In case after being for conversion into the Chinese of completeness, just this Chinese can being exported according to required purpose, or be appended processing.
Fig. 2 is the pie graph that an example of the Chinese input system that the inventive method uses is implemented in expression.Among the figure, 1 is the keyboard of being made up of alphabetic character key and other operating key; 2 for accepting from the CPU(CPU (central processing unit) of carrying out various controls after the input of keyboard 1).3 is the storer that is connected, stores various information with CPU2, and above-mentioned Roman capitals mark ellipsis one corresponding Chinese character mark dictionary 3a is included in its inside.4 are the CRT of expression by the various information of CPU2 output, and 5 is that 6 is the output unit of output transform result to being transformed into the treatment circuit that completeness Chinese is handled as required.As output unit 6, can use for example printing equipment.
Below the action of the system that constitutes is thus described.The operator omits the Chinese that input mode will import and imports from keyboard 1 by deferring to above-mentioned phonetic regular 1, rule 2.The Roman capitals mark ellipsis that CPU2 comes in input and storer 3 interior Roman capitals mark ellipsis one corresponding Chinese character mark dictionary 3a contrast, and read corresponding Chinese character.Finish thus Roman capitals mark ellipsis to the conversion process of corresponding Chinese character.Transformation results is that the cache content that for example is stored in the memory buffer (not shown) set in the CPU2 is presented at CRT4.Number of times carries out this operation repeatedly as required, just can finish the Chinese input and handle.So, express the Chinese of input among the CRT4.
For the occasion of the Chinese that CRT is represented by output unit 6 printouts, with the content of the impact damper in the CPU2 directly to output unit 6 printouts.That is to say that such structure is exactly the word processor of Chinese.If when its word processor as Chinese is worked, also be necessary to make it to increase and a kind of the article that is stored in the memory buffer in the CPU2 such as changed and even correct at editting function, as distribute to the operating key that is attached in the keyboard 1 with various functions, with additional or change some softwares, just can adapt to.
In the treatment circuit 5, can carry out for example handling, perhaps handle for be connected the interface that needs with computing machine for be connected the modified tone that needs with communicating circuit.When linking to each other, become Chinese and pass civilian communication device, just can import the computer operation of Chinese as linking to each other with computing machine with order circuit.
According to the present invention, because keyboard 1 can be with common letter key, so can use the ASC II QWERTY keyboard that also has operating key outside 26 letters.
Below, more specifically the present invention is illustrated.
For following Chinese article being imported, carry out the Chinese character conversion and export to handle being illustrated by the present invention.
<Chinese 〉
The Silk Road has connected east, become on traffic main artery between east and west, has promoted cultural exchanges between east and west, commercial trade and friendly exchanges with West Asia, Europe.
In recent years, the Silk Road is reopened in the common decision of China and some friendly countries.This to the friendship between the developing china and the people of various countries, be bound to make new contribution.
With the information of letter input,, then as follows as representing above-mentioned article with the mark of the phonetic band four tones of standard Chinese pronunciation.
<phonetic band four tones of standard Chinese pronunciation mark 〉
s
chóuzh
lù bǎ dōngfāng gēn x
yà,ōuzhōu liánx
q
lái,chéngle dōngx
fāng de jiāotōng yàodào,cùj
n le dōngx
fāng de wénhuà jiāoliú,tōngshāng màoy
hé yǒuhǎo wǎnglái.
j
nniánlái,zhōngguó hé y
xiē yǒuhaǒ guójiā gòngtóng juéd
ng chóngx
n kāifàng s
chóuzh
lù.zhèdu
fāzhǎn zhōngguō rénm
n hé gèguó rénm
n zh
jiān de yǒuy
,y
d
ng hu
zuòchū x
nde gòngxiàn.
As according to common phonetic Roman capitals mark during with the input of this mark its touching number be 437 times.Then represent identical article with phonetic mark ellipsis of the present invention, then as follows.
<phonetic of the present invention omits mark 〉
siczl baa dongfang gen xiyah,ouzhou lianjxql chengjle dongxf d jiaotyd,cuhjinh l dongxf d wenjhjl,tongsmy hej yoouhwl.
jinhnl,zhongguoj hej yixie yoouhgj gonghtjd chongjxkf siczl,zhehduih fazhaan zhonggrm hej gehgrm zhijian d yoouyih,yidh zuochu xind gonghxianh.
Touching number in the case is 221 times, is about 1/2 when importing according to phonetic Roman capitals mark, and visible the present invention improves efficient greatly.
Fig. 3 pays the contrast figure of four tones of standard Chinese pronunciation mark and phonetic omission mark for Chinese character mark, the phonetic of the used Chinese idiom of present embodiment.Can think and collect the information shown in this figure in a large number among the corresponding one by one Chinese character mark of the Roman capitals mark ellipsis dictionary 3a shown in Fig. 2.At this moment, the capacity of the corresponding one by one Chinese character mark of Roman capitals mark ellipsis dictionary 3a is if any 10~120,000 words, and is then enough for common application target.
As above describe in detail, according to the present invention, by easy pair four tones of standard Chinese pronunciation phonetic mark input method and Chinese idiom ellipsis is combined, can make and utilize common alphabetic character key that required key operation number of times minimizing imported in Chinese, to realize rapidly and to import exactly and handle Chinese language input processing method Chinese, that efficient is high that great practical function is arranged.