JPH06195094A

JPH06195094A - Phonogram string display method and speech synthesizing device

Info

Publication number: JPH06195094A
Application number: JP4345866A
Authority: JP
Inventors: Takashi Aso; 隆麻生; Toshiyuki Noguchi; 利之野口; Yasunori Ohora; 恭則大洞
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1992-12-25
Filing date: 1992-12-25
Publication date: 1994-07-15

Abstract

(57)【要約】【目的】音声合成装置における表音文字列の表示にお
いて、各音節単位の表記を等間隔の表示スペース内に配
置することで、各音節の時間的な位置関係を表示すこと
を可能とする。【構成】「赤い提燈」という入力文を言語処理部によ
り読み情報と音調成分の情報に分ける。読み情報は、例
えばローマ字により表記され、これを表音テキストと称
する。表音テキストは１つまたは複数のアルファベット
より構成され、各音節の表音テキストのうち最も多くの
文字列を含む音節を抽出する。図中の（ｂ）では「ＣＨ
Ｏ」の３文字が最長文字列であり、表音テキストの表示
において、各音節について３文字分の表示スペースを確
保する。そして、表音文字列は各音節毎に左詰めで表示
される（図の（ｃ））。以上のように等間隔の表示スペ
ースに１音節ずつ表音テキストが格納される。 (57) [Summary] [Purpose] When displaying phonetic character strings in a speech synthesizer, by arranging the notation of each syllable unit in a display space at equal intervals, the temporal positional relationship of each syllable is displayed. It is possible. [Structure] The input sentence "red lantern" is divided into reading information and tonal component information by the language processing unit. The reading information is written in Roman letters, for example, and is called phonetic text. The phonetic text is composed of one or more alphabets, and the syllable containing the most character strings is extracted from the phonetic texts of each syllable. In (b) of the figure, "CH
The three characters "O" are the longest character string, and a display space for three characters is secured for each syllable in the display of phonetic text. Then, the phonetic character string is displayed left-justified for each syllable ((c) in the figure). As described above, the phonetic texts are stored one syllable in the equally spaced display space.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、表音文字列表示方法及
び音声合成装置に関する。更に詳しくは、文字情報を音
声に変換する音声合成において、漢字かな混じり文等よ
り得られる読み情報の表示を行う表音文字列表示方法及
び音声合成装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a phonetic character string display method and a voice synthesizer. More specifically, the present invention relates to a phonetic character string display method and a voice synthesizing device for displaying reading information obtained from a kanji / kana mixed sentence in voice synthesis for converting character information into voice.

【０００２】[0002]

【従来の技術】従来より、文字情報を音声に変換して出
力する音声規則合成装置が存在する。この音声規則合成
装置により漢字かな混じり文章を音声に変換する際に
は、まず漢字かな混じり文章を「読み」と「音調成分
（イントネーション、アクセント成分）」に変換する処
理（以下、言語処理と呼ぶ）を行う。次いで、言語処理
により生成された「読み」と「音調成分」より、音声パ
ラメータを調整し、音声に変換する処理（以下、音響処
理と呼ぶ）が必要である。2. Description of the Related Art Conventionally, there is a voice rule synthesizer for converting character information into voice and outputting the voice. When converting a kanji / kana mixed sentence into speech by this voice rule synthesizing device, first, a process of converting a kanji / kana mixed sentence into “reading” and “tone component (intonation, accent component)” (hereinafter referred to as language processing )I do. Next, it is necessary to adjust the voice parameter from the "reading" and the "tone component" generated by the language processing and convert it into voice (hereinafter referred to as acoustic processing).

【０００３】言語処理から音響処理に渡されるデータと
しては、編集可能なテキスト形式のもの（以下、表音テ
キストと呼ぶ）が一般的に用いられている。この表音テ
キストは言語処理より出力される結果が間違っている場
合等に、容易に修正することが可能である。そして、こ
の表音テキストは読み情報がローマ字もしくはかな（ひ
らがな、かたかな）などで表記され、その読み情報の中
に音調成分の情報（アクセントの位置、ポーズを入れる
位置等）が生め込まれたテキストが一般的に用いられ
る。As data to be passed from the language processing to the acoustic processing, editable text format data (hereinafter referred to as phonetic text) is generally used. This phonetic text can be easily corrected when the result output by the language processing is incorrect. In this phonetic text, the reading information is written in Roman letters or kana (Hiragana, Katakana), etc., and the tone information (accent position, pause position, etc.) is included in the reading information. Texts are commonly used.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、上記従
来の音声規則合成装置では、表音テキストにより表現さ
れる音の種類によって表音テキスト中の各音節に対する
表音文字列の文字数が変化する。例えば、「ちょ」とい
う音節の表音文字列は「ＣＨＯ」であり３文字で表現さ
れる。また、「あ」という音節の表音文字列は「Ａ」と
なり１文字で表現される。このように、各音節の読みを
表現する表音文字列の長さが不均一なため、表音文字列
より発生のタイミング（ある音節を発生する時間的な位
置）を得ることが困難となるという欠点がある。即ち、
日本語の発声では、音節単位でリズムが作られるので、
各音節は時間的にほぼ等間隔で発声される。しかし、表
音テキスト中では、各音節に対応する読みの文字列が、
各音節毎に等間隔には表記されていない。However, in the above-mentioned conventional voice rule synthesizer, the number of characters in the phonetic character string for each syllable in the phonetic text changes depending on the kind of the sound represented by the phonetic text. For example, the phonetic character string of the syllable "cho" is "CHO" and is represented by three characters. Also, the phonetic character string of the syllable "A" becomes "A" and is expressed by one character. In this way, since the phonetic character strings that represent the readings of each syllable are unequal in length, it is difficult to obtain the timing of occurrence (temporal position of generating a certain syllable) from the phonetic character string. There is a drawback that. That is,
In Japanese vocalization, rhythms are created in syllable units, so
Each syllable is uttered at almost regular intervals in time. However, in the phonetic text, the reading string corresponding to each syllable is
It is not written at equal intervals for each syllable.

【０００５】例えば、”赤い提燈”という漢字かな混じ
り文を表音テキストの読みに変換した場合、ローマ字で
は”ＡＫＡＩＣＨＯ−ＴＩＮ”という表記になる。この
表記を音節単位に分解すると”Ａ”，”ＫＡ”，”
Ｉ”，”ＣＨＯ”，”−”，”ＴＩ”，”Ｎ”となり、
一つ一つの音節の読みの文字列長が音節毎にまちまちで
あることが判る。また、”赤い提燈”の読みをひらがな
で表現した場合においては、”あかいちょーちん”とな
るが、この場合にも”ちょ”という音節の読みだけ２文
字と成り、各音節が等間隔には並ばない。For example, when a kanji-kana-mixed sentence "red lantern" is converted into phonetic text reading, it is written in Roman letters as "AKAICHO-TIN". When this notation is broken down into syllable units, "A", "KA", "
I "," CHO ","-"," TI "," N ",
It can be seen that the reading length of each syllable varies from syllable to syllable. Also, when the reading of "red lantern" is expressed in Hiragana, it becomes "Akaichichochin", but even in this case, only the reading of the syllable "Cho" is two characters, and each syllable is equally spaced. Not lined up.

【０００６】本発明は上記の問題点に鑑みてなされたも
のであり、表音文字列の表示において、各音節の表記を
等間隔の表示スペース内に配置することで、各音節の時
間的な位置関係を表示すことが可能な表音文字列表示方
法及び音声合成装置を提供することを目的とする。The present invention has been made in view of the above problems, and in displaying a phonetic character string, by arranging the notation of each syllable in a display space at equal intervals, the time of each syllable is increased. An object of the present invention is to provide a phonetic character string display method and a voice synthesizer capable of displaying a positional relationship.

【０００７】[0007]

【課題を解決するための手段】上記の目的を達成する本
発明の音声合成装置は以下の構成を備える。即ち、文を
解析して読み情報と音調成分の情報とを生成し、これら
の情報に基づいて合成音声を生成する音声合成装置であ
って、１音節中に含まれる文字数の最も多い表音文字列
の文字数に基づいて各音節の表音文字列の表示スペース
を決定するスペース決定手段と、前記スペース決定手段
により決定された前記表示スペースに１つの音節の表音
文字列を表示する表示手段と、を備える。A speech synthesizer of the present invention which achieves the above object has the following configuration. That is, a speech synthesizing device that analyzes a sentence to generate reading information and tonal component information, and generates synthetic speech based on these information, and is a phonetic character with the largest number of characters included in one syllable. Space determining means for determining a display space of a phonetic character string of each syllable based on the number of characters in a string; and display means for displaying a phonetic character string of one syllable in the display space determined by the space determining means. , Is provided.

【０００８】また、上記の目的を達成するための本発明
による表音文字列表示方法は以下の工程を備える。即
ち、文を解析して得られる読み情報を表音文字列にて表
示する表音文字列表示方法であって、１音節中に最も多
くの文字数を含む表音文字列の文字数に基づいて各音節
の表音文字列の表示スペースを決定するスペース決定工
程と、前記スペース決定手段により決定された前記表示
スペースに１つの音節の表音文字列を表示する表示工程
と、を備える。The phonetic character string display method according to the present invention for achieving the above object includes the following steps. That is, it is a phonetic character string display method of displaying reading information obtained by analyzing a sentence in a phonetic character string, and the phonetic character string display method is based on the number of characters of the phonetic character string including the largest number of characters in one syllable. A space determination step of determining a display space of a syllabic phonetic character string and a display step of displaying a phonetic character string of one syllable in the display space determined by the space determining means.

【０００９】尚、本発明において、表音文字列とは、読
み情報を表すための１つまたは複数の文字もしくは記号
で構成されるものである。In the present invention, the phonetic character string is composed of one or more characters or symbols for representing reading information.

【００１０】[0010]

【作用】上記の構成により、漢字かな混じり文等を解析
して得られた読み情報を表す表音文字列において、１音
節中に最も多くの文字を含む表音文字列の文字数に基づ
いて各音節単位の表示スペースを決定する。そして、こ
の決定された表示スペースに前記表音文字列を１音節毎
に配置して表示する。このように表音文字列を１音節毎
に定められた表示スペース内に配置することで、発声の
時間的な間隔と表音の表記位置との対応を容易に把握で
きる。With the above configuration, in the phonetic character string representing the reading information obtained by analyzing the kanji / kana mixed sentence, etc., each phoneme character string containing the largest number of characters in one syllable Determines syllable-based display space. Then, the phonetic character strings are arranged and displayed for each syllable in the determined display space. By arranging the phonetic character strings in the display space determined for each syllable in this way, the correspondence between the time interval of utterance and the notation position of the phonetic can be easily grasped.

【００１１】[0011]

【実施例】以下に添付の図面を参照して本発明の好適な
実施例について説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT A preferred embodiment of the present invention will be described below with reference to the accompanying drawings.

【００１２】図１は、本実施例の音声規則合成装置の概
略構成を表すブロック図である。同図において、１１は
ＣＰＵであり、本音声合成規則装置における各種の制御
を実行する。１２はＲＯＭであり、ＣＰＵ１１が処理を
実行する各種制御プログラムが格納されている。尚、Ｒ
ＯＭ１２には、後述の図３のフローチャートで表される
制御プログラムも格納されている。１３はＲＡＭであ
り、ＣＰＵ１１が各種の制御を実行する際に必要なデー
タ等を一時的に記憶する。１４は入力部であり、各種デ
ータの入力や制御命令などを入力する。尚、本音声規則
合成装置の言語処理により得られた表音テキストを編集
する際の編集入力も入力部１４より行う。１５は辞書で
あり、漢字等の読みやアクセント情報が登録されてお
り、入力された漢字かな混じり文を解析して読み情報を
得る言語処理において参照される。４は表示部であり、
文の読みを表す表音テキストの表示等、各種の表示がな
される。６は音響処理部であり、得られた表音テキスト
より音声合成を行い、音声信号に変換する。そしてスピ
ーカ１６により、変換された音声信号が出力される。FIG. 1 is a block diagram showing the schematic arrangement of a speech rule synthesizing apparatus according to this embodiment. In the figure, reference numeral 11 denotes a CPU, which executes various controls in the speech synthesis rule device. A ROM 12 stores various control programs for the CPU 11 to execute processing. Incidentally, R
The OM 12 also stores a control program represented by the flowchart of FIG. 3 described later. Reference numeral 13 denotes a RAM, which temporarily stores data and the like required when the CPU 11 executes various controls. An input unit 14 inputs various data, control commands, and the like. It should be noted that the input unit 14 also performs edit input when editing the phonetic text obtained by the language processing of the present speech rule synthesizing device. Reference numeral 15 is a dictionary in which readings and accent information of kanji and the like are registered, and is referred to in a language process for analyzing the input kanji / kana mixed sentence to obtain reading information. 4 is a display unit,
Various displays such as display of phonetic text indicating reading of sentences are made. Reference numeral 6 denotes a sound processing unit, which performs voice synthesis from the obtained phonetic text and converts it into a voice signal. Then, the speaker 16 outputs the converted audio signal.

【００１３】図２は、本実施例の音声規則合成装置の機
能構成を表すブロック図である。同図において、１は言
語処理部であり、入力された漢字かな混じり文章を解析
し、「読み」と「音調成分」を生成する。２は文字列長
カウンタ部であり、各音節に対応する読み表記の文字列
長を求める。３は表音テキスト生成部であり、言語処理
部１で得られた「読み」に基づいて表音テキストを生成
する。４は表示部であり、表音テキスト等を表示する。
５は編集部であり、表示部４に表示された表音テキスト
に対して編集を実行する。６は音響管理部であり、表音
テキストから音声を生成する。FIG. 2 is a block diagram showing the functional arrangement of the speech rule synthesizing apparatus of this embodiment. In the figure, reference numeral 1 is a language processing unit, which analyzes an input sentence containing kanji and kana and generates "reading" and "tone component". Reference numeral 2 denotes a character string length counter unit, which obtains the character string length of the reading notation corresponding to each syllable. Reference numeral 3 denotes a phonetic text generation unit, which generates phonetic text based on the "reading" obtained by the language processing unit 1. A display unit 4 displays phonetic text and the like.
An editing unit 5 edits the phonetic text displayed on the display unit 4. A sound management unit 6 generates a sound from phonetic text.

【００１４】以上のような構成を有する本実施例の音声
規則合成装置の動作について図３のフローチャートを参
照して以下に説明する。The operation of the speech rule synthesizing device of the present embodiment having the above-mentioned structure will be described below with reference to the flowchart of FIG.

【００１５】図３は本実施例の音声規則合成装置におけ
る制御手順を表すフローチャートである。先ず、ステッ
プＳ１１では、言語処理部１により、対象となる漢字か
な混じり文を解析し、「読み」と「音調成分」の情報を
生成する。このとき、「読み」の情報については、各音
節の境界が判るようにしておく。そして、ステップＳ１
２において、文字列長カウンタ部２は「読み」を表す各
表音文字列の文字列長をカウントして、最大文字列長を
有する表音文字列の文字数を獲得する。例えば”ちょ”
の読み表記である”ＣＨＯ”は３文字の文字列長を有す
る。FIG. 3 is a flow chart showing the control procedure in the speech rule synthesizing device of this embodiment. First, in step S11, the language processing unit 1 analyzes the target kanji / kana-mixed sentence and generates information of “reading” and “tone component”. At this time, regarding the "reading" information, the boundary of each syllable should be known. And step S1
In 2, the character string length counter unit 2 counts the character string length of each phonetic character string representing “reading” to obtain the number of characters of the phonetic character string having the maximum character string length. For example, "Cho"
“CHO”, which is the reading notation of, has a character string length of 3 characters.

【００１６】ステップＳ１３では、最大文字列長より１
音節分の表音文字列の表記スペースを決定する。次にス
テップＳ１４で、表音テキスト生成部３は、各音節に対
応する読み表記の文字列を音節単位で等間隔に配置す
る。即ち、文字列長カウンタ部２で求めた文字列長のう
ち最も長いものを、表音テキスト中の全ての音節の読み
表記に用いる文字列長とし、その文字列長の中に各音節
の読み表記を配置していく。In step S13, 1 is added to the maximum character string length.
Determines the space for writing syllabic phonetic strings. Next, in step S14, the phonetic text generation unit 3 arranges the reading and writing character strings corresponding to each syllable at equal intervals in syllable units. That is, the longest one of the character string lengths obtained by the character string length counter unit 2 is set as the character string length used for reading and writing all syllables in the phonetic text, and the reading of each syllable is included in the character string length. Place the notation.

【００１７】以上の各ステップにおける処理内容を図４
を参照して更に説明する。図４は入力された文から表音
テキストを得るまでの様子を表す図である。入力文は図
４の（ａ）の如く、”赤い提燈”である。これをローマ
字の読み表記にした場合、図４の（ｂ）に示すように、
読み”ＣＨＯ”が３文字と成り、最も長い。従って、こ
の長さ（３文字）を、各音節の読み表記に用いるスペー
スとする。このように、全ての音節の読み文字列に対し
て３文字文のスペースを与え、その３文字文のスペース
の中に各音節の読みを配置していく（図４の（ｃ））。
尚、図４の（ｃ）では、ローマ字で読みを表記する表音
文字列が、３文字分のスペースの中に左詰めで配置され
ている。一方、図５では、３文字分のスペースの中に右
詰めで表音文字列を配置した例を示す。FIG. 4 shows the processing contents in each of the above steps.
Will be further described with reference to. FIG. 4 is a diagram showing a state in which a phonetic text is obtained from an input sentence. The input sentence is “red lantern” as shown in FIG. When this is written in Roman alphabet, as shown in FIG. 4 (b),
The reading "CHO" consists of three characters and is the longest. Therefore, this length (3 characters) is used as a space for reading and writing each syllable. In this way, a space of three-character sentences is given to the reading character strings of all syllables, and the reading of each syllable is arranged in the space of the three-character sentences ((c) of FIG. 4).
In FIG. 4 (c), phonetic character strings in which readings are written in Roman letters are arranged left-justified in a space for three characters. On the other hand, FIG. 5 shows an example in which phonetic character strings are arranged right-justified in a space for three characters.

【００１８】更に、図６には、ひらがなで読みを表記す
る場合の、入力文から表音テキストを得るまでの様子が
表されている。ひらがなを表音テキストに用いた場合
は、図６の（ｂ）から明らかなように、読み”ちょ”が
最も長い文字列を有する。従って、２文字分のスペース
に表音文字列が配置されている（図６の（ｃ））。Further, FIG. 6 shows a state in which the phonetic text is obtained from the input sentence when the reading is written in hiragana. When hiragana is used for phonetic text, the reading "cho" has the longest character string, as is apparent from FIG. 6 (b). Therefore, the phonetic character string is arranged in the space for two characters ((c) of FIG. 6).

【００１９】続くステップＳ１５では、上述のようにし
て生成された表音テキストを表示部４により表示する。
そして、ステップＳ１６で表音テキストを修正するため
の入力が有ればステップＳ１７へ進み、編集部５により
修正を行う。ここで、表示部４には、キャラクター端末
の表示装置やワークステーション等のコンソールなどを
使用し、編集部５にはテキストエディッタなどを使用す
る。ステップＳ１７により修正を実行した後ステップＳ
１５へ戻り、修正内容が表示される。In the following step S15, the phonetic text generated as described above is displayed on the display unit 4.
Then, if there is an input for correcting the phonetic text in step S16, the process proceeds to step S17, and the editing unit 5 performs the correction. Here, a display device of a character terminal, a console such as a workstation, or the like is used for the display unit 4, and a text editor or the like is used for the editing unit 5. After the correction is executed in step S17, step S
Returning to 15, the correction content is displayed.

【００２０】一方、ステップＳ１６でテキスト修正が不
要であればステップＳ１８へ進み、音声出力を実行する
か否かを判定する。そして、音声出力の実行が指示され
れば、上述のようにして完成した表音テキストが、音響
処理部６において音声信号に変換されスピーカ１６より
再生出力される。On the other hand, if the text correction is unnecessary in step S16, the process proceeds to step S18, and it is determined whether or not voice output is executed. Then, when the execution of voice output is instructed, the phonetic text completed as described above is converted into a voice signal in the acoustic processing unit 6 and reproduced and output from the speaker 16.

【００２１】以上説明した様に、表音テキストにおいて
各音節に対応する読み表記を、音節単位で等間隔に配置
することで、各音節の発生タイミングを明確に示すこと
ができる効果がある。As described above, by arranging the phonetic notation corresponding to each syllable in the phonetic text at equal intervals in syllable units, it is possible to clearly indicate the timing of occurrence of each syllable.

【００２２】なお、上記実施例においては、文字列長カ
ウンタ部２において、各音節の読み表記の文字列長を求
めているが、予め各音節の読み表記が決まっていれば、
毎回文字列長を求める必要はない。その場合には文字列
長カウンタ部２は不要となり、予め分かっている各音節
の読み表記の最大文字列長を表音テキストの各音節の読
み表記に使用する文字列長とすることができる。In the above embodiment, the character string length counter unit 2 obtains the character string length of the reading notation of each syllable. However, if the reading notation of each syllable is determined in advance,
It is not necessary to calculate the character string length every time. In that case, the character string length counter unit 2 becomes unnecessary, and the maximum character string length of the phonetic reading of each syllable that is known in advance can be used as the character string length used for the phonetic reading of each syllable.

【００２３】また、上記実施例においては、表音テキス
トにおける各音節の読み表記のスペースとして、ある一
定の文字列長を与え、その文字列長の中に、各音節の読
み表記を記述する様にしている。従って、ある音節の読
み表記が与えられた文字列長よりも短い場合には、空白
文字が挿入されることになる。しかし、文字幅を変える
ことで、与えられたスペース内いっぱいに読み表記を表
示する様にしても良い。その場合の表示例を図７、図８
に示す。図７はローマ字による表音文字列の例であり、
図８はひらがなによる表音文字列の例である。これらの
場合には、図２における表示部４において、与えられた
表示スペース内に各音節の表音文字列が入る様に文字幅
を計算し、その文字幅で表音テキストを表示する。Further, in the above embodiment, a certain character string length is given as a space for writing and writing each syllable in the phonetic text, and the reading and writing of each syllable is described in the character string length. I have to. Therefore, when the phonetic transcription of a certain syllable is shorter than the given character string length, a blank character is inserted. However, by changing the character width, it is possible to display the phonetic notation in a given space. Display examples in that case are shown in FIGS.
Shown in. Figure 7 is an example of a phonetic character string in Roman letters,
FIG. 8 is an example of a phonetic character string in hiragana. In these cases, in the display unit 4 in FIG. 2, the character width is calculated so that the phonetic character string of each syllable falls within the given display space, and the phonetic text is displayed with the character width.

【００２４】以上のように本実施例の音声合成装置で
は、表音文字列の表示において、各音節に対応する読み
表記の文字列を表示スペース内の適当な位置に配置し、
或いは文字幅を適当に変えて、各音節単位で等間隔に読
み表記を並べることで、各音節の時間的な位置関係を明
確に表現できる。As described above, in the voice synthesizing device of this embodiment, in displaying the phonetic character string, the phonetic character string corresponding to each syllable is arranged at an appropriate position in the display space,
Alternatively, by appropriately changing the character width and arranging the readings and notations at equal intervals in each syllable, the temporal positional relationship of each syllable can be clearly expressed.

【００２５】尚、本発明は、複数の機器から構成される
システムに適用しても１つの機器からなる装置に適用し
ても良い。また、本発明は、システム或は装置にプログ
ラムを供給することによって達成される場合にも適用で
きることはいうまでもない。The present invention may be applied to a system composed of a plurality of devices or an apparatus composed of a single device. Further, it goes without saying that the present invention can be applied to the case where it is achieved by supplying a program to a system or an apparatus.

【００２６】[0026]

【発明の効果】以上説明したように本発明の表音文字列
表示方法及び音声合成装置によれば、表音文字列の表示
において、各音節単位の表記を等間隔の表示スペース内
に配置することで、各音節の時間的な位置関係を表示す
ことを可能とし、各音節の時間的な関係の把握を容易に
する。As described above, according to the phonetic character string display method and the voice synthesizing device of the present invention, in the display of the phonetic character string, the notation of each syllable unit is arranged in the display space at equal intervals. By doing so, it becomes possible to display the temporal positional relationship of each syllable, and it becomes easy to grasp the temporal relationship of each syllable.

【００２７】[0027]

[Brief description of drawings]

【図１】本実施例の音声規則合成装置の概略構成を表す
ブロック図である。FIG. 1 is a block diagram showing a schematic configuration of a voice rule synthesizing device according to this embodiment.

【図２】本実施例の音声規則合成装置の機能構成を表す
ブロック図である。FIG. 2 is a block diagram showing a functional configuration of a voice rule synthesizing device according to this embodiment.

【図３】本実施例の音声規則合成装置における制御手順
を表すフローチャートである。FIG. 3 is a flowchart showing a control procedure in the voice rule synthesizing device of the present embodiment.

【図４】入力された文から表音テキストを得るまでの様
子を表す図である。FIG. 4 is a diagram showing how a phonetic text is obtained from an input sentence.

【図５】３文字分のスペースの中に右詰めで表音文字列
を配置した例を示す図である。FIG. 5 is a diagram showing an example in which phonetic character strings are arranged right-justified in a space for three characters.

【図６】ひらがなで読みを表記する場合の、入力文から
表音テキストを得るまでの様子を表す図である。FIG. 6 is a diagram showing a state from input sentence to obtaining phonetic text when reading is written in hiragana.

【図７】ローマ字による表音文字列を表示スペースの大
きさに合わせて文字幅を変更して表示した例を表す図で
ある。FIG. 7 is a diagram showing an example in which a phonetic character string in Roman characters is displayed with the character width changed according to the size of the display space.

【図８】ひらがなによる表音文字列を表示スペースの大
きさに合わせて文字幅を変更して表示した例を表す図で
ある。FIG. 8 is a diagram showing an example in which a phonetic character string in hiragana is displayed with the character width changed according to the size of the display space.

[Explanation of symbols]

１言語処理部２文字列長カウンタ部３表音テキスト生成部４表示部５編集部６音響処理部１１ＣＰＵ１２ＲＯＭ１３ＲＡＭ１４入力部１５辞書１６スピーカ 1 language processing unit 2 character string length counter unit 3 phonetic text generation unit 4 display unit 5 editing unit 6 acoustic processing unit 11 CPU 12 ROM 13 RAM 14 input unit 15 dictionary 16 speaker

Claims

[Claims]

1. A voice synthesizer that analyzes a sentence to generate reading information and tonal component information, and generates a synthesized voice based on these information, and has the largest number of characters included in one syllable. Space determining means for determining the display space of the phonetic character string of each syllable based on the number of characters of the phonetic character string, and displaying the phonetic character string of one syllable in the display space determined by the space determining means. A speech synthesis apparatus comprising: a display unit.

2. A voice synthesizing device for generating reading information and tone component information by analyzing a sentence and generating synthetic speech based on these information, wherein a table of each syllable unit representing the reading information. Counting means for counting the number of characters included in the phonetic character string, as a result of counting by the counting means, extraction means for extracting the phonetic character string having the largest number of characters, and the phonetic character string extracted by the extracting means. Space determining means for determining a display space of a phonetic character string of each syllable based on the number of characters included therein; and display means for displaying a phonetic character string of one syllable in the display space determined by the space determining means. A voice synthesizer comprising:

3. The display means displays a phonetic character string of one syllable for each of the display spaces determined by the space determining means, and a phonetic character string corresponding to each syllable is displayed in the display space. The speech synthesizer according to claim 1 or 2, characterized in that if there is an extra space when placed, it is left blank.

4. The display unit for each syllable so that the display unit displays the phonetic character string of one syllable for each of the display spaces determined by the space determination unit in a given display space. The voice synthesizer according to claim 1 or 2, wherein the characters of the phonetic character string are displayed with different widths.

5. A phonetic character string display method for displaying reading information for voice synthesis in a phonetic character string, which is based on the number of characters in a phonetic character string including the largest number of characters in one syllable. A space determination step of determining a display space of a phonetic character string of each syllable, and a display step of displaying a phonetic character string of one syllable in the display space determined by the space determining means. Characteristic phonetic character string display method.

6. A phonetic character string display method for displaying reading information for voice synthesis in a phonetic character string, wherein the number of characters included in the phonetic character string of each syllable unit representing the reading information is A counting step of counting, an extraction step of extracting a phonetic character string having the largest number of characters as a result of the counting by the counting step, and each syllable based on the number of characters included in the phonetic character string extracted by the extracting step. And a display step of displaying a phonetic character string of one syllable in the display space determined by the space determining step. Phonetic string display method.