JPH09258763A

JPH09258763A - Voice synthesizing device

Info

Publication number: JPH09258763A
Application number: JP8060682A
Authority: JP
Inventors: Mikio Sugiyama; 実輝雄杉山
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1996-03-18
Filing date: 1996-03-18
Publication date: 1997-10-03

Abstract

PROBLEM TO BE SOLVED: To provide a voice synthesizing device that can read out correctly even in the case of including special words needing special way of reading. SOLUTION: When a kana-kanji mixed sentence to be read out is inputted, a read-out control means 1 supplies it to a tag information retrieving means 2, and a sentence implementor retrieves tag information added at the time of preparing a sentence. In the case of detecting tag information that assigns the way of reading, the special word information of this tag is registered in a special word dictionary 3. In the case of detecting tag information that assigns another way of reading in the same notation, an object word is stored in a read-out buffer 4, substituting for the phoneme-rhythm information of this tag, and the object word is stored in a display buffer 8, substituting for the noted information of this tag. A next processing means 6 converts read-out data, stored in the read-out buffer 4, into a pronounciation mark row, referring to the special word dictionary 3 and a general word dictionary 5 and supplies it to a voice synthesizing means 7.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、漢字仮名混じり文
に対して言語処理を施し、その結果を音声合成すること
により音声として読み上げる音声合成装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice synthesizing apparatus for reading a voice as a voice by performing language processing on a sentence containing kanji and kana and synthesizing the result.

【０００２】[0002]

【従来の技術】近年、日本語の漢字仮名混じり文章に対
して言語処理を施し、その結果に対して音声合成処理を
行う事により音声として読み上げる音声合成装置が実用
化されている。この音声合成装置では、漢字仮名混じり
文章内の単語に対する読み情報やアクセント情報等を登
録した単語辞書を参照して言語処理が行われ、その結果
を用いて読み上げが行われる。2. Description of the Related Art In recent years, a speech synthesizer has been put into practical use, in which a sentence containing a mixture of Japanese kanji and kana is subjected to language processing, and the result is subjected to speech synthesis processing to read it out as speech. In this speech synthesizer, language processing is performed by referring to a word dictionary in which reading information, accent information, and the like for words in a sentence mixed with Kanji and kana are registered, and the result is used to read aloud.

【０００３】しかしながら、漢字仮名混じり文章中の単
語には、一般的で汎用的な単語の他にも、一般的でない
固有名詞や専門用語などの特殊単語も含まれている場合
がある。汎用的な単語のみならず、一般的でない単語に
対しても正しい読みやアクセントを付与するためには、
これら全ての単語を１つの単語辞書に登録すれば良い
が、実際上、これら全ての情報を網羅するように１つの
単語辞書を構成することは不可能である。このため、読
み上げられた文章は、必ずしも文章作成者が意図したも
のとはならない。However, the words in the kanji and kana mixed sentences may include not only general and general-purpose words but also special words such as uncommon proper nouns and technical terms. To give correct reading and accent not only to general-purpose words but also to uncommon words,
Although it is sufficient to register all these words in one word dictionary, it is practically impossible to configure one word dictionary so as to cover all of these pieces of information. Therefore, the read sentence is not always what the sentence creator intended.

【０００４】従来、この種の音声合成装置は、例えば特
開平４−３３１９９８号公報に記載されている。この公
報に記載されている音声合成装置は、単語辞書に登録さ
れていない単語に対して、読み方やアクセントなどの情
報を定義した特殊単語情報を文章中に埋め込み、この情
報を利用して、文章作成者の意図した読み方で正確な読
み上げを行なわせている。Conventionally, this type of speech synthesizer is described in, for example, Japanese Patent Application Laid-Open No. 4-331998. The speech synthesizer described in this publication embeds in a sentence special word information that defines information such as reading and accent for a word that is not registered in a word dictionary, and uses this information to write a sentence. Accurate reading is performed according to the reading intended by the creator.

【０００５】この公報に記載された発明について図面を
参照して説明する。図１５は、公報に記載された音声合
成装置のブロック図である。図１５に示すように、この
音声合成装置は、読み上げの対象となる仮名漢字混じり
文章に対する処理を行うため、入力仮名漢字混じり文章
中には、埋め込まれた特殊単語情報を抽出し、これを特
殊単語情報バッファ１０１に保存するとともに、文章中
の特殊単語情報をこの特殊単語の表記に置き換える特殊
単語抽出部１０２と；特殊単語情報が特殊単語の表記に
置き換えられた結果の仮名漢字混じり文章を解析して、
これを音韻、韻律情報に変換する言語処理部１０３と；
言語処理部１０３により得られた音韻、韻律情報を、特
殊単語情報バッファ１０１に保存されている特殊単語の
音韻、韻律情報に差し替える特殊単語変換部１０４と；
特殊単語変換部１０４からの音韻、韻律情報に基づき音
声合成を行い音声の読み上げを行う音声合成部１０５と
から構成される。The invention described in this publication will be described with reference to the drawings. FIG. 15 is a block diagram of the speech synthesizer described in the publication. As shown in FIG. 15, this speech synthesizer performs processing on a sentence containing Kana-Kanji mixed characters to be read aloud. Therefore, embedded special word information is extracted from the input Kana-Kanji mixed sentence and the special word information is extracted. A special word extraction unit 102 which stores the special word information in the sentence into the notation of this special word while saving it in the word information buffer 101; and analyzes a sentence mixed with kana-kanji resulting from the special word information being replaced with the notation of the special word. do it,
A language processing unit 103 for converting this into phoneme / prosodic information;
A special word conversion unit 104 for replacing the phoneme and prosody information obtained by the language processing unit 103 with the phoneme and prosody information of the special word stored in the special word information buffer 101;
It is composed of a voice synthesis unit 105 which performs voice synthesis based on the phoneme and prosody information from the special word conversion unit 104 and reads the voice.

【０００６】図１６は、図１５に示すシステムに入力さ
せて音声合成装置を行わせる仮名漢字混じり文章の例を
示している。図１６において、特殊単語（ＭＩＣやＬＩ
ＮＥ）については、この表記情報（ＭＩＣ、ＬＩＮＥ）
と音韻、韻律情報（マ‘イク、ラ’イン）とをコロ
ン（：）で区分し、これらの情報からなる特殊単語情報
全体をセミコロン（；）で挟んで、処理対象となる仮名
漢字混じり文章に予め埋め込んでおく。FIG. 16 shows an example of a kana-kanji mixed sentence which is input to the system shown in FIG. In FIG. 16, special words (MIC and LI
For NE), this notation information (MIC, LINE)
And phonological / prosodic information (ma'iku, la'in) are separated by a colon (:), and special word information consisting of these information is sandwiched by a semicolon (;), and a sentence containing kana-kanji to be processed is mixed. Embedded in advance.

【０００７】次に、動作について説明する。いま、文章
作成者によって、例えば図１６に示すような仮名漢字混
じり文章が作成されているとし、これを図１５のシステ
ムに入力させると、特殊単語抽出部１０２は、仮名漢字
混じり文章中に埋め込まれた特殊単語情報を抽出する。
すなわち、仮名漢字混じり文章においてセミコロ
ン（；）を検出すると、このセミコロン（；）からその
後ろのコロン（：）までの間を特殊単語の表記情報とし
て抽出し、このコロン（：）からその後ろのセミコロン
（；）までの間を特殊単語の音韻、韻律情報として抽出
する。この結果、例えば、特殊単語情報「；ＭＩＣ：マ
‘イク；」については、「ＭＩＣ」が表記情報として抽
出され、「マ’イク」が音韻、韻律情報として抽出され
る。Next, the operation will be described. Now, assuming that the sentence creator has created a sentence mixed with Kana-Kanji characters, for example, as shown in FIG. 16, and inputting this into the system of FIG. 15, the special word extraction unit 102 embeds it in the sentence mixed with Kana-Kanji characters. Extracted special word information.
That is, when a semicolon (;) is detected in a sentence containing Kana-Kanji, the space between this semicolon (;) and the colon (:) after it is extracted as the notation information of the special word, and the colon (:) The part up to the semicolon (;) is extracted as phoneme and prosody information of the special word. As a result, for example, with respect to the special word information “; MIC: ma'iku;”, “MIC” is extracted as the notation information, and “ma'iku” is extracted as the phoneme and prosody information.

【０００８】特殊単語抽出部１０２は、このように特殊
単語情報を抽出すると、その結果を特殊単語情報バッフ
ァ１０１に保存する。また、セミコロンからセミコロン
までの文字列をこの特殊単語の表記に置き換え、さらに
仮名漢字混じり文章中で特殊単語に対応する部分を明確
にマーキング（アンダーライン）を施す。After extracting the special word information in this way, the special word extracting unit 102 stores the result in the special word information buffer 101. In addition, the character string from semicolon to semicolon is replaced with the notation of this special word, and the part corresponding to the special word is clearly marked (underlined) in the sentence mixed with Kana and Kanji.

【０００９】この結果、言語処理部１０３には、特殊単
語抽出部１０２から特殊単語の表記にマーキングを施し
た仮名漢字混じり文章が送られる。言語処理部１０３で
は、これを解析し、この文章中の各単語についてこれを
音韻、韻律情報に変換する。この際、マーキングの施さ
れた箇所については、この位置情報を保存するように処
理する。また、各単語の処理については、図１５に図示
されていない、既知の一般的な単語辞書が用いられる。As a result, the language processing unit 103 is sent from the special word extracting unit 102 a sentence mixed with Kana-Kanji in which the notation of the special word is marked. The language processing unit 103 analyzes this and converts each word in this sentence into phonemic and prosodic information. At this time, the marked portion is processed so as to store the position information. For processing each word, a known general word dictionary not shown in FIG. 15 is used.

【００１０】しかる後、特殊単語変換部１０４では、言
語処理部１０３で得られた音韻、韻律情報中においてマ
ーキング位置に対応した音韻、韻律情報については、こ
れを特殊単語情報バッファ１０１中の対応する特殊単語
の音韻、韻律情報に置き換える。Thereafter, in the special word conversion unit 104, regarding the phoneme and prosody information corresponding to the marking position in the phoneme and prosody information obtained by the language processing unit 103, this is corresponded to in the special word information buffer 101. Replace with phoneme and prosody information of special words.

【００１１】これにより、言語処理１０３で処理した結
果、特殊単語について、これらを特殊単語情報バッファ
１０１に保存されている文章作成者の意図した通りに置
き換えて音声合成部１０５に送ることができ、音声合成
部１０５では、既知の規則音声合成技術を用いて音声を
合成し、読み上げを行う。As a result, as a result of the processing by the language processing 103, the special words can be replaced with those stored in the special word information buffer 101 as intended by the sentence creator and sent to the speech synthesizer 105. The speech synthesis unit 105 synthesizes speech using a known rule speech synthesis technique and reads it aloud.

【００１２】また、仮名漢字混じり文章中に同じ特殊単
語が何度も繰り返し存在するときに、脚注風に特殊単語
情報を仮名漢字混じり文章に付加し、これを利用する音
声合成装置の一例も、前記特開平４−３３１９９８号公
報に記載されている。この音声合成装置は、文章中に同
じ特殊単語が何度も存在する場合にも、特殊単語情報を
何度も指定する必要をなくすことを目的としている。[0012] Also, when the same special word repeatedly exists in a sentence mixed with Kana and Kanji, special word information is added to a sentence mixed with Kana and Kanji in a footnote style, and an example of a speech synthesizer utilizing this is also provided. It is described in the above-mentioned JP-A-4-331998. This speech synthesizer aims at eliminating the need to specify special word information many times even when the same special word is present in a sentence many times.

【００１３】この従来技術について図１７を参照して説
明する。図１７に示すように、この音声合成装置は、仮
名漢字混じり文章に対する処理を行うため、仮名漢字混
じり文章に付加された特殊単語情報を抽出する特殊単語
抽出部１１１と；特殊単語抽出部１１１で抽出された特
殊単語情報に基づき特殊単語辞書１１２を作成する特殊
単語辞書作成部１１３と；一般的な単語の音韻、韻律情
報が予め格納されている一般単語辞書１１４と；付加さ
れていた特殊単語情報を削除した形の仮名漢字混じり文
章のみが入力し、この仮名漢字混じり文章を特殊単語辞
書１１２と一般単語辞書１１４との両方を参照して言語
処理を行う言語処理部１１５と；言語処理部１１５から
の音韻、韻律情報に基づき音声合成を行い音声の読み上
げを行う音声合成部１１６と、で構成されている。This prior art will be described with reference to FIG. As shown in FIG. 17, this speech synthesizer performs a process on a sentence mixed with Kana-Kanji characters, so a special word extraction unit 111 for extracting special word information added to a sentence mixed with Kana-Kanji characters; A special word dictionary creating unit 113 that creates a special word dictionary 112 based on the extracted special word information; a general word dictionary 114 in which phoneme and prosody information of general words is stored in advance; special words that have been added A language processing unit 115 for inputting only a Kana-Kanji mixed sentence in a form in which information has been deleted, and performing a language process on the Kana-Kanji mixed sentence by referring to both the special word dictionary 112 and the general word dictionary 114; A voice synthesis unit 116 that performs voice synthesis based on the phoneme and prosody information from the voice 115 and reads the voice aloud.

【００１４】図１８は、図１７に示すシステムに入力さ
せて音声合成処理をおこなわせる仮名漢字混じり文章の
例を示している。図１８において、特殊単語（ＭＩＣや
ＬＩＮＥ）については、この表記情報（ＭＩＣ、ＬＩＮ
Ｅ）と音韻、韻律情報（マ‘イク、ラ’イン）とをコロ
ン（：）で区分し、これらの情報からなる特殊単語情報
全体をセミコロン（；）で挟んで、処理対象となる仮名
漢字混じり文章に予め付加しておく。FIG. 18 shows an example of a kana-kanji mixed sentence which is input to the system shown in FIG. 17 to perform a voice synthesis process. In FIG. 18, regarding the special words (MIC and LINE), this notation information (MIC, LIN)
E) and phonological / prosodic information (ma'iku, la'in) are separated by a colon (:), and special word information consisting of these information is sandwiched by semicolons (;) to be processed kana kanji. Add it to the mixed text in advance.

【００１５】次に、動作について説明する。いま、文章
作成者によって、例えば図１８に示すような特殊単語情
報の付加された仮名漢字混じり文章を図１７のシステム
に入力させると、特殊単語抽出部１１１では、仮名漢字
混じり文章に付加されている特殊単語情報を抽出する。
すなわち、図１８の例では、行頭のセミコロン（；）を
検出すると、このセミコロン（；）からその後ろのコロ
ン（：）までの間を特殊単語の表記情報として抽出し、
このコロン（：）から行末までの間を特殊単語の音韻、
韻律情報として抽出する。Next, the operation will be described. Now, when the sentence creator inputs a kana-kanji mixed sentence with special word information as shown in FIG. 18 into the system of FIG. 17, the special word extraction unit 111 adds it to the kana-kanji mixed sentence. Extract the special word information.
That is, in the example of FIG. 18, when a semicolon (;) at the beginning of a line is detected, the space between this semicolon (;) and the colon (:) after it is extracted as the notation information of the special word,
The phoneme of the special word from this colon (:) to the end of the line,
Extract as prosodic information.

【００１６】特殊単語抽出部１１１は、このようにして
特殊単語情報を抽出すると、その結果を特殊単語辞書作
成部１１３に与え、特殊単語辞書作成部１１３では、抽
出された特殊単語情報に基づき特殊単語辞書１１２を作
成する。When the special word extracting unit 111 extracts the special word information in this way, the result is given to the special word dictionary creating unit 113, and the special word dictionary creating unit 113 specializes based on the extracted special word information. The word dictionary 112 is created.

【００１７】また、特殊単語抽出部１１１は、特殊単語
情報を抽出すると、特殊単語情報を削除して、仮名漢字
混じり文章のみを言語処理部１１５に与える。When the special word extracting unit 111 extracts the special word information, the special word extracting unit 111 deletes the special word information and gives only the sentence containing Kana-Kanji to the language processing unit 115.

【００１８】この結果、言語処理部１１５には、特殊単
語抽出部１１１から特殊単語情報を削除した仮名漢字混
じり文章のみが送られる。As a result, the language processing unit 115 is sent from the special word extracting unit 111 only the sentence containing the kana and kanji characters in which the special word information is deleted.

【００１９】言語処理部１１５では、これを解析し、こ
の文章中の各単語についてこれを音韻、韻律情報に変換
する。この際、ある単語について言語処理を行う場合
に、言語処理部１１５は、特殊単語辞書１１２と一般単
語辞書１１４とを参照するが、一般的な単語の情報は特
殊単語辞書１１２には一般的に登録されておらず、ま
た、特殊単語の情報は一般的には一般単語辞書１１４に
登録されていないので、一般的な単語については一般単
語辞書１１４を参照し、文章中の特殊単語「ＭＩＣ」、
「ＬＩＮＥ」については、特殊単語辞書１１２を参照す
る。この結果、言語処理部１１５は、特殊単語「ＭＩ
Ｃ」、「ＬＩＮＥ」については、これらを特殊単語辞書
１１２に登録されている音韻、韻律情報に置き換えて、
音声合成部１１６に送ることができる。音声合成部１１
６では、既知の規則音声合成技術を用いて音声を合成
し、読み上げを行う。The language processing unit 115 analyzes this and converts each word in this sentence into phoneme and prosody information. At this time, when performing language processing on a certain word, the language processing unit 115 refers to the special word dictionary 112 and the general word dictionary 114, but general word information is generally stored in the special word dictionary 112. Since it is not registered and the information of the special word is not generally registered in the general word dictionary 114, the general word dictionary 114 is referred to for general words, and the special word “MIC” in the sentence is referred to. ,
For "LINE", refer to the special word dictionary 112. As a result, the language processing unit 115 causes the special word “MI
For “C” and “LINE”, these are replaced with phoneme and prosody information registered in the special word dictionary 112,
It can be sent to the voice synthesizer 116. Speech synthesizer 11
In 6, a voice is synthesized by using a known rule voice synthesis technique and read out aloud.

【００２０】[0020]

【発明が解決しようとする課題】第１の問題点は、文章
作成者は、特殊単語の特殊単語情報を対象となる単語が
ある度に付加しなければならないことである。The first problem is that the sentence creator must add the special word information of the special word every time there is a target word.

【００２１】特開平４−３３１９９８号公報に記載され
ている第１の例では、文章作成者が予め特殊単語情報を
対象となる仮名漢字混じり文章に埋め込み、この埋め込
んだ特殊単語情報を利用して文章作成者の意図した読み
方で読み上げをさせている。このため、文章作成者が意
図した読み方をさせたい対象単語毎に特殊単語情報を埋
め込む必要を生じる。In the first example described in Japanese Patent Application Laid-Open No. 4-331998, the sentence creator previously embeds special word information in a sentence containing a mixture of kana and kanji, and utilizes the embedded special word information. The text is read aloud as intended by the author. For this reason, it becomes necessary for the sentence creator to embed special word information for each target word that he / she wants to read.

【００２２】第２の問題点は、表記が同じ単語に対して
複数の読み方を指定できないことである。The second problem is that plural readings cannot be designated for words having the same notation.

【００２３】特開平４−３３１９９８号公報に記載され
ている第２の例では、特殊単語情報を仮名漢字混じり文
章に脚注風に付加し、この付加した特殊単語情報を特殊
単語辞書に登録する。読み上げる際には、対象となる単
語をこの特殊単語辞書の音韻、韻律情報に置き換えて読
み上げている。このため、特殊単語辞書に登録した音
韻、韻律以外の音韻、韻律情報で読み上げさせることが
できなくなる。また、登録した音韻、韻律情報以外で読
み上げさせたい場合、特殊単語情報を再登録する必要が
生じる。In the second example disclosed in Japanese Patent Laid-Open No. 4-331998, special word information is added to a footnote style in a sentence mixed with Kana and Kanji, and the added special word information is registered in the special word dictionary. When reading aloud, the target word is replaced with the phoneme and prosody information of this special word dictionary. For this reason, it becomes impossible to read aloud with phonemes, phonemes other than prosody, and prosody information registered in the special word dictionary. Further, when it is desired to read aloud other than the registered phoneme and prosody information, it is necessary to re-register the special word information.

【００２４】第３の問題点は、仮名漢字混じり文章デー
タをページ単位等で入力した場合、脚注風の特殊単語情
報を取得できない場合があり、文章作成者の意図した読
み上げを行えない場合がある。A third problem is that when sentence data containing kana-kanji characters is input in page units or the like, footnote-like special word information may not be obtained, and it may not be possible to read aloud as intended by the sentence creator. .

【００２５】特開平４−３３１９９８号公報に記載され
ている第２の例では、特殊単語情報を脚注風に仮名漢字
混じり文章に付加している。入力する文章データを自由
に選択した場合、文章作成者が特殊単語情報を付加した
にも関わらず、特殊単語情報を取得できないために、意
図した読み上げが行われない場合が生じる。In the second example described in Japanese Patent Laid-Open No. 4-331998, special word information is added to a sentence mixed with Kana and Kanji in a footnote style. When the sentence data to be input is freely selected, the special creator may add special word information, but the special word information cannot be acquired, so that the intended reading may not be performed.

【００２６】そこで本発明は、上記従来技術の問題点を
解決し、対象となる単語がある度に特殊単語情報を付加
する必要がないこと、文章作成者が表記が同じ単語に対
して複数の読み方を指定することが可能であり、文章作
成者が意図した読み方で読み上げを行わせる音声合成装
置を提供することである。Therefore, the present invention solves the above-mentioned problems of the prior art, and it is not necessary to add special word information every time there is a target word. The purpose of the present invention is to provide a speech synthesizer capable of designating a reading method and reading the speech in a reading method intended by the sentence creator.

【００２７】[0027]

【課題を解決するための手段】本発明の第一の音声合成
装置は、読み上げる仮名漢字混じり文章に付加されたタ
グ情報を検索する手段（図１の２）と、タグ情報から取
得した特殊単語情報を登録する特殊単語辞書（図１の
３）と、入力される仮名漢字混じり文章を特殊単語情報
を用いて、表示データと音声データに変換する読み上げ
制御手段（図１の１）とを有する。The first speech synthesizer of the present invention comprises means for retrieving tag information added to a sentence mixed with kana and kanji to be read (2 in FIG. 1), and a special word obtained from the tag information. It has a special word dictionary (3 in FIG. 1) for registering information, and a reading control means (1 in FIG. 1) for converting an input kana-kanji mixed sentence into display data and voice data by using the special word information. .

【００２８】本発明の第二の音声合成装置は、読み上げ
る仮名漢字混じり文章に付加されたタグ情報を検索する
手段（図５の２）と、対象となる単語をタグ情報に変換
するとともに、入力される仮名漢字混じり文章を特殊単
語情報を用いて、表示データと音声データに変換する読
み上げ制御手段（図５の１）とを有する。The second speech synthesizing device of the present invention is a means for retrieving tag information added to a sentence containing kana-kanji mixed characters to be read out (2 in FIG. 5) and converting a target word into tag information and inputting it. It has a reading control means (1 in FIG. 5) for converting a kana-kanji mixed sentence to be displayed data and voice data by using special word information.

【００２９】本発明の第三の音声合成装置は、読み上げ
制御手段（図７の１）にヘッダ情報と読み上げる仮名漢
字混じり文章を区別して入力する制御手段（図７の１
１）と、タグ情報を検索する手段（図７の２）と、ヘッ
ダ情報の特殊単語情報を特殊単語辞書（図７の３）に登
録するとともに、読み上げる仮名漢字混じり文章を特殊
単語情報を用いて、表示データと音声データに変換する
読み上げ制御手段（図７の１）とを有する。The third voice synthesizing device of the present invention is a control means (1 in FIG. 7) for distinguishing and inputting header information and a sentence mixed with kana and kanji to be read into the reading control means (1 in FIG. 7).
1), means for retrieving tag information (2 in FIG. 7), and registering the special word information in the header information in the special word dictionary (3 in FIG. 7), and using the special word information for a sentence mixed with kana and kanji to be read. And a reading control means (1 in FIG. 7) for converting the display data and the voice data.

【００３０】本発明の第四の音声合成装置は、読み上げ
制御手段（図９の１）にヘッダ情報と読み上げる仮名漢
字混じり文章を区別して入力する制御手段（図９の１
１）と、タグ情報を検索する手段（図９の２）と、対象
となる単語をヘッダ情報のタグ情報の特殊単語情報に変
換するとともに、読み上げる仮名漢字混じり文章を特殊
単語情報を用いて、表示データと音声データに変換する
読み上げ制御手段（図９の１）とを有する。The fourth speech synthesizer of the present invention is a control means (1 in FIG. 9) for distinguishing and inputting header information and a sentence mixed with kana and kanji to be read into the reading control means (1 in FIG. 9).
1), means for searching tag information (2 in FIG. 9), and converting the target word into special word information of the tag information of the header information, and using the special word information for a sentence mixed with kana and kanji to be read. It has reading control means (1 in FIG. 9) for converting display data and voice data.

【００３１】読み上げ制御手段は、タグ情報を検索して
得られた特殊単語情報を、特殊単語辞書に登録するとと
もに、表示データに変換する際には、タグ情報を表記情
報に置き換え、音声データに変換する際には、特殊単語
辞書に登録された特殊単語情報を参照して、正しい読み
方に変換するか、タグ情報が付加されている単語の読み
方を、タグ情報の音韻、韻律情報に置き換え、正しい読
み方で読み上げるようにしている。The reading control means registers the special word information obtained by searching the tag information in the special word dictionary, and at the time of converting it into display data, replaces the tag information with the notation information and converts it into voice data. When converting, refer to the special word information registered in the special word dictionary, convert to the correct reading, or replace the reading of the word to which the tag information is added with the phoneme of the tag information, prosodic information, I try to read it out correctly.

【００３２】また、タグ情報を検索して得られた特殊単
語情報を特殊単語辞書に登録せず、この表記情報と同じ
文字列で読み方を指定していない単語に対して、正しい
読み方を付加する。Further, the special word information obtained by searching the tag information is not registered in the special word dictionary, and the correct reading is added to the word whose reading is not specified by the same character string as this notation information. .

【００３３】[0033]

【発明の実施の形態】本発明の実施の形態を図面を参照
して説明する。図１は本発明の第一の実施の形態を示す
ブロック図である。本発明の第一の実施形態の音声合成
装置は、図１に示すように、文章作成者が意図した読み
方を指定した読み替え情報のタグ情報を付加した仮名漢
字混じり文章から読み上げ文章データと表示データを生
成する読み上げ制御手段１と、入力される仮名漢字混じ
り文章データからタグ情報を検索するタグ情報検索手段
２と；読み方を指定するタグ情報の表記、音韻、韻律情
報を登録する特殊単語辞書３と；読み上げる文章データ
を格納する読み上げバッファ４と；あらかじめ一般的に
使われる単語の音韻、韻律情報が記録されている一般単
語辞書５と；読み上げバッファ４に格納されている読み
上げデータを、特殊単語辞書３および一般単語辞書５を
利用して発音記号列に変換するテキスト処理手段６と；
音声合成技術を利用して発音記号列データを音声波形デ
ータに変換して読み上げを行う音声合成処理手段７と；
入力される仮名漢字混じり文章の表示データを格納する
表示バッファ８と；表示バッファ８に格納されている表
示データを表示形式に変換する表示処理手段９と；表示
形式に変換されたデータを表示するディスプレイ、およ
び液晶ディスプレイ装置などの表示装置１０と、から構
成される。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing a first embodiment of the present invention. As shown in FIG. 1, the speech synthesis device according to the first embodiment of the present invention reads aloud sentence data and display data from a sentence mixed with Kana and Kanji to which tag information of reading information specifying the reading intended by the sentence creator is added. And a tag information retrieval unit 2 for retrieving tag information from input sentence data containing Kana-Kanji; a special word dictionary 3 for registering notation of tag information for specifying the reading, phoneme, and prosody information. And; a reading buffer 4 for storing text data to be read; a general word dictionary 5 in which phoneme and prosody information of commonly used words is recorded in advance; and reading data stored in the reading buffer 4 for special words Text processing means 6 for converting into a phonetic symbol string using the dictionary 3 and the general word dictionary 5;
A voice synthesis processing means 7 for converting phonetic symbol string data into voice waveform data by using a voice synthesis technique and reading the voice;
A display buffer 8 for storing display data of a sentence mixed with kana-kanji input; display processing means 9 for converting the display data stored in the display buffer 8 into a display format; and displaying the data converted into the display format It includes a display and a display device 10 such as a liquid crystal display device.

【００３４】読み上げ制御手段１に入力される仮名漢字
混じり文章について、図３、図４を参照して説明する。
仮名漢字混じり文章は、あらかじめ文章作成者が、読み
方を指定するタグ情報、および同一表記で別の読み方を
指定するタグ情報を付加した構成となっている。タグ情
報は、“＜”と“＞”で囲まれたタグ情報の始まりを示
すタグと、“＜／”と“＞”で囲まれたタグ情報の終わ
りを示すタグで構成される。Texts mixed with kana and kanji input to the reading control means 1 will be described with reference to FIGS. 3 and 4.
The kana-kanji mixed sentence has a structure in which the sentence creator adds in advance tag information for designating a reading style and tag information for designating another reading style with the same notation. The tag information is composed of a tag enclosed by "<" and ">" indicating the beginning of the tag information and a tag enclosed by "</" and ">" indicating the end of the tag information.

【００３５】図３は、読み方を指定するタグ情報の書式
を示し、タグ情報の始まりを示すタグには、タグの種類
を示す文字列（ＷＣＨＧ）と、対象単語の音韻、韻律情
報（例えば、マイク）が書かれている。始まりを示すタ
グと終わりを示すタグの間には、対象となる単語の表記
（例えば、ＭＩＣ）が書かれている。図４は、同一表記
で別の読み方を指定するタグ情報の書式を示し、タグ情
報の始まりを示すタグには、同一表記で別の読み方を指
定するタグ情報を示す文字列（ＴＣＨＧ）と、対象単語
の音韻、韻律情報（例えば、あす、あしたなど）が書か
れている。始まりを示すタグと終わりを示すタグの間に
は、対象となる単語の表記（例えば、明日）が書かれて
いる。FIG. 3 shows a format of tag information that specifies the reading. The tag indicating the beginning of the tag information includes a character string (WCHG) indicating the type of tag, phoneme of the target word, and prosodic information (for example, Mike) is written. Between the tag showing the beginning and the tag showing the end, the notation (for example, MIC) of the target word is written. FIG. 4 shows a format of tag information that specifies different readings in the same notation, and a tag indicating the beginning of tag information includes a character string (TCHG) indicating tag information that specifies different readings in the same notation, The phoneme and prosody information (eg, tomorrow, tomorrow) of the target word are written. The notation (for example, tomorrow) of the target word is written between the tag indicating the beginning and the tag indicating the end.

【００３６】図２を参照して、本発明の動作を説明す
る。図２は、本発明の第一の実施の形態の処理を示すフ
ローチャートである。The operation of the present invention will be described with reference to FIG. FIG. 2 is a flowchart showing the processing of the first embodiment of the present invention.

【００３７】まず、読み替え情報が付加された仮名漢字
混じり文章は、読み上げ制御手段１に入力される。読み
上げ制御手段１は、この入力された仮名漢字混じり文章
をタグ情報検索手段２に供給し、タグ情報の有無を検索
する（ステップＡ１、およびＡ２）。仮名漢字混じり文
章中にタグ情報を発見できた場合は、発見したタグ情報
の種類を判別する（ステップＡ３，Ａ４）。発見したタ
グ情報が読み方を指定するタグ情報の場合は、タグ情報
から表記、音韻、および韻律情報を取得し、この取得し
た表記、音韻、および韻律情報を特殊単語辞書３に登録
する（ステップＡ５，Ａ６）。First, a sentence mixed with kana-kanji added with the replacement information is input to the reading control means 1. The reading control means 1 supplies the input mixed kana-kanji characters to the tag information searching means 2 and searches for the presence or absence of tag information (steps A1 and A2). When the tag information can be found in the sentence mixed with Kana and Kanji, the type of the found tag information is determined (steps A3 and A4). When the found tag information is the tag information that specifies the reading, the notation, the phoneme, and the prosody information are acquired from the tag information, and the acquired notation, the phoneme, and the prosody information are registered in the special word dictionary 3 (step A5). , A6).

【００３８】発見したタグ情報が同一表記で別の読み方
を指定するタグ情報の場合は、タグ情報の表記、音韻、
韻律情報を取得し、表記情報をこの取得した音韻、韻律
情報に置き換えて読み上げバッファ４に格納する（ステ
ップＡ７，Ａ８）。表示バッファ８には、発見したタグ
情報を取得した表記情報に置換した後格納する（ステッ
プＡ９）。When the found tag information is the same notation that specifies a different reading, the notation of the tag information, the phoneme,
Prosodic information is acquired, and the notation information is replaced with the acquired phoneme and prosodic information and stored in the reading buffer 4 (steps A7 and A8). The found tag information is stored in the display buffer 8 after being replaced with the acquired notation information (step A9).

【００３９】仮名漢字混じり文章中にタグ情報を発見で
きない場合は、検索文字列を読み上げバッファ４、表示
バッファ８に格納し、入力された仮名漢字混じり文章の
検索を終了したかどうか判定し、終了していなければス
テップＡ１に戻り再度検索をおこない、終了していれば
読み上げ処理を終了する（ステップＡ１５，Ａ１６，Ａ
１７）。When the tag information cannot be found in the sentence mixed with Kana-Kanji, the retrieval character string is stored in the reading buffer 4 and the display buffer 8 and it is judged whether or not the retrieval of the inputted sentence mixed with Kana-Kanji has been completed, and the process ends. If not, the process returns to step A1 to perform the search again, and if completed, the reading process ends (steps A15, A16, A).
17).

【００４０】以上説明したように、上記第一の実施の形
態では、仮名漢字混じり文章中に文章作成者が付加した
読み方を指定するタグ情報を見つけると、この読み替え
情報を特殊単語辞書に登録し、この登録した表記情報が
仮名漢字混じり文章中にある場合、登録してある音韻、
韻律情報に置き換えて読み上げを行う。これにより、文
章作成者が対象となる単語毎に特殊単語情報を付加する
必要がなくなる。また、同一表記で別の読み方を指定す
るタグ情報を、仮名漢字混じり文章中に見つけると、こ
のタグ情報で指定された音韻、韻律情報に置き換えて読
み上げを行う。このことにより、同じ表記を持つ単語
や、前記読み方を指定するタグ情報で読み方を指定した
単語を、文章作成者が意図する複数の読み方で読み上げ
させることができる。As described above, in the first embodiment described above, when the tag information designating the reading added by the sentence creator is found in a sentence mixed with Kana / Kanji, the replacement information is registered in the special word dictionary. , If the registered notation information is in a sentence mixed with Kana and Kanji, the registered phoneme,
It reads it by replacing it with prosody information. This eliminates the need for the sentence creator to add special word information for each target word. Further, when tag information designating different readings with the same notation is found in a sentence mixed with Kana-Kanji, it is replaced by the phoneme and prosody information designated by the tag information and read aloud. As a result, a word having the same notation or a word whose reading is specified by the tag information specifying the reading can be read aloud by a plurality of readings intended by the sentence creator.

【００４１】本発明の第二の実施の形態について図面を
参照して説明する。第二の実施形態は、仮名漢字混じり
文章に付加した読み方を指定するタグ情報を辞書登録せ
ず、文章作成者が意図した読み上げを行うようにしたも
のである。本発明の第二の実施の形態を示すブロック図
を図５に示す。図５に示すとおり、本発明の第二の実施
の形態は、図１に示した第一の実施の形態における音声
合成装置の構成から、特殊単語辞書３が、削除されてい
る。A second embodiment of the present invention will be described with reference to the drawings. In the second embodiment, the tag information that is added to a sentence mixed with kana and kanji is not registered in the dictionary, and the reading is intended by the sentence creator. FIG. 5 is a block diagram showing the second embodiment of the present invention. As shown in FIG. 5, in the second embodiment of the present invention, the special word dictionary 3 is deleted from the configuration of the speech synthesizer in the first embodiment shown in FIG.

【００４２】図６は、本発明の第二の実施の形態の処理
を示すフローチャートである。本発明の第二の実施の形
態の動作は、図２に示した第一の実施の形態において、
読み方を指定するタグを発見した後の処理が異なる（図
１のステップＡ６）。この実施の形態では、読み方を指
定するタグを発見した場合、読み方を指定するタグ情報
の特殊単語情報を取得し、取得した表記情報を仮名漢字
混じり文字列中から検索する。発見した場合、この表記
情報を同一表記で別の読み方を指定するタグ情報に置換
する処理を行う（ステップＢ１）。図６のステップＡ１
−Ａ５、およびＡ７−Ａ１２で示される第二の実施の形
態における読み上げ手段１の動作は、第一の実施の形態
の動作と同一のため説明を省略する。FIG. 6 is a flow chart showing the processing of the second embodiment of the present invention. The operation of the second embodiment of the present invention is the same as that of the first embodiment shown in FIG.
The process after finding the tag designating the reading is different (step A6 in FIG. 1). In this embodiment, when a tag designating the reading is found, special word information of the tag information designating the reading is acquired, and the acquired notation information is searched from the kana-kanji mixed character string. If found, the notation information is replaced with the tag information designating different reading with the same notation (step B1). Step A1 in FIG.
Since the operation of the reading means 1 in the second embodiment shown by -A5 and A7-A12 is the same as the operation in the first embodiment, the description thereof will be omitted.

【００４３】以上説明したように、上記第二の実施の形
態では、入力される仮名漢字混じり文章中の読み方を指
定するタグ情報を発見すると、特殊単語情報を取得する
とともに、この表記情報と同じ文字列を仮名漢字混じり
文章中から検索し、この文字列を同一表記で別の読み方
をするタグ情報に置き換える。このことにより、読み方
を指定するタグ情報の表記、音韻、韻律情報を登録する
必要がなくなり、特殊単語辞書を削除することができ、
辞書の管理が不要となる。As described above, in the second embodiment, when the tag information designating the reading in the input mixed Kana-Kanji character is found, the special word information is acquired and the same as this notation information. A character string is searched from a sentence mixed with Kana and Kanji, and this character string is replaced with tag information which is read differently with the same notation. This eliminates the need to register the notation of tag information that specifies the reading, phoneme, and prosody information, and it is possible to delete the special word dictionary,
No need to manage a dictionary.

【００４４】本発明の第三の実施の形態について図面を
参照して説明する。第三の実施形態は、ユーザが自由に
選択した仮名漢字混じり文章を入力した場合でも、文章
作成者が意図した読み方で読み上げを行うようにしたも
のである。本発明の第三の実施の形態を示すブロック図
を図７に示す。図７を参照すると、本発明の第三の実施
の形態は、制御手段１１と入力装置１２が、図１に示し
た第一の実施の形態における音声合成装置の構成に加え
られている。制御手段１１は、まず、ユーザが選択でき
る仮名漢字混じり文章とは別領域に付加した読み方を指
定するタグ情報を記述したヘッダ情報を読み上げ装置１
に入力し、次に、ユーザがキーボードなどの入力装置１
２を使って選択した仮名漢字混じり文章を読み上げ制御
手段１に入力する。A third embodiment of the present invention will be described with reference to the drawings. In the third embodiment, even when the user inputs a sentence mixed with kana and kanji that is freely selected, the sentence creator reads the sentence according to the intended reading. FIG. 7 is a block diagram showing the third embodiment of the present invention. Referring to FIG. 7, in the third embodiment of the present invention, the control means 11 and the input device 12 are added to the configuration of the speech synthesizer in the first embodiment shown in FIG. The control unit 11 first reads the header information describing the tag information that specifies the reading method added to the area different from the kana-kanji mixed sentence that can be selected by the user.
The user, and then the user enters an input device 1 such as a keyboard.
The sentence mixed with kana and kanji selected using 2 is input to the reading control means 1.

【００４５】図８を参照して本発明の動作を説明する。
図８は、本発明の第三の実施の形態の処理を示すフロー
チャートである。まず、読み上げ制御手段１に、制御手
段１１は、読み上げを行う仮名漢字混じり文章とは別領
域に格納されたヘッダ情報を入力する。読み上げ制御手
段１は、入力されたデータがヘッダ情報か読み上げを行
う文章データか判定する（ステップＣ１）。入力データ
がヘッダ情報の場合は、この入力データをタグ情報検索
手段２に供給し、タグ情報の有無を検索する（ステップ
Ｃ１，Ｃ２）。ヘッダ情報データ中にタグ情報を発見で
きた場合は、発見したタグ情報の種類を判別する（ステ
ップＣ３，Ｃ４）。発見したタグ情報が読み方を指定す
るタグ情報の場合は、タグ情報から表記、音韻、および
韻律情報を取得し、この取得した表記、音韻、および韻
律情報を特殊単語辞書３に登録する（ステップＣ５，Ｃ
６）。この処理を入力されたヘッダ情報を終了するまで
行う（ステップＣ７）。The operation of the present invention will be described with reference to FIG.
FIG. 8 is a flowchart showing the processing of the third embodiment of the present invention. First, the control means 11 inputs to the reading control means 1 header information stored in a different area from the kana-kanji mixed text to be read. The reading control means 1 determines whether the input data is header information or text data to be read (step C1). When the input data is header information, the input data is supplied to the tag information searching means 2 to search for the presence or absence of tag information (steps C1 and C2). If the tag information can be found in the header information data, the type of the found tag information is determined (steps C3 and C4). When the found tag information is the tag information that specifies the reading, the notation, the phoneme, and the prosody information are acquired from the tag information, and the acquired notation, the phoneme, and the prosody information are registered in the special word dictionary 3 (step C5). , C
6). This process is repeated until the input header information is completed (step C7).

【００４６】読み上げ制御手段１に入力したヘッダ情報
の処理を終了すると、制御手段１１は、読み上げを行う
仮名漢字混じり文章を読み上げ制御手段１に入力する。
図８のステップＡ１−Ａ３、およびＡ９−Ａ１２で示さ
れる第三の実施の形態における読み上げ手段１の動作
は、第一の実施の形態の動作と同一のため、説明を省略
する。When the processing of the header information input to the reading control means 1 is completed, the control means 11 inputs to the reading control means 1 a kana-kanji mixed sentence to be read.
The operation of the reading means 1 in the third embodiment shown by steps A1-A3 and A9-A12 in FIG. 8 is the same as the operation in the first embodiment, and therefore its explanation is omitted.

【００４７】上記第三の実施の形態では、読み方を指定
するタグ情報を読み上げを行う仮名漢字混じり文章とは
別の領域に格納し、あらかじめ特殊単語辞書に登録す
る。これにより、ユーザが自由に選択した仮名漢字混じ
り文章の場合でも、あらかじめ登録した特殊単語辞書を
利用して、文章作成者が意図した読み上げが可能とな
る。In the third embodiment, the tag information designating the reading method is stored in an area different from the kana-kanji mixed sentence to be read and registered in advance in the special word dictionary. As a result, even in the case of a sentence mixed with kana-kanji characters freely selected by the user, it is possible to read aloud as intended by the sentence creator by using the special word dictionary registered in advance.

【００４８】本発明の第四の実施の形態について図面を
参照して説明する。本発明の第四の実施の形態を示すブ
ロック図を図９に示す。図９に示すとおり、本発明の第
四の実施の形態は、図８に示した第三の実施の形態にお
ける音声合成装置の構成から、特殊単語辞書３が削除さ
れている。A fourth embodiment of the present invention will be described with reference to the drawings. A block diagram showing a fourth embodiment of the present invention is shown in FIG. As shown in FIG. 9, in the fourth embodiment of the present invention, the special word dictionary 3 is deleted from the configuration of the speech synthesizer in the third embodiment shown in FIG.

【００４９】図１０は、本発明の第四の実施の形態の処
理を示すフローチャートである。本発明の第四の実施の
形態の動作は、読み方を指定するタグを発見した後の処
理が異なる（図８のステップＣ６）。図１０のステップ
Ｃ１−Ｃ５、およびＣ７で示す処理は、本発明の第三の
実施の形態の処理と同一のため説明を省略する。制御手
段１１から読み上げ制御手段１に入力されるヘッダ情報
に記述されている読み方を指定するタグ情報を発見し、
このタグ情報の特殊単語情報を取得した後（ステップＣ
５）、読み上げを行う仮名漢字混じり文章中の取得した
表記情報と同じ文字列を、同一表記で別の読み方をする
タグ情報に差し替える（ステップＤ１）。この処理を入
力されたヘッダ情報が終了するまで行う（ステップＣ
７）。図１０のステップＡ１−Ａ９、Ａ１５およびＡ１
６は、本発明の第二の実施の形態と同一のため説明を省
略する。FIG. 10 is a flow chart showing the processing of the fourth embodiment of the present invention. The operation of the fourth embodiment of the present invention is different in the processing after discovering the tag designating the reading method (step C6 in FIG. 8). The processing shown in steps C1-C5 and C7 of FIG. 10 is the same as the processing of the third embodiment of the present invention, and therefore description thereof is omitted. The tag information that specifies the reading method described in the header information that is input from the control unit 11 to the reading control unit 1 is found,
After acquiring the special word information of this tag information (step C
5) Replace the same character string as the acquired notation information in the sentence with mixed kana and kanji to be read out with tag information for different reading with the same notation (step D1). This process is repeated until the input header information is completed (step C
7). Steps A1-A9, A15 and A1 of FIG.
Since 6 is the same as the second embodiment of the present invention, the description thereof is omitted.

【００５０】上記第四の実施の形態では、読み上げを行
う仮名漢字混じり文章とは別の領域にあらかじめ格納し
てある読み方を指定するタグ情報を取得し、読み上げを
行う仮名漢字混じり文章中の同じ表記の単語を、同一表
記で別の読み方をするタグ情報に置き換える。このこと
により、特殊単語情報を辞書登録する必要がなくなり、
辞書管理の必要がなくなる。In the fourth embodiment, the tag information for specifying the reading method, which is stored in advance in a different area from the kana-kanji mixed sentence to be read out, is acquired and the same in the kana-kanji mixed sentence to be read out. Replace the words in the notation with tag information that reads differently in the same notation. This eliminates the need to register special word information in the dictionary,
Eliminates the need for dictionary management.

【００５１】[0051]

【実施例】本発明の第一の実施の形態の一実施例を説明
する。図１１（ａ）は、読み上げ制御手段１に入力され
る仮名漢字混じり文章の一例を示す。読み上げ制御手段
１は、タグ情報検索処理を行い、読み方を指定するタグ
情報（ＷＣＨＧ）の表記（明日）、音韻および韻律情報
（あした）を特殊単語辞書３に登録する（ステップＡ
６）。韻律情報は、例えば、“あ‘した”など一文字目
に特殊文字を付加してアクセントを示すことも可能であ
る。EXAMPLE An example of the first embodiment of the present invention will be described. FIG. 11 (a) shows an example of a kana-kanji mixed sentence input to the reading control unit 1. The reading control means 1 performs tag information search processing, and registers the notation (tomorrow) of the tag information (WCHG) designating the reading, the phoneme and the prosody information (tomorrow) in the special word dictionary 3 (step A).
6). For the prosody information, for example, a special character may be added to the first character such as "A'shita" to indicate an accent.

【００５２】図１１（ｂ）は、読み上げ制御手段１から
供給され、表示バッファ８に格納される表示データを示
し、表示バッファ８に格納されるデータは、仮名漢字混
じり文章に埋め込まれている同一表記で別の読み方を指
定するタグ情報（ＴＣＨＧ）を、このタグの表記情報
（明日）に置き換えていることを示す。FIG. 11 (b) shows the display data supplied from the reading control means 1 and stored in the display buffer 8. The data stored in the display buffer 8 is the same as that embedded in a kana-kanji mixed sentence. It indicates that the tag information (TCHG) that specifies another reading in the notation is replaced with the notation information (tomorrow) of this tag.

【００５３】この表示データは、表示制御手段９に供給
され、文字列の表示位置や表示サイズなどを決定し、表
示時の文字列を生成して、表示装置１０に供給して表示
を行う。This display data is supplied to the display control means 9, determines the display position and display size of the character string, generates the character string at the time of display, and supplies it to the display device 10 for display.

【００５４】図１１（ｃ）は、音声合成手段７から出力
される音声データを示し、入力された図１１（ａ）の仮
名漢字混じり文章に埋め込まれた同一表記で別の読み方
を指定するタグ情報（ＴＣＨＧ）は、読み替え処理手段
１で、音韻、韻律情報（あす）に置き換え、さらに、テ
キスト処理手段９では特殊単語辞書３を参照して、読み
方を指定するタグ情報（ＷＣＨＧ）と同じ表記情報（明
日）を持った単語を、特殊単語情報の音韻、韻律情報
（あした）に置き換えていることを示す。FIG. 11 (c) shows voice data output from the voice synthesizing means 7, and a tag for designating another reading by the same notation embedded in the inputted mixed Kana / Kanji sentence of FIG. 11 (a). The information (TCHG) is replaced by phoneme and prosody information (tomorrow) in the rewriting processing means 1, and further, the text processing means 9 refers to the special word dictionary 3 to refer to the special word dictionary 3 and the same notation as the tag information (WCHG) that specifies the reading. It shows that the word with information (tomorrow) is replaced with the phoneme and prosody information (tomorrow) of the special word information.

【００５５】本発明の第二の実施の形態の一実施例を説
明する。図１２（ａ）は、読み上げ制御手段１に入力さ
れる仮名漢字混じり文章の一例を示す。An example of the second embodiment of the present invention will be described. FIG. 12A shows an example of a sentence containing kana-kanji characters input to the reading control unit 1.

【００５６】図１２（ｂ）は、読み方を指定するタグ情
報（ＷＣＨＧ）の表記情報（明日）と同じ文字列を同一
表記で別の読み方を指定するタグ情報（ＴＣＨＧ）に置
き換えたデータを示し、このデータの音韻、韻律情報
（あした）は、読み方を指定するタグ情報（ＷＣＨＧ）
から取得した音韻、韻律情報を用い、予め付加されてい
る同一表記で別の読み方を指定するタグ情報の音韻、韻
律情報（あす）は置き換えない。FIG. 12B shows data obtained by replacing the same character string as the notation information (tomorrow) of the tag information (WCHG) designating the reading method with the tag information (TCHG) designating another reading method with the same notation. , The phonological and prosody information (tomorrow) of this data is tag information (WCHG) that specifies the reading method.
Using the phoneme and prosody information acquired from the above, the phoneme and prosody information (tomorrow) of the tag information that specifies a different reading with the same notation added in advance is not replaced.

【００５７】図１２（ｃ）は、読み上げ制御手段１から
供給され、表示バッファ８に格納される表示データを示
し、図１２（ａ）に示す入力される仮名漢字混じり文章
データのタグ情報（ＴＣＨＧ）は、このタグ情報の表記
情報（明日）に置き換わる。FIG. 12C shows the display data supplied from the reading control means 1 and stored in the display buffer 8, and the tag information (TCHG) of the input kana-kanji mixed sentence data shown in FIG. 12A. ) Is replaced by the notation information (tomorrow) of this tag information.

【００５８】図１２（ｄ）は、音声合成手段７から出力
される音声データを示し、タグ情報（ＴＣＨＧ）が音
韻、韻律情報（あした、あすなど）に置き換えられてい
ることを示す。FIG. 12D shows the voice data output from the voice synthesizing means 7, and shows that the tag information (TCHG) is replaced with phoneme and prosody information (tomorrow, tomorrow, etc.).

【００５９】本発明の第三の実施の形態の一実施例を説
明する。図１３（ａ）は、読み上げ制御手段１に入力さ
れるヘッダ情報の一例を示す。ヘッダ情報には、例え
ば、読み方を指定するタグ情報（ＷＣＨＧ）などが記述
されている。これ以外にも、表示に関するタグ情報など
他のタグ情報を含むこともできる。読み上げ制御手段１
は、ヘッダ情報のタグ情報検索処理を行い、特殊単語辞
書３に、取得したタグの音韻、韻律情報を登録する。An example of the third embodiment of the present invention will be described. FIG. 13A shows an example of header information input to the reading control unit 1. In the header information, for example, tag information (WCHG) designating how to read is described. In addition to this, other tag information such as display-related tag information may be included. Reading control means 1
Performs tag information search processing of header information and registers the phoneme and prosody information of the acquired tag in the special word dictionary 3.

【００６０】図１３（ｂ）は、読み上げを行う仮名漢字
混じり文章データの一例を示す。FIG. 13B shows an example of text data mixed with kana and kanji to be read aloud.

【００６１】図１３（ｃ）は、読み上げ制御手段１から
供給され、表示バッファ８に格納される表示データで、
図１３（ｂ）に示す仮名漢字混じり文章データに付加さ
れているタグ情報（ＴＣＨＧ）が、タグの表記情報（明
日、今日）に置き換えられていることを示す。FIG. 13C shows display data supplied from the reading control means 1 and stored in the display buffer 8.
It is indicated that the tag information (TCHG) added to the kana-kanji mixed sentence data shown in FIG. 13B is replaced with the tag notation information (tomorrow, today).

【００６２】図１３（ｄ）は、音声合成手段７から出力
される音声データを示し、同一表記で別の読み方を指定
するタグ情報（ＴＣＨＧ）が音韻、韻律情報（みょうに
ち、あす、こんにち）に置き換えられ、読み方を指定す
るタグ情報（ＷＣＨＧ）と同じ表記の単語の読み方に、
タグの音韻、韻律情報（あした、きょう）に置き換えら
れていることを示す。FIG. 13D shows voice data output from the voice synthesizing means 7. Tag information (TCHG) designating different readings with the same notation is phoneme and prosody information (Myoni, Tomorrow, Konnichi). To the reading of words with the same notation as the tag information (WCHG) that specifies the reading.
Indicates that the tag has been replaced with the phoneme and prosody information (tomorrow, today).

【００６３】本発明の第四の実施の形態の一実施例を説
明する。図１４（ａ）は、読み上げ制御手段１に入力さ
れるヘッダ情報の一例を示し、図１４（ｂ）は、読み上
げを行う仮名漢字混じり文章データの一例を示す。ヘッ
ダ情報には、例えば、読み方を指定するタグ情報（ＷＣ
ＨＧ）などが記述されている。これ以外にも、表示に関
するタグ情報など他のタグ情報を含むこともできる。読
み上げ制御手段１は、ヘッダ情報のタグ情報検索処理を
行い、読み方を指定するタグ情報（ＷＣＨＧ）の特殊単
語情報を取得する。この取得した表記情報（明日、今
日）と同じ文字列を、図１４（ｂ）に示す仮名漢字混じ
り文章から検索し、同一表記で別の読み方を指定するタ
グ情報（ＴＣＨＧ）に置き換える。An example of the fourth embodiment of the present invention will be described. FIG. 14A shows an example of header information input to the reading control unit 1, and FIG. 14B shows an example of text data mixed with Kana and Kanji to be read. The header information includes, for example, tag information (WC
HG) and the like are described. In addition to this, other tag information such as display-related tag information may be included. The reading control unit 1 performs a tag information search process of the header information, and acquires special word information of tag information (WCHG) that specifies the reading method. The same character string as the acquired notation information (tomorrow, today) is searched from a sentence mixed with kana-kanji characters shown in FIG. 14B, and replaced with tag information (TCHG) that specifies different reading with the same notation.

【００６４】図１４（ｃ）は、ヘッダ情報を用いて、読
み上げさせる仮名漢字混じり文章中の対象単語を読み方
を指定するタグ情報（ＴＣＨＧ）に変換したデータであ
る。このデータは、読み上げ制御手段１に供給され、タ
グ検索処理を行い、表示データと読み上げデータに変換
される。FIG. 14C shows data obtained by converting the target word in a sentence mixed with kana and kanji to be read into tag information (TCHG) designating the reading method using the header information. This data is supplied to the reading control means 1 and subjected to tag search processing to be converted into display data and reading data.

【００６５】図１４（ｄ）は、表示バッファ８に格納さ
れる表示データを示し、図１４（ｂ）の仮名漢字混じり
文章に埋め込まれている読み方を指定するタグ情報（Ｔ
ＣＨＧ）が、タグの表記情報（明日、および今日）に変
換されていることを示す。FIG. 14D shows display data stored in the display buffer 8, and tag information (T) for designating the reading embedded in the kana-kanji mixed sentence of FIG. 14B.
CHG) is converted into the notation information (tomorrow and today) of the tag.

【００６６】図１４（ｅ）は、音声合成手段７から出力
される音声データを示し、図１４（ｂ）の仮名漢字混じ
り文章に埋め込まれている同一表記で別の読み方を指定
するタグ情報（ＴＣＨＧ）が、タグの音韻、韻律情報
（みょうにち、あす、および、こんにち）に変換され、
図１４（ａ）のヘッダ情報の読み方を指定するタグ情報
（ＷＣＨＧ）と同じ表記の単語（明日、および今日）の
音韻、韻律情報が、ヘッダ情報に記述されているタグの
音韻、韻律情報（あした、およびきょう）に変換されて
いることを示す。FIG. 14 (e) shows voice data output from the voice synthesizing means 7, and tag information for designating another reading method with the same notation embedded in the kana-kanji mixed sentence of FIG. 14 (b) ( TCHG) is converted into the phoneme and prosody information of the tag (Myoni, Tomorrow, and Hi),
The phoneme and prosody information of the words (tomorrow and today) having the same notation as the tag information (WCHG) that specifies the reading of the header information in FIG. 14A is the phoneme and prosody information of the tag described in the header information ( Tomorrow and today).

【００６７】[0067]

【発明の効果】第１の効果は、仮名漢字混じり文章中に
同じ読み方をする特殊単語が複数存在する場合でも、読
み方やアクセントなどの特殊単語情報を文章作成者が文
章作成時にあらかじめ文章に付加しておくことにより、
存在する毎に特殊単語情報を付加する必要がないことに
ある。The first effect is that even when there are a plurality of special words that have the same reading in a kana-kanji mixed sentence, the sentence creator adds the special word information such as reading and accent to the sentence in advance when creating the sentence. By keeping
There is no need to add special word information every time it exists.

【００６８】その理由は、同じ読み方をする複数の単語
の読み方を指定するタグ情報の特殊単語情報を取得し、
特殊単語辞書に登録し、特殊単語に対応する部分を特殊
単語に登録した正しい音韻、韻律情報に差し替えるため
である。The reason is that the special word information of the tag information that specifies the reading of a plurality of words having the same reading is acquired,
This is to register in the special word dictionary and replace the part corresponding to the special word with the correct phoneme and prosody information registered in the special word.

【００６９】第２の効果は、仮名漢字混じり文章中に同
じ表記で別の読み方をする特殊単語が存在する場合で
も、読み方やアクセントなどの特殊単語情報を文章作成
者が文章作成時にあらかじめ文章に埋め込んでおくこと
により、特殊単語毎に複数の読み方で読み上げられるこ
とにある。The second effect is that even if there is a special word which is read differently in the same notation in a sentence mixed with kana and kanji, the sentence creator prepares the special word information such as the reading and accent in the sentence at the time of writing the sentence. By embedding, each special word can be read out in multiple ways.

【００７０】その理由は、同じ表記で複数の読み方をす
る単語の読み方を指定するタグ情報の特殊単語情報を取
得し、特殊単語に対応する部分を特殊単語情報の正しい
音韻、韻律情報に差し替えるためである。The reason is that the special word information of the tag information that specifies the reading of a plurality of readings with the same notation is acquired, and the portion corresponding to the special word is replaced with the correct phoneme and prosody information of the special word information. Is.

【００７１】第３の効果は、特殊単語情報を登録する辞
書を必要としないことにある。The third effect is that a dictionary for registering special word information is not required.

【００７２】その理由は、文章作成者が文章作成時にあ
らかじめ文章に付加した特殊単語情報を取得すると、入
力文章データの特殊単語に対応する部分を取得した特殊
単語情報の正しい音韻、韻律情報に差し替えるためであ
る。The reason is that, when the sentence creator obtains the special word information added to the sentence in advance at the time of writing the sentence, the portion corresponding to the special word of the input sentence data is replaced with the correct phonological and prosodic information of the obtained special word information. This is because.

【００７３】第４の効果は、ユーザが自由に選択した文
章データを正しい音韻、韻律情報で読み上げさせること
にある。The fourth effect is that the text data freely selected by the user is read aloud with correct phoneme and prosody information.

【００７４】その理由は、文章作成者が文章作成時に文
章に付加した特殊単語情報をあらかじめ取得し、特殊単
語辞書に登録しておき、ユーザが自由に選択した仮名漢
字混じり文章を読み上げさせる場合に、この登録した特
殊単語辞書の正しい音韻、韻律情報に差し替えるためで
ある。The reason is that when the sentence creator acquires the special word information added to the sentence at the time of writing the sentence and registers it in the special word dictionary in advance, and the user reads the sentence mixed with kana and kanji freely selected by the user. This is because the registered special word dictionary is replaced with the correct phoneme and prosody information.

[Brief description of drawings]

【図１】本発明の第一の実施の形態を示すブロック図で
ある。FIG. 1 is a block diagram showing a first embodiment of the present invention.

【図２】本発明の第一の実施の形態の動作を説明するフ
ローチャートである。FIG. 2 is a flowchart illustrating the operation of the first exemplary embodiment of the present invention.

【図３】特殊単語の読み方を指定するタグ情報の一例を
示す図である。FIG. 3 is a diagram showing an example of tag information that specifies how to read special words.

【図４】同一表記で別の読み方を指定するタグ情報の一
例を示す図である。FIG. 4 is a diagram showing an example of tag information designating different readings with the same notation.

【図５】本発明の第二の実施の形態を示すブロック図で
ある。FIG. 5 is a block diagram showing a second embodiment of the present invention.

【図６】本発明の第二の実施の形態の動作を説明するた
めのフローチャートである。FIG. 6 is a flowchart for explaining the operation of the second embodiment of the present invention.

【図７】本発明の第三の実施の形態を示すブロック図で
ある。FIG. 7 is a block diagram showing a third embodiment of the present invention.

【図８】本発明の第三の実施の形態の動作を説明するた
めのフローチャートである。FIG. 8 is a flowchart for explaining the operation of the third exemplary embodiment of the present invention.

【図９】本発明の第四の実施の形態を示すブロック図で
ある。FIG. 9 is a block diagram showing a fourth embodiment of the present invention.

【図１０】本発明の第四の実施の形態の動作を説明する
ためのフローチャートである。FIG. 10 is a flow chart for explaining the operation of the fourth embodiment of the present invention.

【図１１】本発明の第一の実施の形態の実施例を説明す
るための図である。FIG. 11 is a diagram for explaining an example of the first embodiment of the present invention.

【図１２】本発明の第二の実施の形態の実施例を説明す
るための図である。FIG. 12 is a diagram for explaining an example of the second embodiment of the present invention.

【図１３】本発明の第三の実施の形態の実施例を説明す
るための図である。FIG. 13 is a diagram for explaining an example of the third exemplary embodiment of the present invention.

【図１４】本発明の第四の実施の形態の実施例を説明す
るための図である。FIG. 14 is a diagram for explaining an example of the fourth embodiment of the present invention.

【図１５】従来の音声合成装置の一例を示すブロック図
である。FIG. 15 is a block diagram showing an example of a conventional speech synthesizer.

【図１６】特殊単語が埋め込まれた仮名漢字混じり文章
の一例を示す図である。FIG. 16 is a diagram showing an example of a kana-kanji mixed sentence in which special words are embedded.

【図１７】従来の音声合成装置の一例を示すブロック図
である。FIG. 17 is a block diagram showing an example of a conventional speech synthesizer.

【図１８】特殊単語が付加された仮名漢字混じり文章の
一例を示す図である。FIG. 18 is a diagram showing an example of a kana-kanji mixed sentence to which special words are added.

[Explanation of symbols]

１読み上げ制御装置２タグ検索手段３特殊単語辞書４読み上げバッファ５一般単語辞書６テキスト処理手段７音声合成処理手段８表示バッファ９表示制御手段１０表示装置１１制御手段１２入力手段１０１特殊単語バッファ１０２特殊単語抽出部１０３言語処理部１０４特殊単語変換部１０５音声合成部１１１特殊単語抽出部１１２特殊単語辞書１１３特殊単語辞書作成部１１４一般単語辞書１１５言語処理部１１６音声合成部 1 reading control device 2 tag search means 3 special word dictionary 4 reading buffer 5 general word dictionary 6 text processing means 7 voice synthesis processing means 8 display buffer 9 display control means 10 display device 11 control means 12 input means 101 special word buffer 102 special Word extraction unit 103 Language processing unit 104 Special word conversion unit 105 Speech synthesis unit 111 Special word extraction unit 112 Special word dictionary 113 Special word dictionary creation unit 114 General word dictionary 115 Language processing unit 116 Speech synthesis unit

Claims

[Claims]

1. A tag search means for searching tag information describing phoneme, prosody information, etc. of a special word embedded in a sentence to be processed, and a special word of the special word based on an analysis result of the detected tag information. A speech synthesis apparatus comprising: a reading control means for generating phonological and prosody information.

2. The voice synthesizing apparatus according to claim 1, further comprising a reading control means for replacing the special word to which the tag information is added with the notation information of the tag and storing it in the display buffer.

3. The speech synthesis apparatus according to claim 1, further comprising a reading control means for replacing the special word to which the tag information is added with the phoneme and prosody information of the tag and storing it in the reading buffer.

4. Obtaining special word information of the detected tag,
4. The speech synthesizer according to claim 3, further comprising reading control means for registering the acquired notation, phoneme, and prosody information of the special word information in the special word dictionary.

5. A reading control means for detecting a character string that matches the detected notation information of the tag from a sentence to be processed and replacing the notation information of the matched character string with phoneme and prosody information and storing it in a reading buffer. The speech synthesizer according to claim 3, wherein

6. The speech synthesis according to claim 4, further comprising text processing means for converting the reading sentence data stored in the reading buffer into a phonetic symbol string by referring to the special word dictionary and the general dictionary. apparatus.

7. Control means for separately supplying data of only tag information describing phonemes, prosodic information, etc. of a special word and a sentence to be processed in which special word information is embedded to a reading control means, and tag information. 7. The speech synthesizer according to claim 5, further comprising: a reading control unit that acquires tag information from the data of only the data and registers the acquired notation, phoneme, and prosody information of the special word payment in the special word.