JPH03236099A

JPH03236099A - Document read-aloud device

Info

Publication number: JPH03236099A
Application number: JP2033504A
Authority: JP
Inventors: Yoshiaki Teramoto; 寺本　良明
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1990-02-14
Filing date: 1990-02-14
Publication date: 1991-10-22

Abstract

PURPOSE:To accurately display a read-aloud position by storing analytic results of plural phrases in a storage means and outputting the analytic results to a synthesizing means corresponding to a command specifying the read-aloud position in a document, and outputting a voice. CONSTITUTION:A task control part 21 reads a Japanese document out of a memory 25 through an access part 23 and outputting it to the document read- aloud device 33 through a transmission/reception part 29, and a reception part 41 stores individual texts in a buffer 43 together with added text numbers. The document is analyzed by a word identification part 47 and an analysis part 49 and stored as phoneme and rhythm data in a buffer 51, and they are outputted from a speaker 35 through generation parts 55 and 57. An input text number and position information are inputted from the generation part 55 to a control part 21 through the transmission/reception part 29 and the control part 21 sends the text number and position to a display device 31 to invert and displays the position where a read-aloud is currently made. The control part 21 outputs control commands corresponding to respective keys to the device 33 and the buffer 51 sends the phrase at the position specified with a control command to a voice synthesizing mechanism, which synthesizes a voice.

Description

【発明の詳細な説明】〔概　要〕日本語の漢字かな混じり文を自然な音声で読み上げる文
章読み上げ装置に関し、指定フレーズの繰り返し、後戻り、先送りの機能を実現
するとともに、読み上げ位置を的確に表示できることを
目的とし、入力される漢字かな混じり文の言語解析を行い、音韻情
報および韻律情報の解析結果をフレーズ単位で出力する
解析手段と、解析結果に基づいて、音声を合成出力する
合成手段とを備えた文章読み上げ装置において、複数フ
レーズの解析結果を順次格納し、入力された文中の読み
上げ位置を指定する制御コマンドに応じて、対応するフ
レーズの解析結果を合成手段に出力する格納手段と、格
納手段から合成手段に出力されたフレーズに対応する文
中の位置を表示する表示手段とを備えて構成される。[Detailed Description of the Invention] [Summary] Regarding a text-to-speech device that reads Japanese sentences containing kanji and kana in a natural voice, this device realizes the functions of repeating, backtracking, and forwarding specified phrases, as well as accurately displaying the reading position. With the aim of achieving this, we have developed an analysis means that performs linguistic analysis on input sentences containing kanji and kana, and outputs the analysis results of phonological and prosodic information in phrase units, and a synthesis means that synthesizes and outputs speech based on the analysis results. In the text reading device, storage means sequentially stores analysis results of a plurality of phrases and outputs analysis results of corresponding phrases to a synthesis means in response to a control command specifying a reading position in an input sentence; and display means for displaying the position in the sentence corresponding to the phrase output from the storage means to the synthesis means.

[Industrial application field]

本発明は、日本語の漢字かな混じり文（テキスト）を自
然な音声で読み上げる文章読み上げ装置に関する。The present invention relates to a text reading device that reads out Japanese sentences (text) containing Kanji and Kana in a natural voice.

文章読み上げ装置は、視覚による情報伝達を行うことが
できないときの情報伝達手段として使用することができ
る。例えば、電話回線を用いたデータベースの読み上げ
、盲人用端末、ワードプロセッサに入力した文章の読み
合わせ校正、音声ガイダンス等が考えられる。A text reading device can be used as an information transmission means when visual information transmission is not possible. For example, reading aloud from a database using a telephone line, a terminal for the blind, proofreading text input into a word processor, voice guidance, etc. can be considered.

[Conventional technology]

漢字かな混じり文を読み上げるには、文章中の単語を正
しく認定しく以下、「同定するＪという、）、単語の正
しい音韻（読み）とアクセントを自動付与するとともに
、文に自然なアクセント、ポーズ、イントネーションな
どを自動付与するための言語解析技術が必要である。In order to read aloud a sentence containing kanji and kana, the words in the sentence are correctly identified (hereinafter referred to as ``Identify J''), the correct phonetic (reading) and accent are automatically assigned to the word, and the sentence is given a natural accent, pause, etc. Language analysis technology is needed to automatically add intonation, etc.

したがって、入力装置（キーボード等）から入力された
漢字かな混じり文を読み上げる文章読み上げ装置の構成
は、この言語解析を行う言語処理部と、ここで得られた
音韻情報および韻律情報をもとに、対応する音声を合成
する音声合成部とに大きく分けることができる。Therefore, the structure of a text reading device that reads out sentences containing kanji and kana input from an input device (keyboard, etc.) is based on the language processing section that performs this language analysis and the phonological and prosody information obtained here. It can be broadly divided into a speech synthesis section that synthesizes corresponding speech.

すなわち、言語処理部は、入力された漢字かな混じり文
を所定の単位（単語、単語列）に分解し、単語辞書を参
照してそれぞれの読み、アクセント情報、文法情報を付
与する単語同定処理を行った後に、再び複合語、文節、
文節相互の関係〔アクセント句、呼気段落〕の順に高次
の言語単位に再構成する処理を行い、複合した語あるい
は文として音韻情報の修正およびアクセント、イントネ
ーションなどの韻律情報を決定し、各情報を音声合成部
に渡す構成である。In other words, the language processing unit breaks down the input kanji-kana mixed sentence into predetermined units (words, word strings), refers to the word dictionary, and performs word identification processing that adds pronunciation, accent information, and grammatical information to each of them. After doing so, try compound words, clauses,
Processing is performed to reconstruct the relationships between clauses (accent phrases, exhalation paragraphs) into higher-level linguistic units in the order of the phrases, correct phonological information as compound words or sentences, determine prosodic information such as accent and intonation, and The configuration is such that the information is passed to the speech synthesis section.

ところで、日本語文章の解析は、大容量の単語辞書を検
索するため、比較的解析時間がかかり（日本語文章２０
文字で約３００ミリ秒）、文章読み上げ装置から出力さ
れる音声が途切れることがあった。By the way, it takes a relatively long time to analyze Japanese sentences because it searches a large word dictionary (Japanese sentences 20
(approx. 300 milliseconds for text), the audio output from the text reading device was sometimes interrupted.

そこで、従来の文章読み上げ装置では、解析処理（単語
同定処理および韻律解析処理）の前後にバッファを設け
て解析処理と音声合成処理を並列に行い、解析時間がか
かるときでも音声出力が途切れないようにしていた。Therefore, in conventional text-to-speech devices, buffers are installed before and after the analysis processing (word identification processing and prosody analysis processing) to perform the analysis processing and speech synthesis processing in parallel, so that the audio output will not be interrupted even when the analysis takes time. I was doing it.

[Problem to be solved by the invention]

上述した文章読み上げ装置では、単純に文章を読み流す
だけで、指定するフレーズの繰り返し、後戻り、先送り
等、文章の読み上げ位置を外部から移動できる機能を有
していない。The above-mentioned text reading device simply reads the text, but does not have a function that allows the reading position of the text to be moved from the outside, such as repeating a specified phrase, going back, or moving forward.

また、文章読み上げ装置では、入力された日本語文章を
デイスプレィ装置に表示し、現在読み上げている位置を
反転表示しているが、音声が途切れないようにと設けた
バッファの容量が大きいときには、この表示出力と音声
出力の間に時間的なずれを生じることがある。In addition, text-to-speech devices display the input Japanese text on the display device and highlight the position currently being read, but when the capacity of the buffer provided to prevent audio interruptions is large, this There may be a time lag between display output and audio output.

本発明は、指定するフレーズの繰り返し、後戻り、先送
りの機能を実現するとともに、読み上げ位置を的確に表
示できる文章読み上げ装置を提供することを目的とする
。SUMMARY OF THE INVENTION An object of the present invention is to provide a text reading device that can realize the functions of repeating, backtracking, and forwarding specified phrases, and can accurately display the reading position.

[Means to solve the problem]

第１図は、本発明の原理ブロック図である。 FIG. 1 is a block diagram of the principle of the present invention.

図において、文章読み上げ装置は、解析手段１１と、合
成手段１３とを備える。In the figure, the text reading device includes an analysis means 11 and a synthesis means 13.

解析手段１１は、入力される漢字かな混じり文の言語解
析を行い、読みその他の音韻情報、および単語間のポー
ズ、アクセント結合その他の韻律情報の解析結果をフレ
ーズ単位で出力する。The analysis means 11 performs linguistic analysis of the input sentence containing kanji and kana, and outputs the analysis results of reading and other phonological information, as well as inter-word pauses, accent combinations and other prosodic information, in units of phrases.

合成手段１３は、解析結果に基づいて、音声を合成出力
する。The synthesizing means 13 synthesizes and outputs speech based on the analysis result.

本発明では、格納手段１５と、表示手段１７とを備えた
。The present invention includes storage means 15 and display means 17.

格納手段１５は、複数フレーズの解析結果を順次格納し
、入力された文中の読み上げ位置を指定する制御コセン
ドに応じて、対応するフレーズの解析結果を合成手段１
３に出力する。The storage means 15 sequentially stores the analysis results of a plurality of phrases, and in response to a control command that specifies the reading position in the input sentence, the analysis results of the corresponding phrases are stored in the synthesis means 1.
Output to 3.

表示手段１７は、格納手段１５から合成手段１３に出力
されたフレーズに対応する文中の位置を表示する。The display means 17 displays the position in the sentence corresponding to the phrase output from the storage means 15 to the synthesis means 13.

[For production]

本発明では、格納手段１５に複数フレーズの解析結果を
順次格納し、文中の読み上げ位置を指定する制御コマン
ドに応して、対応するフレーズの解析結果を合成手段１
３に出力させて、音声出力を行わせる。すなわち、現在
読み上げ中のフレーズの前後にわたって複数フレーズの
解析結果を保持することにより、読み上げ中のフレーズ
の繰り返し、読み上げ位置のフレーズ単位の後戻りおよ
び先送りの指定を行うことが可能となる。In the present invention, the analysis results of a plurality of phrases are sequentially stored in the storage means 15, and the analysis results of the corresponding phrases are stored in the synthesis means 1 in response to a control command that specifies the reading position in the sentence.
3 to perform audio output. That is, by retaining the analysis results of a plurality of phrases before and after the phrase currently being read out, it is possible to repeat the phrase being read out, and to specify backward or forward reading positions in units of phrases.

また、格納手段１５から合成手段１３に出力されたフレ
ーズに対応する文中の位置を表示させるので、読み上げ
位置を移動させたときでも、音声出力と表示出力にずれ
が生じることがなくなる。Furthermore, since the position in the sentence corresponding to the phrase output from the storage means 15 to the synthesis means 13 is displayed, even when the reading position is moved, there is no difference between the audio output and the display output.

［実施例〕以下、図面に基づいて実施例の動作について詳細に説明
する。[Example] Hereinafter, the operation of the example will be described in detail based on the drawings.

第２図は、本発明の文章読み上げ装置を使用したシステ
ムの構成を示すブロック図である。FIG. 2 is a block diagram showing the configuration of a system using the text reading device of the present invention.

図において、タスク管理部２１には、ファイルアクセス
部２３を介してメモリ２５が接続されるとともに、キー
ボード２７、コマンド送受信部２９およびデイスプレィ
装置３１が接続される。コマンド送受信部２９には、文
章読み上げ装置３３が接続される。文章読み上げ装置３
３には、スピーカ３５が接続される。In the figure, a memory 25 is connected to the task management section 21 via a file access section 23, and a keyboard 27, a command transmission/reception section 29, and a display device 31 are also connected. A text reading device 33 is connected to the command transmitting/receiving section 29 . Sentence reading device 3
A speaker 35 is connected to 3.

第３図は、文章読み上げ装置の実施例構成を示すブロッ
ク図である。FIG. 3 is a block diagram showing the configuration of an embodiment of the text reading device.

図において、受信部４１には、日本語データ格納バッフ
ァ４３が接続される。日本語データ格納バッファ４３に
は、単語辞書４５が接続される単語同定部４７が接続さ
れる。単語同定部４７には、音韻・韻律解析部４９が接
続される。音韻・韻律解析部４９には、音韻・韻律デー
タ格納バッファ５１が接続される。音韻・韻律データ格
納バッファ５１には、音声ファイル５３が接続される合
成パラメータ作成部５５が接続される。合成パラメータ
作成部には、合成音声作成部５７が接続される。合成音
声作成部５７には、スピーカ３５が接続される。受信部
４１、日本語データ格納バッファ４３および音韻・韻律
データ格納バッファ５１には、コマンド送受信部２９が
接続される。In the figure, a Japanese data storage buffer 43 is connected to the receiving section 41. A word identification section 47 to which a word dictionary 45 is connected is connected to the Japanese data storage buffer 43 . A phoneme/prosody analysis section 49 is connected to the word identification section 47 . A phoneme/prosody data storage buffer 51 is connected to the phoneme/prosody analysis section 49 . A synthesis parameter creation section 55 to which the audio file 53 is connected is connected to the phoneme/prosody data storage buffer 51 . A synthesized speech generator 57 is connected to the synthesis parameter generator. A speaker 35 is connected to the synthesized speech creation section 57 . A command transmitting/receiving section 29 is connected to the receiving section 41, the Japanese data storage buffer 43, and the phoneme/prosody data storage buffer 51.

単語同定部４７は、単語辞書４５を参照しながら単語あ
るいは単語列を同定し、それぞれの読み、アクセント情
報、文法情報を取り出す。なお、単語辞書４５には、単
語同定を行うための表記、単語間の接続関係を表す文法
属性およびその他の単語属性と、次段の音韻、韻律処理
に用いる音韻（読み）、アクセントおよびアクセント結
合属性とがそれぞれ登録されている。The word identification unit 47 identifies words or word strings while referring to the word dictionary 45, and extracts the pronunciation, accent information, and grammatical information of each word. Note that the word dictionary 45 contains notation for word identification, grammatical attributes representing connection relationships between words, and other word attributes, as well as phonology (reading), accent, and accent combination used for next-stage phonology and prosodic processing. Attributes are registered respectively.

音韻・韻律解析部４９は、複合語、文節、文節相互の関
係の順に高次の言語単位に再構成する処理を行う。すな
わち、単語同定部４７で得られた単語の読みをもとに、
単語の連接によって生ずる連濁あるいは音便化情報を得
る音韻規則、および単語・文節境界の文法情報をもとに
、アクセント句、呼気段落の決定、その他音調制御情報
を得る韻律規則にしたがって、音声合成に必要な種々の
制御情報（音韻情報、韻律情報）を作成する。The phoneme/prosody analysis unit 49 performs a process of reconstructing into higher-level linguistic units in the order of compound words, clauses, and relationships between clauses. That is, based on the pronunciation of the word obtained by the word identification section 47,
Speech synthesis is performed according to phonological rules that obtain rendaku or phoneticization information generated by word concatenation, and prosodic rules that determine accent phrases, exhalation paragraphs, and other tonal control information based on grammatical information of word and clause boundaries. Create various control information (phonological information, prosody information) necessary for

合成パラメータ作成部５５は、日本語の基本的な音節が
パラメータ形式で蓄積される音声ファイル５３を参照し
、音韻記号に基づいて必要なパラメータを読み出し、連
続するパラメータ列を合成し、これを合成音声作成部５
７に出力する。The synthesis parameter creation unit 55 refers to the audio file 53 in which basic Japanese syllables are stored in parameter format, reads out necessary parameters based on the phonetic symbols, synthesizes a continuous parameter string, and synthesizes this. Audio creation section 5
Output to 7.

合成音声作成部５７は、合成パラメータから音声波形を
合成してスピーカ３５から連続音声を出力させる。The synthesized speech creation unit 57 synthesizes a speech waveform from the synthesis parameters and causes the speaker 35 to output continuous speech.

日本語データ格納バッファ４３、音韻・韻律データ格納
バッファ５１は、言語解析と音声合成を並列に処理する
ためのものであり、言語解析に時間を要するときに、ス
ピーカ３５から出力される音声が途切れることを防止す
る機能を有する。The Japanese data storage buffer 43 and the phoneme/prosody data storage buffer 51 are for processing language analysis and speech synthesis in parallel, and when language analysis takes time, the audio output from the speaker 35 is interrupted. It has a function to prevent this.

ここで、第１図と、第２図および第３図との対応関係を
示す。Here, the correspondence between FIG. 1 and FIGS. 2 and 3 will be shown.

解析手段１１は、単語辞書４５、単語同定部４７、音韻
・韻律解析部４９に相当する。The analysis means 11 corresponds to a word dictionary 45, a word identification section 47, and a phoneme/prosody analysis section 49.

合成手段１３は、音声ファイル５３、合成パラメータ作
成部５５、合成音声作成部５７、スピーカ３５に相当す
る。The synthesis means 13 corresponds to an audio file 53, a synthesis parameter generation section 55, a synthesized speech generation section 57, and a speaker 35.

格納手段１５は、日本語データ格納バ・ソファ４３、音
韻・韻律データ格納バッファ５１に相当する。The storage means 15 corresponds to a Japanese data storage buffer 43 and a phoneme/prosody data storage buffer 51.

表示手段１７は、デイスプレィ装［３１に相当する。The display means 17 corresponds to the display device [31].

受信部４１は、文章読み上げ装置の標準構成部である。The receiving unit 41 is a standard component of a text reading device.

以下、第２図および第３図を参照して、文章読み上げ装
置の動作について説明する。The operation of the text reading device will be described below with reference to FIGS. 2 and 3.

タスク管理部２１は、ファイルアクセス部２３を介して
メモリ２５から日本語文章を読み込み、この日本語文章
をコマンド送受信部２９を介して文章読み上げ装置３３
に出力する。The task management unit 21 reads a Japanese text from the memory 25 via the file access unit 23 and sends the Japanese text to the text reading device 33 via the command transmission/reception unit 29.
Output to.

文章読み上げ装置３３に入力された日本語文章は、受信
部４１に受信され、その文章を構成する各テキストを識
別するために個々のテキストにテキスト番号が付されて
、日本語データ格納バッファ４３に格納される。The Japanese text input to the text reading device 33 is received by the receiving unit 41, a text number is assigned to each text in order to identify each text that makes up the text, and the text is stored in the Japanese data storage buffer 43. Stored.

日本語データ格納バッファ４３に格納される日本語文章
は、単語同定部４７、音韻・韻律解析部４９において、
解析され、１フレーズごとの音韻・韻律データとして音
韻・韻律データ格納バッファ５１に格納される。例えば
、「今日の天気は晴れてす」という入力テキストは、「
キョウノ」、「テンキワ」、「ハレデス」という３つの
フレーズに分解されて格納される。The Japanese sentences stored in the Japanese data storage buffer 43 are processed by the word identification section 47 and the phoneme/prosody analysis section 49.
It is analyzed and stored in the phoneme/prosody data storage buffer 51 as phoneme/prosody data for each phrase. For example, the input text "Today's weather is sunny" is "
The phrases are broken down and stored into three phrases: ``Kyouno'', ``Tenkiwa'', and ``Haredes''.

音韻・韻律データ格納バッファ５１は、数フレーズある
いは数十フレーズの音韻・韻律データを格納する構成で
あり、格納される音韻・韻律データは、フレーズ単位で
合成パラメータ作成部５５、合成音声作成部５７を介し
てスピーカ３５から出力される。また、音韻・韻律デー
タ格納バッファ５１に格納される音韻、韻律データは、
各フレーズごとに入力テキスト番号と位置（例えば、「
キヨウノ」は、第１のテキストの第１文字から第４文字
というような、単語の開始位置と長さ）を保持しており
、合成パラメータ作成部５５に出力されたフレーズのこ
の情報は、コマンド送受信部２９を介してタスク管理部
２１に出力される。The phoneme/prosodic data storage buffer 51 is configured to store phoneme/prosodic data of several phrases or dozens of phrases, and the stored phoneme/prosodic data is collected by the synthesis parameter creation unit 55 and the synthetic speech creation unit 57 on a phrase-by-phrase basis. The signal is output from the speaker 35 via. In addition, the phoneme and prosody data stored in the phoneme and prosody data storage buffer 51 are as follows:
For each phrase, input text number and position (e.g. "
``Kiyouno'' holds the starting position and length of a word, such as the first to fourth characters of the first text, and this information of the phrase output to the synthesis parameter creation unit 55 is used by the command It is output to the task management section 21 via the transmission/reception section 29.

タスク管理部２１は、コマンド送受信部２９から供給さ
れる情報により、読み上げ中のテキスト番号と読み上げ
位置をデイスプレィ装置３１に出力し、表示出力してい
る日本語文章中の現在読み上げ中の位置を反転表示させ
る。The task management unit 21 outputs the text number and reading position currently being read out to the display device 31 based on the information supplied from the command transmitting/receiving unit 29, and reverses the position currently being read out in the Japanese text being displayed. Display.

また、タスク管理部２１は、キーボード２７をスキャン
しており、キーボード２７に搭載される読み上げ中のフ
レーズの繰り返し、読み上げ位置の後戻り、先送り等の
キーが押下されたときには、各キーに対応する制御コマ
ンドを文章読み上げ装置３３に出力する。The task management unit 21 also scans the keyboard 27, and when a key on the keyboard 27, such as repeating a phrase being read out, backtracking the reading position, or forwarding the reading position, is pressed, the task management unit 21 controls the control corresponding to each key. The command is output to the text reading device 33.

文章読み上げ装置３３では、コマンド送受信部２９を介
して制御コマンドが日本語データ格納バッファ４３、音
韻・韻律データ格納バッファ５１に受信される。In the text reading device 33, the control command is received by the Japanese data storage buffer 43 and the phoneme/prosody data storage buffer 51 via the command transmission/reception unit 29.

音韻・韻律データ格納バッファ５１は、制御コマンドを
受信したとき、音声合成機構（合成パラメータ作成部５
５、合成音声作成部５７）における音声合成処理を一旦
停止し、制御コマンドで指定される位置のフレーズを音
声合成機構に送り、音声合成を行わせる。When the phoneme/prosody data storage buffer 51 receives a control command, the phoneme/prosody data storage buffer 51 is configured to perform a speech synthesis mechanism (synthesis parameter creation section 5).
5. The voice synthesis process in the synthesized voice creation unit 57) is temporarily stopped, and the phrase at the position specified by the control command is sent to the voice synthesis mechanism to perform voice synthesis.

例えば、第４図に示すようにテキストの先頭から順次番
号が付される６フレーズを格納し、その第３のフレーズ
を出力する音韻・韻律データ格納バッファ５１は、「現
在読み上げ中のフレーズより２つ後のフレーズを読み上
げる」という制御コマンドに応じて第５のフレーズを音
声合成機構に送り、スピーカ３５から出力させる。同様
にして、出力中のフレーズの繰り返し、また、「２つ前
のフレーズを読み上げる」等の後戻り等、読み上げ位置
を制御コマンドに応じて設定することが可能である。For example, as shown in FIG. 4, the phoneme/prosody data storage buffer 51 stores six phrases numbered sequentially from the beginning of the text and outputs the third phrase. In response to the control command "Read out the next phrase," the fifth phrase is sent to the speech synthesis mechanism and output from the speaker 35. Similarly, it is possible to set the reading position according to the control command, such as repeating the phrase that is being output, or going backwards such as "reading the previous phrase two times."

また、音韻・韻律データ格納バッファ５１に格納される
範囲を越えて読み上げ位置の先送りを行う（例えば、現
在音声出力中のフレーズより４つ以上後のフレーズを読
み上げる）ときには、制御コマンドに応じて、日本語デ
ータ格納バッファ４３に格納されるテキストが単語同定
部４７に出力されて解析された後に、音韻・韻律データ
格納バッファ５１に格納され、合成パラメータ作成部５
５に出力される。In addition, when the reading position is advanced beyond the range stored in the phonetic/prosody data storage buffer 51 (for example, to read out a phrase four or more phrases after the phrase currently being output), according to the control command, After the text stored in the Japanese data storage buffer 43 is output to the word identification section 47 and analyzed, it is stored in the phonological/prosodic data storage buffer 51 and then sent to the synthesis parameter creation section 5.
5 is output.

〔Effect of the invention〕

上述したように、本発明によれば、文章読み上げ装置の
バッファリング効果を十分に利用して、日本語文章を単
純に読み流すだけでなく、読み上げ中のフレーズの繰り
返し、読み上げ位置の後戻り、先送りの制御を行うこと
ができるので、同じ個所を繰り返して読ませたり、聞き
流してしまった個所を前に返って聞き直したり、不必要
な個所を読み飛ばすことが可能となる。As described above, according to the present invention, by making full use of the buffering effect of the text reading device, it is possible to not only simply read Japanese text, but also to avoid repeating phrases being read, backtracking to the reading position, and postponing the reading position. Since it can be controlled, it is possible to read the same passage repeatedly, go back and listen to a passage that you have missed, and skip over unnecessary passages.

また、音声合成機構に出力されたフレーズの日本語文章
中の位置を用いて表示出力を行うので、バッファの容量
が大きいときでも、また読み上げ位置が移動したときで
も、読み上げ中の位置を示す表示出力と、音声出力の同
期をとることができる。In addition, since the display output is performed using the position in the Japanese text of the phrase output to the speech synthesis mechanism, even when the buffer capacity is large or the reading position moves, the display indicating the position being read is displayed. Output and audio output can be synchronized.

[Brief explanation of drawings]

第１図は本発明の原理ブロック図、第２図はシステムの構成を示すブロック図、第３図は実
施例構成を示すブロック図、第４図はバッファの動作を
説明する図である。図において、１１は解析手段、１３は合成手段、１５は格納手段、１７は表示手段、２１はタスク管理部、２３はファイルアクセス部、２５はメモリ、２７はキーボード、２９はコマンド送受信部、３１はデイスプレィ装置、３３は文章読み上げ装置、３５はスピーカ、４１は受信部、４３は日本語データ格納バッファ、４５は単語辞書、４７は単語同定部、４９は音韻・韻律解析部、５１は音韻・韻律データ格納バッファ、５３は音声ファ
イル、５５は合成パラメータ作成部である。５７は合成音声作成部である。第１図システムの構成を示すブロック図第２図１１７５− 実施例構成を示すブロック図バッファの動作を説明する図第図FIG. 1 is a block diagram of the principle of the present invention, FIG. 2 is a block diagram showing the configuration of the system, FIG. 3 is a block diagram showing the configuration of an embodiment, and FIG. 4 is a diagram explaining the operation of the buffer. In the figure, 11 is an analysis means, 13 is a synthesis means, 15 is a storage means, 17 is a display means, 21 is a task management section, 23 is a file access section, 25 is a memory, 27 is a keyboard, 29 is a command transmission/reception section, 31 is a display device, 33 is a text reading device, 35 is a speaker, 41 is a receiving unit, 43 is a Japanese data storage buffer, 45 is a word dictionary, 47 is a word identification unit, 49 is a phonological/prosody analysis unit, 51 is a phonological/prosody analysis unit 53 is a prosody data storage buffer, 53 is an audio file, and 55 is a synthesis parameter creation unit. 57 is a synthesized speech creation section. Figure 1 is a block diagram showing the system configuration. Figure 2 is a block diagram showing the configuration of the 1175-embodiment. Figure 1 is a diagram explaining the operation of the buffer.

Claims

[Claims]

(1) Perform linguistic analysis of input sentences containing kanji and kana,
A text reading device comprising an analysis means (11) for outputting analysis results of phonetic information and prosody information in units of phrases, and a synthesis means (13) for outputting synthesized speech based on the analysis results, a storage means (15) for sequentially storing the analysis results and outputting the analysis results of the corresponding phrase to the synthesis means (13) in response to a control command specifying a reading position in an input sentence; 15) to the synthesis means (13); display means (17) for displaying a position in the sentence corresponding to the phrase output from the synthesis means (13).