JPH02176797A

JPH02176797A - Speech synthesis system

Info

Publication number: JPH02176797A
Application number: JP63331744A
Authority: JP
Inventors: Shiyuuichi Kawama; 河間　修一; Jiyungo Kitou; 鬼頭　淳悟
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1988-12-28
Filing date: 1988-12-28
Publication date: 1990-07-09

Abstract

PURPOSE:To decrease the amount of data of auxiliary information at a storage part by adding code data indicating the terminal of a segment and code data indicating the frequency of repetition of segment reproducing operation successively to the end of each segment and storing them. CONSTITUTION:The code data indicating the end of the segment of a waveform such as a pitch waveform segment and the code data indicating the frequency of repetition of segment reproducing operation are added successively right before or after speech code data at the end of the segment of the waveform and they are stored in a storage part 1. When read code data is speech code data, a composition part 5 decodes the speech waveform data and when the data is code data indicating the end of the segment, a repetitive resending processing part 4 outputs a necessary decision processing and control signal for the repetitive reproduction of the segment of a repetitive reproduction processing part 4, and the composition part 5 repeats segment reproducing operation from the speech code data of the head of the segment is repeated as many times as the contents of the sign data. Consequently, the amount of data of the auxiliary information stored on the storage part 1 can be reduced.

Description

【発明の詳細な説明】〈産業上の利用分野〉この発明は、波形素片を繰り返し再生することによって
合成音声を生成する音声合成方式に関する。DETAILED DESCRIPTION OF THE INVENTION <Industrial Application Field> The present invention relates to a speech synthesis method that generates synthesized speech by repeatedly reproducing waveform segments.

〈従来の技術〉音声における母音等の定常部においては、音の高さ（ピ
ッチ）に対応した周期でほぼ同じ音声波形声符号データ
のビット長を３〜４ビット程度とすると、上記素片の繰
り返し数は音声符号データと同じビット長で符号化する
ことができる。<Prior art> In the stationary parts of speech, such as vowels, if the bit length of the speech waveform voice code data, which is approximately the same at a period corresponding to the pitch of the sound, is about 3 to 4 bits, the above-mentioned segment The number of repetitions can be encoded with the same bit length as the voice code data.

しかしながら、１素片に含まれる音声符号データ数は数
百となる場合がある。例えば、音声のわたり部（音素と
音素の中間部）においては素片の繰り返し再生を行わな
いために、上記わたり部における素片においては１素片
に３ピツチ分の波形を含むとし、サンプリング周波数が
８ＫＨｚであるとする。そうすると、ピッチ周波数１０
０１（ｚのときの音声符号データ数は、８０００ｘ３／
＋００　＝　２４．０個となる。したがって、この場合
に１素片に含まれる音声符号データ数を表わすのに必要
なビット長は８ビツトとなり、音声符号データのビット
長が３〜４ビツトであるのに比較して２倍以上になる。However, the number of voice code data included in one elemental piece may be several hundred. For example, in order not to repeatedly play a segment at the transition part of speech (the middle part between phonemes), it is assumed that one segment at the transition part contains a waveform of 3 pitches, and the sampling frequency is Suppose that the frequency is 8KHz. Then, the pitch frequency is 10
01 (the number of voice code data when z is 8000x3/
+00 = 24.0 pieces. Therefore, in this case, the bit length required to represent the number of speech code data included in one segment is 8 bits, which is more than twice the bit length of speech code data, which is 3 to 4 bits. Become.

すなわち、記憶部に格納される素片の数が多くなると、
それだけ各素片に含まれる音声符号データ数の情報が占
めるビット数も多くなり、記憶部に格納される補助情報
のデータ量が多くなるといが繰り返されるピッチ構造が
見られる。In other words, when the number of fragments stored in the storage section increases,
The number of bits occupied by the information on the number of voice code data included in each segment increases accordingly, and as the amount of auxiliary information stored in the storage section increases, a pitch structure in which data is repeated is observed.

そこて、従来より音声波形における１ピツチの波形素片
やこれに準じた音声波形素片等の素片の波形を表す符号
データを予め記憶部に記憶しておき、この素片の波形を
表す符号データを必要回数だけ繰り返して復号化して合
成音声波形を再生することによって、記憶部に記憶する
音声波形の符号データ量の低減を図った音声合成方式が
ある。Conventionally, code data representing the waveform of a one-pitch waveform segment in a speech waveform or a similar speech waveform segment is stored in advance in a storage unit, and the waveform of this segment is represented. There is a speech synthesis method that aims to reduce the amount of coded data of a speech waveform stored in a storage unit by decoding coded data repeatedly a necessary number of times to reproduce a synthesized speech waveform.

上記音声合成方式においては、各素片毎に素片の波形を
符号化した音声符号データを記憶部に格納している。そ
の際に、素片の音声符号データに基づく合成音声波形生
成（以下、素片再生と言う）時における補助情報として
、素片の繰り返し数１素片に含まれている音声符号デー
タの個数および最終素片か否かの情報等も記憶部に格納
している。In the above speech synthesis method, speech code data obtained by encoding the waveform of each segment is stored in the storage unit. At that time, as auxiliary information when generating a synthesized speech waveform based on the speech code data of the segment (hereinafter referred to as segment reproduction), the number of repetitions of the segment, the number of speech code data included in one segment, and Information such as whether it is the final segment or not is also stored in the storage unit.

〈発明が解決しようとする課題〉通常、素片再生時に用いる補助情報のうち素片の繰り返
し数は２〜４程度であり２ビツトのビット長で符号化す
ることができる。したがって、音う問題がある。<Problems to be Solved by the Invention> Normally, among the auxiliary information used when reproducing a segment, the number of repetitions of a segment is about 2 to 4, and can be encoded with a bit length of 2 bits. Therefore, there is a noise problem.

そこで、この発明の目的は、素片の音声符号データの個
数を記憶しないことによって、記憶部に記憶する補助情
報のデータ量を少なくすることができる音声合成方式を
提供することにある。SUMMARY OF THE INVENTION Accordingly, it is an object of the present invention to provide a speech synthesis method that can reduce the amount of auxiliary information stored in a storage section by not storing the number of speech code data of a segment.

〈課題を解決するための手段〉上記目的を達成するため、この発明は、音声波形を符号
化して得られた音声符号データをピッチ波形素片等の波
形の素片毎に格納する記憶部と、上記記憶部に素片毎に
格納された音声符号データを順次読み出して復号化する
素片再生動作を繰り返して行うことによって合成音声波
形を生成する合成部を有する音声合成方式において、各
素片の終端の音声符号データの直航または直後に、上記
素片の終端を示す符号データと上記素片再生動作の繰り
返し数を示す符号データとを連続して付加して上記記憶
部に格納し、上記記憶部に格納された各符号データを順
次１個ずつ読み出して、この読み出された符号データが
上記素片の終端を示す符号データか音声符号データかの
判別を素片終端判別手段によって行い、上記素片終端判
別手段が、上記読み出された符号データが音声符号デー
タであると判別した場合は、上記合成部によってその音
声符号データを音声波形データへ復号化する一方、上記
読み出された符号データが上記素片の終端を示す符号デ
ータであると判別した場合は、」−記合成部によって上
記繰り返し数を示す符号データの内容が示す回数だけ同
じ素片の先頭の音声符号データに戻って上記素片再生動
作を繰り返すようになしたことを特徴としている。<Means for Solving the Problems> In order to achieve the above object, the present invention includes a storage unit that stores speech code data obtained by encoding a speech waveform for each waveform segment such as a pitch waveform segment; , in a speech synthesis method having a synthesis unit that generates a synthesized speech waveform by repeatedly reading and decoding the speech code data stored for each segment in the storage unit, each segment is Directly or immediately after the voice code data at the end of the segment, code data indicating the end of the segment and code data indicating the number of repetitions of the segment playback operation are successively added and stored in the storage unit; Each piece of code data stored in the storage section is read out one by one, and a piece end determination means determines whether the read code data is code data indicating the end of the piece or speech code data. If the segment end determining means determines that the read code data is voice code data, the synthesizer decodes the voice code data into voice waveform data, while decoding the read code data into voice waveform data. If it is determined that the encoded data is the code data indicating the end of the segment, the "-" synthesis unit repeats the code data at the beginning of the same segment the number of times indicated by the content of the code data indicating the number of repetitions. It is characterized in that it returns and repeats the above-mentioned segment reproduction operation.

また、この発明は、上記音声合成方式において、上記素
片が上記記憶部に格納される最終素片である場合は、上
記最終素片の終端の音声符号データの直前または直後に
、上記素片の終端を示す符号データと、繰り返し数を示
す符号データに変わる最終素片を示す符号データとを連
続して付加して上記記憶部に格納し、上記素片終端判別
手段が上記読み出された符号データが上記素片の終端を
示す符号データであると判別した場合に、次に読み出さ
れた符号データが上記最終素片を示す符号デ＝７上記記憶部に格納された符号データが順次１個ずつ読み
出される。そうすると、素片終端判別手段によって、上
記読み出された符号データが上記素片の終端を示す符号
データか音声符号データかが判別される。そして、その
結果上記読み出された符号データが音声符号データであ
ると判別された場合は、合成部によってその音声符号デ
ータが音声波形データへ復号化される一方、上記読み出
された符号データが上記素片の終端を示す符号データで
あると判別された場合は、上記合成部によって上記繰り
返し数を示す符号データの内容が示す回数だけ同じ素片
の先頭の音声符号データに戻って上記素片再生動作が繰
り返して実行される。Further, in the speech synthesis method, when the segment is the final segment stored in the storage unit, the segment is added to the segment immediately before or after the end speech code data of the final segment. Code data indicating the end of the segment and code data indicating the final segment that changes to code data indicating the number of repetitions are successively added and stored in the storage unit, and the segment end discriminating means reads the segment end. When it is determined that the code data is the code data indicating the end of the segment, the code data read next is code data indicating the final segment. They are read out one by one. Then, the segment end determining means determines whether the read code data is code data indicating the end of the segment or audio code data. As a result, if it is determined that the read code data is voice code data, the voice code data is decoded into voice waveform data by the synthesis unit, while the read code data is If it is determined that the code data indicates the end of the segment, the synthesizing unit returns to the speech code data at the beginning of the same segment the number of times indicated by the content of the code data indicating the number of repetitions, and returns to the speech code data at the beginning of the segment. The playback operation is executed repeatedly.

したがって、ｌ素片に含まれる音声符号データの数を示
す符号データを用いなくても、上記素片単位で素片再生
動作を実行することができる。Therefore, even without using code data indicating the number of speech code data included in an element, the elemental piece reproduction operation can be performed on an elemental piece basis.

また、この発明の音声合成方法においては、上記素片が
上記記憶部に格納される最終素片である場合には、上記
最終素片の終端の音声符号データの直前または直後に、
上記素片の終端を示す符号−タか繰り返し数を示す符号
データかの判別を最終素片判別手段によって行い、上記
最終素片判別手段が、上記読み出された符号データが最
終素片を示す符号データであると判別した場合は、上記
合成部は上記素片再生動作を終了する一方、上記読み出
された符号データが上記繰り返し数を示す符号データで
あると判別した場合は、上記合成部によって上記繰り返
し数を示す符号データの内容が示す回数だけ同じ素片の
先頭の音声符号データに戻って上記素片再生動作を繰り
返すようになしたことを特徴としている。Further, in the speech synthesis method of the present invention, when the segment is the final segment stored in the storage section, immediately before or after the voice code data at the end of the final segment,
The final segment determining means determines whether the code data indicates the end of the segment or the code data indicating the number of repetitions, and the final segment discriminating means determines whether the read code data indicates the final segment. If it is determined that the read code data is code data, the synthesis unit ends the segment reproduction operation, while if it is determined that the read code data is code data indicating the number of repetitions, the synthesis unit The present invention is characterized in that the segment reproduction operation is repeated by returning to the first speech code data of the same segment as many times as indicated by the content of the code data indicating the number of repetitions.

く作用〉この発明の音声合成方法においては、ピッチ波形素片等
の波形の素片における終端の音声符号データの直前また
は直後に、上記素片の終端を示す符号データと上記記憶
部に素片毎に格納された音声符号データを順次読み出し
て復号化する素片再生動作の繰り返し数を示す符号デー
タとを連続して付加して記憶部に格納される。Effects> In the speech synthesis method of the present invention, the code data indicating the end of the segment and the segment are stored in the storage section immediately before or after the voice code data at the end of the segment of the waveform such as the pitch waveform segment. The code data indicating the number of repetitions of the segment playback operation in which the stored voice code data is sequentially read and decoded is sequentially added and stored in the storage unit.

そして、上記素片再生動作を実行する際には、データと
、上記繰り返し数を示す符号データに変わる最終素片を
示す符号データとを連続して付加して記憶部に格納され
る。When performing the segment reproduction operation, data and code data indicating the final segment that changes to the code data indicating the number of repetitions are successively added and stored in the storage section.

上記素片再生動作を実行する際には、上記記憶部に格納
された符号データが順次１個ずつ読み出される。そして
、上記素片終端判別手段によって上記読み出された符号
データが上記素片の終端を示す符号データであると判別
された場合は、次に読み出された符号データが上記最終
素片を示す符号データか素片の終端を示す符号データか
の判別が最終素片判別手段によって判別される。When performing the segment reproduction operation, the code data stored in the storage section is sequentially read out one by one. If the segment end determination means determines that the read code data is code data indicating the end of the segment, the next read code data indicates the final segment. The final segment determining means determines whether the code data is the code data or the code data indicating the end of the segment.

その結果、上記読み出された符号データが上記最終素片
を示す符号データであると判別された場合は、上記合成
部によって上記素片再生動作が終了される。一方、上記
読み出された符号データが上記素片の終端を示す符号デ
ータであると判別された場合は、上記合成部によって上
記繰り返し数を示す符号データの内容が示す回数だけ同
じ素片の先頭の音声符号データに戻って上記素片再生動
作が繰り返して実行される。As a result, if it is determined that the read code data is code data indicating the final segment, the synthesis section ends the segment reproduction operation. On the other hand, if the read code data is determined to be the code data indicating the end of the segment, the synthesizing section repeats the start of the same segment the number of times indicated by the content of the code data indicating the number of repetitions. The voice code data is returned to and the above segment reproduction operation is repeated.

したがって、Ｉ素片に含まれる音声符号データの数を示
す符号データを用いなくても、上記素片単位で素片再生
動作を実行することができる。また、上記記憶部上に最
終素片を示す符号データのための領域を別に確保するこ
となく、再生された素片が最終素片であることを判別し
て」１記素片再生動作を終了することができる。Therefore, even without using code data indicating the number of speech code data included in an I segment, the segment reproduction operation can be performed on a segment-by-segment basis. In addition, without securing a separate area on the storage unit for code data indicating the final segment, it is determined that the reproduced segment is the final segment, and the 1 segment reproduction operation is terminated. can do.

〈実施例〉以下、この発明を図示の実施例により詳細に説明する。<Example> Hereinafter, the present invention will be explained in detail with reference to illustrated embodiments.

第１図はこの発明に係る音声合成装置のブロック図であ
る。記憶部としてのＲＯＭ（リード・オンリ・メモリ）
１はアナログの音声波形データを図示しない符号化器に
よってＤ　Ｐ　ＣＭ（差分パルス符号化）、ＡＤＰＣＭ
（適応差分パルス符号化）等の波形符号化方式で符号化
して得られた音声符号データや補助情報等の符号データ
を格納し、アドレス・カウンタ２はＲＯＭＩをアクセス
する際のアドレスを指示する。このアドレス・カウンタ
２はＲＯＭＩから音声符号データが読み出されるごと−
Ｉ＋音声符号データＣを各素片ごとにＲＱＭＩに格納する。FIG. 1 is a block diagram of a speech synthesis device according to the present invention. ROM (read-only memory) as a storage unit
1 converts analog audio waveform data into D PCM (differential pulse coding) and ADPCM by an encoder (not shown).
The address counter 2 stores coded data such as audio coded data and auxiliary information obtained by encoding using a waveform encoding method such as (adaptive differential pulse encoding), and the address counter 2 indicates an address when accessing the ROMI. This address counter 2 is set every time voice code data is read from ROMI.
I+ Voice code data C is stored in RQMI for each segment.

そして、各素片の最後の音声符号データＣ（１，ｎ）、
・・・、Ｃ（Ｌ、ｍ）の後に、素片の終端を示ず符号デ
ータＥと素片再生の繰り返しを行う際の繰り返し数を示
す符号データＲとを格納する。もし、素片再生を繰り返
さない場合には繰り返し数を示す符号データＲの内容を
“０″にする。また、この素片がＲＯＭＩに格納された
最後の素片である最終素片（Ｌ）である場合は、繰り返
し数を表す符号データＲを格納する場所に最終素片（Ｌ
）を示す符号データＲＥを格納する。ここで、素片の終
端を示す符号データＥ、繰り返し数を示ず符号データＲ
１最終素片（Ｌ）を示す符号データＲＥおよび音声符号
データＣは互いに異なる符号データでなければならない
。すなわち、例えば、上記各符号データＥ、Ｒ，ＲＥの
ビット長を４ビツトとすると、上記各符号データＥ、Ｒ
，ＲＥは０〜１５の数字で表わすことができる。このう
ち、音声符号データＣおよび繰り返し数を示す符号デー
タＲに０−１４を割り当て、素片の終端を示す符号デー
タＥおよにアドレス・カウンタ自身の内容に“ビを加算
する。アドレス・スタック３は上記素片における先頭の
音声符号データが格納されているＲＯＭＩ上のアドレス
の値を格納する。このアドレスはアドレス・カウンタ２
から供給される。Then, the last speech code data C(1,n) of each segment,
. . . After C(L, m), code data E which does not indicate the end of the segment and code data R which indicates the number of repetitions when repeating the segment reproduction are stored. If segment reproduction is not repeated, the content of code data R indicating the number of repetitions is set to "0". In addition, if this elemental piece is the final elemental piece (L) that is the last elemental element stored in ROMI, the final elemental element (L) is stored in the location where code data R representing the number of repetitions is stored.
) is stored. Here, code data E indicating the end of the segment, code data R indicating the number of repetitions,
Code data RE and voice code data C indicating one final segment (L) must be different code data from each other. That is, for example, if the bit length of each of the code data E, R, and RE is 4 bits, then each of the code data E, R, and
, RE can be represented by numbers from 0 to 15. Of these, 0-14 is assigned to the voice code data C and the code data R indicating the number of repetitions, and "bi" is added to the code data E indicating the end of the segment and the contents of the address counter itself.Address stack 3 stores the value of the address on the ROMI where the first voice code data in the segment is stored.This address is stored in the address counter 2.
Supplied from.

繰り返し再生処理部４は素片再生を繰り返して行う際に
必要な判別処理や制御信号の出力を行う。The repetitive reproduction processing section 4 performs discrimination processing and outputs control signals necessary for repeatedly performing fragment reproduction.

復号化部５は入力された音声符号データを復号化してデ
ィジタルの合成音声波形データを出力する。The decoding unit 5 decodes the input voice code data and outputs digital synthesized voice waveform data.

Ｄ／Ａ変換器６は入力されたディジタルの合成音声波形
データをＤ／Ａ変換してアナログの音声波形データを出
力する。パラメータ・スタック７は素片の先頭の音声符
号データを復号化する際に必要なパラメータ類を記憶す
る。繰り返し数カウンタ８は素片再生時における同じ素
片の繰り返し数を格納し、同じ素片に基づく素片再生を
繰り返す毎に上記格納した繰り返し数を減算する。The D/A converter 6 performs D/A conversion on the input digital synthesized voice waveform data and outputs analog voice waveform data. The parameter stack 7 stores parameters necessary for decoding the audio code data at the beginning of a segment. The repetition number counter 8 stores the number of repetitions of the same elemental piece during elemental piece reproduction, and subtracts the stored repetition number each time elemental piece reproduction based on the same elemental piece is repeated.

第２図はＲＯＭＩに格納される音声符号データ等の符号
データのフォーマットの一例を示す。図示しない符号化
器によって符号化されて得られたび最終素片を示す符号
データＲＥに１５を割り当てればよい。FIG. 2 shows an example of the format of code data such as voice code data stored in ROMI. It is sufficient to allocate 15 to code data RE indicating the final elemental piece obtained by encoding by an encoder (not shown).

ここで素片とは、繰り返し再生を行う場合には１ピッチ
周期の音声波形素片あるいはこれに準する音声波形素片
であり、繰り返し再生を行わない場合は繰り返し再生を
行う２つの素片間の音声波形素片である。Here, a segment is a speech waveform segment with one pitch period or a similar speech waveform segment when repeated playback is performed, and an interval between two segments that are repeatedly played back when repeated playback is not performed. This is a speech waveform segment.

第２図（ａ）のフォーマットで各符号データが格納され
ているＲＯＭＩを有する上記構成の音声合成装置は、第
３図に示す素片再生動作のフローチャートに従って動作
する。以下、このフローチャートに従って素片再生動作
を詳細に説明する。The speech synthesis apparatus having the above configuration and having the ROMI in which each code data is stored in the format shown in FIG. 2(a) operates according to the flowchart of the segment playback operation shown in FIG. 3. Hereinafter, the segment reproduction operation will be explained in detail according to this flowchart.

ステップＳ１で、素片再生動作が開始されると各部の初
期化が次のように行われる。すなわち、アドレス・カウ
ンタ２およびアドレス・スタック３の内容は、先頭の素
片（１）の最初の音声符号データＣ（１，１）を格納し
ているＲＯＭ１のアドレスに設定される。また、パラメ
ータ・スタック７の内容は、復号化部５内に格納されて
いる初期化されたパラメータ類と同じに設定される。さ
らに、繰り返し数カウンタ８の内容は“０”に設定され
る。In step S1, when the elemental piece reproduction operation is started, each part is initialized as follows. That is, the contents of the address counter 2 and the address stack 3 are set to the address of the ROM 1 storing the first voice code data C(1,1) of the first segment (1). Further, the contents of the parameter stack 7 are set to be the same as the initialized parameters stored in the decoding unit 5. Furthermore, the content of the repetition number counter 8 is set to "0".

ステップＳ２で、アドレス・カウンタ２が指示するＲＯ
Ｍ＋のアドレスに格納されている符号データが読み出さ
れて、繰り返し再生処理部４および復号化部５に出力さ
れる。そして、アドレス・カウンタ２の内容ａに“ビが
加算されて、次にＲＯＭ１から符号データを読み出す際
のアドレスに更新される。In step S2, the RO indicated by address counter 2
The encoded data stored at the address M+ is read out and output to the repetitive reproduction processing section 4 and the decoding section 5. Then, "bi" is added to the content a of the address counter 2, and the address is updated to the address for reading code data from the ROM 1 next time.

ステップＳ３で、入力された符号データが素片の終端を
示す符号データＥであるか音声符号データＣであるかが
繰り返し再生処理部４によって判別される。その結果、
素片の終端を示す符号データＥの場合はステップＳ６に
進み、音声符号データＣの場合はステップＳ４に進む。In step S3, the repetitive reproduction processing unit 4 determines whether the input code data is code data E indicating the end of a segment or voice code data C. the result,
In the case of code data E indicating the end of a segment, the process proceeds to step S6, and in the case of speech code data C, the process proceeds to step S4.

ステップＳ４で、上記ステップＳ２においてＲＯＭ１か
ら読み込まれた符号データは音声符号データＣであるの
で、この音声符号データＣが復号化部５によって復号化
されてディジタルの合成音声波形データが得られる。In step S4, since the coded data read from the ROM 1 in step S2 is voice coded data C, this voice coded data C is decoded by the decoding section 5 to obtain digital synthesized voice waveform data.

ステップＳ５で、上記ステップＳ４において得ら込まれ
た符号データが最終素片（Ｌ）を示す符号データＲＥで
あるか再生の繰り返し数を示す符号データＲＥであるか
が判別される。その結果最終素片（Ｌ）を示す符号デー
タＲＥであると判別された場合には素片再生動作を終了
し、そうでなければステップＳ８に進む。In step S5, it is determined whether the code data obtained in step S4 is code data RE indicating the final segment (L) or code data RE indicating the number of repetitions of reproduction. As a result, if it is determined that the code data RE indicates the final elemental piece (L), the elemental piece reproduction operation is ended; otherwise, the process advances to step S8.

ステップＳ８で、繰り返し数カウンタ８の内容ｒが“０
″であるか否かが判別される。その結果“θ″であれば
ステップＳ９に進み、そうでなければステップＳ１２に
進む。In step S8, the content r of the repetition number counter 8 is “0”.
If the result is "θ", the process advances to step S9; otherwise, the process advances to step S12.

ステップＳ９で、上記ステップＳ６において読み込まれ
た繰り返し数を示す符号データＲの内容が繰り返し数カ
ウンタ８にセットされる。In step S9, the content of the code data R indicating the number of repetitions read in step S6 is set in the repetition number counter 8.

ステップＳＩＯで、繰り返し数カウンタ８の内容ｒがθ
″であるか否かが判別される。その結果“０″であれば
ステップＳｌｌに進んで次の素片に基づく素片再生の準
備に入る一方、そうでなければステップＳ１４に進んで
同じ素片に基づく素片再生の繰り返しの準備に入る。In step SIO, the content r of the repetition number counter 8 is θ
If the result is "0", the process proceeds to step Sll and preparations are made to reproduce the next elemental piece, while if not, the process proceeds to step S14 and the same elemental piece is reproduced. Preparation begins for repeated fragment reproduction based on fragments.

ステップＳＬＩで、ＲＯＭ１の次の素片におけるれたデ
ジタルの合成音声波形データが、Ｄ／Ａ変換器６によっ
てＤ／Ａ変換されてアナログの合成音声波形が出力され
る。そして、ステップＳ２に戻り次のアドレスの音声符
号データの処理に入る。At step SLI, the digital synthesized speech waveform data in the next segment of the ROM 1 is D/A converted by the D/A converter 6, and an analog synthesized speech waveform is output. Then, the process returns to step S2 and begins processing the voice code data at the next address.

一方、ステップＳ３において、入力された符号データが
素片の終端を表す符号データＥであると判別された場合
には、以下のステップ８６〜ステツプＳ１４の処理が繰
り返し再生処理部４によって実行される。On the other hand, if it is determined in step S3 that the input code data is code data E representing the end of the segment, the following processes from step 86 to step S14 are executed by the repeat reproduction processing unit 4. .

ステップＳ６で、次の符号データがＲＯＭ１から読み込
まれる。この場合、読み込まれた符号データは素片の終
端を示す符号データＥの次のアドレスから読み出された
符号データであるから、繰り返し数を示す符号データＲ
あるいは最終素片を示す符号データＲＥである。そして
、アドレス・カウンタ２の内容ａに“ビが加算される。In step S6, the next code data is read from ROM1. In this case, the read code data is the code data read from the address next to the code data E indicating the end of the segment, so the code data R indicating the number of repetitions
Alternatively, it is code data RE indicating the final elemental piece. Then, "bi" is added to the content a of the address counter 2.

そうすると、アドレス・カウンタ２の内容ａは、次の素
片における先頭の音声符号データＣのアドレスに更新さ
れる。Then, the content a of the address counter 2 is updated to the address of the first speech code data C in the next segment.

ステップＳ７で、上記ステップＳ６において読み先頭の
音声符号データＣのアドレスを格納しているアドレス・
カウンタの内容ａをアドレス・スタック３にセットする
。また、この場合復号化部５にも次の素片の先頭の音声
符号データＣを復号化する際のパラメータ類が保持され
ている。そして、この復号化部５に保持されているパラ
メータ類の値ｐｉがパラメータ・スタック７にセットさ
れ、ステップＳ２へ戻る。In step S7, the address storing the address of the audio code data C at the beginning of the reading in step S6 is checked.
Set counter content a to address stack 3. Further, in this case, the decoding unit 5 also holds parameters for decoding the first speech code data C of the next segment. Then, the values pi of the parameters held in the decoding unit 5 are set in the parameter stack 7, and the process returns to step S2.

ステップＳ１２で、繰り返し数カウンタ８の内容ｒから
“１”が減算される。In step S12, "1" is subtracted from the content r of the repetition number counter 8.

ステップＳ１３で、繰り返し数カウンタ８の内容ｒがθ
″か否かが判別される。その結果“０″である場合はス
テップＳｌｌに進んで次の素片における素片再生の準備
に入り、そうでなければステップＳ１４に進んで同じ素
片における素片再生の繰り返しの準備に入る。In step S13, the content r of the repetition number counter 8 is θ
If the result is "0", the process advances to step Sll to prepare for the reproduction of the next elemental piece; otherwise, the process advances to step S14 to reproduce the elemental element in the same elemental piece. Begins preparation for repeating one-sided playback.

ステップ８１４で、今回復号化が終了した素片における
先頭の音声符号データＣのＲＯＭ１上のアドレスを格納
しているアドレス・スタック３の内容ａ°がアドレス・
カウンタ２にセットされると共に、復号化部５のパラメ
ータ類ｐｉがパラメータ・スタック７の内容ｐｉ“に更
新される。In step 814, the contents a° of the address stack 3 storing the address on ROM 1 of the first speech code data C in the fragment that has just been decoded is the address.
The counter 2 is set, and the parameters pi of the decoding unit 5 are updated to the contents pi of the parameter stack 7.

こうすることにより、今回復号化が終了した素片の先頭
の音声符号データＣを再度復号化することが可能になり
、同じ素片に基づく素片再生の繰り返しの準備が完了す
る。そして、ステップＳ２へ戻って同じ素片に基づいて
素片再生の繰り返しが実行される。By doing so, it becomes possible to decode again the speech code data C at the beginning of the segment for which decoding has just been completed, and preparations for repeating segment reproduction based on the same segment are completed. Then, the process returns to step S2, and repetition of elemental piece reproduction is performed based on the same elemental piece.

以下、素片再生動作をより具体的に説明する。The fragment reproduction operation will be explained in more detail below.

第５図（ａ）はＲＯＭＩの内容の一例を示し、各符号デ
ータＣには第２図（ａ）と同じ番号がついているものと
する。第５図（ｂ）は第５図（ａ）に示すＲＯＭ１の内
容に従ってＤ／Ａ変換器６から出力される合成音声波形
を示す。FIG. 5(a) shows an example of the contents of the ROMI, and it is assumed that each code data C has the same number as in FIG. 2(a). FIG. 5(b) shows a synthesized speech waveform output from the D/A converter 6 according to the contents of the ROM 1 shown in FIG. 5(a).

素片再生動作が開始すると、第３図に示すステップＳ１
で、アドレス・レジスタ２の内容ａが先頭素片（１）の
最初の符号データＣ（１，１）が格納されているＲＯＭ
Ｉのアドレス″０″になる。さらに、繰り返し数カウン
タ８の内容ｒも“０”になる。そして、ステップ８２〜
ステツプＳ５の処理を繰り返が実行されて行く。When the fragment reproduction operation starts, step S1 shown in FIG.
Then, the content a of address register 2 is a ROM in which the first code data C (1, 1) of the first segment (1) is stored.
The address of I becomes "0". Further, the content r of the repetition number counter 8 also becomes "0". Then, step 82~
The process of step S5 is repeated.

ステップ８２〜ステツプＳ５を繰り返して素片（２）の
合成音声波形を生成している際に、ステ・ツブＳ３にお
いて素片（２）の終端を示す符号データＥを検出すると
ステップＳ６に進み、ステ・ンプＳ６において再生の繰
り返し数を示す符号データＲ２（＝２）を得る。そうす
ると、Ｒ３の値は“θ″でないので、ステップＳ９にお
いて繰り返し数カウンタ８にＲ２の値“２”がセットさ
れた後ステップＳ１４に進む。そして、ステップＳ１４
において、先に素片（２）の最初の音声符号データＣ（
２，１）のアドレス″Ｘ”になっているアドレス・スタ
ック３の内容が再度アドレス・カウンタ２にセットされ
る。While repeating steps 82 to S5 to generate the synthesized speech waveform of segment (2), if code data E indicating the end of segment (2) is detected in step S3, the process advances to step S6. At step S6, code data R2 (=2) indicating the number of reproduction repetitions is obtained. Then, since the value of R3 is not "θ", the value of R2 is set to "2" in the repetition number counter 8 in step S9, and then the process proceeds to step S14. Then, step S14
First, the first speech code data C(
The contents of the address stack 3, which is the address "X" of 2, 1), are set in the address counter 2 again.

さらに、素片（２）の先頭の音声符号データＣ（２゜ｌ
）を復号化する際に使われる復号化部５のパラメータ類
ｐ１がパラメータ・スタック７から転送される。そして
、ステップＳ２へ戻って、素片（２）の先頭の音声符号
データＣ（２，１）が読み出されて、再度素片（２）に
基づく素片再生動作が実行されて行く。Furthermore, the speech code data C (2゜l
) are transferred from the parameter stack 7. Then, the process returns to step S2, and the audio code data C(2,1) at the beginning of the segment (2) is read out, and the segment reproduction operation based on the segment (2) is performed again.

しながらＲＯＭＩから先頭素片（１）の音声符号データ
Ｃを読み込んで合成音声波形を生成していく。At the same time, the speech code data C of the first segment (1) is read from the ROMI and a synthesized speech waveform is generated.

こうして、先頭素片（１）の最後の音声符号データＣ（
１，ｎ）を復号化すると、ステップＳ２において素片の
終端を示す符号データＥが読み出されて、ステップＳ３
を介してステップＳ６へ進むのである。In this way, the last speech code data C(
1, n), the code data E indicating the end of the segment is read out in step S2, and the code data E indicating the end of the segment is read out in step S3.
The process then proceeds to step S6.

ステップＳ６において再生の繰り返し数を示す符号デー
タＲ，（−“０”）を読み出すと、繰り返し数カウンタ
８の内容ｒが“０″となるから、ステップＳ７ステツプ
Ｓ８．ステップＳ９．ステップＳＩＯを介してステップ
８１１に進む。そうすると、アドレス・カウンタ２の内
容ａは、次の素片（２）の最初の音声符号データＣ（２
，１）を格納しているＲＯＭ１のアドレス”ｘ”になっ
ており、この値“ｘ”がステップＳｌｌにおいてアドレ
ス・スタック３にセットされる。さらに、復号化部５が
素片（２）の先頭の音声符号データＣを復号する際に用
いるパラメータ類ｐｌが、復号化部５からパラメータ・
スタック７に転送されてセントされる。そして、ステッ
プＳ２にへ戻って次の素片（２）に基づく素片再生動作
ステップ８２〜ステツプＳ５を繰り返して、再度素片（
２）の合成音声波形を生成している際に、ステップＳ３
において素片（２）の終端を示す符号データＥを検出す
るとステップＳ６に進み、ステップＳ７を介してステッ
プ８に進む。繰り返し数カウンタ８の内容ｒは“２”な
のでステップＳ１２に進み、ステップＳ１２において繰
り返し数カウンタ８の内容は“ビになる。したがって、
繰り返し数カウンタ８の内容は“０”ではないので、上
述と同様に、ステップＳ１４において、素片（２）の先
頭の音声符号データＣ（２，１）を復号化する際のＲＯ
ＭＩのアドレス”ｘ″がアドレス・カウンタ２に３度セ
ットされ、復号化部５のパラメータ類ｐｉが復号化部５
に３度セットされる。その後、ステップＳ２に進んで素
片（２）の先頭の音声符号データＣ（２，１）が読み出
され、３炭素片（２）に基づく素片再生動作が実行され
て行く。When the code data R, (-"0") indicating the number of reproduction repetitions is read out in step S6, the content r of the repetition number counter 8 becomes "0", so that the process proceeds to step S7 and step S8. Step S9. The process advances to step 811 via step SIO. Then, the content a of address counter 2 is the first speech code data C (2) of the next segment (2).
, 1), and this value "x" is set in the address stack 3 in step Sll. Further, the parameters pl used when the decoding unit 5 decodes the first speech code data C of the segment (2) are sent from the decoding unit 5 as parameters pl.
It is transferred to stack 7 and is cented. Then, the process returns to step S2 and repeats the fragment reproducing operation step 82 to step S5 based on the next fragment (2).
2) When generating the synthesized speech waveform, step S3
When code data E indicating the end of elemental piece (2) is detected at step S6, the process proceeds to step S7 and then to step 8. Since the content r of the repetition number counter 8 is "2", the process advances to step S12, and in step S12, the content of the repetition number counter 8 becomes "bi". Therefore,
Since the content of the repetition number counter 8 is not "0", similarly to the above, in step S14, the RO when decoding the speech code data C(2,1) at the beginning of the segment (2) is
The address “x” of MI is set in the address counter 2 three times, and the parameters pi of the decoding unit 5 are set in the address counter 2 three times.
is set three times. Thereafter, the process proceeds to step S2, where the audio code data C(2,1) at the beginning of the segment (2) is read out, and the segment reproduction operation based on the 3-carbon segment (2) is performed.

ステップ８２〜ステツプＳ５を繰り返して、３炭素片（
２）の合成音声波形を生成している際に、ステップＳ３
において素片（２）の終端を示す符号デ−タＥを検出す
るとステップＳ６に進み、さらに、ステップＳ７ステッ
プＳ８を介してステップＳＩ２に進み繰り返し数カウン
タ８の内容ｒは“０”となり、素片（２）に基づく素片
再生動作の繰り返しが終了するのである。そして、ステ
ップ９１３からステップＳｌｌに進む。Steps 82 to S5 are repeated until the 3 carbon pieces (
2) When generating the synthesized speech waveform, step S3
When code data E indicating the end of elemental piece (2) is detected at step S6, the process proceeds to step SI2 via step S7 and step S8, and the content r of the repetition number counter 8 becomes "0", and the element The repetition of the elemental piece reproduction operation based on piece (2) ends. Then, the process advances from step 913 to step Sll.

そうすると、アドレス・レジスタ２の内容ａは、次の素
片（３）の先頭の音声符号データＣ（３，１）を格納し
ているＲＯＭＩのアドレス“ｙ″になっており、この値
″ｙ”がステップＳｌｌにおいてアドレス・スタック３
にセットされる。さらに、復号化部５が素片（３）の先
頭の音声符号データＣ（３，１）を復号する際に用いる
パラメータ類ｐ１がパラメータ・スタック７にセットさ
れる。そして、ステップＳ２にへ戻って次の素片（３）
に基づく素片再生動作が実行されて行く。Then, the content a of the address register 2 becomes the address "y" of the ROMI that stores the first speech code data C (3, 1) of the next segment (3), and this value "y" ” is added to address stack 3 in step Sll.
is set to Further, parameters p1 used when the decoding unit 5 decodes the first speech code data C(3,1) of the segment (3) are set in the parameter stack 7. Then, return to step S2 and proceed to the next elemental piece (3)
The elemental piece reproduction operation based on the above is executed.

ステップ８２〜ステツプＳ５を繰り返し、前の素片（２
）と同様にして素片（３）の合成音声波形を生成して行
く。その際に、ＲＯＭＩに格納されている素片（３）に
おける再生の繰り返し数を示す符号（Ｌ、ｍ）の復号化
が終了したと判断して素片再生動作を終了する。Steps 82 to S5 are repeated, and the previous fragment (2
), the synthesized speech waveform of segment (3) is generated. At this time, it is determined that the decoding of the code (L, m) indicating the number of reproduction repetitions in the segment (3) stored in the ROMI has been completed, and the segment reproduction operation is ended.

このようにして、第５図（ａ）のＲＯＭＩの内容に基づ
いて素片再生動作が実行された結果、第５図（ｂ）に示
すように、素片（１）に基づく合成音声波形に続いて素
片（２）に基づく合成音声波形が３回続き、さらに素片
（３）に基づ（合成音声波形が２回続く合成音声波形が
出力されるのである。In this way, as a result of executing the segment playback operation based on the contents of the ROMI shown in FIG. 5(a), as shown in FIG. 5(b), a synthesized speech waveform based on segment (1) is generated. Subsequently, a synthesized speech waveform based on segment (2) continues three times, and a synthesized speech waveform based on segment (3) (synthesized speech waveform continues twice) is then output.

上述のように、この発明の音声合成方式においては、音
声符号データＣをＲＯＭ１に格納する際に、各素片の終
端の音声符号データＣの直後に、素片の終端を示す符号
データＥおよび再生の繰り返し数を示す符号データＲを
同一ビットで付加する。また、最終素片の終端の音声符
号データＣの直後に、素片の終端を示す符号データＥお
よび再生の繰り返し数を示す符号データＲに変わる最終
素片を示す符号データＲＥを同一ビットで付加する。そ
して、素片再生時においては、音声符号データＣを順次
読み出して復号化処理を行い素片の終端を示す符号デー
タＥを読み出した場合には、データＲ３は“１”である
から、素片（３）に基づく素片再生動作が１回繰り返さ
れることになる。そして、素片（３）に基づく合成音声
波形が２回出力されると、アドレス・レジスタ２の内容
ａは、次の最終素片（Ｌ）の先頭の音声符号データＣ（
Ｌ、１）を格納しているＲＯＭＩのアドレス“Ｚ”にな
っており、この値“２”がステップＳｌｌにおいてアド
レス・スタック３にセットされる。さらに、復号化部５
が最終素片（Ｌ）の先頭の音声符号データＣ（Ｌ、１）
を復号する際に用いるパラメータ類ｐｉがパラメータ・
スタック７にセットされる。そして、ステップＳ２にへ
戻って次の最終素片（Ｌ）に基づく素片再生動作が実行
されて行く。As described above, in the speech synthesis method of the present invention, when the speech code data C is stored in the ROM 1, immediately after the speech code data C at the end of each segment, the code data E and the code data indicating the end of the segment are stored. Code data R indicating the number of reproduction repetitions is added using the same bits. Immediately after the audio code data C at the end of the final segment, code data RE indicating the final segment, which changes to code data E indicating the end of the segment and code data R indicating the number of repetitions of reproduction, is added in the same bit. do. Then, when playing a segment, when the audio code data C is sequentially read out and decoded, and the code data E indicating the end of the segment is read out, data R3 is "1", so the segment The segment reproduction operation based on (3) is repeated once. Then, when the synthesized speech waveform based on the segment (3) is output twice, the content a of the address register 2 becomes the speech code data C(
This is the address "Z" of the ROMI storing the data L, 1), and this value "2" is set in the address stack 3 in step Sll. Furthermore, the decoding unit 5
is the first speech code data C(L, 1) of the final segment (L)
The parameters pi used when decoding are the parameters
Set on stack 7. Then, the process returns to step S2, and an elemental piece reproduction operation based on the next final elemental piece (L) is executed.

こうして、ステップ８２〜ステツプＳ５を繰り返して最
終素片（Ｌ）の合成音声波形を生成している際に、ステ
ップＳ３において最終素片（Ｌ）の終端を示す符号デー
タＥを検出するとステップＳ６に進み、ステップＳ６に
おいて最終素片を示す符号データＲＥを読み出す。そう
すると、ステップＳ７において最終素片（Ｌ）の最後の
音声符号データＣ次に読み出す繰り返し数を示す符号デ
ータＲの内容に応じた回数だけ同じ素片に基づく素片再
生を繰り返す。一方、素片の終端を示す符号データＥに
続く符号データが最終素片を示す符号データＲＥの場合
には、素片再生動作を終了するようにしている。そのた
めに、ｌ素片に含まれる音声符号データ数を示す情報を
用いなくても素片単位で素片再生を実行することができ
、ｌ素片に含まれる音声符号データ数を示す符号データ
を記憶部に記憶する必要がない。したがって、記憶部に
おける補助情報のデータ量を少なくすることができる。In this way, when the synthesized speech waveform of the final segment (L) is generated by repeating steps 82 to S5, when code data E indicating the end of the final segment (L) is detected in step S3, the process proceeds to step S6. Then, in step S6, code data RE indicating the final elemental piece is read out. Then, in step S7, the segment reproduction based on the same segment is repeated a number of times according to the content of the code data R indicating the number of repetitions to be read next to the last audio code data C of the final segment (L). On the other hand, if the code data following the code data E indicating the end of a segment is the code data RE indicating the final segment, the segment reproduction operation is ended. Therefore, it is possible to perform segment-by-fragment playback without using information indicating the number of speech code data included in l segment, and code data indicating the number of speech code data included in l segment can be reproduced. There is no need to store it in the storage unit. Therefore, the amount of auxiliary information in the storage section can be reduced.

上記実施例の第３図のフローチャートにおいて、ステッ
プＳ１４でアドレス・レジスタ２の内容ａをアドレス・
スタック３に格納されている素片の最初の音声符号デー
タＣのアドレスに更新して、次にステップＳ２において
このアドレスから音声符号データＣを読み出して素片再
生を繰り返す際に次のような問題がある。すなわち、Ｒ
ＯＭ１のアクセス時間（ＲＯＭＩのアドレスが確定して
からＲＯＭＩのデータを出力するまでの時間）が長い場
合には、ステップＳ２において音声符号データＣが読み
出されるまでしばらく待つ操作が必要となる。そこで、
次の実施例においては、このような操作を必要としない
ようにしている。In the flowchart of FIG. 3 of the above embodiment, the content a of the address register 2 is set to the address in step S14.
When updating the address of the first voice code data C of the segment stored in the stack 3, and then reading the voice code data C from this address in step S2 and repeating the segment playback, the following problem occurs. There is. That is, R
If the access time of OM1 (the time from when the ROMI address is determined to when the ROMI data is output) is long, it is necessary to wait for a while until the voice code data C is read out in step S2. Therefore,
In the following embodiment, such an operation is not required.

この実施例においては、ＲＯＭＩのフォーマットを第２
図（ｂ）に示すように、最終素片（Ｌ）以外の素片の終
端を示す符号データＥと再生の繰り返し数を示す符号デ
ータＲとを、最後の音声符号データＣ（＋、ｎ）、・・
の直前に付加する。そして、第３図のフローチャートの
ステップＳ８以降を第４図に示すようにするのである。In this embodiment, the ROMI format is
As shown in Figure (b), the code data E indicating the end of the segment other than the final segment (L) and the code data R indicating the number of repetitions of reproduction are converted into the last audio code data C(+,n). ,...
Add immediately before. Then, the steps after step S8 in the flowchart of FIG. 3 are performed as shown in FIG.

すなわち、ステップＳ２９で、上記ステップＳ６におい
て読み込まれた繰り返し数を示す符号データＲの内容が
繰り返し数カウンタ８にセットされる。That is, in step S29, the content of the code data R indicating the number of repetitions read in step S6 is set in the repetition number counter 8.

ステップＳ３０で、繰り返し数カウンタ８の内容ｒが０
”であるか否かが判別される。その結果″０”であれば
ステップＳ３１に進んで次の素片に基づく素片再生動作
の準備に入り、そうでなければステップＳ３７に進んで
同じ素片に基づく素片再生の繰り換器６によってＤ／Ａ
変換されてアナログの合成音声波形が出力される。そし
て、ステップＳ２に戻り次のアドレスの音声符号データ
の処理に入る。In step S30, the content r of the repetition number counter 8 is 0.
If the result is "0", the process advances to step S31 to prepare for the next elemental piece reproduction operation; otherwise, the process advances to step S37 to reproduce the same elemental element. D/A by the repeater 6 of fragment-based fragment reproduction
After conversion, an analog synthesized speech waveform is output. Then, the process returns to step S2 and begins processing the voice code data at the next address.

ステップＳ３５で、繰り返し数カウンタ８の内容ｒが“
１″だけ減算される。In step S35, the content r of the repetition number counter 8 is “
1″ is subtracted.

ステップ８３６で、繰り返し数カウンタ８の内容ｒが“
０″か否かが判別される。その結果“０”であればステ
ップＳ３１に進んで次の素片における素片再生の準備に
入り、そうでなければステップＳ３７に進んで同じ素片
に基づく素片再生の繰り返しの準備に入る。At step 836, the content r of the repetition number counter 8 is “
0". If the result is "0", the process advances to step S31 and preparations are made for the reproduction of the next elemental piece; if not, the process advances to step S37 and the reproduction is based on the same elemental piece. Begins preparation for repeating fragment playback.

ステップＳ３７で、現在の素片における最後の音声符号
データＣが読み出され、前回復号化が終了した素片にお
ける先頭の音声符号データＣのＲＯＭ１上のアドレスを
格納しているアドレス・スタック３の内容ａ゛がアドレ
ス・カウンタ２にセットされる。In step S37, the last speech code data C in the current segment is read out, and the address stack 3 stores the address on ROM 1 of the first speech code data C in the segment for which the previous decoding has been completed. The contents a' are set in address counter 2.

ステップ８２Ｂで、上記ステップＳ３７においてＲＯＭ
１から読み込まれた最後の音声符号データＣが復号化部
５によって復号化されてディジタルの返しの準備に入る
。In step 82B, the ROM is
The last audio coded data C read from 1 is decoded by the decoding section 5, and preparations are made for digital return.

ステップＳ３１で、現在の素片における最後の音声符号
データＣが読み出されて、アドレス・カウンタ２の内容
ａに“ビが加算される。そうすると、アドレス・カウン
タ２の内容ａは、次の素片における先頭の音声符号デー
タＣのアドレスに更新される。In step S31, the last speech code data C in the current element is read out, and "bi" is added to the content a of the address counter 2. Then, the content a of the address counter 2 becomes the next element. It is updated to the address of the first voice code data C in the piece.

ステップ８３２で、上記ステップＳ３１においてＲＯＭ
１から読み出された最後の音声符号データＣが復号化部
５によって復号化されてディジタルの合成音声波形デー
タが得られる。In step 832, the ROM in step S31 above is
The last voice code data C read out from 1 is decoded by the decoding section 5 to obtain digital synthesized voice waveform data.

ステップＳ３３で、次の素片における先頭の音声符号デ
ータＣのＲＯＭＩ上のアドレスを格納しているアドレス
・カウンタの内容ａをアドレス・スタック３にセットす
る。また、復号化部５に保持されている次の素片の先頭
の音声符号データを復号化する際のパラメータ類の値ｐ
ｉがパラメータ・スタック７にセットされる。In step S33, the contents a of the address counter storing the address on the ROMI of the first audio code data C in the next segment are set in the address stack 3. Also, the value p of parameters when decoding the first speech code data of the next segment held in the decoding unit 5
i is set in parameter stack 7.

ステップ８３４で、上記ステップＳ３２において得られ
たデジタルの合成音声波形データがＤ／Ａ変合成音声波
形データが得られる。In step 834, the digital synthesized speech waveform data obtained in step S32 is converted into D/A synthesized speech waveform data.

ステップＳ３９で、復号化部５のパラメータ類ｐｉがパ
ラメータ・スタック７の内容ｐｉ゛に更新される。In step S39, the parameters pi of the decoding unit 5 are updated to the contents pi of the parameter stack 7.

こうすることにより、前回復号化が終了した素片の先頭
の音声符号データＣを再度復号化することが可能になり
、同じ素片に基づく素片再生の繰り返しの準備が完了す
る。By doing this, it becomes possible to decode again the speech code data C at the head of the segment for which the previous decoding has been completed, and preparations for repeating segment reproduction based on the same segment are completed.

ステップＳ４０で、上記ステップ８３８において得られ
たデジタルの合成音声波形データがＤ／Ａ変換器６によ
ってＤ／Ａ変換されてアナログの合成音声波形が出力さ
れる。そして、ステップＳ２に戻り次のアドレスの音声
符号データＣの処理に入る。In step S40, the digital synthesized speech waveform data obtained in step 838 is D/A converted by the D/A converter 6, and an analog synthesized speech waveform is output. Then, the process returns to step S2 and starts processing the voice code data C at the next address.

すなわち、この実施例においては、素片再生を繰り返す
際には、ステップＳ３７においてアドレス・カウンタ２
にアドレス・スタック３の内容ａ°をセットした後、ス
テップ９３８において復号化処理を行い、ステップＳ３
９において復号化部５のパラメータ類ｐｉにパラメータ
・スタック７の内容ｐｉ°をセットし、ステップＳ４０
においてＤ／Ａ変換処理を行った後に、ステップＳ２に
おいて上記ステップＳ３７でアドレス・カウンタ２にセ
ットしたｒ’ｔＯＭ＋のアドレスから音声符号データＣ
を読み出すようにしている。したがって、アドレス・カ
ウンタ２にアドレス・スタック３の内容ａ”をセットし
てから、そのアドレス・カウンタ２の内容ａで示される
アドレスから音声符号データＣが読み出されるまで十分
な時間がある。そのために、ＲＯＭ１のアクセス時間が
長くてもステップＳ２における符号データ読み出しを待
つ操作が不用となるのである。That is, in this embodiment, when repeating elemental piece reproduction, the address counter 2 is set in step S37.
After setting the content a° of address stack 3 to , decoding processing is performed in step 938, and step S3
9, the contents pi° of the parameter stack 7 are set to the parameters pi of the decoding unit 5, and step S40
After performing D/A conversion processing in step S2, voice code data C is obtained from the address r'tOM+ set in the address counter 2 in step S37.
I am trying to read out the . Therefore, there is sufficient time after the address counter 2 is set to the contents a'' of the address stack 3 until the audio code data C is read from the address indicated by the contents a of the address counter 2. , even if the access time to ROM1 is long, the operation of waiting for readout of code data in step S2 is unnecessary.

〈発明の効果〉以上より明らかなように、この発明の音声合成方式は、
各素片の終端の音声符号データの直前または直後に、上
記素片の終端を示す符号データと素片再生動作の繰り返
し数を示す符号データとを連続して付加して記憶部に記
憶し、上記記憶部から読み出した符号データが音声符号
データである場合にはその音声符号データを復号化する
一方、上記素片の終端を示す符号データである場合には
同じ素片に基づいて再生動作を繰り返すようにし第１図
はこの発明に係る音声合成装置のブロック図、第２図（
ａ）および第２図（ｂ）は第１図にお（プるＲＯＭに格
納される符号データの）Ａ−マットの例を示す図、第３
図は第２図（ａ）のフォーマットで符号データが格納さ
れているＲＯＭを用いた場合の素片再生動作のフローチ
ャート、第４図は第２図（ｂ）のフォーマットで符号デ
ータが格納されているＲＯＭを用いた場合にお（づる第
３図のフーヂャートのステップＳ８以降のフローチャー
１・、第５図（ａ）はＲＯＭの内容の一例を示す図、第
５図（ｂ）は第５図（ａ）に示すＲＯＭの内容に従って
素片再生動作を行って得られた合成音声波形を示す図で
ある。<Effects of the Invention> As is clear from the above, the speech synthesis method of the present invention has the following effects:
Immediately or immediately after the audio code data at the end of each segment, code data indicating the end of the segment and code data indicating the number of repetitions of the segment playback operation are successively added and stored in a storage unit; If the coded data read from the storage unit is audio coded data, the audio coded data is decoded, while if it is coded data indicating the end of the segment, playback operation is performed based on the same segment. As will be repeated, Fig. 1 is a block diagram of a speech synthesis device according to the present invention, and Fig. 2 (
a) and FIG. 2(b) are diagrams showing an example of the A-mat (of code data stored in the ROM) shown in FIG. 1, and FIG.
The figure is a flowchart of the segment playback operation when using a ROM in which encoded data is stored in the format of Figure 2(a), and Figure 4 is a flowchart of the segment reproduction operation when encoded data is stored in the format of Figure 2(b). When using a ROM with FIG. 6 is a diagram showing a synthesized speech waveform obtained by performing a segment playback operation according to the contents of the ROM shown in FIG.

１−ＲＯＭ、　　　　　２・・・アドレス・カウンタ、
３−　アドレス・スタック、４・繰り返し再生処理部、５・復号化部、６　・Ｄ／Ａ
変換器、　　７・パラメータ・スタック、８・・繰り返
し数カウンタ。1-ROM, 2...address counter,
3-Address stack, 4.Repetitive playback processing section, 5.Decoding section, 6.D/A
Converter, 7. Parameter stack, 8. Repeat number counter.

特許出願人　　シャープ株式会社代理人　　弁理士　　青　山　　葆　ほか１名たので、
１素片に含まれる音声符号データの数を示す符号データ
を用いることなく素片単位で素片再生動作を実行するこ
とができる。したがって、上記記憶部における補助情報
のデータ量を少なくすることができる。Patent applicant Sharp Co., Ltd. agent Patent attorney Aoyama Aoyama and one other person,
It is possible to perform an elemental piece reproduction operation on an elemental piece basis without using code data indicating the number of audio code data included in one elemental piece. Therefore, the amount of auxiliary information in the storage section can be reduced.

また、この発明の音声合成方式は、上記素片が最終素片
である場合には、その最終素片の終端の音声符号データ
の直前または直後に、上記素片の終端を示す符号データ
と、上記繰り返し数を示す符号データに変わる最終素片
を示す符号データとを連続して付加して記憶部に記憶し
、上記記憶部から読み出した符号データが上記素片の終
端′を示ず符号データであり、次に読み出した符号デー
タが上記最終素片を示す符号データである場合には素片
再生動作を終了するようにしたので、記憶部に最終素片
を示す符号データを記憶するための領域を別に確保する
必要がないので、さらに、上記記憶部におけるデータ量
を少なくすることができる。Furthermore, in the speech synthesis method of the present invention, when the segment is a final segment, code data indicating the end of the segment is added immediately before or after the speech code data at the end of the final segment; Code data indicating the final segment to be changed to the code data indicating the number of repetitions are successively added and stored in the storage unit, and the code data read from the storage unit does not indicate the end of the segment. If the next read code data is the code data indicating the final segment, the segment reproduction operation is terminated. Therefore, the code data indicating the final segment is stored in the storage section. Since there is no need to secure a separate area, the amount of data in the storage section can be further reduced.

【図面の簡単な説明】[Brief explanation of the drawing]

Claims

[Claims]

(1) a storage unit that stores voice code data obtained by encoding a voice waveform for each waveform element such as a pitch waveform element;
In a speech synthesis method having a synthesis unit that generates a synthesized speech waveform by repeatedly reading and decoding the speech code data stored for each segment in the storage unit, immediately before or after the end speech code data, code data indicating the end of the segment and code data indicating the number of repetitions of the segment playback operation are successively added and stored in the storage unit; Each piece of code data stored in the segment is read out one by one, and the segment end discriminating means determines whether the read code data is code data indicating the end of the segment or audio code data, and When the segment end determination means determines that the read code data is voice code data, the synthesizer decodes the voice code data into voice waveform data, while decoding the read code data. If it is determined that the data is code data indicating the end of the segment, the synthesizing unit returns to the speech code data at the beginning of the same segment the number of times indicated by the content of the code data indicating the number of repetitions. A speech synthesis method characterized by repeating one-sided playback operations.

(2) In the speech synthesis method according to claim 1, if the segment is the final segment stored in the storage unit,
Immediately or immediately after the audio code data at the end of the final segment, code data indicating the end of the segment and code data indicating the final segment that changes to code data indicating the number of repetitions are successively added. When the segment end determination means determines that the read code data is code data indicating the end of the segment, the next read code data is stored in the segment end discriminating unit. A final segment discriminating means determines whether the coded data indicates a segment or the number of repetitions, and the final segment discriminating means determines that the read coded data is coded data indicating a final segment. If it is determined that
The synthesizing section ends the segment reproduction operation, and if the read code data is determined to be the code data indicating the repetition number, the synthesis section causes the content of the code data indicating the repetition number. A speech synthesis method characterized in that the above-mentioned segment reproduction operation is repeated by returning to the first speech code data of the same segment a number of times indicated by .