JPH06308992A

JPH06308992A - Voice ebook

Info

Publication number: JPH06308992A
Application number: JP5116599A
Authority: JP
Inventors: Hiroshi Ishibashi; 広石橋
Original assignee: Advance Co Ltd
Current assignee: Advance Co Ltd
Priority date: 1993-04-21
Filing date: 1993-04-21
Publication date: 1994-11-04
Also published as: KR950702323A; WO1994024667A1

Abstract

(57)【要約】【目的】書籍の朗読音声出力を長時間行う音声式電子
ブック【構成】実質的に無音声部を削除した様式で、音声信
号をディジタル記憶する記憶手段、所望の発話速度でデ
ィジタル音声信号を音声再生する音声再生手段より成
る。 (57) [Abstract] [Purpose] A voice-type electronic book that outputs a reading voice of a book for a long time [Configuration] Storage means for digitally storing a voice signal in a format in which substantially no voice portion is deleted, desired speech rate And a voice reproducing means for reproducing a digital voice signal by voice.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、音声式電子ブックに関
する。FIELD OF THE INVENTION The present invention relates to voice electronic books.

【０００２】[0002]

【従来の技術】ＣＤ−ＲＯＭ等のディジタル高容量記憶
媒体を用いて、音声再生用の装置があるが、その再生時
間は、せいぜい70分程度である。この程度の再生時間
は、音楽を録音するには充分であるが、文庫本、学習書
等の書籍を朗読した朗読音声全部を録音するには不足し
ている。特に聴者に理解と認識を与える為の学習器等の
様な繰り返し且つ、明確な音声を低速でしかも長時間再
生出力する場合、上述のデイジタル記憶媒体の使用は、
学習内容を調整乃至省略しない限り困難なことであり、
その他の記憶媒体であっては、なおさらに困難である。2. Description of the Related Art There is a device for audio reproduction using a digital high capacity storage medium such as a CD-ROM, but the reproduction time is about 70 minutes at most. Although such a reproduction time is sufficient for recording music, it is insufficient for recording all the read voices read aloud books such as paperback books and study books. In particular, when a repetitive and clear sound such as a learning device for giving an understanding and recognition to a listener is reproduced and output at a low speed for a long time, the use of the above digital storage medium is
It is difficult unless you adjust or omit the learning content,
It is even more difficult with other storage media.

【０００３】[0003]

【課題を解決するための手段】上記に鑑み本発明は、鋭
意研究の結果、実質的に無音声部を削除した様式でディ
ジタル音声データを記憶媒体に記憶させ、再生時、この
無音声時間を付加することにより、記憶媒体には、充分
な音声データが格納でき、しかも再生時この無音声時間
が付加されていることから、自然の朗読に近い音声出力
が長時間得られる音声式電子ブックを実現した。本発明
で無音声部とは、例えば音節間、文節間等々の音声的に
無音乃至無音に近い部分を示すものである。又、無音声
部の実質的削除の様式とは、例えば無音声部の全部又は
１部の削除あるいは、無音声部を他の符号に変換するこ
と等々を示すものである。SUMMARY OF THE INVENTION In view of the above, the present invention has as a result of earnest research that digital voice data is stored in a storage medium in a manner in which a non-voice portion is substantially deleted, and this voiceless time is reduced during reproduction. By adding this, a sufficient amount of audio data can be stored in the storage medium, and since this silent time is added during playback, it is possible to obtain an audio e-book that can obtain audio output that is close to natural reading for a long time. It was realized. In the present invention, the non-speech portion indicates a portion of the sound, such as inter-syllables and inter-syllables, which is silent or close to silence. In addition, the method of substantially deleting the silent part indicates, for example, deleting all or a part of the silent part or converting the silent part into another code.

【０００４】[0004]

【実施例】以下、本発明の実施例を図面を参照して詳細
に説明する。図１は、記憶手段の一例であり、以下、記
録部とした。(11)は、記録媒体であり、主に光ディス
ク、光磁気ディスク、磁気ディスク等のディジタル記憶
媒体よりなる。(111)は、書き込み手段であり、書き込
み用ヘッド、ヘッド駆動用ドライバ等から構成される。
(01)は、アナログ音声入力手段であり、マイクロフォ
ン、フィルタ、増幅器等から構成される。(02)は、Ａ／
Ｄ変換手段であり、アナログ音声信号をデジタル音声信
号に変換する。更にＡ／Ｄ変換手段(02)は、ADPCM等の
デジタル信号圧縮手段を組み込む場合もある。(03)は、
無音声検出手段であり、無音声部を自動的、あるいは目
視的によって検出する部分である。(04)は、変換手段で
あり、無音声検出手段(03)及びＡ／Ｄ変換手段(02)の出
力信号を入力し、無音声検出手段(03)からの入力信号に
基づいてデジタル音声信号の無音声部にたいし、削除あ
るいは、他の符号に変換処理を行う手段である。無音声
検出手段(03)並びに変換手段(04)は、ＣＰＵ，ＤＳＰな
どを用いてアルゴリズム的処理を施すものであってもよ
い。この場合、両手段(03)(04)の区別は、無くなるもの
である。図１は、アナログ音声を最初にデジタル音声に
変換した後、無音声部の実質的削除を行う構成を示した
が、これに限られるものではなく、例えばデジタル変換
行程中、あるいはアナログ音声時に無音声部の実質的削
除がおこなわれるものであってもよい。Embodiments of the present invention will now be described in detail with reference to the drawings. FIG. 1 shows an example of a storage unit, which will be referred to as a recording unit hereinafter. Reference numeral (11) is a recording medium, which mainly comprises a digital storage medium such as an optical disc, a magneto-optical disc, a magnetic disc, or the like. Reference numeral (111) is a writing unit, which includes a writing head, a head driving driver, and the like.
Reference numeral (01) is an analog voice input means, which is composed of a microphone, a filter, an amplifier and the like. (02) is A /
The D conversion means converts an analog audio signal into a digital audio signal. Further, the A / D conversion means (02) may incorporate digital signal compression means such as ADPCM. (03) is
It is a voiceless detection means, and is a portion that automatically or visually detects a voiceless portion. Reference numeral (04) is a conversion means, which inputs the output signals of the non-voice detection means (03) and the A / D conversion means (02), and which is based on the input signal from the non-voice detection means (03). It is a means for deleting the unvoiced part of the above or converting it to another code. The non-voice detecting means (03) and the converting means (04) may perform algorithmic processing by using a CPU, DSP or the like. In this case, the distinction between the means (03) and (04) is lost. FIG. 1 shows a configuration in which analog voice is first converted to digital voice, and then the non-voice portion is substantially deleted. However, the present invention is not limited to this. For example, during the digital conversion process or during analog voice, The audio part may be substantially deleted.

【０００５】図２は、音声再生手段の一例であり、以下
再生部とした。(11)は、記録媒体であり、図１で示した
ものである。(112)は読み取り手段であり、読み取り用
ピックアップ、記録手段(11)を回転させる手段、読み取
り用ピックアップを摺動させる手段等から構成される。
(12)は、検出手段であり、読み取り手段(112)が出力す
るディジタル音声から、実質的に削除された無音声部を
検出し、検出した無音声部を復元又は、新たに形成又
は、これらと同等の意味を持つ信号に変換し、出力する
ものである。(13)は調整手段であり、読み取り手段(11
2)から出力されたディジタル音声信号と、検出手段(12)
が出力した無音声信号とを組み合わせた後、この組み合
わせ信号を出力する。検出手段(12)、調整手段(13)は、
１つのＣＰＵ、ＤＳＰワンチップマイコン等によってア
ルゴリズム的に処理される場合がある。この場合、両手
段(12)(13)の区別する必要はなく、すくなくとも削除さ
れた無音声部を任意の無音声時間、又は原無音声時間を
有する無音声ディジタル信号に変換し、ディジタル音声
と組み合わせて出力するプログラムルーチン等のアルゴ
リズムを有すればよいものである。(14)はＤ／Ａ変換手
段であり、調整手段(13)から出力されるディジタル音声
をアナログ音声に変換するものである。この時、図１で
しめすＡ／Ｄ変換手段(02)が圧縮手段を有している場
合、Ｄ／Ａ変換手段(14)は、復元手段を有するものであ
る。又、Ｄ／Ａ変換手段(14)が、検出手段(12)、調整手
段(13)を兼ねる場合もある。(15)は、増幅手段であり、
アナログ音声を電気的に増幅する手段である。尚、増幅
手段(15)には更に周波数フィルタ特性が付加されたもの
であってもよい。(16)は、発声手段であり、スピーカ、
イヤホーンの何れか、あるいは全部等よりなる。尚、記
録部及び再生部は両部一体型または別体型何れの場合で
も良い。FIG. 2 shows an example of the audio reproducing means, which will be referred to as a reproducing section hereinafter. (11) is a recording medium, which is shown in FIG. Reference numeral (112) is a reading means, which is composed of a reading pickup, a means for rotating the recording means (11), a means for sliding the reading pickup, and the like.
Reference numeral (12) is a detection means, which detects a virtually deleted voiceless portion from the digital voice output by the reading means (112) and restores or newly forms the detected voiceless portion, or these It is converted into a signal having the same meaning as and output. (13) is an adjusting means, and a reading means (11
Digital voice signal output from 2) and detection means (12)
After combining with the non-voice signal output by, the combined signal is output. The detection means (12) and the adjustment means (13) are
It may be processed algorithmically by one CPU, DSP one-chip microcomputer, or the like. In this case, it is not necessary to distinguish between the means (12) and (13), and at least the deleted voiceless part is converted into a voiceless digital signal having an arbitrary voiceless time or an original voiceless time, and the digital voice and It suffices if it has an algorithm such as a program routine that outputs it in combination. Reference numeral (14) is a D / A conversion means for converting the digital voice output from the adjusting means (13) into an analog voice. At this time, when the A / D converting means (02) shown in FIG. 1 has a compressing means, the D / A converting means (14) has a restoring means. The D / A conversion means (14) may also serve as the detection means (12) and the adjustment means (13). (15) is an amplification means,
It is a means for electrically amplifying analog voice. The amplifying means (15) may have a frequency filter characteristic added thereto. (16) is a voicing means, a speaker,
It consists of any or all of the earphones. Incidentally, the recording unit and the reproducing unit may be of both type integrated type or separate type.

【０００６】次に図１及び図２の動作の一例を説明す
る。図１で示す記録部において、アナログ音声入力部(0
1)に入力されたアナログ音声は、ろ波、増幅されたの
ち、Ａ／Ｄ変換手段(02)において、デジタル音声信号
（図３（１））に変換される。デジタル音声信号は、無
音声検出手段(03)並びに変換手段(04)に入力される。無
音声検出手段(03)で、図３（１）で示す無音声部(31)が
検出され、変換手段(04)で図３（２）でしめす(32)のよ
うに無音声部は、実質的に削除される。無音声部が実質
的に削除されたデジタル音声信号（図３（２））は、書
き込み手段(111)を介して記録媒体(11)に書き込まれ
る。尚、ディジタル音声信号列は、非常にこまかいこと
から、省略して描いた。又、デイジタル音声列の１つ
は、１音節、１文節、１段落あるいは、無音声部から、
次の無音声部迄等が示される。Next, an example of the operation of FIGS. 1 and 2 will be described. In the recording section shown in FIG. 1, the analog voice input section (0
The analog voice input to 1) is filtered and amplified, and then converted into a digital voice signal ((1) in FIG. 3) by the A / D conversion means (02). The digital audio signal is input to the silence detection means (03) and the conversion means (04). The non-voice detection section (03) detects the non-voice section (31) shown in FIG. 3 (1), and the conversion section (04) shows the non-voice section as shown by (32) in FIG. 3 (2). Effectively deleted. The digital audio signal (FIG. 3 (2)) from which the non-voice part is substantially deleted is written in the recording medium (11) via the writing means (111). The digital audio signal sequence is omitted because it is very detailed. Also, one of the digital voice strings is from one syllable, one syllable, one paragraph, or a silent part,
Up to the next unvoiced part is shown.

【０００７】次に記録媒体(11)に記録されたデイジタル
音声信号を再生する再生部を示す図２に於て、記録手段
(11)を読み取り手段(112)にセットし、記録手段(11)か
ら、実質的に無音声部が削除されたディジタル音声信号
が読み取られ、検出手段(12)、並びに調整手段(13)に出
力される。検出手段(12)は、削除された無音声部（図３
（２））(32)を検出し、任意の時間幅又は原時間幅を有
する無音声ディジタル信号に変換し、調整手段(13)に出
力する。調整手段(13)は、記録手段(11)から入力された
実質的に無音声部が削除されたディジタル音声信号の削
除部に検出手段(12)から入力された無音声ディジタル信
号を組み合わせて、この組み合わせディジタル音声信号
（図３（１））をＤ／Ａ変換手段(14)に出力する。Ｄ／
Ａ変換手段(14)は、入力された組み合わせディジタル音
声信号をアナログ音声信号に変換出力する。増幅手段(1
5)は、このアナログ音声信号を増幅、場合によってろ波
し、発声手段(16)に出力する。発声手段(16)は、スピー
カ、イヤホンを媒体として音声を出力する。この時、無
音声デイジタル信号量の数値的加算、減算等の調整によ
り、発話速度は自在に調整され、低速発話も容易に実施
できる。この調整は、聴者が調整できるように調整用の
ツマミを装置上に装着される場合もある。Next, in FIG. 2 showing a reproducing section for reproducing the digital audio signal recorded on the recording medium (11), recording means
(11) is set in the reading means (112), the digital sound signal from which the substantially silent part is deleted is read from the recording means (11), and is detected by the detecting means (12) and the adjusting means (13). Is output. The detection means (12) uses the deleted non-voice part (see FIG. 3).
(2)) Detects (32), converts it to a voiceless digital signal having an arbitrary time width or original time width, and outputs it to the adjusting means (13). The adjusting means (13) combines the non-voice digital signal input from the detecting means (12) with the deletion portion of the digital voice signal in which the substantially non-voice portion input from the recording means (11) is deleted, This combined digital audio signal (FIG. 3 (1)) is output to the D / A conversion means (14). D /
The A conversion means (14) converts the input combined digital audio signal into an analog audio signal and outputs it. Amplification means (1
5) amplifies this analog audio signal, filters it depending on the case, and outputs it to the voicing means (16). The voicing means (16) outputs a sound using a speaker and an earphone as a medium. At this time, the utterance speed can be freely adjusted by adjusting numerically adding or subtracting the amount of voiceless digital signal, and low speed utterance can be easily performed. In this adjustment, a knob for adjustment may be mounted on the device so that the listener can adjust it.

【０００８】又、実質的に無音声部が削除されたディジ
タル音声は、図４で示す様に記録媒体に記録される場合
もある。図１の記録部において、無音声検出手段(03)、
変換手段(04)は、図４(１)で示す原ディジタル音声信号
の無音声部(41)を図４(２)で示す様に、他の符号(42)で
置換する。図４（２）で示すデイジタル音声信号は、書
き込み手段(111)を介して記録手段(11)に書き込まれ
る。図４（２）で示す他の符号(42)とは、単なる目印
の他、無音声時間幅の情報、無音声部の性質を示す情報
を具備した数ビットの符号等を示すものである。図２の
再生部に於て、記録手段(11)は図４(２)で示すディジタ
ル音声信号を記録している。読み取り手段(112)は、
この記録手段(11)に記録された実質的に無音声部が削除
されたディジタル音声信号を読み出し、検出手段(12)、
調整手段(13)に出力する。検出手段(12)は、入力された
ディジタル音声信号の削除された無音声部に代替付加さ
れている符号を検出した後、その符号を解読し、解読内
容に従った信号を調整手段(13)に出力する。図４（２）
で示す他の符号(42)の内容は上述の様にその部分の原無
音声部の時間幅等である。調整手段(13)は、検出手段(1
2)から入力された信号と、読み取り手段(112)から入力
された無音声部が削除されたディジタル音声信号より、
無音声部を付加乃至再現したディジタル音声信号（図４
（１））をＤ／Ａ変換手段(14)に出力する。Ｄ／Ａ変換
手段(14)以降の動作は、前述と同一なので説明は省略す
る。Further, the digital voice from which the non-voice portion is substantially deleted may be recorded on the recording medium as shown in FIG. In the recording unit of FIG. 1, the voiceless detection means (03),
The conversion means (04) replaces the unvoiced part (41) of the original digital audio signal shown in FIG. 4 (1) with another code (42) as shown in FIG. 4 (2). The digital audio signal shown in FIG. 4 (2) is written in the recording means (11) via the writing means (111). The other code (42) shown in FIG. 4 (2) is, in addition to a mere mark, a code of several bits provided with information of the non-voice time width, information indicating the property of the non-voice portion, and the like. In the reproducing section of FIG. 2, the recording means (11) records the digital audio signal shown in FIG. 4 (2). The reading means (112) is
The recording means (11) reads out the digital voice signal from which the substantially silent portion is deleted, and the detection means (12),
Output to the adjusting means (13). The detecting means (12) detects the code added to the deleted voiceless part of the input digital voice signal, decodes the code, and adjusts the signal according to the decoded content (13). Output to. Figure 4 (2)
The content of the other code (42) indicated by is the time width or the like of the original unvoiced portion of the portion as described above. The adjusting means (13) is provided with the detecting means (1
From the signal input from (2) and the digital audio signal from which the voiceless part input from the reading means (112) has been deleted,
A digital voice signal with or without a voiceless portion added (see FIG. 4).
(1)) is output to the D / A conversion means (14). Since the operation after the D / A conversion means (14) is the same as that described above, its explanation is omitted.

【０００９】次に無音声部を実質的に削除する他のアル
ゴリズムの一例について説明する。図１で示す記録部に
於て、無音声部に対し、図５のウィンドウを予じめ設定
しておく。Ｌｔｈは、無音声と判断する為の閾値であ
り、(＋)(−)方向に設定されている。図５で示すＡ〜
Ｄの符号は予じめ決定されており、又Ａ〜Ｄの符号間
の時間幅の初期値も予じめ設定されている。尚、時間幅
は初期値だけであって可変可能である。現時点ｔｓに於
いて時刻ｔｓ＋１からｔａまでの間で(１)式を満たす最
小のｔｎを見つける。｜Ｖ（ｔｎ）−Ｖ（ｔｓ）｜＞Ｌｔｈ（１）ｔｎが見つからなければ符号Ａをとり、再びこの符号Ａ
を現時点ｔｓとして図５で示すウィンドウ上で次のｔｎ
を見つける動作をする。その他の場合、ｔｂ＜ｔｎ≦ｔ
ａの時は符号Ｂを取り、その後、符号の付与を中止す
る。以下同じくｔｓ＋２＜ｔｎ＜ｔｂのときは符号Ｃ
を取り、ｔｎ≦ｔｓ＋２のときは符号Ｄをとり、その
後、それぞれ符号の付与を中止する。次に｜Ｖ（ｔｉ）−Ｖ（ｔｓ）｜≦Ｌｔｈとなった時、無音声の削除処理が再開される。この時、
再開を示す符号が付与される。符号Ａが繰り返し、又は
多数の頻度で選択される場合、Ａ〜Ｄ符号間の時間幅
の全体乃至一部は長くなる。Ｖ（ｔｉ）は、現時点ｔｓ
から、所定の時間前乃至時間後の時間ｔｉ時の電圧値で
ある。本実施例で使用される符号は、Ａ〜Ｄの４個で
あるから、２ビット程度で表現されるので記録手段上で
の無音声部はわずかの符号列で置き換わるものである。
尚、符号の数は、少ない方が好ましいが、特に限定され
るものではない。上述した行程に於て決定された符号Ａ
〜Ｄが書き込み手段(111)を介して記録媒体(11)に記録
される。この様にして、ディジタル音声が記録された記
録手段が図２で示す再生部で再生される際の動作を説明
する。記録手段(11)で記録されたディジタル音声が読み
取り手段(112)で読み取られ、検出手段(12)並びに調整
手段(13)に入力される。検出手段(12)は、図５で示した
符号Ａ〜Ｄ乃至無音声開始を示す信号並びに符号を検
出し、図５でしめしたウインドウに当てはめ、その符号
に応じた時間幅を有する無音声部で復元し、調整手段(1
3)に出力する。調整手段(13)は、ディジタル音声の符号
Ａ〜Ｄの部分に検出手段(12)から出力された無音声部
を挿入していく。又、検出手段(12)は、符号Ａが繰り返
し出現する場合、図５で示す符号Ａ〜Ｄの時間幅の一部
乃至全部も長くなり、復元される無音声部の時間幅も繰
り返し回数に比例する様に自動的に長くなっていく。以
上の様に、記録時、無音声部が少ない符号で自動的に置
き換え可能であることから、非常に至便、且つ合理性に
富み、再生時、少ない符号であっても正確な無音声時間
を復元でき、しかも復元処理時間が短いので、再生音声
出力に支障がない等の効果がある。尚、上述したＡ〜
Ｄの符号の付与並びに符号に基づく処理内容等々はあく
まで一例であり、限られるものではない。上述した実施
例を使用して構成させる装置の大きさは、携帯型ができ
る程度が好ましく、学習書であれば、反復する音声を出
力する機能や、しおり的な機能を付加する場合もある。
又、装置の大きさは、記録媒体の大きさにも左右される
ことから、記録媒体は、小さくてしかも高容量であるも
の、例えばＣＤ−ＲＯＭ、ミニ光磁気デイスク、３．５
インチフロッピイデイスク、デジタルオーデイオテープ
等が適当である。尚、ディジタル音声は、合成音声、自
然音声をＡ／Ｄ変換、圧縮処理した音声等、特に限定す
る必要はなく、既存の方式によって変換された音声を示
すものである。Next, an example of another algorithm for substantially eliminating the voiceless portion will be described. In the recording section shown in FIG. 1, the window shown in FIG. 5 is preset for the silent section. Lth is a threshold for determining that there is no voice, and is set in the (+) (-) direction. A shown in FIG.
The code of D has been determined in advance, and the initial value of the time width between the codes of A to D has also been set in advance. The time width is only an initial value and can be changed. At the present time ts, the minimum tn satisfying the expression (1) is found from the time ts + 1 to ta. | V (tn) -V (ts) |> Lth (1) If tn is not found, the code A is taken, and the code A is used again.
Is the current time ts and the next tn on the window shown in FIG.
To find out. In other cases, tb <tn ≦ t
When it is a, the code B is taken, and thereafter, the code addition is stopped. Similarly, when ts + 2 <tn <tb, the code C
, And when tn ≦ ts + 2, the code D is taken, and thereafter, the addition of the code is stopped. Next, when | V (ti) −V (ts) | ≦ Lth, the voiceless deletion process is restarted. This time,
A code indicating restart is added. When the code A is selected repeatedly or with a large number of frequencies, the whole or part of the time width between the A to D codes becomes long. V (ti) is currently ts
Is the voltage value at time ti before or after the predetermined time. Since there are four codes A to D used in this embodiment, they are represented by about 2 bits, so that the non-voice part on the recording means is replaced by a small code string.
The number of codes is preferably as small as possible, but is not particularly limited. Code A determined in the above process
To D are recorded on the recording medium (11) via the writing means (111). The operation when the recording means in which digital voice is recorded in this way is reproduced by the reproducing section shown in FIG. 2 will be described. The digital voice recorded by the recording means (11) is read by the reading means (112) and input to the detecting means (12) and the adjusting means (13). The detection means (12) detects the signals A to D shown in FIG. 5 to the signal indicating the start of non-voice and the code, applies them to the window shown in FIG. 5, and has a voiceless section having a time width corresponding to the code. Restoration with the adjustment means (1
Output to 3). The adjusting means (13) inserts the non-voice part output from the detecting means (12) into the portions A to D of the digital voice. Further, when the code A appears repeatedly, the detecting means (12) lengthens a part or all of the time widths of the codes A to D shown in FIG. 5, and the time width of the restored non-voice part also becomes the number of repetitions. It will automatically increase in proportion. As described above, since it is possible to automatically replace a code with few voiceless parts during recording, it is very convenient and rational, and an accurate voiceless time can be provided during playback even with a small number of codes. Since the restoration can be performed and the restoration processing time is short, the output of the reproduced voice is not hindered. In addition, the above A ~
The addition of the D code, the processing content based on the code, and the like are merely examples, and the present invention is not limited thereto. The size of the device configured by using the above-described embodiment is preferably such that it is portable, and if it is a learning book, a function of outputting repetitive voice or a bookmark-like function may be added.
Further, since the size of the device depends on the size of the recording medium, the recording medium has a small size and a high capacity, such as a CD-ROM, a mini magneto-optical disk, 3.5.
Inch floppy disks, digital audio tapes, etc. are suitable. It should be noted that the digital voice does not need to be particularly limited, such as synthetic voice, voice obtained by A / D conversion of natural voice, compression processing, and the like, and indicates voice converted by an existing method.

【０００１０】[00010]

【発明の効果】以上詳述の如く本発明は、一般に提供さ
れている記憶媒体であっても書籍の朗読音声を充分に記
憶し、しかも再生時、発話速度を可変自在とし、且つ通
常の朗読と変わらない音声を出力させることができる等
の効果を有する。As described above in detail, according to the present invention, even if it is a storage medium which is generally provided, the reading voice of the book is sufficiently stored, and the utterance speed can be freely varied during the reproduction, and the ordinary reading can be performed. It has the effect of being able to output the same sound as the above.

[Brief description of drawings]

【図１】本発明の記憶部の実施例を示す図FIG. 1 is a diagram showing an embodiment of a storage unit of the present invention.

【図２】本発明の再生部の実施例を示す図FIG. 2 is a diagram showing an embodiment of a reproducing unit of the present invention.

【図３】[Figure 3]

【図４】[Figure 4]

【図５】本発明の実施例を説明するための図FIG. 5 is a diagram for explaining an embodiment of the present invention.

[Explanation of symbols]

01 アナログ音声入力手段 02 Ａ／Ｄ変換手段 03 無音声検出手段 04 変換手段 111 書き込み手段 11 記録媒体 12 検出手段 13 調整手段 14 Ｄ／Ａ変換手段 15 増幅手段 16 発生手段 112 読み取り手段 01 analog voice input means 02 A / D conversion means 03 silence detection means 04 conversion means 111 writing means 11 recording medium 12 detection means 13 adjusting means 14 D / A conversion means 15 amplification means 16 generation means 112 reading means

Claims

[Claims]

1. A voice electronic book comprising a storage means for digitally storing a voice signal in a manner in which substantially no voice portion is deleted, and a voice reproducing means for voice-reproducing the digital voice signal at a desired speech rate.