JPH06308992A - Voice ebook - Google Patents

Voice ebook

Info

Publication number
JPH06308992A
JPH06308992A JP5116599A JP11659993A JPH06308992A JP H06308992 A JPH06308992 A JP H06308992A JP 5116599 A JP5116599 A JP 5116599A JP 11659993 A JP11659993 A JP 11659993A JP H06308992 A JPH06308992 A JP H06308992A
Authority
JP
Japan
Prior art keywords
voice
digital
code
signal
voiceless
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP5116599A
Other languages
Japanese (ja)
Inventor
Hiroshi Ishibashi
広 石橋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advance Co Ltd
Original Assignee
Advance Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Advance Co Ltd filed Critical Advance Co Ltd
Priority to JP5116599A priority Critical patent/JPH06308992A/en
Priority to PCT/JP1994/000661 priority patent/WO1994024667A1/en
Priority to EP94913792A priority patent/EP0652560A4/en
Publication of JPH06308992A publication Critical patent/JPH06308992A/en
Priority to KR1019940704661A priority patent/KR950702323A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B5/00Electrically-operated educational appliances
    • G09B5/04Electrically-operated educational appliances with audible presentation of the material to be studied
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/04Time compression or expansion
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • G11B20/10527Audio or video recording; Data buffering arrangements
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/005Reproducing at a different information rate from the information rate of recording
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B2220/00Record carriers by type
    • G11B2220/20Disc-shaped record carriers
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B2220/00Record carriers by type
    • G11B2220/20Disc-shaped record carriers
    • G11B2220/21Disc-shaped record carriers characterised in that the disc is of read-only, rewritable, or recordable type
    • G11B2220/215Recordable discs
    • G11B2220/218Write-once discs
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B2220/00Record carriers by type
    • G11B2220/20Disc-shaped record carriers
    • G11B2220/25Disc-shaped record carriers characterised in that the disc is based on a specific recording technology
    • G11B2220/2537Optical discs
    • G11B2220/2545CDs

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Educational Technology (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Educational Administration (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)

Abstract

(57)【要約】 【目的】 書籍の朗読音声出力を長時間行う音声式電子
ブック 【構成】 実質的に無音声部を削除した様式で、音声信
号をディジタル記憶する記憶手段、所望の発話速度でデ
ィジタル音声信号を音声再生する音声再生手段より成
る。
(57) [Abstract] [Purpose] A voice-type electronic book that outputs a reading voice of a book for a long time [Configuration] Storage means for digitally storing a voice signal in a format in which substantially no voice portion is deleted, desired speech rate And a voice reproducing means for reproducing a digital voice signal by voice.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は、音声式電子ブックに関
する。
FIELD OF THE INVENTION The present invention relates to voice electronic books.

【0002】[0002]

【従来の技術】CD−ROM等のディジタル高容量記憶
媒体を用いて、音声再生用の装置があるが、その再生時
間は、せいぜい70分程度である。この程度の再生時間
は、音楽を録音するには充分であるが、文庫本、学習書
等の書籍を朗読した朗読音声全部を録音するには不足し
ている。特に聴者に理解と認識を与える為の学習器等の
様な繰り返し且つ、明確な音声を低速でしかも長時間再
生出力する場合、上述のデイジタル記憶媒体の使用は、
学習内容を調整乃至省略しない限り困難なことであり、
その他の記憶媒体であっては、なおさらに困難である。
2. Description of the Related Art There is a device for audio reproduction using a digital high capacity storage medium such as a CD-ROM, but the reproduction time is about 70 minutes at most. Although such a reproduction time is sufficient for recording music, it is insufficient for recording all the read voices read aloud books such as paperback books and study books. In particular, when a repetitive and clear sound such as a learning device for giving an understanding and recognition to a listener is reproduced and output at a low speed for a long time, the use of the above digital storage medium is
It is difficult unless you adjust or omit the learning content,
It is even more difficult with other storage media.

【0003】[0003]

【課題を解決するための手段】上記に鑑み本発明は、鋭
意研究の結果、実質的に無音声部を削除した様式でディ
ジタル音声データを記憶媒体に記憶させ、再生時、この
無音声時間を付加することにより、記憶媒体には、充分
な音声データが格納でき、しかも再生時この無音声時間
が付加されていることから、自然の朗読に近い音声出力
が長時間得られる音声式電子ブックを実現した。本発明
で無音声部とは、例えば音節間、文節間等々の音声的に
無音乃至無音に近い部分を示すものである。又、無音声
部の実質的削除の様式とは、例えば無音声部の全部又は
1部の削除あるいは、無音声部を他の符号に変換するこ
と等々を示すものである。
SUMMARY OF THE INVENTION In view of the above, the present invention has as a result of earnest research that digital voice data is stored in a storage medium in a manner in which a non-voice portion is substantially deleted, and this voiceless time is reduced during reproduction. By adding this, a sufficient amount of audio data can be stored in the storage medium, and since this silent time is added during playback, it is possible to obtain an audio e-book that can obtain audio output that is close to natural reading for a long time. It was realized. In the present invention, the non-speech portion indicates a portion of the sound, such as inter-syllables and inter-syllables, which is silent or close to silence. In addition, the method of substantially deleting the silent part indicates, for example, deleting all or a part of the silent part or converting the silent part into another code.

【0004】[0004]

【実施例】以下、本発明の実施例を図面を参照して詳細
に説明する。図1は、記憶手段の一例であり、以下、記
録部とした。(11)は、記録媒体であり、主に光ディス
ク、光磁気ディスク、磁気ディスク等のディジタル記憶
媒体よりなる。(111)は、書き込み手段であり、書き込
み用ヘッド、ヘッド駆動用ドライバ等から構成される。
(01)は、アナログ音声入力手段であり、マイクロフォ
ン、フィルタ、増幅器等から構成される。(02)は、A/
D変換手段であり、アナログ音声信号をデジタル音声信
号に変換する。更にA/D変換手段(02)は、ADPCM等の
デジタル信号圧縮手段を組み込む場合もある。(03)は、
無音声検出手段であり、無音声部を自動的、あるいは目
視的によって検出する部分である。(04)は、変換手段で
あり、無音声検出手段(03)及びA/D変換手段(02)の出
力信号を入力し、無音声検出手段(03)からの入力信号に
基づいてデジタル音声信号の無音声部にたいし、削除あ
るいは、他の符号に変換処理を行う手段である。無音声
検出手段(03)並びに変換手段(04)は、CPU,DSPな
どを用いてアルゴリズム的処理を施すものであってもよ
い。この場合、両手段(03)(04)の区別は、無くなるもの
である。図1は、アナログ音声を最初にデジタル音声に
変換した後、無音声部の実質的削除を行う構成を示した
が、これに限られるものではなく、例えばデジタル変換
行程中、あるいはアナログ音声時に無音声部の実質的削
除がおこなわれるものであってもよい。
Embodiments of the present invention will now be described in detail with reference to the drawings. FIG. 1 shows an example of a storage unit, which will be referred to as a recording unit hereinafter. Reference numeral (11) is a recording medium, which mainly comprises a digital storage medium such as an optical disc, a magneto-optical disc, a magnetic disc, or the like. Reference numeral (111) is a writing unit, which includes a writing head, a head driving driver, and the like.
Reference numeral (01) is an analog voice input means, which is composed of a microphone, a filter, an amplifier and the like. (02) is A /
The D conversion means converts an analog audio signal into a digital audio signal. Further, the A / D conversion means (02) may incorporate digital signal compression means such as ADPCM. (03) is
It is a voiceless detection means, and is a portion that automatically or visually detects a voiceless portion. Reference numeral (04) is a conversion means, which inputs the output signals of the non-voice detection means (03) and the A / D conversion means (02), and which is based on the input signal from the non-voice detection means (03). It is a means for deleting the unvoiced part of the above or converting it to another code. The non-voice detecting means (03) and the converting means (04) may perform algorithmic processing by using a CPU, DSP or the like. In this case, the distinction between the means (03) and (04) is lost. FIG. 1 shows a configuration in which analog voice is first converted to digital voice, and then the non-voice portion is substantially deleted. However, the present invention is not limited to this. For example, during the digital conversion process or during analog voice, The audio part may be substantially deleted.

【0005】図2は、音声再生手段の一例であり、以下
再生部とした。(11)は、記録媒体であり、図1で示した
ものである。(112)は読み取り手段であり、読み取り用
ピックアップ、記録手段(11)を回転させる手段、読み取
り用ピックアップを摺動させる手段等から構成される。
(12)は、検出手段であり、読み取り手段(112)が出力す
るディジタル音声から、実質的に削除された無音声部を
検出し、検出した無音声部を復元又は、新たに形成又
は、これらと同等の意味を持つ信号に変換し、出力する
ものである。(13)は調整手段であり、読み取り手段(11
2)から出力されたディジタル音声信号と、検出手段(12)
が出力した無音声信号とを組み合わせた後、この組み合
わせ信号を出力する。検出手段(12)、調整手段(13)は、
1つのCPU、DSPワンチップマイコン等によってア
ルゴリズム的に処理される場合がある。この場合、両手
段(12)(13)の区別する必要はなく、すくなくとも削除さ
れた無音声部を任意の無音声時間、又は原無音声時間を
有する無音声ディジタル信号に変換し、ディジタル音声
と組み合わせて出力するプログラムルーチン等のアルゴ
リズムを有すればよいものである。(14)はD/A変換手
段であり、調整手段(13)から出力されるディジタル音声
をアナログ音声に変換するものである。この時、図1で
しめすA/D変換手段(02)が圧縮手段を有している場
合、D/A変換手段(14)は、復元手段を有するものであ
る。又、D/A変換手段(14)が、検出手段(12)、調整手
段(13)を兼ねる場合もある。(15)は、増幅手段であり、
アナログ音声を電気的に増幅する手段である。尚、増幅
手段(15)には更に周波数フィルタ特性が付加されたもの
であってもよい。(16)は、発声手段であり、スピーカ、
イヤホーンの何れか、あるいは全部等よりなる。尚、記
録部及び再生部は両部一体型または別体型何れの場合で
も良い。
FIG. 2 shows an example of the audio reproducing means, which will be referred to as a reproducing section hereinafter. (11) is a recording medium, which is shown in FIG. Reference numeral (112) is a reading means, which is composed of a reading pickup, a means for rotating the recording means (11), a means for sliding the reading pickup, and the like.
Reference numeral (12) is a detection means, which detects a virtually deleted voiceless portion from the digital voice output by the reading means (112) and restores or newly forms the detected voiceless portion, or these It is converted into a signal having the same meaning as and output. (13) is an adjusting means, and a reading means (11
Digital voice signal output from 2) and detection means (12)
After combining with the non-voice signal output by, the combined signal is output. The detection means (12) and the adjustment means (13) are
It may be processed algorithmically by one CPU, DSP one-chip microcomputer, or the like. In this case, it is not necessary to distinguish between the means (12) and (13), and at least the deleted voiceless part is converted into a voiceless digital signal having an arbitrary voiceless time or an original voiceless time, and the digital voice and It suffices if it has an algorithm such as a program routine that outputs it in combination. Reference numeral (14) is a D / A conversion means for converting the digital voice output from the adjusting means (13) into an analog voice. At this time, when the A / D converting means (02) shown in FIG. 1 has a compressing means, the D / A converting means (14) has a restoring means. The D / A conversion means (14) may also serve as the detection means (12) and the adjustment means (13). (15) is an amplification means,
It is a means for electrically amplifying analog voice. The amplifying means (15) may have a frequency filter characteristic added thereto. (16) is a voicing means, a speaker,
It consists of any or all of the earphones. Incidentally, the recording unit and the reproducing unit may be of both type integrated type or separate type.

【0006】次に図1及び図2の動作の一例を説明す
る。図1で示す記録部において、アナログ音声入力部(0
1)に入力されたアナログ音声は、ろ波、増幅されたの
ち、A/D変換手段(02)において、デジタル音声信号
(図3(1))に変換される。デジタル音声信号は、無
音声検出手段(03)並びに変換手段(04)に入力される。無
音声検出手段(03)で、図3(1)で示す無音声部(31)が
検出され、変換手段(04)で図3(2)でしめす(32)のよ
うに無音声部は、実質的に削除される。無音声部が実質
的に削除されたデジタル音声信号(図3(2))は、書
き込み手段(111)を介して記録媒体(11)に書き込まれ
る。尚、ディジタル音声信号列は、非常にこまかいこと
から、省略して描いた。又、デイジタル音声列の1つ
は、1音節、1文節、1段落あるいは、無音声部から、
次の無音声部迄等が示される。
Next, an example of the operation of FIGS. 1 and 2 will be described. In the recording section shown in FIG. 1, the analog voice input section (0
The analog voice input to 1) is filtered and amplified, and then converted into a digital voice signal ((1) in FIG. 3) by the A / D conversion means (02). The digital audio signal is input to the silence detection means (03) and the conversion means (04). The non-voice detection section (03) detects the non-voice section (31) shown in FIG. 3 (1), and the conversion section (04) shows the non-voice section as shown by (32) in FIG. 3 (2). Effectively deleted. The digital audio signal (FIG. 3 (2)) from which the non-voice part is substantially deleted is written in the recording medium (11) via the writing means (111). The digital audio signal sequence is omitted because it is very detailed. Also, one of the digital voice strings is from one syllable, one syllable, one paragraph, or a silent part,
Up to the next unvoiced part is shown.

【0007】次に記録媒体(11)に記録されたデイジタル
音声信号を再生する再生部を示す図2に於て、記録手段
(11)を読み取り手段(112)にセットし、記録手段(11)か
ら、実質的に無音声部が削除されたディジタル音声信号
が読み取られ、検出手段(12)、並びに調整手段(13)に出
力される。検出手段(12)は、削除された無音声部(図3
(2))(32)を検出し、任意の時間幅又は原時間幅を有
する無音声ディジタル信号に変換し、調整手段(13)に出
力する。調整手段(13)は、記録手段(11)から入力された
実質的に無音声部が削除されたディジタル音声信号の削
除部に検出手段(12)から入力された無音声ディジタル信
号を組み合わせて、この組み合わせディジタル音声信号
(図3(1))をD/A変換手段(14)に出力する。D/
A変換手段(14)は、入力された組み合わせディジタル音
声信号をアナログ音声信号に変換出力する。増幅手段(1
5)は、このアナログ音声信号を増幅、場合によってろ波
し、発声手段(16)に出力する。発声手段(16)は、スピー
カ、イヤホンを媒体として音声を出力する。この時、無
音声デイジタル信号量の数値的加算、減算等の調整によ
り、発話速度は自在に調整され、低速発話も容易に実施
できる。この調整は、聴者が調整できるように調整用の
ツマミを装置上に装着される場合もある。
Next, in FIG. 2 showing a reproducing section for reproducing the digital audio signal recorded on the recording medium (11), recording means
(11) is set in the reading means (112), the digital sound signal from which the substantially silent part is deleted is read from the recording means (11), and is detected by the detecting means (12) and the adjusting means (13). Is output. The detection means (12) uses the deleted non-voice part (see FIG. 3).
(2)) Detects (32), converts it to a voiceless digital signal having an arbitrary time width or original time width, and outputs it to the adjusting means (13). The adjusting means (13) combines the non-voice digital signal input from the detecting means (12) with the deletion portion of the digital voice signal in which the substantially non-voice portion input from the recording means (11) is deleted, This combined digital audio signal (FIG. 3 (1)) is output to the D / A conversion means (14). D /
The A conversion means (14) converts the input combined digital audio signal into an analog audio signal and outputs it. Amplification means (1
5) amplifies this analog audio signal, filters it depending on the case, and outputs it to the voicing means (16). The voicing means (16) outputs a sound using a speaker and an earphone as a medium. At this time, the utterance speed can be freely adjusted by adjusting numerically adding or subtracting the amount of voiceless digital signal, and low speed utterance can be easily performed. In this adjustment, a knob for adjustment may be mounted on the device so that the listener can adjust it.

【0008】又、実質的に無音声部が削除されたディジ
タル音声は、図4で示す様に記録媒体に記録される場合
もある。図1の記録部において、無音声検出手段(03)、
変換手段(04)は、図4(1)で示す原ディジタル音声信号
の無音声部(41)を図4(2)で示す様に、他の符号(42)で
置換する。図4(2)で示すデイジタル音声信号は、書
き込み手段(111)を介して記録手段(11)に書き込まれ
る。 図4(2)で示す他の符号(42)とは、単なる目印
の他、無音声時間幅の情報、無音声部の性質を示す情報
を具備した数ビットの符号等を示すものである。図2の
再生部に於て、記録手段(11)は図4(2)で示すディジタ
ル音声信号を記録している。 読み取り手段(112)は、
この記録手段(11)に記録された実質的に無音声部が削除
されたディジタル音声信号を読み出し、検出手段(12)、
調整手段(13)に出力する。検出手段(12)は、入力された
ディジタル音声信号の削除された無音声部に代替付加さ
れている符号を検出した後、その符号を解読し、解読内
容に従った信号を調整手段(13)に出力する。図4(2)
で示す他の符号(42)の内容は上述の様にその部分の原無
音声部の時間幅等である。調整手段(13)は、検出手段(1
2)から入力された信号と、読み取り手段(112)から入力
された無音声部が削除されたディジタル音声信号より、
無音声部を付加乃至再現したディジタル音声信号(図4
(1))をD/A変換手段(14)に出力する。D/A変換
手段(14)以降の動作は、前述と同一なので説明は省略す
る。
Further, the digital voice from which the non-voice portion is substantially deleted may be recorded on the recording medium as shown in FIG. In the recording unit of FIG. 1, the voiceless detection means (03),
The conversion means (04) replaces the unvoiced part (41) of the original digital audio signal shown in FIG. 4 (1) with another code (42) as shown in FIG. 4 (2). The digital audio signal shown in FIG. 4 (2) is written in the recording means (11) via the writing means (111). The other code (42) shown in FIG. 4 (2) is, in addition to a mere mark, a code of several bits provided with information of the non-voice time width, information indicating the property of the non-voice portion, and the like. In the reproducing section of FIG. 2, the recording means (11) records the digital audio signal shown in FIG. 4 (2). The reading means (112) is
The recording means (11) reads out the digital voice signal from which the substantially silent portion is deleted, and the detection means (12),
Output to the adjusting means (13). The detecting means (12) detects the code added to the deleted voiceless part of the input digital voice signal, decodes the code, and adjusts the signal according to the decoded content (13). Output to. Figure 4 (2)
The content of the other code (42) indicated by is the time width or the like of the original unvoiced portion of the portion as described above. The adjusting means (13) is provided with the detecting means (1
From the signal input from (2) and the digital audio signal from which the voiceless part input from the reading means (112) has been deleted,
A digital voice signal with or without a voiceless portion added (see FIG. 4).
(1)) is output to the D / A conversion means (14). Since the operation after the D / A conversion means (14) is the same as that described above, its explanation is omitted.

【0009】次に無音声部を実質的に削除する他のアル
ゴリズムの一例について説明する。図1で示す記録部に
於て、無音声部に対し、図5のウィンドウを予じめ設定
しておく。Lthは、無音声と判断する為の閾値であ
り、(+)(−)方向に設定されている。図5で示すA 〜
Dの符号は予じめ決定されており、又A 〜 Dの符号間
の時間幅の初期値も予じめ設定されている。尚、時間幅
は初期値だけであって可変可能である。現時点tsに於
いて時刻ts+1からtaまでの間で(1)式を満たす最
小のtnを見つける。 |V(tn)−V(ts)|>Lth (1) tnが見つからなければ符号Aをとり、再びこの符号A
を現時点tsとして図5で示すウィンドウ上で次のtn
を見つける動作をする。その他の場合、tb<tn≦t
aの時は符号Bを取り、その後、符号の付与を中止す
る。以下同じく ts+2<tn<tbのときは符号C
を取り、tn≦ts+2のときは符号Dをとり、その
後、それぞれ符号の付与を中止する。次に |V(ti)−V(ts)|≦Lth となった時、無音声の削除処理が再開される。この時、
再開を示す符号が付与される。符号Aが繰り返し、又は
多数の頻度で選択される場合、A 〜 D符号間の時間幅
の全体乃至一部は長くなる。V(ti)は、現時点ts
から、所定の時間前乃至時間後の時間ti時の電圧値で
ある。本実施例で使用される符号は、A 〜 Dの4個で
あるから、2ビット程度で表現されるので記録手段上で
の無音声部はわずかの符号列で置き換わるものである。
尚、符号の数は、少ない方が好ましいが、特に限定され
るものではない。上述した行程に於て決定された符号A
〜Dが書き込み手段(111)を介して記録媒体(11)に記録
される。この様にして、ディジタル音声が記録された記
録手段が図2で示す再生部で再生される際の動作を説明
する。記録手段(11)で記録されたディジタル音声が読み
取り手段(112)で読み取られ、検出手段(12)並びに調整
手段(13)に入力される。検出手段(12)は、図5で示した
符号A 〜 D乃至無音声開始を示す信号並びに符号を検
出し、図5でしめしたウインドウに当てはめ、その符号
に応じた時間幅を有する無音声部で復元し、調整手段(1
3)に出力する。調整手段(13)は、ディジタル音声の符号
A 〜 Dの部分に検出手段(12)から出力された無音声部
を挿入していく。又、検出手段(12)は、符号Aが繰り返
し出現する場合、図5で示す符号A〜Dの時間幅の一部
乃至全部も長くなり、復元される無音声部の時間幅も繰
り返し回数に比例する様に自動的に長くなっていく。以
上の様に、記録時、無音声部が少ない符号で自動的に置
き換え可能であることから、非常に至便、且つ合理性に
富み、再生時、少ない符号であっても正確な無音声時間
を復元でき、しかも復元処理時間が短いので、再生音声
出力に支障がない等の効果がある。尚、上述したA 〜
Dの符号の付与並びに符号に基づく処理内容等々はあく
まで一例であり、限られるものではない。上述した実施
例を使用して構成させる装置の大きさは、携帯型ができ
る程度が好ましく、学習書であれば、反復する音声を出
力する機能や、しおり的な機能を付加する場合もある。
又、装置の大きさは、記録媒体の大きさにも左右される
ことから、記録媒体は、小さくてしかも高容量であるも
の、例えばCD−ROM、ミニ光磁気デイスク、3.5
インチフロッピイデイスク、デジタルオーデイオテープ
等が適当である。尚、ディジタル音声は、合成音声、自
然音声をA/D変換、圧縮処理した音声等、特に限定す
る必要はなく、既存の方式によって変換された音声を示
すものである。
Next, an example of another algorithm for substantially eliminating the voiceless portion will be described. In the recording section shown in FIG. 1, the window shown in FIG. 5 is preset for the silent section. Lth is a threshold for determining that there is no voice, and is set in the (+) (-) direction. A shown in FIG.
The code of D has been determined in advance, and the initial value of the time width between the codes of A to D has also been set in advance. The time width is only an initial value and can be changed. At the present time ts, the minimum tn satisfying the expression (1) is found from the time ts + 1 to ta. | V (tn) -V (ts) |> Lth (1) If tn is not found, the code A is taken, and the code A is used again.
Is the current time ts and the next tn on the window shown in FIG.
To find out. In other cases, tb <tn ≦ t
When it is a, the code B is taken, and thereafter, the code addition is stopped. Similarly, when ts + 2 <tn <tb, the code C
, And when tn ≦ ts + 2, the code D is taken, and thereafter, the addition of the code is stopped. Next, when | V (ti) −V (ts) | ≦ Lth, the voiceless deletion process is restarted. This time,
A code indicating restart is added. When the code A is selected repeatedly or with a large number of frequencies, the whole or part of the time width between the A to D codes becomes long. V (ti) is currently ts
Is the voltage value at time ti before or after the predetermined time. Since there are four codes A to D used in this embodiment, they are represented by about 2 bits, so that the non-voice part on the recording means is replaced by a small code string.
The number of codes is preferably as small as possible, but is not particularly limited. Code A determined in the above process
To D are recorded on the recording medium (11) via the writing means (111). The operation when the recording means in which digital voice is recorded in this way is reproduced by the reproducing section shown in FIG. 2 will be described. The digital voice recorded by the recording means (11) is read by the reading means (112) and input to the detecting means (12) and the adjusting means (13). The detection means (12) detects the signals A to D shown in FIG. 5 to the signal indicating the start of non-voice and the code, applies them to the window shown in FIG. 5, and has a voiceless section having a time width corresponding to the code. Restoration with the adjustment means (1
Output to 3). The adjusting means (13) inserts the non-voice part output from the detecting means (12) into the portions A to D of the digital voice. Further, when the code A appears repeatedly, the detecting means (12) lengthens a part or all of the time widths of the codes A to D shown in FIG. 5, and the time width of the restored non-voice part also becomes the number of repetitions. It will automatically increase in proportion. As described above, since it is possible to automatically replace a code with few voiceless parts during recording, it is very convenient and rational, and an accurate voiceless time can be provided during playback even with a small number of codes. Since the restoration can be performed and the restoration processing time is short, the output of the reproduced voice is not hindered. In addition, the above A ~
The addition of the D code, the processing content based on the code, and the like are merely examples, and the present invention is not limited thereto. The size of the device configured by using the above-described embodiment is preferably such that it is portable, and if it is a learning book, a function of outputting repetitive voice or a bookmark-like function may be added.
Further, since the size of the device depends on the size of the recording medium, the recording medium has a small size and a high capacity, such as a CD-ROM, a mini magneto-optical disk, 3.5.
Inch floppy disks, digital audio tapes, etc. are suitable. It should be noted that the digital voice does not need to be particularly limited, such as synthetic voice, voice obtained by A / D conversion of natural voice, compression processing, and the like, and indicates voice converted by an existing method.

【00010】[00010]

【発明の効果】以上詳述の如く本発明は、一般に提供さ
れている記憶媒体であっても書籍の朗読音声を充分に記
憶し、しかも再生時、発話速度を可変自在とし、且つ通
常の朗読と変わらない音声を出力させることができる等
の効果を有する。
As described above in detail, according to the present invention, even if it is a storage medium which is generally provided, the reading voice of the book is sufficiently stored, and the utterance speed can be freely varied during the reproduction, and the ordinary reading can be performed. It has the effect of being able to output the same sound as the above.

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の記憶部の実施例を示す図FIG. 1 is a diagram showing an embodiment of a storage unit of the present invention.

【図2】本発明の再生部の実施例を示す図FIG. 2 is a diagram showing an embodiment of a reproducing unit of the present invention.

【図3】[Figure 3]

【図4】[Figure 4]

【図5】本発明の実施例を説明するための図FIG. 5 is a diagram for explaining an embodiment of the present invention.

【符号の説明】[Explanation of symbols]

01 アナログ音声入力手段 02 A/D変換手段 03 無音声検出手段 04 変換手段 111 書き込み手段 11 記録媒体 12 検出手段 13 調整手段 14 D/A変換手段 15 増幅手段 16 発生手段 112 読み取り手段 01 analog voice input means 02 A / D conversion means 03 silence detection means 04 conversion means 111 writing means 11 recording medium 12 detection means 13 adjusting means 14 D / A conversion means 15 amplification means 16 generation means 112 reading means

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】 実質的に無音声部を削除した様式で音声
信号をディジタル記憶する記憶手段、所望の発話速度で
ディジタル音声信号を音声再生する音声再生手段より成
る音声式電子ブック。
1. A voice electronic book comprising a storage means for digitally storing a voice signal in a manner in which substantially no voice portion is deleted, and a voice reproducing means for voice-reproducing the digital voice signal at a desired speech rate.
JP5116599A 1993-04-21 1993-04-21 Voice ebook Pending JPH06308992A (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
JP5116599A JPH06308992A (en) 1993-04-21 1993-04-21 Voice ebook
PCT/JP1994/000661 WO1994024667A1 (en) 1993-04-21 1994-04-21 Apparatus for recording and reproducing voice
EP94913792A EP0652560A4 (en) 1993-04-21 1994-04-21 Apparatus for recording and reproducing voice.
KR1019940704661A KR950702323A (en) 1993-04-21 1994-12-20 Audio recording / playback device (APPARATUS FOR RECORDING AND REPRODUCING VOICE)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP5116599A JPH06308992A (en) 1993-04-21 1993-04-21 Voice ebook

Publications (1)

Publication Number Publication Date
JPH06308992A true JPH06308992A (en) 1994-11-04

Family

ID=14691150

Family Applications (1)

Application Number Title Priority Date Filing Date
JP5116599A Pending JPH06308992A (en) 1993-04-21 1993-04-21 Voice ebook

Country Status (3)

Country Link
JP (1) JPH06308992A (en)
KR (1) KR950702323A (en)
WO (1) WO1994024667A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7020663B2 (en) 2001-05-30 2006-03-28 George M. Hay System and method for the delivery of electronic books
JP2019168668A (en) * 2018-06-27 2019-10-03 株式会社アセンド Voice data optimization system
JP2019168604A (en) * 2018-03-23 2019-10-03 株式会社アセンド Voice data optimization system

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IL110883A (en) * 1994-09-05 1997-03-18 Ofer Bergman Reading tutorial system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH03184171A (en) * 1989-12-13 1991-08-12 Hitachi Ltd Electronic book reproducing device
JPH03248398A (en) * 1990-12-13 1991-11-06 Sharp Corp Recording and reproducing system for digital recording and reproducing machine

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS59195307A (en) * 1983-04-20 1984-11-06 Casio Comput Co Ltd Recording system of sound information
JPS6035795A (en) * 1983-08-05 1985-02-23 赤井電機株式会社 Signal pitch converter
JPS62125577A (en) * 1985-11-26 1987-06-06 Nec Corp Voice storing and reproducing device

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH03184171A (en) * 1989-12-13 1991-08-12 Hitachi Ltd Electronic book reproducing device
JPH03248398A (en) * 1990-12-13 1991-11-06 Sharp Corp Recording and reproducing system for digital recording and reproducing machine

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7020663B2 (en) 2001-05-30 2006-03-28 George M. Hay System and method for the delivery of electronic books
JP2019168604A (en) * 2018-03-23 2019-10-03 株式会社アセンド Voice data optimization system
JP2019168668A (en) * 2018-06-27 2019-10-03 株式会社アセンド Voice data optimization system

Also Published As

Publication number Publication date
KR950702323A (en) 1995-06-19
WO1994024667A1 (en) 1994-10-27

Similar Documents

Publication Publication Date Title
US6088313A (en) Method and apparatus for reproducing audio signals at various speeds by dividing original audio signals into a sequence of frames based on zero-cross points
Stockdale Tools for digital audio recording in qualitative research
JPH06308992A (en) Voice ebook
CN1084916C (en) Data recording apparatus and method for semiconductor memory card
JP2838159B2 (en) Audio signal processing device
EP0652560A1 (en) Apparatus for recording and reproducing voice
KR100372576B1 (en) Method of Processing Audio Signal
JPH07272447A (en) Voice data editing system
JP2741566B2 (en) Voice-based e-book
JPH07160282A (en) Voice reproducing device
JPS6253093B2 (en)
JP4779954B2 (en) Audio data processing apparatus, method and program
JP2001117596A (en) Audio signal reproduction method and audio signal reproduction device
JPH07261779A (en) Syllable recognizer
JP3490655B2 (en) Audio signal decoder
JPH0927189A (en) Voice information reproducing system
JPH0744199A (en) Voice recording / playback device
JP2011215314A (en) Recorder
JP2962777B2 (en) Audio signal time-base expansion / compression device
JPH0242497A (en) Audio recording and playback device
JPH07153188A (en) Audio playback device
Stockdale UPDATE social
JP2005140858A (en) Recording / reproducing apparatus and method
Nash Evaluating the use of adaptive transform acoustic coding (ATRAC) data compression in acoustic phonetics
JPH07169291A (en) Voice recording device and voice reproducing device