JPH01238698A

JPH01238698A - Voice fundamental period extractor

Info

Publication number: JPH01238698A
Application number: JP6637688A
Authority: JP
Inventors: Akihiro Kimura; 晋太木村
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1988-03-19
Filing date: 1988-03-19
Publication date: 1989-09-22
Anticipated expiration: 2012-09-10
Also published as: JP2650954B2

Abstract

PURPOSE:To stably extract a voice fundamental period even if there is much change in the voice fundamental period by providing plural voice analyzing means, a selective coupling section for the outputs thereof and a voice fundamental period extracting section. CONSTITUTION:A microphone 1 is led via an A/D converter 2 to the voice fundamental period extractor by a cepstrum method. A spectral analyzing section 3-1 is shortest in analysis section length and 3-n is longest. The analysis section length of the analyzing section 5-5 past a logarithmic conversion section 4 is shortest and is longest in 5-n. The adequate cepstra of the analysis section length are selected and coupled according to the frequency part in a cepstrum selective coupling section 6. The max. value of the cepstra obtd. in the selective coupling section 6 is determined and the position of said max. value is determined as the fundamental period in the fundamental period extraction section 7. The flexible follow-up to the fundamental frequencies of the heavily changing voices is enabled and the stable extraction of the fundamental frequencies is possible according to this constitution.

Description

【発明の詳細な説明】［概　要］本発明は音声分析に係る音声基本周期の抽出、を行なう
方式に関し、音声の基本周期の変化が多い場合てあっても安定に基本
周期を抽出することのできる音声基本周期抽出装置を提
供することを目的とし、入力された音声テイジタル信号
を分析するそれぞれ分析区間長の異なる複数の音声分析
手段と、重複する分析区間については上記各分析手段の内の最適
なピーク検出を行なうことのできる分析手段の出力を選
択し各分析手段の出力を結合する選択結合部と、該選択結合部の出力から音声の基本周期に対応するピー
ク検出を行なう基本周期抽出部とを具備することにより
構成する。[Detailed Description of the Invention] [Summary] The present invention relates to a method for extracting the fundamental period of speech related to speech analysis, and is capable of stably extracting the fundamental period even when there are many changes in the fundamental period of speech. The purpose of the present invention is to provide a speech fundamental period extraction device that can analyze an input speech digital signal, and includes a plurality of speech analysis means each having a different analysis section length, and for overlapping analysis sections, one of the above-mentioned analysis means. a selection combination unit that selects the output of the analysis means that can perform optimal peak detection and combines the outputs of each analysis unit; and a fundamental period extraction unit that performs peak detection corresponding to the fundamental period of speech from the output of the selection combination unit. It is constituted by comprising a section.

［産業上の利用分野］本発明は音声分析に係る音声基本周期の抽出を行なう方
式に関するものてあって、特に、基本周期の変化が大き
い音声の場合てあっても、その基本周期を安定に抽出す
ることの可能な音声基本周期抽出装置に係る。[Field of Industrial Application] The present invention relates to a method for extracting the fundamental period of speech related to speech analysis, and in particular, it is a method for stabilizing the fundamental period even in the case of speech whose fundamental period changes greatly. The present invention relates to a speech fundamental period extraction device capable of extracting a fundamental period of speech.

［従来の技術］第６図は従来のケプストラム法による基本周期抽出方式
の例を示す図である。同図において５１は音声を電気信
号に変換するマイクロホン、５２は音声電気信号をディ
ジタル化するＡＤ変換部、５３は数ミリ秒から数十ミリ
秒毎に音声ティジタル信号の固定された区間長の短区間
パワースペクトルを分析するスペクトル分析部、５４は
スペクトル分析部５３て得られたパワースペクトルの対
数を計算する対数変換部、５５は対数スペクトルからケ
プストラムを計算するケプストラム分析部、５６はケプ
ストラム上のピークを音声のピッチ（基本周期）として
抽出するピッチ抽出部を表している。[Prior Art] FIG. 6 is a diagram showing an example of a fundamental period extraction method using the conventional cepstrum method. In the figure, 51 is a microphone that converts audio into an electrical signal, 52 is an AD converter that digitizes the audio electrical signal, and 53 is a short circuit that converts the audio digital signal into a fixed section length every several milliseconds to several tens of milliseconds. A spectrum analysis section that analyzes the interval power spectrum, 54 a logarithmic conversion section that calculates the logarithm of the power spectrum obtained by the spectrum analysis section 53, 55 a cepstrum analysis section that calculates the cepstrum from the logarithmic spectrum, and 56 a peak on the cepstrum. It represents a pitch extracting unit that extracts as the pitch (fundamental period) of the voice.

ＡＤ変換部５２において、標本化およびディジタル化さ
れた音声信号を＋Ｘｏ＋とする。スペクトル分析部５３
においては、式Ｉおよび式■を用いて短区間パワースペ
クトル（Ｐ８）を計算する。ここでサンプリング周波数
をｆ５　　）１ｚ、分析区間長をτ秒とした時、分析区
間の標本数（Ｌ）は式■で得られる。式Ｉは高速フーリ
エ変換（ＦＦＴ）の算法を用いて効率よく計算すること
ができる。式Ｉでｊは虚数単位であり（ｊ２−−１）、
ｅｘｐは自然指数関数である。式■でＦ８′はＦ、の共
役複素数を表している。The audio signal sampled and digitized in the AD converter 52 is designated as +Xo+. Spectrum analysis section 53
In , the short-term power spectrum (P8) is calculated using Equation I and Equation (2). Here, when the sampling frequency is f5)1z and the length of the analysis interval is τ seconds, the number of samples (L) in the analysis interval can be obtained by formula (2). Equation I can be efficiently calculated using fast Fourier transform (FFT) algorithms. In formula I, j is an imaginary unit (j2--1),
exp is a natural exponential function. In formula (2), F8' represents the conjugate complex number of F.

Ｆ、、−’Ｅ　ω　、、　　Ｘｎ　　　ｅｘｐ（−２π
　、ｉｎｋ／Ｌ）　　（式　Ｉ　）Ｐ、　　＝Ｆ　　う
　・　Ｆ、”　　　　　　　　　　　　　　　　　　　
　　　　　　　（式　■　）Ｌ　−τ・　ｆ５　　　　
　　　　　　　　（弐■）対数変換部５４においてはＰ
、、の対数を弐■により計算する。F,,-'E ω,, Xn exp(-2π
, ink/L) (Formula I) P, =F ・F,”
(Formula ■) L −τ・f5
(2) In the logarithmic conversion unit 54, P
Calculate the logarithm of , , using 2■.

Ｐｋ’＝ｌｏｇ（Ｐ、）　　　　　　　　　　　　（式
■）第７図（、）に対数変換された音声パワースペクト
ル（Ｐ、”）の例を示す。Pk'=log(P,) (Formula ■) An example of a logarithmically transformed audio power spectrum (P,'') is shown in FIG. 7 (,).

ケプストラム分析部５５においては、対数変換部５４て
対数変換された短区間パワースペクトルに弐■を用いて
フーリエ変換を施すことによりケプストラム分析を行な
う。The cepstrum analysis unit 55 performs cepstrum analysis by subjecting the short-term power spectrum logarithmically transformed by the logarithmic transformation unit 54 to Fourier transformation using 2).

Ｃｎ　’Ｘ　　Ｐ　Ｍ　’　　ｅｘｐ（２πｊｎｋ／　
Ｌ）　　　（弐■）第７図（ｂ）にケプストラムの例を
示す。Cn 'X P M' exp(2πjnk/
L) (2■) Figure 7(b) shows an example of the cepstrum.

基本周期抽出部５６（ピッチ抽出部）では、人間の音声
の基本周期にあたる約２ミリ秒から約２０ミリ秒の範囲
（サンプリング周波数が１０ｋＨｚの場合、＋Ｃｎ＋の
ｎが２０から２００の範囲）でケプストラムの最大値を
求め、その最大値の位置を基本周期とする。The fundamental period extraction unit 56 (pitch extraction unit) extracts the cepstrum in the range of about 2 milliseconds to about 20 milliseconds, which is the fundamental cycle of human speech (when the sampling frequency is 10 kHz, n of +Cn+ is in the range of 20 to 200). Find the maximum value of and set the position of the maximum value as the fundamental period.

音声の基本周期抽出方式には、上述したケプストラム法
の他にも、自己相関法および変形自己相関法などがある
が、これらのケプストラム法、自己相関法および偏自己
相関法の詳細については、電子通信学会編「デジタル信
号処理」およびＮＴＴ技術移転株式会社ｍｒ音声情報工
学」に詳しく解説されている。In addition to the cepstrum method mentioned above, there are other fundamental period extraction methods for speech, such as the autocorrelation method and the modified autocorrelation method. It is explained in detail in ``Digital Signal Processing'' edited by the Institute of Communication Engineers and ``Mr.Speech Information Engineering'' by NTT Technology Transfer Co., Ltd.

［発明か解決しようとする課題］音声分析において、安定に基本周期を抽出するなめには
、−ｉに基本周期の分析区間（本方式では短区間スペク
トルを計算する区間）の中に音声波形の数周期分以上が
入る必要がある。[Problem to be solved by the invention] In speech analysis, in order to stably extract the fundamental period, it is necessary to -i include the speech waveform in the analysis interval of the fundamental period (in this method, the interval in which the short-term spectrum is calculated). It is necessary to include at least several cycles.

定常な音声を分析する場合は広い分析区間にすれは安定
に基本周期を得ることがてきる。しかし基本周期の変化
の多い場合、広い分析区間を用いると分析結果が実際の
音声の基本周期変化に追従したものにならない。また分
析区間内で実際の音声の基本周期が変化するためケプス
トラノ＼上での最大値の場所が不明確になり基本周期抽
出が不安定になることがある。また語尾や文末なとのよ
うに基本周期が長くなったり２０ミリ秒以上）場合に、
３０ミリ秒の固定分析長ては音声波形が１５個程度しか
入らす、ケプストラム上での最大値か不明確になり基本
周期抽出が不安定になりやすい　という問題点があった
。When analyzing stationary speech, it is possible to stably obtain the fundamental period over a wide analysis interval. However, when there are many changes in the fundamental period, if a wide analysis interval is used, the analysis result will not follow the actual changes in the fundamental period of speech. Furthermore, since the fundamental period of the actual voice changes within the analysis interval, the location of the maximum value on the cepstrano\ becomes unclear, and fundamental period extraction may become unstable. Also, when the fundamental period is long (20 milliseconds or more), such as at the end of words or sentences,
With a fixed analysis length of 30 milliseconds, only about 15 audio waveforms can be included, and the problem is that the maximum value on the cepstrum is unclear, and fundamental period extraction tends to become unstable.

第８図はこのような基本周期の抽出について説明する図
であって、（ａ）は十分基本周期が抽出できる場合を示
しており　（ｂ）は基本周期の抽出が不安定になる場合
を示している。Figure 8 is a diagram explaining extraction of such a fundamental period, where (a) shows a case where the fundamental period can be extracted sufficiently, and (b) shows a case where extraction of the fundamental period becomes unstable. ing.

本発明は上述したような従来の問題点に鑑み、実際の音
声の基本周期が変化した場合であっても安定に基本周期
を抽出することのできる音声基本周期の抽出装置を提供
することを目的としている。In view of the conventional problems as described above, an object of the present invention is to provide a speech fundamental period extraction device that can stably extract the fundamental period even when the actual fundamental period of speech changes. It is said that

［課題を解決するための手段］本発明によれば上述の目的は前記特許請求の範囲に記載
した手段により達成される。すなわち、本発明は、入力
された音声ディジタル信号を分析するそれぞれの分析区
間長の異なる複数の音声分析手段と、重複する分析区間
については上記各分析手段の内の最適なピーク検出を行
なうことのできる分析手段の出力を選択し各分析手段の
出力を結合する選択結合部と、該選択結合部の出力から
音声の基本周期に対応するピーク検出を行なう基本周期
抽出部とを具備する音声基本周期抽出装置である。[Means for Solving the Problems] According to the present invention, the above objects are achieved by the means described in the claims. That is, the present invention includes a plurality of audio analysis means for analyzing an input audio digital signal, each having a different analysis section length, and for overlapping analysis sections, performing optimum peak detection among the above-mentioned analysis sections. A speech fundamental period comprising: a selection combination section that selects the outputs of the analysis means that can be used and combines the outputs of the respective analysis means; and a fundamental period extraction section that detects a peak corresponding to the fundamental period of the speech from the output of the selection combination section. It is an extraction device.

［作　用］本発明においては、それぞれ分析区間長の異なる複数の
音声分析手段の分析結果の中から、短い基本周期のＰ４
キは短い分析区間長による分析が選択されるため高い時
間追従性が得られるとともに、長い基本周期の場合は長
い分析区間長による分析が選択されるため安定抽出が行
なえる。本方式では、分析区間長の変更を意識的に行な
うことなく等価的に（自動的に）分析区間長の変更を実
現していることになる。[Function] In the present invention, P4 with a short fundamental period is selected from among the analysis results of a plurality of speech analysis means each having a different analysis section length.
Key is that analysis with a short analysis interval length is selected, so high time followability can be obtained, and in the case of a long fundamental period, analysis with a long analysis interval length is selected, so stable extraction can be performed. In this method, the analysis interval length is equivalently (automatically) changed without consciously changing the analysis interval length.

本発明の作用について、例えば音声分析手段としてケプ
ストラム法を採った場合について更に説明すれは、ケプ
ストラムの低ケフレンシー部分については分析区間長が
短い分析により得られたケプストラムを選択し、中ケフ
レンシー部分については分析区間長が中庸の分析により
得られたケプストラムを選択し、高ケフレンシ一部分に
ついては分析区間長が長い分析により一８＝得られたケプストラムを選択し、さらに各選択されたケ
プストラムを結合し新たに選択結合ケプストラムを作成
し、最後にこの選択結合ケプストラム上で基本周期抽出
を行なっている。To further explain the operation of the present invention, for example, when the cepstrum method is adopted as a speech analysis means, a cepstrum obtained by analysis with a short analysis interval length is selected for the low quefrency part of the cepstrum, and a cepstrum obtained by analysis with a short analysis interval length is selected for the low quefrency part of the cepstrum, and for the middle quefrency part. Select a cepstrum obtained by an analysis with a medium analysis interval length, and for a part with high quefrency, select a cepstrum obtained by an analysis with a long analysis interval length, and then combine each selected cepstrum to create a new one. A selectively combined cepstrum is created, and finally fundamental period extraction is performed on this selectively combined cepstrum.

これにより前述のように分析区間長を自動的に最適なも
のとする制御が実現される。As a result, control for automatically optimizing the analysis interval length is realized as described above.

［実施例］第１図は本発明の第１の実施例の構成を示すブロック図
であって、ケプストラム法による音声基本周期の抽出装
置の例を示している。同図において、１はマイクロホン
、２はＡＤ変換部を表している。また、３−１〜３−ｎ
はそれぞれスペクトル分析部であり、各スペクトル分析
部は従来のスペクトル分析部と同じ機能であるが分析区
間長が異なっていて、スペクトル分析部３−１は最も短
い分析区間長を有し、他のスペクトル分析部は番号順に
より長い分析区間長を有し、スペクトル分析部３−ｎが
最も長い分析区間長を有している。各分析区間の相対関
係を第２図に示す。各分析区間は同図に示すように音声
の基本周期を分析しない時間位置を中心に両側に徐々に
分析区間長を長くしたものになりいる。４−１〜４−ｎ
は対数変換部であり、従来例の対数変換部と全く同じ機
能のものである。５−１〜５−ｎはゲプストラム分析部
群であり、各スペクトル分析部は従来のケプストラム分
析部と同し機能であるが、前記スペクトル分析部群と同
様にそれぞれ分析化区間長が異なっていて、ケプストラ
ム分析部５−１は最も短い分析区間長を有し、他のケプ
ストラム分析部は番号順により長い分析区間長を有し、
ケプストラム分析部５−ｎは最も長い分析区間長を有す
る。６はケプストラノ＼選択結き部てあり、ケプストラ
ム分析部５−１〜５−「ｌから得られたゲプスドラム群
から目的の選択結きケプストラムを作成する。選択結合
の処理の例を第３図に示す。[Embodiment] FIG. 1 is a block diagram showing the configuration of a first embodiment of the present invention, and shows an example of an apparatus for extracting fundamental periods of speech using the cepstral method. In the figure, 1 represents a microphone, and 2 represents an AD converter. Also, 3-1 to 3-n
are respectively spectrum analysis units, and each spectrum analysis unit has the same function as a conventional spectrum analysis unit, but has different analysis interval lengths, and the spectrum analysis unit 3-1 has the shortest analysis interval length, and the other The spectrum analysis sections have longer analysis section lengths in numerical order, and spectrum analysis section 3-n has the longest analysis section length. Figure 2 shows the relative relationship between each analysis section. As shown in the figure, each analysis section is such that the length of the analysis section is gradually increased on both sides of the time position where the fundamental period of the voice is not analyzed. 4-1 to 4-n
is a logarithmic conversion section, which has exactly the same function as the logarithmic conversion section of the conventional example. 5-1 to 5-n are a group of gepstrum analysis units, and each spectrum analysis unit has the same function as a conventional cepstrum analysis unit, but like the spectrum analysis unit group, each has a different analysis interval length. , the cepstrum analysis unit 5-1 has the shortest analysis interval length, and the other cepstrum analysis units have longer analysis interval lengths in numerical order,
The cepstrum analysis section 5-n has the longest analysis interval length. 6 is a cepstrano\selection connection section, and a cepstrum analysis section 5-1 to 5-5-" creates a desired selection connection cepstrum from the gepstrum group obtained from 5-1. An example of the selection connection process is shown in Fig. 3. show.

７は基本周期抽出部であり、従来の基本周期抽出部と同
じ機能のものであるが、ケプストラム選択結合部６で得
られた選択結合ケプストラｌ＼の最大値を求め、その最
大値の位置を基本周期とする。Reference numeral 7 denotes a fundamental period extraction section, which has the same function as the conventional fundamental period extraction section, but calculates the maximum value of the selectively combined cepstra l\ obtained by the cepstrum selection and combination section 6, and determines the position of the maximum value. This is the basic period.

第４図は本発明の第２の実施例の構成を示すブロック図
であって、自己相関分析による音声基本周期の抽出装置
の例を示している。FIG. 4 is a block diagram showing the configuration of a second embodiment of the present invention, and shows an example of an apparatus for extracting fundamental periods of speech using autocorrelation analysis.

同図において、ｌ、２は第１図と同様であって、８−１
〜８−ｎは自己相関分析部群であり、各自己相関分析部
では、前記第２図で示されたような分析区間長の異なる
自己相関分析を行なう。具体的には、ＡＤ変換された音
声信号を（×１）とし、分析区間切り出しの影響を除去
するための窓関数を　（ωｌ１ｌ−０＋Ｌ〜＋　　（１
−は分析区間長）とすると分析区間長がＬの場合の自己
相関関数（σ４．Ｌ）は次の式■で計算される。ｎは分
析中心位置である。In the figure, l and 2 are the same as in Figure 1, and 8-1
8-n is a group of autocorrelation analysis units, and each autocorrelation analysis unit performs autocorrelation analysis with different analysis interval lengths as shown in FIG. Specifically, the AD-converted audio signal is set to (×1), and the window function for removing the influence of cutting out the analysis section is (ωl1l-0+L~+ (1
- is the analysis interval length), the autocorrelation function (σ4.L) when the analysis interval length is L is calculated by the following formula (2). n is the analysis center position.

９は自己相関選択結合部であり、前述したゲブストラム
分析の場合のケプストラム選択結合部と同様な機能を有
するものである。Reference numeral 9 denotes an autocorrelation selection combination unit, which has the same function as the cepstrum selection combination unit in the case of the Gebstral analysis described above.

また、基本周期抽出部１０も前記第１の実施例の場合と
同様な機能を有するものである。Further, the fundamental period extraction section 10 also has the same function as in the first embodiment.

第５図は本発明の第３の実施例のブロック図であって、
変形自己相関分析による音声基本周期の抽出装置の例に
ついて示している。FIG. 5 is a block diagram of a third embodiment of the present invention,
An example of a device for extracting fundamental periods of speech using modified autocorrelation analysis is shown.

同図において、１．２は、第１図あるいは第４図の場合
と同様であり、１１−１〜１１−ｎはそれぞれ自己相関
分析部であって、これも前記、第２の実施例の場合と同
様のものである。In the figure, 1.2 is the same as in FIG. 1 or 4, and 11-1 to 11-n are autocorrelation analysis units, which are also similar to those in the second embodiment. It is similar to the case.

１２−１〜１２−ｎは線形予測分析部であって各自己相
関分析部で得られた自己相関関数（σ１、Ｌ）より式■
を満たす（α１．Ｌ）を計算する。12-1 to 12-n are linear prediction analysis units, and the formula ■ is calculated from the autocorrelation function (σ1, L) obtained by each autocorrelation analysis unit.
Calculate (α1.L) that satisfies the following.

この計算にはＬｅｖｉｎｓｏｎ法か利用できる。The Levinson method can be used for this calculation.

（式■）つぎに、式■により　（α１．Ｌ）の相関係数であるＡ
パラメーターを計算する。(Formula ■) Next, by Formula ■, A which is the correlation coefficient of (α1.L)
Calculate parameters.

１３−１〜１３−〇はそれぞれは変形自己相関分析部で
あり、式■に従って変形自己相関分析部＋Ｗ、、Ｌ）を
計算する。Each of 13-1 to 13-0 is a modified autocorrelation analysis section, which calculates the modified autocorrelation analysis section +W, , L) according to equation (2).

−１−Σ　＾Ｊ＋Ｌσ、−１（式ＩＸ）１４は自己相関
選択結合部てあり、ケプストム分析の場きのケプスラム
選択結き部６に相当する。-1-Σ ^J+Lσ, -1 (Formula IX) 14 is an autocorrelation selection coupling part, which corresponds to the cepslum selection coupling part 6 in the case of cepstom analysis.

［発明の効果］以上説明したように本発明によれば、変化の激しい音声
の基本周波数に柔軟に追従てき、しかも安定に基本周波
数を抽出てきる音声の基本周波数抽出装置を実現できる
。[Effects of the Invention] As described above, according to the present invention, it is possible to realize a fundamental frequency extracting device for speech that can flexibly follow the fundamental frequency of speech that changes drastically and can extract the fundamental frequency stably.

本発明の方式は従来の場合に比し、若干処理量は増大す
るものの、複雑なアルゴリズムなしに非常に優れた基本
周期抽出装置を実現てきる。Although the method of the present invention requires a slight increase in the amount of processing compared to the conventional method, it is possible to realize an extremely excellent fundamental period extraction device without complicated algorithms.

そして、この程度の処理量の増大は、近年のＬＳＩ技術
によれは全く問題となるものではない。This level of increase in processing amount is not a problem at all with recent LSI technology.

[Brief explanation of the drawing]

第１図は本発明の第１の実施例の構成を示すブロック図
、第２図は各スペクトル分析部の分析区間の相対関係を
示す図、第３図はケプストラムの選択結合の処理の例を
示す図、第４図は本発明の第２の実施例の構成を示すブ
ロック図、第５図は本発明の第３の実施例のブロック図
、第６図は従来のケプストラム法による基本周期抽出方
式の例を示す図、第７図は対数変換された音声パワース
ペクトルとケプストラムの例を示す図、第８図は基本周
期の抽出について説明する図である。ｌ　・マイクロホン、２・・・−・ＡＤ変換部、３−１
〜３−ｎ−・・・・スペクトル分析部、４−１〜４−ロ
ー・・対数変換部、５−１〜５−１１　　ケプストラム
分析部、６・・・・ケプストラム選択結合部、　７．１
Ｏ５１５・・・・・・基本周期抽出部、８−１〜８−ｎ
、１１−１〜１１−ｎ・・・・・・自己相関分析部、９
・・・・自己相関選択結合部、１２−１〜１２−ｎ　・
・線形予測分析部、１３−１〜１３−ｎ　・変形自己相
関分析部、１４・・・・・変形自己相関選択結合部＼、代理人　弁理士　井　桁　貞　−。４＼Ｘ　、。＼□□ （ａ）　　分断区間新同期の抽出１第　ａ（ｂ）分析ｙ間二ついて説明マる回図FIG. 1 is a block diagram showing the configuration of the first embodiment of the present invention, FIG. 2 is a diagram showing the relative relationship between the analysis intervals of each spectrum analysis section, and FIG. 3 is an example of cepstrum selective combination processing. 4 is a block diagram showing the configuration of the second embodiment of the present invention, FIG. 5 is a block diagram of the third embodiment of the present invention, and FIG. 6 is fundamental period extraction using the conventional cepstral method. FIG. 7 is a diagram showing an example of the method, FIG. 7 is a diagram showing an example of a logarithmically transformed audio power spectrum and cepstrum, and FIG. 8 is a diagram explaining extraction of the fundamental period. l ・Microphone, 2...-AD converter, 3-1
~3-n-... Spectral analysis unit, 4-1 to 4-Rho... Logarithmic conversion unit, 5-1 to 5-11 Cepstrum analysis unit, 6... Cepstrum selection combination unit, 7.1
O515... Fundamental period extraction unit, 8-1 to 8-n
, 11-1 to 11-n... Autocorrelation analysis section, 9
...Autocorrelation selection combination unit, 12-1 to 12-n ・
・Linear prediction analysis section, 13-1 to 13-n ・Modified autocorrelation analysis section, 14...Modified autocorrelation selection combination section\, Agent Patent attorney Sada Igeta -. 4\X,. ＼□□ (a) Extraction of new synchronization in divided section 1 Part a (b) Diagram with two explanations between analysis y

Claims

[Scope of Claims] 1. A plurality of audio analysis means each having a different analysis section length for analyzing an input audio digital signal, and for overlapping analysis sections, optimal peak detection among the above-mentioned analysis means is performed. A selection combination unit that selects the outputs of the analysis means that can perform the analysis and combines the outputs of the analysis means, and a fundamental period extraction unit that detects a peak corresponding to the fundamental period of the voice from the output of the selection combination unit. Characteristic voice fundamental period extraction device. 2. The speech fundamental period extraction device according to claim 1, wherein each of the plurality of speech analysis means comprises a respective spectrum analysis section, a logarithmic conversion section, and a cepstrum analysis section, and the selective combination section is a cepstrum selection combination section. 3. The speech fundamental period extraction device according to claim 1, wherein each of the plurality of speech analysis means is an analysis means based on autocorrelation analysis, and the selective combination section is an autocorrelation selection combination section. 4. The speech fundamental period extraction device according to claim 1, wherein each of the plurality of speech analysis means includes an autocorrelation analysis section, a linear prediction analysis section, and a modified autocorrelation analysis section, and the selective combination section is a modified autocorrelation selection combination section. 5. The speech fundamental period extraction device according to claim 1, wherein the plurality of analysis sections have the same center position and different analysis section lengths.