JPH024920B2 - - Google Patents
Info
- Publication number
- JPH024920B2 JPH024920B2 JP58117712A JP11771283A JPH024920B2 JP H024920 B2 JPH024920 B2 JP H024920B2 JP 58117712 A JP58117712 A JP 58117712A JP 11771283 A JP11771283 A JP 11771283A JP H024920 B2 JPH024920 B2 JP H024920B2
- Authority
- JP
- Japan
- Prior art keywords
- phoneme
- speech
- similarity
- speech unit
- acoustic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired
Links
- 239000011159 matrix material Substances 0.000 claims 7
- 238000001514 detection method Methods 0.000 claims 2
Description
【発明の詳細な説明】 産業上の利用分野 本発明は音声認識装置に関するものである。[Detailed description of the invention] Industrial applications The present invention relates to a speech recognition device.
従来例の構成とその問題点
音声認識装置は、人間の音声命令によつて対象
の機器の動作を制御することを目的とする命令入
力装置の一種であり、○
覗犧郛紊侶盈Configuration of conventional examples and their problems A voice recognition device is a type of command input device whose purpose is to control the operation of target equipment using human voice commands.
Claims (1)
して音韻系列を出力する音韻識別手段と、認識し
ようとする音声単位が有すると予想される標準的
な音韻系列が予め登録された音声単位標準パター
ンと、各音韻間の類似性を表わす音韻類似度行列
と、前記音韻識別手段から出力される音韻系列と
前記音声単位標準パターンと前記音韻類似度行列
とを用いて入力音声の音韻性と前記音声単位標準
パターンの音韻性との間の類似度を前記短時間区
間の音韻毎に表現する音声単位音韻比較行列と、
この音声単位音韻比較行列によつて表わされると
ころの音韻類似性を音声単位毎に求める音声単位
類似度計算手段とを備え、予め音韻別に雑音の強
度・特性に応じた複数組の重み付け係数を用意し
ておき、入力音声に重畳する雑音の強度及び特性
を検出する雑音検出手段から得られる情報に基づ
いて、適宜特定の重み付け係数の組を選択し、前
記音声単位類似度計算手段により前記重み付け係
数を用いて音声単位毎に音韻類似度を計算する構
成とした音声認識装置。 2 入力音声を任意の短時間区間毎に音響分析し
てその音響特徴系列を出力する音響分析手段と、
認識しようとする音声単位が有すると予想される
標準的な音響特徴系列が予め登録された音声単位
標準パターンと、前記音響分析手段から出力され
る音響特徴系列と前記音声単位標準パターンとを
用いて入力音声と音声単位標準パターンとの間の
類似度を前記短時間区間の入力音声毎に表現する
音声単位音響比較行列と、この音声単位音響比較
行列によつて表わされるところの音響類似度を音
声単位毎に求める音声単位類似度計算手段とを備
え、予め音響特徴量別に雑音の強度・特性に応じ
た複数組の重み付け係数を用意しておき、入力音
声に重畳する雑音の強度及び特性を検出する雑音
検出手段から得られる情報に基づいて、適宜特定
の重み付け係数の組を選択し、前記音声単位類似
度計算手段により前記重み付け係数を用いて音声
単位毎に音響類似度を計算する構成とした音声認
識装置。[Claims] 1. A phoneme identification means for identifying the phonemes of an input speech every arbitrary short time interval and outputting a phoneme sequence, and a standard phoneme sequence expected to be possessed by a phonetic unit to be recognized. Using a pre-registered phoneme unit standard pattern, a phoneme similarity matrix representing the similarity between each phoneme, a phoneme sequence output from the phoneme identification means, the phoneme standard pattern, and the phoneme similarity matrix, a speech unit phoneme comparison matrix that expresses the degree of similarity between the phonology of the input speech and the phonology of the speech unit standard pattern for each phoneme of the short time interval;
A speech unit similarity calculating means is provided for calculating the phoneme similarity represented by this speech unit phoneme comparison matrix for each speech unit, and multiple sets of weighting coefficients are prepared in advance for each phoneme according to noise intensity and characteristics. Based on the information obtained from the noise detection means that detects the intensity and characteristics of the noise superimposed on the input speech, a specific set of weighting coefficients is appropriately selected, and the weighting coefficients are determined by the speech unit similarity calculation means. A speech recognition device configured to calculate phonological similarity for each speech unit using . 2. acoustic analysis means for acoustically analyzing input speech for each arbitrary short time interval and outputting the acoustic feature sequence;
A speech unit standard pattern in which a standard acoustic feature sequence that is expected to be possessed by the speech unit to be recognized is registered in advance, an acoustic feature series output from the acoustic analysis means, and the speech unit standard pattern are used. A speech unit acoustic comparison matrix that expresses the similarity between the input speech and the speech unit standard pattern for each input speech in the short time period, and a speech unit acoustic comparison matrix that expresses the acoustic similarity expressed by this speech unit acoustic comparison matrix. It is equipped with a speech unit similarity calculation means that is calculated for each unit, prepares in advance multiple sets of weighting coefficients according to the intensity and characteristics of noise for each acoustic feature value, and detects the intensity and characteristics of noise superimposed on input speech. Based on the information obtained from the noise detection means, a specific set of weighting coefficients is selected as appropriate, and the acoustic similarity calculation means calculates acoustic similarity for each speech unit using the weighting coefficients. Speech recognition device.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP58117712A JPS607496A (en) | 1983-06-28 | 1983-06-28 | voice recognition device |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP58117712A JPS607496A (en) | 1983-06-28 | 1983-06-28 | voice recognition device |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JPS607496A JPS607496A (en) | 1985-01-16 |
| JPH024920B2 true JPH024920B2 (en) | 1990-01-30 |
Family
ID=14718435
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP58117712A Granted JPS607496A (en) | 1983-06-28 | 1983-06-28 | voice recognition device |
Country Status (1)
| Country | Link |
|---|---|
| JP (1) | JPS607496A (en) |
Families Citing this family (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPS61143070A (en) * | 1985-11-29 | 1986-06-30 | 帝人株式会社 | Heat sterilization of artificial kidney |
| JPS61143072A (en) * | 1985-11-29 | 1986-06-30 | 帝人株式会社 | Heat sterilization of artificial kidney |
| JPS61143071A (en) * | 1985-11-29 | 1986-06-30 | 帝人株式会社 | Heat sterilization of artificial kidney |
| JPS6343669A (en) * | 1986-08-08 | 1988-02-24 | 帝人株式会社 | Production of blood treatment device |
| JPH0426900A (en) * | 1990-05-22 | 1992-01-30 | Nec Corp | Voice recognition device |
-
1983
- 1983-06-28 JP JP58117712A patent/JPS607496A/en active Granted
Also Published As
| Publication number | Publication date |
|---|---|
| JPS607496A (en) | 1985-01-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Bou-Ghazale et al. | A comparative study of traditional and newly proposed features for recognition of speech under stress | |
| JPWO2003015076A1 (en) | Dog emotion discrimination device and method based on voice feature analysis | |
| EP1022722A3 (en) | Speaker adaptation based on eigenvoices | |
| Shanthi Therese et al. | Review of feature extraction techniques in automatic speech recognition | |
| JPH024920B2 (en) | ||
| CN86100298A (en) | Speech Recognition | |
| Zhou et al. | Linear and nonlinear speech feature analysis for stress classification. | |
| Dai et al. | An improved model of masking effects for robust speech recognition system | |
| JPH0766734A (en) | Equipment and method for voice coding | |
| JP3083915U (en) | Dog emotion discrimination device based on phonetic feature analysis of call | |
| Koniaris et al. | Selecting static and dynamic features using an advanced auditory model for speech recognition | |
| JP2658426B2 (en) | Voice recognition method | |
| JPH07210197A (en) | Method of identifying speaker | |
| JPH03276199A (en) | Speaker recognizing system | |
| Xue | Joint speech and spearker recognition using neural networks | |
| JPH04219798A (en) | Speaker adaptive voice recognizing method and device | |
| Jing et al. | Auditory-modeling inspired methods of feature extraction for robust automatic speech recognition | |
| KR100488121B1 (en) | Speaker verification apparatus and method applied personal weighting function for better inter-speaker variation | |
| JPS6148898A (en) | Voiced/unvoiced determination device | |
| Milinković | Pure software-based speech recognition for os-less embedded systems | |
| Ramos et al. | Voice Detection with Noise Reduction using Dynamic Time Warping and Mel-Frequency Cepstral Coefficients Algorithm Applied to Home Automation | |
| JPS59180598A (en) | Voice input system | |
| Hermansky | Data-guided processing of speech | |
| Abdul-Kadir et al. | Vowel effects towards dental Arabic consonants based on spectrogram | |
| JPH087599B2 (en) | Speech recognition method |