JPH11507443A - 話者確認システム - Google Patents
話者確認システムInfo
- Publication number
- JPH11507443A JPH11507443A JP9501618A JP50161897A JPH11507443A JP H11507443 A JPH11507443 A JP H11507443A JP 9501618 A JP9501618 A JP 9501618A JP 50161897 A JP50161897 A JP 50161897A JP H11507443 A JPH11507443 A JP H11507443A
- Authority
- JP
- Japan
- Prior art keywords
- speaker
- feature
- voice
- classifier
- classifiers
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 238000000034 method Methods 0.000 claims abstract description 85
- 238000005070 sampling Methods 0.000 claims abstract description 5
- 239000013598 vector Substances 0.000 claims description 45
- 238000012795 verification Methods 0.000 claims description 27
- 238000000605 extraction Methods 0.000 claims description 17
- 230000001419 dependent effect Effects 0.000 claims description 13
- 230000001537 neural effect Effects 0.000 claims description 9
- 238000012549 training Methods 0.000 abstract description 32
- 230000004927 fusion Effects 0.000 abstract description 18
- 238000012360 testing method Methods 0.000 abstract description 13
- 238000001914 filtration Methods 0.000 abstract description 9
- 230000000694 effects Effects 0.000 abstract description 8
- 238000013459 approach Methods 0.000 abstract description 2
- 238000012790 confirmation Methods 0.000 description 19
- 238000010586 diagram Methods 0.000 description 17
- 238000001228 spectrum Methods 0.000 description 13
- 238000010606 normalization Methods 0.000 description 10
- 238000012545 processing Methods 0.000 description 9
- 239000011159 matrix material Substances 0.000 description 8
- 238000013528 artificial neural network Methods 0.000 description 7
- 238000003909 pattern recognition Methods 0.000 description 7
- 230000003595 spectral effect Effects 0.000 description 7
- 230000009466 transformation Effects 0.000 description 7
- 238000007689 inspection Methods 0.000 description 6
- 230000003044 adaptive effect Effects 0.000 description 5
- 230000004044 response Effects 0.000 description 4
- PXFBZOLANLWPMH-UHFFFAOYSA-N 16-Epiaffinine Natural products C1C(C2=CC=CC=C2N2)=C2C(=O)CC2C(=CC)CN(C)C1C2CO PXFBZOLANLWPMH-UHFFFAOYSA-N 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 210000002364 input neuron Anatomy 0.000 description 3
- 238000013138 pruning Methods 0.000 description 3
- 238000007796 conventional method Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000013139 quantization Methods 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 238000012952 Resampling Methods 0.000 description 1
- 230000032683 aging Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000008602 contraction Effects 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 238000002790 cross-validation Methods 0.000 description 1
- 238000003066 decision tree Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 238000012850 discrimination method Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 210000004205 output neuron Anatomy 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/06—Decision making techniques; Pattern matching strategies
- G10L17/10—Multimodal systems, i.e. based on the integration of multiple recognition engines or fusion of expert systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/20—Pattern transformations or operations aimed at increasing system robustness, e.g. against channel noise or different working conditions
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/06—Decision making techniques; Pattern matching strategies
- G10L17/14—Use of phonemic categorisation or speech recognition prior to speaker recognition or verification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/18—Artificial neural networks; Connectionist approaches
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Game Theory and Decision Science (AREA)
- Business, Economics & Management (AREA)
- Computational Linguistics (AREA)
- Image Analysis (AREA)
- Circuit For Audible Band Transducer (AREA)
- Electrically Operated Instructional Devices (AREA)
- Selective Calling Equipment (AREA)
- Traffic Control Systems (AREA)
- Electric Propulsion And Braking For Vehicles (AREA)
- Train Traffic Observation, Control, And Security (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
- Eye Examination Apparatus (AREA)
Abstract
Description
Claims (1)
- 【特許請求の範囲】 1.話者確認方法であって、 前記話者が発音した第1音声から少なくとも1つの特徴と抽出するステップと 、 複数の分類出力を形成するための複数の分類部によって、前記少なくとも1つ の特徴を分類する手段と、 前記複数の分類出力および予め前記話者が発音した第2音声の類似性を判定す ることによって、前記複数の分類出力を認識する手段と、 前記認識した複数の分類出力から、前記話者を受認するかまたは拒絶するかに ついて判定を行う手段と から成ることを特徴とする方法。 2.請求項1記載の方法において、該方法は更に、 前記認識した複数の分類出力から信頼度を判定するステップ を備えていることを特徴とする方法。 3.請求項2記載の方法において、前記少なくとも1つの特徴を分類するステッ プの前に、前記方法は更に、 前記少なくとも1つの特徴を、予め記憶されている前記話者に対するデータと 比較することによって、前記話者が発音した前記第1音声に対してワード認識を 行い、前記話者を暫定的に受認するか、または暫定的に拒絶するかについて判定 を行うステップと、 前記話者を暫定的に受認すると判定した場合に、前記少なくとも1つの特徴を 分類する前記ステップをイネーブルし、または前記話者を暫定的に拒絶すると判 定した場合に、取り消しモジュールをイネーブルするステップと を備えていることを特徴とする方法。 4.請求項3記載の方法において、前記第1音声は、前記話者に対するパスワー ドの少なくとも1つの発声から成ることを特徴とする方法。 5.請求項4記載の方法において、前記データは、予め前記話者が発音した第1 音声から形成された話者依存テンプレートと、予め少なくともひとりの第2話者 が発音した第1音声によって形成された話者独立テンプレートとから成ることを 特徴とする方法。 6.請求項1記載の方法において、前記分類ステップは、ニューラル・ツリー・ ネットワーク(NTN)分類部および動的時間ワープ分類部によって実行するこ とを特徴とする方法。 7.請求項1記載の方法において、前記分類ステップは、改良ニューラル・ツリ ー・ネットワーク(MNTN)および動的時間ワープ分類部によって実行するこ とを特徴とする方法。 8.請求項7記載の方法において、前記MNTN分類部の話者スコアは、clは 話者Siに対する信頼度スコア、c0は他の全話者に対する信頼度スコア、Mおよ びNはそれぞれ"1"および"0"と分類されたベクトルの数に対応するとしたとき、 によって、定義されることを特徴とする方法。 9.請求項1記載の方法において、前記認識ステップは、 前記複数の分類部の内1対に、前記話者に対する音声の複数の第1発声を印加 し、抜き取り発声と定義された、前記発声の1つを抜き取り、 前記抜き取った音声を前記分類部対に印加し、 前記分類部対において、前記分類部の各々について確率を計算し、 前記確率から、前記分類部対内の前記分類部の各々についてスレシホルドを決 定すること によって訓練され、 前記複数の分類出力の前記類似性は、前記分類部を前記スレシホルドと比較す ることによって判定される ことを特徴とする方法。 10.請求項1記載の方法において、前記抽出ステップは、前記第1および第2 音声のポール・フィルタ処理を行い、前記少なくとも1つの特徴を抽出すること によって実行することを特徴とする方法。 11.請求項1記載の方法において、該方法はさらに、 前記抽出ステップの後に、前記少なくとも1つの特徴をサブワードに細分する ステップ を含むことを特徴とする方法。 12.請求項11記載の方法において、前記サブワードは音素であることを特徴 とする方法。 13.請求項12記載の方法において、前記サブワードは話者に依存することを 特徴とする方法。 14.請求項12記載の方法において、前記サブワードは話者に独立であること を特徴とする方法。 15.請求項1記載の方法において、前記少なくとも1つの特徴は、疑似マップ 変形を用いて補正されることを特徴とする方法。 16.話者確認システムであって、 前記話者が発音した第1音声から少なくとも1つの特徴と抽出する手段と、 複数の分類出力を形成するための複数の分類部によって、前記少なくとも1つ の特徴を分類する手段と、 前記複数の分類出力および予め前記話者が発音した第2音声の類似性を判定す ることによって、前記複数の分類出力を認識する手段と、 前記認識した複数の分類出力から、前記話者を受認するかまたは拒絶するかに ついて判定を行う手段と から成ることを特徴とするシステム。 17.請求項16記載のシステムにおいて、該システムは更に、 前記少なくとも1つの特徴を、予め記憶されている前記話者に対するデータと 比較することによって、前記話者が発音した前記第1音声に対してワード認識を 行い、前記話者を暫定的に受認するか、または暫定的に拒絶するかについて判定 を行う手段と、 前記話者を暫定的に受認すると判定した場合に、前記少なくとも1つの特徴を 分類する前記手段をイネーブルし、または前記話者を暫定的に拒絶すると判定し た場合に、取り消しモジュールをイネーブルする手段と を備えていることを特徴とするシステム。 18.請求項17記載のシステムにおいて、前記データは、予め前記話者が発音 した第1音声から形成された話者依存テンプレートと、予め少なくともひとりの 第2話者が発音した第1音声によって形成された話者独立テンプレートとから成 ることを特徴とするシステム。 19.請求項18記載のシステムにおいて、前記分類手段は、改良ニューラル・ ツリー・ネットワーク(MNTN)および動的時間ワープ分類部から成ることを 特徴とするシステム。 20.請求項19記載のシステムにおいて、前記抽出手段は、全ポール・フィル タと共に動作することを特徴とするシステム。 21.請求項20記載のシステムにおいて、前記少なくとも1つの特徴は、疑似 変形を用いて補正されることを特徴とするシステム。
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US08/479,012 US5839103A (en) | 1995-06-07 | 1995-06-07 | Speaker verification system using decision fusion logic |
| US08/479,012 | 1995-06-07 | ||
| PCT/US1996/009260 WO1996041334A1 (en) | 1995-06-07 | 1996-06-06 | Speaker verification system |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| JPH11507443A true JPH11507443A (ja) | 1999-06-29 |
Family
ID=23902297
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP9501618A Ceased JPH11507443A (ja) | 1995-06-07 | 1996-06-06 | 話者確認システム |
Country Status (16)
| Country | Link |
|---|---|
| US (1) | US5839103A (ja) |
| EP (1) | EP0870300B1 (ja) |
| JP (1) | JPH11507443A (ja) |
| KR (1) | KR19990022391A (ja) |
| CN (1) | CN1197526A (ja) |
| AT (1) | ATE323934T1 (ja) |
| AU (1) | AU711496B2 (ja) |
| CA (1) | CA2221415A1 (ja) |
| DE (1) | DE69636057T2 (ja) |
| FI (1) | FI117954B (ja) |
| IL (1) | IL122354A (ja) |
| NO (1) | NO321125B1 (ja) |
| NZ (1) | NZ311289A (ja) |
| RU (1) | RU2161336C2 (ja) |
| TR (1) | TR199701555T1 (ja) |
| WO (1) | WO1996041334A1 (ja) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2008126627A1 (ja) * | 2007-03-26 | 2008-10-23 | Nec Corporation | 音声分類装置、音声分類方法、および音声分類用プログラム |
Families Citing this family (108)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5937381A (en) * | 1996-04-10 | 1999-08-10 | Itt Defense, Inc. | System for voice verification of telephone transactions |
| US6038528A (en) * | 1996-07-17 | 2000-03-14 | T-Netix, Inc. | Robust speech processing with affine transform replicated data |
| US6003002A (en) * | 1997-01-02 | 1999-12-14 | Texas Instruments Incorporated | Method and system of adapting speech recognition models to speaker environment |
| US6076055A (en) * | 1997-05-27 | 2000-06-13 | Ameritech | Speaker verification method |
| US7630895B2 (en) * | 2000-01-21 | 2009-12-08 | At&T Intellectual Property I, L.P. | Speaker verification method |
| CA2304747C (en) * | 1997-10-15 | 2007-08-14 | British Telecommunications Public Limited Company | Pattern recognition using multiple reference models |
| US6519561B1 (en) * | 1997-11-03 | 2003-02-11 | T-Netix, Inc. | Model adaptation of neural tree networks and other fused models for speaker verification |
| US6233555B1 (en) * | 1997-11-25 | 2001-05-15 | At&T Corporation | Method and apparatus for speaker identification using mixture discriminant analysis to develop speaker models |
| US6243695B1 (en) * | 1998-03-18 | 2001-06-05 | Motorola, Inc. | Access control system and method therefor |
| EP1072035A1 (en) * | 1998-04-20 | 2001-01-31 | Koninklijke KPN N.V. | Theshold setting and training of a speaker verification system |
| AU3889799A (en) * | 1998-05-08 | 1999-11-29 | T-Netix, Inc. | Channel estimation system and method for use in automatic speaker verification systems |
| JP3090119B2 (ja) * | 1998-05-15 | 2000-09-18 | 日本電気株式会社 | 話者照合装置、方法及び記憶媒体 |
| DE19824353A1 (de) * | 1998-05-30 | 1999-12-02 | Philips Patentverwaltung | Vorrichtung zur Verifizierung von Signalen |
| DE19824354A1 (de) * | 1998-05-30 | 1999-12-02 | Philips Patentverwaltung | Vorrichtung zur Verifizierung von Signalen |
| US6178400B1 (en) * | 1998-07-22 | 2001-01-23 | At&T Corp. | Method and apparatus for normalizing speech to facilitate a telephone call |
| TW418383B (en) * | 1998-09-23 | 2001-01-11 | Ind Tech Res Inst | Telephone voice recognition system and method and the channel effect compensation device using the same |
| US6411930B1 (en) * | 1998-11-18 | 2002-06-25 | Lucent Technologies Inc. | Discriminative gaussian mixture models for speaker verification |
| JP2000200098A (ja) * | 1999-01-07 | 2000-07-18 | Sony Corp | 学習装置および学習方法、並びに認識装置および認識方法 |
| JP2000259198A (ja) * | 1999-03-04 | 2000-09-22 | Sony Corp | パターン認識装置および方法、並びに提供媒体 |
| US20010044818A1 (en) * | 2000-02-21 | 2001-11-22 | Yufeng Liang | System and method for identifying and blocking pornogarphic and other web content on the internet |
| US6735562B1 (en) * | 2000-06-05 | 2004-05-11 | Motorola, Inc. | Method for estimating a confidence measure for a speech recognition system |
| US6735563B1 (en) * | 2000-07-13 | 2004-05-11 | Qualcomm, Inc. | Method and apparatus for constructing voice templates for a speaker-independent voice recognition system |
| US6671669B1 (en) * | 2000-07-18 | 2003-12-30 | Qualcomm Incorporated | combined engine system and method for voice recognition |
| US6728674B1 (en) * | 2000-07-31 | 2004-04-27 | Intel Corporation | Method and system for training of a classifier |
| US20040190688A1 (en) * | 2003-03-31 | 2004-09-30 | Timmins Timothy A. | Communications methods and systems using voiceprints |
| US20020147694A1 (en) * | 2001-01-31 | 2002-10-10 | Dempsey Derek M. | Retraining trainable data classifiers |
| US6792434B2 (en) * | 2001-04-20 | 2004-09-14 | Mitsubishi Electric Research Laboratories, Inc. | Content-based visualization and user-modeling for interactive browsing and retrieval in multimedia databases |
| GB0112749D0 (en) * | 2001-05-25 | 2001-07-18 | Rhetorical Systems Ltd | Speech synthesis |
| EP1399915B1 (en) * | 2001-06-19 | 2009-03-18 | Speech Sentinel Limited | Speaker verification |
| US20050055208A1 (en) * | 2001-07-03 | 2005-03-10 | Kibkalo Alexandr A. | Method and apparatus for fast calculation of observation probabilities in speech recognition |
| US7493258B2 (en) * | 2001-07-03 | 2009-02-17 | Intel Corporation | Method and apparatus for dynamic beam control in Viterbi search |
| RU2276810C2 (ru) * | 2001-07-03 | 2006-05-20 | Интел Зао | Способ и устройство для динамической регулировки луча в поиске по витерби |
| US7844476B2 (en) | 2001-12-31 | 2010-11-30 | Genworth Financial, Inc. | Process for case-based insurance underwriting suitable for use by an automated system |
| US7895062B2 (en) | 2001-12-31 | 2011-02-22 | Genworth Financial, Inc. | System for optimization of insurance underwriting suitable for use by an automated system |
| US7899688B2 (en) | 2001-12-31 | 2011-03-01 | Genworth Financial, Inc. | Process for optimization of insurance underwriting suitable for use by an automated system |
| US7818186B2 (en) | 2001-12-31 | 2010-10-19 | Genworth Financial, Inc. | System for determining a confidence factor for insurance underwriting suitable for use by an automated system |
| US8793146B2 (en) * | 2001-12-31 | 2014-07-29 | Genworth Holdings, Inc. | System for rule-based insurance underwriting suitable for use by an automated system |
| US8005693B2 (en) | 2001-12-31 | 2011-08-23 | Genworth Financial, Inc. | Process for determining a confidence factor for insurance underwriting suitable for use by an automated system |
| US7630910B2 (en) | 2001-12-31 | 2009-12-08 | Genworth Financial, Inc. | System for case-based insurance underwriting suitable for use by an automated system |
| US7844477B2 (en) | 2001-12-31 | 2010-11-30 | Genworth Financial, Inc. | Process for rule-based insurance underwriting suitable for use by an automated system |
| US20030149881A1 (en) * | 2002-01-31 | 2003-08-07 | Digital Security Inc. | Apparatus and method for securing information transmitted on computer networks |
| US6687672B2 (en) | 2002-03-15 | 2004-02-03 | Matsushita Electric Industrial Co., Ltd. | Methods and apparatus for blind channel estimation based upon speech correlation structure |
| US7424425B2 (en) * | 2002-05-19 | 2008-09-09 | International Business Machines Corporation | Optimization of detection systems using a detection error tradeoff analysis criterion |
| FR2848715B1 (fr) * | 2002-12-11 | 2005-02-18 | France Telecom | Procede et systeme de correction multi-references des deformations spectrales de la voix introduites par un reseau de communication |
| US7734025B2 (en) * | 2003-02-28 | 2010-06-08 | Grape Technology Group, Inc. | Methods and systems for providing on-line bills for use in communications services |
| US7567914B2 (en) | 2003-04-30 | 2009-07-28 | Genworth Financial, Inc. | System and process for dominance classification for insurance underwriting suitable for use by an automated system |
| US7813945B2 (en) | 2003-04-30 | 2010-10-12 | Genworth Financial, Inc. | System and process for multivariate adaptive regression splines classification for insurance underwriting suitable for use by an automated system |
| US7383239B2 (en) | 2003-04-30 | 2008-06-03 | Genworth Financial, Inc. | System and process for a fusion classification for insurance underwriting suitable for use by an automated system |
| US7801748B2 (en) | 2003-04-30 | 2010-09-21 | Genworth Financial, Inc. | System and process for detecting outliers for insurance underwriting suitable for use by an automated system |
| CN1308911C (zh) * | 2003-07-10 | 2007-04-04 | 上海优浪信息科技有限公司 | 一种说话者身份识别方法和系统 |
| US7698159B2 (en) | 2004-02-13 | 2010-04-13 | Genworth Financial Inc. | Systems and methods for performing data collection |
| US20050288930A1 (en) * | 2004-06-09 | 2005-12-29 | Vaastek, Inc. | Computer voice recognition apparatus and method |
| US7386448B1 (en) | 2004-06-24 | 2008-06-10 | T-Netix, Inc. | Biometric voice authentication |
| KR100571574B1 (ko) * | 2004-07-26 | 2006-04-17 | 한양대학교 산학협력단 | 비선형 분석을 이용한 유사화자 인식방법 및 그 시스템 |
| US7865362B2 (en) * | 2005-02-04 | 2011-01-04 | Vocollect, Inc. | Method and system for considering information about an expected response when performing speech recognition |
| US7827032B2 (en) * | 2005-02-04 | 2010-11-02 | Vocollect, Inc. | Methods and systems for adapting a model for a speech recognition system |
| US8200495B2 (en) | 2005-02-04 | 2012-06-12 | Vocollect, Inc. | Methods and systems for considering information about an expected response when performing speech recognition |
| US7895039B2 (en) * | 2005-02-04 | 2011-02-22 | Vocollect, Inc. | Methods and systems for optimizing model adaptation for a speech recognition system |
| US7949533B2 (en) * | 2005-02-04 | 2011-05-24 | Vococollect, Inc. | Methods and systems for assessing and improving the performance of a speech recognition system |
| US7853539B2 (en) * | 2005-09-28 | 2010-12-14 | Honda Motor Co., Ltd. | Discriminating speech and non-speech with regularized least squares |
| US7539616B2 (en) * | 2006-02-20 | 2009-05-26 | Microsoft Corporation | Speaker authentication using adapted background models |
| CN101051463B (zh) * | 2006-04-06 | 2012-07-11 | 株式会社东芝 | 说话人认证的验证方法及装置 |
| CN101154380B (zh) * | 2006-09-29 | 2011-01-26 | 株式会社东芝 | 说话人认证的注册及验证的方法和装置 |
| US7822605B2 (en) * | 2006-10-19 | 2010-10-26 | Nice Systems Ltd. | Method and apparatus for large population speaker identification in telephone interactions |
| RU2351023C2 (ru) * | 2007-05-02 | 2009-03-27 | Общество с ограниченной ответственностью "Тридакна" | Способ верификации пользователя в системах санкционирования доступа |
| US8886663B2 (en) * | 2008-09-20 | 2014-11-11 | Securus Technologies, Inc. | Multi-party conversation analyzer and logger |
| US8145483B2 (en) * | 2009-08-05 | 2012-03-27 | Tze Fen Li | Speech recognition method for all languages without using samples |
| RU2419890C1 (ru) * | 2009-09-24 | 2011-05-27 | Общество с ограниченной ответственностью "Центр речевых технологий" | Способ идентификации говорящего по фонограммам произвольной устной речи на основе формантного выравнивания |
| RU2421699C1 (ru) * | 2010-05-19 | 2011-06-20 | ОБЩЕСТВО С ОГРАНИЧЕННОЙ ОТВЕТСТВЕННОСТЬЮ "Интегрированные Биометрические Решения И Системы" (ООО "ИБРиС") | Способ верификации личности по голосу на основе анатомических параметров человека |
| US8775341B1 (en) | 2010-10-26 | 2014-07-08 | Michael Lamport Commons | Intelligent control with hierarchical stacked neural networks |
| US9015093B1 (en) | 2010-10-26 | 2015-04-21 | Michael Lamport Commons | Intelligent control with hierarchical stacked neural networks |
| US20120116764A1 (en) * | 2010-11-09 | 2012-05-10 | Tze Fen Li | Speech recognition method on sentences in all languages |
| WO2012068705A1 (en) * | 2010-11-25 | 2012-05-31 | Telefonaktiebolaget L M Ericsson (Publ) | Analysis system and method for audio data |
| US8914290B2 (en) | 2011-05-20 | 2014-12-16 | Vocollect, Inc. | Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment |
| US9390445B2 (en) | 2012-03-05 | 2016-07-12 | Visa International Service Association | Authentication using biometric technology through a consumer device |
| CN102664011B (zh) * | 2012-05-17 | 2014-03-12 | 吉林大学 | 一种快速说话人识别方法 |
| EA023695B1 (ru) * | 2012-07-16 | 2016-07-29 | Ооо "Центр Речевых Технологий" | Способ распознавания речевых сообщений и устройство для его осуществления |
| US9240184B1 (en) * | 2012-11-15 | 2016-01-19 | Google Inc. | Frame-level combination of deep neural network and gaussian mixture models |
| US9230550B2 (en) * | 2013-01-10 | 2016-01-05 | Sensory, Incorporated | Speaker verification and identification using artificial neural network-based sub-phonetic unit discrimination |
| US8694315B1 (en) | 2013-02-05 | 2014-04-08 | Visa International Service Association | System and method for authentication using speaker verification techniques and fraud model |
| US9865266B2 (en) * | 2013-02-25 | 2018-01-09 | Nuance Communications, Inc. | Method and apparatus for automated speaker parameters adaptation in a deployed speaker verification system |
| US9978395B2 (en) | 2013-03-15 | 2018-05-22 | Vocollect, Inc. | Method and system for mitigating delay in receiving audio stream during production of sound from audio stream |
| US9621713B1 (en) | 2014-04-01 | 2017-04-11 | Securus Technologies, Inc. | Identical conversation detection method and apparatus |
| US10237399B1 (en) | 2014-04-01 | 2019-03-19 | Securus Technologies, Inc. | Identical conversation detection method and apparatus |
| CN103986725A (zh) * | 2014-05-29 | 2014-08-13 | 中国农业银行股份有限公司 | 一种客户端、服务器端以及身份认证系统和方法 |
| US9922048B1 (en) | 2014-12-01 | 2018-03-20 | Securus Technologies, Inc. | Automated background check via facial recognition |
| CN104410697A (zh) * | 2014-12-02 | 2015-03-11 | 广东安居宝数码科技股份有限公司 | 考勤信息的处理方法和系统 |
| JP6481939B2 (ja) * | 2015-03-19 | 2019-03-13 | 株式会社レイトロン | 音声認識装置および音声認識プログラム |
| US10133538B2 (en) * | 2015-03-27 | 2018-11-20 | Sri International | Semi-supervised speaker diarization |
| CN109313902A (zh) | 2016-06-06 | 2019-02-05 | 思睿逻辑国际半导体有限公司 | 语音用户接口 |
| US20180018973A1 (en) | 2016-07-15 | 2018-01-18 | Google Inc. | Speaker verification |
| CN106228976B (zh) * | 2016-07-22 | 2019-05-31 | 百度在线网络技术(北京)有限公司 | 语音识别方法和装置 |
| US10714121B2 (en) | 2016-07-27 | 2020-07-14 | Vocollect, Inc. | Distinguishing user speech from background speech in speech-dense environments |
| CN107886955B (zh) * | 2016-09-29 | 2021-10-26 | 百度在线网络技术(北京)有限公司 | 一种语音会话样本的身份识别方法、装置及设备 |
| US10614813B2 (en) * | 2016-11-04 | 2020-04-07 | Intellisist, Inc. | System and method for performing caller identity verification using multi-step voice analysis |
| KR102125549B1 (ko) * | 2017-04-20 | 2020-06-22 | 한국전자통신연구원 | 심층신경망 기반 음성 인식 시스템을 위한 발화 검증 방법 |
| DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
| US10957318B2 (en) * | 2018-11-02 | 2021-03-23 | Visa International Service Association | Dynamic voice authentication |
| US11024291B2 (en) | 2018-11-21 | 2021-06-01 | Sri International | Real-time class recognition for an audio stream |
| US11114103B2 (en) | 2018-12-28 | 2021-09-07 | Alibaba Group Holding Limited | Systems, methods, and computer-readable storage media for audio signal processing |
| US10891318B2 (en) * | 2019-02-22 | 2021-01-12 | United States Of America As Represented By The Secretary Of The Navy | Temporal logic fusion of real time data |
| EP3982360A4 (en) * | 2019-06-07 | 2022-06-08 | NEC Corporation | DEVICE AND METHOD FOR VOICE PROCESSING, AND NON-TRANSITORY COMPUTER READABLE MEDIA ON WHICH A PROGRAM IS STORED |
| JP7259981B2 (ja) * | 2019-10-17 | 2023-04-18 | 日本電気株式会社 | 話者認証システム、方法およびプログラム |
| JP7395960B2 (ja) * | 2019-10-30 | 2023-12-12 | 富士通株式会社 | 予測モデル説明方法、予測モデル説明プログラム、予測モデル説明装置 |
| CN111081255B (zh) * | 2019-12-31 | 2022-06-03 | 思必驰科技股份有限公司 | 说话人确认方法和装置 |
| JP7548316B2 (ja) * | 2020-08-11 | 2024-09-10 | 日本電気株式会社 | 音声処理装置、音声処理方法、プログラム、および音声認証システム |
| CN114004353B (zh) * | 2021-09-30 | 2025-02-28 | 中国科学院计算技术研究所 | 减少光器件数量的光神经网络芯片构建方法及系统 |
| CN116153336B (zh) * | 2023-04-19 | 2023-07-21 | 北京中电慧声科技有限公司 | 一种基于多域信息融合的合成语音检测方法 |
Family Cites Families (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4053710A (en) * | 1976-03-01 | 1977-10-11 | Ncr Corporation | Automatic speaker verification systems employing moment invariants |
| US4837831A (en) * | 1986-10-15 | 1989-06-06 | Dragon Systems, Inc. | Method for creating and using multiple-word sound models in speech recognition |
| US4975961A (en) * | 1987-10-28 | 1990-12-04 | Nec Corporation | Multi-layer neural network to which dynamic programming techniques are applicable |
| JPH0673080B2 (ja) * | 1987-11-25 | 1994-09-14 | 日本電気株式会社 | 連続音声認識方式 |
| SU1629917A1 (ru) * | 1989-02-10 | 1991-02-23 | Институт Систем Управления Ан Гсср | Способ идентификации говор щего |
| DE3931638A1 (de) * | 1989-09-22 | 1991-04-04 | Standard Elektrik Lorenz Ag | Verfahren zur sprecheradaptiven erkennung von sprache |
| DE69030561T2 (de) * | 1989-12-28 | 1997-10-09 | Sharp Kk | Spracherkennungseinrichtung |
| US5220640A (en) * | 1990-09-20 | 1993-06-15 | Motorola, Inc. | Neural net architecture for rate-varying inputs |
| US5271088A (en) * | 1991-05-13 | 1993-12-14 | Itt Corporation | Automated sorting of voice messages through speaker spotting |
| US5430827A (en) * | 1993-04-23 | 1995-07-04 | At&T Corp. | Password verification system |
| US5528728A (en) * | 1993-07-12 | 1996-06-18 | Kabushiki Kaisha Meidensha | Speaker independent speech recognition system and method using neural network and DTW matching technique |
| DE4325404C2 (de) * | 1993-07-29 | 2002-04-11 | Tenovis Gmbh & Co Kg | Verfahren zum Ermitteln und Klassifizieren von Störgeräuschtypen |
| WO1995005656A1 (en) * | 1993-08-12 | 1995-02-23 | The University Of Queensland | A speaker verification system |
| US5457770A (en) * | 1993-08-19 | 1995-10-10 | Kabushiki Kaisha Meidensha | Speaker independent speech recognition system and method using neural network and/or DP matching technique |
| US5522011A (en) * | 1993-09-27 | 1996-05-28 | International Business Machines Corporation | Speech coding apparatus and method using classification rules |
| US5522012A (en) * | 1994-02-28 | 1996-05-28 | Rutgers University | Speaker identification and verification system |
-
1995
- 1995-06-07 US US08/479,012 patent/US5839103A/en not_active Expired - Lifetime
-
1996
- 1996-06-06 RU RU98100221/09A patent/RU2161336C2/ru not_active IP Right Cessation
- 1996-06-06 NZ NZ311289A patent/NZ311289A/xx unknown
- 1996-06-06 KR KR1019970708871A patent/KR19990022391A/ko not_active Ceased
- 1996-06-06 AT AT96921329T patent/ATE323934T1/de not_active IP Right Cessation
- 1996-06-06 EP EP96921329A patent/EP0870300B1/en not_active Expired - Lifetime
- 1996-06-06 JP JP9501618A patent/JPH11507443A/ja not_active Ceased
- 1996-06-06 CN CN96194550A patent/CN1197526A/zh active Pending
- 1996-06-06 TR TR97/01555T patent/TR199701555T1/xx unknown
- 1996-06-06 AU AU62576/96A patent/AU711496B2/en not_active Ceased
- 1996-06-06 IL IL12235496A patent/IL122354A/xx not_active IP Right Cessation
- 1996-06-06 DE DE69636057T patent/DE69636057T2/de not_active Expired - Lifetime
- 1996-06-06 CA CA002221415A patent/CA2221415A1/en not_active Abandoned
- 1996-06-06 WO PCT/US1996/009260 patent/WO1996041334A1/en not_active Ceased
-
1997
- 1997-11-26 FI FI974339A patent/FI117954B/fi not_active IP Right Cessation
- 1997-11-28 NO NO19975475A patent/NO321125B1/no unknown
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2008126627A1 (ja) * | 2007-03-26 | 2008-10-23 | Nec Corporation | 音声分類装置、音声分類方法、および音声分類用プログラム |
| US8630853B2 (en) | 2007-03-26 | 2014-01-14 | Nec Corporation | Speech classification apparatus, speech classification method, and speech classification program |
Also Published As
| Publication number | Publication date |
|---|---|
| US5839103A (en) | 1998-11-17 |
| FI974339A0 (fi) | 1997-11-26 |
| RU2161336C2 (ru) | 2000-12-27 |
| WO1996041334A1 (en) | 1996-12-19 |
| AU711496B2 (en) | 1999-10-14 |
| EP0870300B1 (en) | 2006-04-19 |
| DE69636057T2 (de) | 2007-04-12 |
| DE69636057D1 (de) | 2006-05-24 |
| AU6257696A (en) | 1996-12-30 |
| FI117954B (fi) | 2007-04-30 |
| EP0870300A4 (en) | 1999-04-21 |
| CN1197526A (zh) | 1998-10-28 |
| FI974339L (fi) | 1998-02-06 |
| CA2221415A1 (en) | 1996-12-19 |
| NO975475L (no) | 1998-01-21 |
| NZ311289A (en) | 1998-12-23 |
| NO975475D0 (no) | 1997-11-28 |
| IL122354A (en) | 2000-10-31 |
| IL122354A0 (en) | 1998-04-05 |
| ATE323934T1 (de) | 2006-05-15 |
| EP0870300A1 (en) | 1998-10-14 |
| NO321125B1 (no) | 2006-03-20 |
| TR199701555T1 (xx) | 1998-04-21 |
| KR19990022391A (ko) | 1999-03-25 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JPH11507443A (ja) | 話者確認システム | |
| US6539352B1 (en) | Subword-based speaker verification with multiple-classifier score fusion weight and threshold adaptation | |
| EP1399915B1 (en) | Speaker verification | |
| US6519561B1 (en) | Model adaptation of neural tree networks and other fused models for speaker verification | |
| JP3532346B2 (ja) | ミックスチャ分解識別による話者検証方法と装置 | |
| US7603275B2 (en) | System, method and computer program product for verifying an identity using voiced to unvoiced classifiers | |
| US20090171660A1 (en) | Method and apparatus for verification of speaker authentification and system for speaker authentication | |
| EP0892388B1 (en) | Method and apparatus for providing speaker authentication by verbal information verification using forced decoding | |
| AU2002311452A1 (en) | Speaker recognition system | |
| Ozaydin | Design of a text independent speaker recognition system | |
| Ilyas et al. | Speaker verification using vector quantization and hidden Markov model | |
| WO2002029785A1 (en) | Method, apparatus, and system for speaker verification based on orthogonal gaussian mixture model (gmm) | |
| Georgescu et al. | GMM-UBM modeling for speaker recognition on a Romanian large speech corpora | |
| KR100917419B1 (ko) | 화자 인식 시스템 | |
| Ahmad et al. | Client-wise cohort set selection by combining speaker-and phoneme-specific I-vectors for speaker verification | |
| MXPA97009615A (en) | High verification system | |
| Fakotakis et al. | High performance text-independent speaker recognition system based on voiced/unvoiced segmentation and multiple neural nets. | |
| Morris et al. | Discriminative Feature Projection for Noise Robust Speaker Identification | |
| Suh et al. | Filling acoustic holes through leveraged uncorellated GMMs for in-set/out-of-set speaker recognition. | |
| Jianping et al. | Speaker Recognition Using Radial Basis Function Neural Networks | |
| HK1016727A (en) | Speaker verification system |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20060228 |
|
| A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20060530 |
|
| A602 | Written permission of extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A602 Effective date: 20060714 |
|
| A313 | Final decision of rejection without a dissenting response from the applicant |
Free format text: JAPANESE INTERMEDIATE CODE: A313 Effective date: 20061016 |
|
| A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20061121 |