NO20021631L - Fremgangsmåte og anordning for taledata - Google Patents
Fremgangsmåte og anordning for taledataInfo
- Publication number
- NO20021631L NO20021631L NO20021631A NO20021631A NO20021631L NO 20021631 L NO20021631 L NO 20021631L NO 20021631 A NO20021631 A NO 20021631A NO 20021631 A NO20021631 A NO 20021631A NO 20021631 L NO20021631 L NO 20021631L
- Authority
- NO
- Norway
- Prior art keywords
- prediction
- speech
- class
- sound quality
- unit
- Prior art date
Links
- 239000000284 extract Substances 0.000 abstract 2
- 230000015572 biosynthetic process Effects 0.000 abstract 1
- 238000003786 synthesis reaction Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Abstract
Det er beskrevet en talebehandlingsanordning, der forutsigelsesutgang for å finne forutsigelsesverdier for talen som har høy lydkvalitet, blir trukket ut fra den syntetiserte lyd som er fremkommet ved å føre lineære forutsigelseskoeffisienter og restsignaler, frembragt fra en forhåndsstilt kode, til et talesyntesefilter der talen med høy lydkvalitet har høyere lydkvalitet enn den syntetiserte lyd, og der forutsigelsesuttakene blir benyttet sammen med forhåndsstihe uttakskoefEsienter for å utføre forhåndsstilte forutsigelsesberegninger for å finne forutsigelsesverdiene for talen som har høy lydkvalitet. Lyden som har høy lydkvalitet har høyere lydkvalitet enn den syntetiserte lyd. Anordningen omfatter en enhet (45) til uttrekning av forutsigelsesuttak fra den syntetiserte lyd, der forutsigelsesuttakene benyttes til forutsigelse av talen som har høy kvalitet, som måltale, for hvilken forutsigelsesverdi og en enhet (46) for uttrekning av klasseuttak, benyttet til klassifisering av måltalen i en av et flertall klasser fra den ovenstående kode. Anordningen omfatter også en k]assifiseringsenhet(47) for å finne klassen for måltalen basert på klasseuttakene, uthentningsenhet og uthéntning av uttakskoefEsienter som er knyttet til klassen for måltalen fra blant uttakskoefifsientene som er funnet ved opplæring fra klasse til klasse, og enforutsigelsesenhet (49) for å finne forutsigelsesverdiene for måltalen ved bruk av forutsigelsesuttak og uttakskoefifsientene som er knyttet til klassen for måltalen.
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2000241062 | 2000-08-09 | ||
| JP2000251969A JP2002062899A (ja) | 2000-08-23 | 2000-08-23 | データ処理装置およびデータ処理方法、学習装置および学習方法、並びに記録媒体 |
| JP2000346675A JP4517262B2 (ja) | 2000-11-14 | 2000-11-14 | 音声処理装置および音声処理方法、学習装置および学習方法、並びに記録媒体 |
| PCT/JP2001/006708 WO2002013183A1 (en) | 2000-08-09 | 2001-08-03 | Voice data processing device and processing method |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| NO20021631D0 NO20021631D0 (no) | 2002-04-05 |
| NO20021631L true NO20021631L (no) | 2002-06-07 |
| NO326880B1 NO326880B1 (no) | 2009-03-09 |
Family
ID=27344301
Family Applications (3)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| NO20021631A NO326880B1 (no) | 2000-08-09 | 2002-04-05 | Fremgangsmate og anordning for taledata |
| NO20082401A NO20082401L (no) | 2000-08-09 | 2008-05-26 | Fremgangsmate og anordning for taledata |
| NO20082403A NO20082403L (no) | 2000-08-09 | 2008-05-26 | Fremgangsmate og anordning for taledata |
Family Applications After (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| NO20082401A NO20082401L (no) | 2000-08-09 | 2008-05-26 | Fremgangsmate og anordning for taledata |
| NO20082403A NO20082403L (no) | 2000-08-09 | 2008-05-26 | Fremgangsmate og anordning for taledata |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US7912711B2 (no) |
| EP (3) | EP1944760B1 (no) |
| KR (1) | KR100819623B1 (no) |
| DE (3) | DE60140020D1 (no) |
| NO (3) | NO326880B1 (no) |
| TW (1) | TW564398B (no) |
| WO (1) | WO2002013183A1 (no) |
Families Citing this family (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP4857467B2 (ja) | 2001-01-25 | 2012-01-18 | ソニー株式会社 | データ処理装置およびデータ処理方法、並びにプログラムおよび記録媒体 |
| JP4857468B2 (ja) | 2001-01-25 | 2012-01-18 | ソニー株式会社 | データ処理装置およびデータ処理方法、並びにプログラムおよび記録媒体 |
| JP4711099B2 (ja) | 2001-06-26 | 2011-06-29 | ソニー株式会社 | 送信装置および送信方法、送受信装置および送受信方法、並びにプログラムおよび記録媒体 |
| DE102006022346B4 (de) * | 2006-05-12 | 2008-02-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Informationssignalcodierung |
| US8504090B2 (en) * | 2010-03-29 | 2013-08-06 | Motorola Solutions, Inc. | Enhanced public safety communication system |
| US9363068B2 (en) | 2010-08-03 | 2016-06-07 | Intel Corporation | Vector processor having instruction set with sliding window non-linear convolutional function |
| WO2013063440A1 (en) * | 2011-10-27 | 2013-05-02 | Lsi Corporation | Vector processor having instruction set with vector convolution funciton for fir filtering |
| RU2012102842A (ru) | 2012-01-27 | 2013-08-10 | ЭлЭсАй Корпорейшн | Инкрементное обнаружение преамбулы |
| ES2549953T3 (es) * | 2012-08-27 | 2015-11-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Aparato y método para la reproducción de una señal de audio, aparato y método para la generación de una señal de audio codificada, programa de ordenador y señal de audio codificada |
| US9923595B2 (en) | 2013-04-17 | 2018-03-20 | Intel Corporation | Digital predistortion for dual-band power amplifiers |
Family Cites Families (46)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPS6011360B2 (ja) | 1981-12-15 | 1985-03-25 | ケイディディ株式会社 | 音声符号化方式 |
| JP2797348B2 (ja) | 1988-11-28 | 1998-09-17 | 松下電器産業株式会社 | 音声符号化・復号化装置 |
| US5293448A (en) * | 1989-10-02 | 1994-03-08 | Nippon Telegraph And Telephone Corporation | Speech analysis-synthesis method and apparatus therefor |
| US5261027A (en) * | 1989-06-28 | 1993-11-09 | Fujitsu Limited | Code excited linear prediction speech coding system |
| CA2031965A1 (en) | 1990-01-02 | 1991-07-03 | Paul A. Rosenstrach | Sound synthesizer |
| JP2736157B2 (ja) | 1990-07-17 | 1998-04-02 | シャープ株式会社 | 符号化装置 |
| JPH05158495A (ja) | 1991-05-07 | 1993-06-25 | Fujitsu Ltd | 音声符号化伝送装置 |
| CA2483324C (en) * | 1991-06-11 | 2008-05-06 | Qualcomm Incorporated | Estimation of background noise in a variable rate vocoder |
| JP3076086B2 (ja) * | 1991-06-28 | 2000-08-14 | シャープ株式会社 | 音声合成装置用ポストフィルタ |
| US5233660A (en) * | 1991-09-10 | 1993-08-03 | At&T Bell Laboratories | Method and apparatus for low-delay celp speech coding and decoding |
| US5371853A (en) * | 1991-10-28 | 1994-12-06 | University Of Maryland At College Park | Method and system for CELP speech coding and codebook for use therewith |
| US5327520A (en) * | 1992-06-04 | 1994-07-05 | At&T Bell Laboratories | Method of use of voice message coder/decoder |
| JP2779886B2 (ja) * | 1992-10-05 | 1998-07-23 | 日本電信電話株式会社 | 広帯域音声信号復元方法 |
| US5455888A (en) * | 1992-12-04 | 1995-10-03 | Northern Telecom Limited | Speech bandwidth extension method and apparatus |
| US5491771A (en) * | 1993-03-26 | 1996-02-13 | Hughes Aircraft Company | Real-time implementation of a 8Kbps CELP coder on a DSP pair |
| JP3043920B2 (ja) * | 1993-06-14 | 2000-05-22 | 富士写真フイルム株式会社 | ネガクリップ |
| JP3293700B2 (ja) | 1993-11-04 | 2002-06-17 | 京セラミタ株式会社 | 磁性粒子およびその製造方法 |
| US5717823A (en) * | 1994-04-14 | 1998-02-10 | Lucent Technologies Inc. | Speech-rate modification for linear-prediction based analysis-by-synthesis speech coders |
| JPH08202399A (ja) | 1995-01-27 | 1996-08-09 | Kyocera Corp | 復号音声の後処理方法 |
| SE504010C2 (sv) * | 1995-02-08 | 1996-10-14 | Ericsson Telefon Ab L M | Förfarande och anordning för prediktiv kodning av tal- och datasignaler |
| JP3235703B2 (ja) * | 1995-03-10 | 2001-12-04 | 日本電信電話株式会社 | ディジタルフィルタのフィルタ係数決定方法 |
| EP0732687B2 (en) * | 1995-03-13 | 2005-10-12 | Matsushita Electric Industrial Co., Ltd. | Apparatus for expanding speech bandwidth |
| JP2993396B2 (ja) * | 1995-05-12 | 1999-12-20 | 三菱電機株式会社 | 音声加工フィルタ及び音声合成装置 |
| FR2734389B1 (fr) * | 1995-05-17 | 1997-07-18 | Proust Stephane | Procede d'adaptation du niveau de masquage du bruit dans un codeur de parole a analyse par synthese utilisant un filtre de ponderation perceptuelle a court terme |
| GB9512284D0 (en) * | 1995-06-16 | 1995-08-16 | Nokia Mobile Phones Ltd | Speech Synthesiser |
| JPH0990997A (ja) * | 1995-09-26 | 1997-04-04 | Mitsubishi Electric Corp | 音声符号化装置、音声復号化装置、音声符号化復号化方法および複合ディジタルフィルタ |
| JP3248668B2 (ja) * | 1996-03-25 | 2002-01-21 | 日本電信電話株式会社 | ディジタルフィルタおよび音響符号化/復号化装置 |
| US6014622A (en) * | 1996-09-26 | 2000-01-11 | Rockwell Semiconductor Systems, Inc. | Low bit rate speech coder using adaptive open-loop subframe pitch lag estimation and vector quantization |
| JP3095133B2 (ja) * | 1997-02-25 | 2000-10-03 | 日本電信電話株式会社 | 音響信号符号化方法 |
| JP3946812B2 (ja) * | 1997-05-12 | 2007-07-18 | ソニー株式会社 | オーディオ信号変換装置及びオーディオ信号変換方法 |
| US5995923A (en) | 1997-06-26 | 1999-11-30 | Nortel Networks Corporation | Method and apparatus for improving the voice quality of tandemed vocoders |
| JP4132154B2 (ja) * | 1997-10-23 | 2008-08-13 | ソニー株式会社 | 音声合成方法及び装置、並びに帯域幅拡張方法及び装置 |
| US6014618A (en) * | 1998-08-06 | 2000-01-11 | Dsp Software Engineering, Inc. | LPAS speech coder using vector quantized, multi-codebook, multi-tap pitch predictor and optimized ternary source excitation codebook derivation |
| JP2000066700A (ja) * | 1998-08-17 | 2000-03-03 | Oki Electric Ind Co Ltd | 音声信号符号器、音声信号復号器 |
| JP4099879B2 (ja) | 1998-10-26 | 2008-06-11 | ソニー株式会社 | 帯域幅拡張方法及び装置 |
| US6539355B1 (en) | 1998-10-15 | 2003-03-25 | Sony Corporation | Signal band expanding method and apparatus and signal synthesis method and apparatus |
| US6260009B1 (en) | 1999-02-12 | 2001-07-10 | Qualcomm Incorporated | CELP-based to CELP-based vocoder packet translation |
| US6434519B1 (en) * | 1999-07-19 | 2002-08-13 | Qualcomm Incorporated | Method and apparatus for identifying frequency bands to compute linear phase shifts between frame prototypes in a speech coder |
| CN1578159B (zh) * | 2000-05-09 | 2010-05-26 | 索尼公司 | 数据处理装置和方法 |
| JP4517448B2 (ja) | 2000-05-09 | 2010-08-04 | ソニー株式会社 | データ処理装置およびデータ処理方法、並びに記録媒体 |
| JP4752088B2 (ja) | 2000-05-09 | 2011-08-17 | ソニー株式会社 | データ処理装置およびデータ処理方法、並びに記録媒体 |
| US7283961B2 (en) * | 2000-08-09 | 2007-10-16 | Sony Corporation | High-quality speech synthesis device and method by classification and prediction processing of synthesized sound |
| JP4857468B2 (ja) * | 2001-01-25 | 2012-01-18 | ソニー株式会社 | データ処理装置およびデータ処理方法、並びにプログラムおよび記録媒体 |
| JP4857467B2 (ja) * | 2001-01-25 | 2012-01-18 | ソニー株式会社 | データ処理装置およびデータ処理方法、並びにプログラムおよび記録媒体 |
| JP3876781B2 (ja) * | 2002-07-16 | 2007-02-07 | ソニー株式会社 | 受信装置および受信方法、記録媒体、並びにプログラム |
| JP4554561B2 (ja) * | 2006-06-20 | 2010-09-29 | 株式会社シマノ | 釣り用グローブ |
-
2001
- 2001-08-03 EP EP08003539A patent/EP1944760B1/en not_active Expired - Lifetime
- 2001-08-03 EP EP01956800A patent/EP1308927B9/en not_active Expired - Lifetime
- 2001-08-03 KR KR1020027004559A patent/KR100819623B1/ko not_active Expired - Fee Related
- 2001-08-03 EP EP08003538A patent/EP1944759B1/en not_active Expired - Lifetime
- 2001-08-03 DE DE60140020T patent/DE60140020D1/de not_active Expired - Lifetime
- 2001-08-03 DE DE60143327T patent/DE60143327D1/de not_active Expired - Lifetime
- 2001-08-03 WO PCT/JP2001/006708 patent/WO2002013183A1/ja not_active Ceased
- 2001-08-03 DE DE60134861T patent/DE60134861D1/de not_active Expired - Lifetime
- 2001-08-08 TW TW090119402A patent/TW564398B/zh not_active IP Right Cessation
-
2002
- 2002-04-05 NO NO20021631A patent/NO326880B1/no not_active IP Right Cessation
-
2007
- 2007-09-21 US US11/903,550 patent/US7912711B2/en not_active Expired - Fee Related
-
2008
- 2008-05-26 NO NO20082401A patent/NO20082401L/no not_active Application Discontinuation
- 2008-05-26 NO NO20082403A patent/NO20082403L/no not_active Application Discontinuation
Also Published As
| Publication number | Publication date |
|---|---|
| NO20021631D0 (no) | 2002-04-05 |
| KR100819623B1 (ko) | 2008-04-04 |
| NO326880B1 (no) | 2009-03-09 |
| EP1944760A3 (en) | 2008-07-30 |
| NO20082401L (no) | 2002-06-07 |
| EP1308927A1 (en) | 2003-05-07 |
| US7912711B2 (en) | 2011-03-22 |
| EP1308927B9 (en) | 2009-02-25 |
| KR20020040846A (ko) | 2002-05-30 |
| EP1944759A3 (en) | 2008-07-30 |
| DE60134861D1 (de) | 2008-08-28 |
| EP1308927B1 (en) | 2008-07-16 |
| TW564398B (en) | 2003-12-01 |
| US20080027720A1 (en) | 2008-01-31 |
| EP1944760B1 (en) | 2009-09-23 |
| EP1944759B1 (en) | 2010-10-20 |
| EP1944760A2 (en) | 2008-07-16 |
| NO20082403L (no) | 2002-06-07 |
| WO2002013183A1 (en) | 2002-02-14 |
| DE60140020D1 (de) | 2009-11-05 |
| DE60143327D1 (de) | 2010-12-02 |
| EP1308927A4 (en) | 2005-09-28 |
| EP1944759A2 (en) | 2008-07-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| NO20082401L (no) | Fremgangsmate og anordning for taledata | |
| Pasad et al. | Comparative layer-wise analysis of self-supervised speech models | |
| Labied et al. | Automatic speech recognition features extraction techniques: A multi-criteria comparison | |
| JPS63225300A (ja) | パタ−ン認識装置 | |
| Erzin | Improving throat microphone speech recognition by joint analysis of throat and acoustic microphone recordings | |
| US5241649A (en) | Voice recognition method | |
| US5144672A (en) | Speech recognition apparatus including speaker-independent dictionary and speaker-dependent | |
| JPH04270398A (ja) | 音声符号化方式 | |
| EP1465153B1 (en) | Method and apparatus for formant tracking using a residual model | |
| Hai et al. | Improved linear predictive coding method for speech recognition | |
| KR20040041740A (ko) | 적은 복잡도를 가진 고정 코드북 검색방법 및 장치 | |
| KR100323487B1 (ko) | 버스트여기선형예측 | |
| Sunny et al. | Feature extraction methods based on linear predictive coding and wavelet packet decomposition for recognizing spoken words in malayalam | |
| JPH0764600A (ja) | 音声のピッチ符号化装置 | |
| JP2003216183A (ja) | 情報検索方法及び装置 | |
| KR100766170B1 (ko) | 다중 레벨 양자화를 이용한 음악 요약 장치 및 방법 | |
| JPH058839B2 (no) | ||
| CN117351988B (zh) | 一种基于数据分析的远程音频信息处理方法及系统 | |
| JP3095758B2 (ja) | ベクトル量子化のコードベクトル検索方法 | |
| Nofal et al. | Arabic/English automatic spoken language identification | |
| Singh et al. | A perfect balance of sparsity and acoustic hole in speech signal and its application in speaker recognition system | |
| JPH0235994B2 (no) | ||
| Agarwal et al. | Robustness of end-to-end Automatic Speech Recognition Models–A Case Study using Mozilla DeepSpeech | |
| JPH0736119B2 (ja) | 区分的最適関数近似方法 | |
| JPH0233200A (ja) | データベース検索方式 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| MM1K | Lapsed by not paying the annual fees |