ATE214832T1 - METHOD AND DEVICE FOR VOICE IMPROVEMENT IN A VOICE TRANSMISSION SYSTEM - Google Patents
METHOD AND DEVICE FOR VOICE IMPROVEMENT IN A VOICE TRANSMISSION SYSTEMInfo
- Publication number
- ATE214832T1 ATE214832T1 AT98932337T AT98932337T ATE214832T1 AT E214832 T1 ATE214832 T1 AT E214832T1 AT 98932337 T AT98932337 T AT 98932337T AT 98932337 T AT98932337 T AT 98932337T AT E214832 T1 ATE214832 T1 AT E214832T1
- Authority
- AT
- Austria
- Prior art keywords
- speech
- unit
- determines
- intelligible
- voice
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0364—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G10L2021/0135—Voice conversion or morphing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/15—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
- H04R2225/00—Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
- H04R2225/43—Signal processing in hearing aids to enhance the speech intelligibility
Landscapes
- Engineering & Computer Science (AREA)
- Quality & Reliability (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Telephonic Communication Services (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
- Document Processing Apparatus (AREA)
- Machine Translation (AREA)
- Telephone Function (AREA)
- Interconnected Communication Systems, Intercoms, And Interphones (AREA)
Abstract
The characteristics of the speech received by the decoding unit are altered by a processing unit 10 based upon an analysis of the listener's current background noise before the speech is output to enhance its intelligibility to a listener. An analysis unit 12 determines the type and level of the background noise by use of a microphone 13. A decision unit 11 then determines whether the speech currently being received and replayed would be intelligible to an average listener in the current background noise. If unit 11 determines that the speech is readily intelligible then no processing is necessary and the processing unit 10 does not alter the speech which has been passed to it. However, if unit 11 determines that the speech would be unintelligible, then unit 10 alters the speech before passing it to the output to make the speech more intelligible. In a particularly preferred embodiment, the speech characteristics are altered by altering line spectral pair/formant data representing the speech.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| GBGB9714001.6A GB9714001D0 (en) | 1997-07-02 | 1997-07-02 | Method and apparatus for speech enhancement in a speech communication system |
| PCT/GB1998/001936 WO1999001863A1 (en) | 1997-07-02 | 1998-07-01 | Method and apparatus for speech enhancement in a speech communication system |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE214832T1 true ATE214832T1 (en) | 2002-04-15 |
Family
ID=10815285
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT98932337T ATE214832T1 (en) | 1997-07-02 | 1998-07-01 | METHOD AND DEVICE FOR VOICE IMPROVEMENT IN A VOICE TRANSMISSION SYSTEM |
Country Status (12)
| Country | Link |
|---|---|
| EP (1) | EP0993670B1 (en) |
| JP (1) | JP2002507291A (en) |
| KR (1) | KR20010014352A (en) |
| CN (1) | CN1265217A (en) |
| AT (1) | ATE214832T1 (en) |
| AU (1) | AU8227798A (en) |
| CA (1) | CA2235455A1 (en) |
| DE (1) | DE69804310D1 (en) |
| GB (2) | GB9714001D0 (en) |
| PL (1) | PL337717A1 (en) |
| WO (1) | WO1999001863A1 (en) |
| ZA (1) | ZA985607B (en) |
Families Citing this family (36)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| SE9903553D0 (en) * | 1999-01-27 | 1999-10-01 | Lars Liljeryd | Enhancing conceptual performance of SBR and related coding methods by adaptive noise addition (ANA) and noise substitution limiting (NSL) |
| FR2794322B1 (en) * | 1999-05-27 | 2001-06-22 | Sagem | NOISE SUPPRESSION PROCESS |
| US7120579B1 (en) | 1999-07-28 | 2006-10-10 | Clear Audio Ltd. | Filter banked gain control of audio in a noisy environment |
| US6876968B2 (en) * | 2001-03-08 | 2005-04-05 | Matsushita Electric Industrial Co., Ltd. | Run time synthesizer adaptation to improve intelligibility of synthesized speech |
| DE10124189A1 (en) * | 2001-05-17 | 2002-11-21 | Siemens Ag | Signal reception procedure |
| JP2003255993A (en) * | 2002-03-04 | 2003-09-10 | Ntt Docomo Inc | Speech recognition system, speech recognition method, speech recognition program, speech synthesis system, speech synthesis method, speech synthesis program |
| KR20050010927A (en) * | 2002-06-19 | 2005-01-28 | 코닌클리케 필립스 일렉트로닉스 엔.브이. | Audio signal processing apparatus |
| EP1609134A1 (en) * | 2003-01-31 | 2005-12-28 | Oticon A/S | Sound system improving speech intelligibility |
| KR20050049103A (en) * | 2003-11-21 | 2005-05-25 | 삼성전자주식회사 | Method and apparatus for enhancing dialog using formant |
| CA2621916C (en) * | 2004-09-07 | 2015-07-21 | Sensear Pty Ltd. | Apparatus and method for sound enhancement |
| US8280730B2 (en) | 2005-05-25 | 2012-10-02 | Motorola Mobility Llc | Method and apparatus of increasing speech intelligibility in noisy environments |
| GB2433849B (en) | 2005-12-29 | 2008-05-21 | Motorola Inc | Telecommunications terminal and method of operation of the terminal |
| DE102006001730A1 (en) | 2006-01-13 | 2007-07-19 | Robert Bosch Gmbh | Sound system, method for improving the voice quality and / or intelligibility of voice announcements and computer program |
| EP1814109A1 (en) * | 2006-01-27 | 2007-08-01 | Texas Instruments Incorporated | Voice amplification apparatus for modelling the Lombard effect |
| JP2007295347A (en) * | 2006-04-26 | 2007-11-08 | Mitsubishi Electric Corp | Audio processing device |
| KR101414233B1 (en) | 2007-01-05 | 2014-07-02 | 삼성전자 주식회사 | Apparatus and method for improving intelligibility of speech signal |
| JP4926005B2 (en) | 2007-11-13 | 2012-05-09 | ソニー・エリクソン・モバイルコミュニケーションズ株式会社 | Audio signal processing apparatus, audio signal processing method, and communication terminal |
| EP2232700B1 (en) | 2007-12-21 | 2014-08-13 | Dts Llc | System for adjusting perceived loudness of audio signals |
| JP5453740B2 (en) * | 2008-07-02 | 2014-03-26 | 富士通株式会社 | Speech enhancement device |
| US8538042B2 (en) | 2009-08-11 | 2013-09-17 | Dts Llc | System for increasing perceived loudness of speakers |
| EP2372700A1 (en) * | 2010-03-11 | 2011-10-05 | Oticon A/S | A speech intelligibility predictor and applications thereof |
| KR102060208B1 (en) | 2011-07-29 | 2019-12-27 | 디티에스 엘엘씨 | Adaptive voice intelligibility processor |
| CN103002105A (en) * | 2011-09-16 | 2013-03-27 | 宏碁股份有限公司 | Mobile Communication Method That Increases the Clarity of Communication Content |
| CN103297896B (en) * | 2012-02-27 | 2016-07-06 | 联想(北京)有限公司 | A kind of audio-frequency inputting method and electronic equipment |
| US9020818B2 (en) | 2012-03-05 | 2015-04-28 | Malaspina Labs (Barbados) Inc. | Format based speech reconstruction from noisy signals |
| US9312829B2 (en) | 2012-04-12 | 2016-04-12 | Dts Llc | System for adjusting loudness of audio signals in real time |
| EP3010017A1 (en) * | 2014-10-14 | 2016-04-20 | Thomson Licensing | Method and apparatus for separating speech data from background data in audio communication |
| JP6565206B2 (en) * | 2015-02-20 | 2019-08-28 | ヤマハ株式会社 | Audio processing apparatus and audio processing method |
| EP3107097B1 (en) | 2015-06-17 | 2017-11-15 | Nxp B.V. | Improved speech intelligilibility |
| US9847093B2 (en) | 2015-06-19 | 2017-12-19 | Samsung Electronics Co., Ltd. | Method and apparatus for processing speech signal |
| JP6790732B2 (en) * | 2016-11-02 | 2020-11-25 | ヤマハ株式会社 | Signal processing method and signal processing device |
| ES2801924T3 (en) * | 2017-01-03 | 2021-01-14 | Lizn Aps | Oligonucleotide-based inhibitors comprising a blocked nucleic acid motif |
| CN108369805B (en) * | 2017-12-27 | 2019-08-13 | 深圳前海达闼云端智能科技有限公司 | A voice interaction method, device and intelligent terminal |
| CN109346058B (en) * | 2018-11-29 | 2024-06-28 | 西安交通大学 | A system for expanding speech acoustic features |
| KR102845224B1 (en) * | 2019-12-09 | 2025-08-12 | 삼성전자주식회사 | Electronic apparatus and controlling method thereof |
| US11817114B2 (en) * | 2019-12-09 | 2023-11-14 | Dolby Laboratories Licensing Corporation | Content and environmentally aware environmental noise compensation |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPS5870292A (en) * | 1981-10-22 | 1983-04-26 | 日産自動車株式会社 | Voice recognition equipment for vehicle |
| US4538295A (en) * | 1982-08-16 | 1985-08-27 | Nissan Motor Company, Limited | Speech recognition system for an automotive vehicle |
| DE3689035T2 (en) * | 1985-07-01 | 1994-01-20 | Motorola Inc | NOISE REDUCTION SYSTEM. |
| GB8801014D0 (en) * | 1988-01-18 | 1988-02-17 | British Telecomm | Noise reduction |
| US5235669A (en) * | 1990-06-29 | 1993-08-10 | At&T Laboratories | Low-delay code-excited linear-predictive coding of wideband speech at 32 kbits/sec |
| CA2056110C (en) * | 1991-03-27 | 1997-02-04 | Arnold I. Klayman | Public address intelligibility system |
| FI102337B (en) * | 1995-09-13 | 1998-11-13 | Nokia Mobile Phones Ltd | Procedure and circuit arrangement for processing audio signal |
| GB2306086A (en) * | 1995-10-06 | 1997-04-23 | Richard Morris Trim | Improved adaptive audio systems |
-
1997
- 1997-07-02 GB GBGB9714001.6A patent/GB9714001D0/en not_active Ceased
-
1998
- 1998-04-21 CA CA002235455A patent/CA2235455A1/en not_active Abandoned
- 1998-06-26 ZA ZA9805607A patent/ZA985607B/en unknown
- 1998-07-01 GB GB9814279A patent/GB2327835B/en not_active Expired - Fee Related
- 1998-07-01 KR KR1019997012508A patent/KR20010014352A/en not_active Withdrawn
- 1998-07-01 WO PCT/GB1998/001936 patent/WO1999001863A1/en not_active Ceased
- 1998-07-01 CN CN98807458A patent/CN1265217A/en active Pending
- 1998-07-01 AU AU82277/98A patent/AU8227798A/en not_active Abandoned
- 1998-07-01 JP JP50665899A patent/JP2002507291A/en active Pending
- 1998-07-01 PL PL98337717A patent/PL337717A1/en unknown
- 1998-07-01 DE DE69804310T patent/DE69804310D1/en not_active Expired - Lifetime
- 1998-07-01 AT AT98932337T patent/ATE214832T1/en not_active IP Right Cessation
- 1998-07-01 EP EP98932337A patent/EP0993670B1/en not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| AU8227798A (en) | 1999-01-25 |
| GB9714001D0 (en) | 1997-09-10 |
| GB2327835A (en) | 1999-02-03 |
| WO1999001863A1 (en) | 1999-01-14 |
| PL337717A1 (en) | 2000-08-28 |
| CA2235455A1 (en) | 1999-01-02 |
| CN1265217A (en) | 2000-08-30 |
| JP2002507291A (en) | 2002-03-05 |
| EP0993670A1 (en) | 2000-04-19 |
| KR20010014352A (en) | 2001-02-26 |
| EP0993670B1 (en) | 2002-03-20 |
| ZA985607B (en) | 2000-06-01 |
| GB2327835B (en) | 2000-04-19 |
| GB9814279D0 (en) | 1998-09-02 |
| DE69804310D1 (en) | 2002-04-25 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ATE214832T1 (en) | METHOD AND DEVICE FOR VOICE IMPROVEMENT IN A VOICE TRANSMISSION SYSTEM | |
| DE69620585D1 (en) | METHOD AND DEVICE FOR DETECTING AND Bypassing TANDEM SPEECH CODING | |
| Liu et al. | Efficient joint compensation of speech for the effects of additive noise and linear filtering | |
| Servetti et al. | Perception-based partial encryption of compressed speech | |
| ATE267443T1 (en) | DEVICE FOR VOICE DETECTION IN AMBIENT NOISE | |
| JP2002014689A (en) | Method and device for improving understandability of digitally compressed speech | |
| AU2001277647A1 (en) | Method for noise robust classification in speech coding | |
| BR9204112A (en) | PROCESS AND APPARATUS FOR TEACHING LANGUAGES | |
| DE69739545D1 (en) | METHOD AND SYSTEM FOR THE AUTOMATIC TEXT-INDEPENDENT EVALUATION OF THE LANGUAGE DIRECTORY | |
| GB2343822A (en) | Using LSP to alter frequency characteristics of speech | |
| El-Maleh | Classification-based Techniques for Digital Coding of Speech-plus-noise | |
| JP3166797B2 (en) | Voice coding method, voice decoding method, and voice codec | |
| SU1674226A1 (en) | Method and apparatus for detecting speech signals and their boundaries | |
| Cox | Current methods of speech coding | |
| Riedhammer et al. | A software kit for automatic voice descrambling | |
| KR100624694B1 (en) | Sound quality improvement device for call connection sound and its method | |
| Bertrand | Secure narrowband digital conferencing | |
| Patwardhan et al. | Effect of voice quality on frequency-warped modeling | |
| Bunnell et al. | Speech processing program | |
| McGahan et al. | Modelling listeners’ identification of concurrent vowels using a Kohonen net | |
| JPS5853349B2 (en) | Speech analysis and synthesis method | |
| Burchfield et al. | Command and Control Related Computer Technology. Part 2. Speech Compression | |
| O'Brien et al. | Preliminary study of multilevel peak‐clipped and time‐quantized speech |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |