ATE214832T1 - METHOD AND DEVICE FOR VOICE IMPROVEMENT IN A VOICE TRANSMISSION SYSTEM - Google Patents

METHOD AND DEVICE FOR VOICE IMPROVEMENT IN A VOICE TRANSMISSION SYSTEM

Info

Publication number
ATE214832T1
ATE214832T1 AT98932337T AT98932337T ATE214832T1 AT E214832 T1 ATE214832 T1 AT E214832T1 AT 98932337 T AT98932337 T AT 98932337T AT 98932337 T AT98932337 T AT 98932337T AT E214832 T1 ATE214832 T1 AT E214832T1
Authority
AT
Austria
Prior art keywords
speech
unit
determines
intelligible
voice
Prior art date
Application number
AT98932337T
Other languages
German (de)
Inventor
Robert James Chance
Ian Vince Mcloughlin
Original Assignee
Simoco Int Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Simoco Int Ltd filed Critical Simoco Int Ltd
Application granted granted Critical
Publication of ATE214832T1 publication Critical patent/ATE214832T1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • G10L2021/0135Voice conversion or morphing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/15Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R2225/00Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
    • H04R2225/43Signal processing in hearing aids to enhance the speech intelligibility

Landscapes

  • Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephonic Communication Services (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)
  • Telephone Function (AREA)
  • Interconnected Communication Systems, Intercoms, And Interphones (AREA)

Abstract

The characteristics of the speech received by the decoding unit are altered by a processing unit 10 based upon an analysis of the listener's current background noise before the speech is output to enhance its intelligibility to a listener. An analysis unit 12 determines the type and level of the background noise by use of a microphone 13. A decision unit 11 then determines whether the speech currently being received and replayed would be intelligible to an average listener in the current background noise. If unit 11 determines that the speech is readily intelligible then no processing is necessary and the processing unit 10 does not alter the speech which has been passed to it. However, if unit 11 determines that the speech would be unintelligible, then unit 10 alters the speech before passing it to the output to make the speech more intelligible. In a particularly preferred embodiment, the speech characteristics are altered by altering line spectral pair/formant data representing the speech.
AT98932337T 1997-07-02 1998-07-01 METHOD AND DEVICE FOR VOICE IMPROVEMENT IN A VOICE TRANSMISSION SYSTEM ATE214832T1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GBGB9714001.6A GB9714001D0 (en) 1997-07-02 1997-07-02 Method and apparatus for speech enhancement in a speech communication system
PCT/GB1998/001936 WO1999001863A1 (en) 1997-07-02 1998-07-01 Method and apparatus for speech enhancement in a speech communication system

Publications (1)

Publication Number Publication Date
ATE214832T1 true ATE214832T1 (en) 2002-04-15

Family

ID=10815285

Family Applications (1)

Application Number Title Priority Date Filing Date
AT98932337T ATE214832T1 (en) 1997-07-02 1998-07-01 METHOD AND DEVICE FOR VOICE IMPROVEMENT IN A VOICE TRANSMISSION SYSTEM

Country Status (12)

Country Link
EP (1) EP0993670B1 (en)
JP (1) JP2002507291A (en)
KR (1) KR20010014352A (en)
CN (1) CN1265217A (en)
AT (1) ATE214832T1 (en)
AU (1) AU8227798A (en)
CA (1) CA2235455A1 (en)
DE (1) DE69804310D1 (en)
GB (2) GB9714001D0 (en)
PL (1) PL337717A1 (en)
WO (1) WO1999001863A1 (en)
ZA (1) ZA985607B (en)

Families Citing this family (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE9903553D0 (en) * 1999-01-27 1999-10-01 Lars Liljeryd Enhancing conceptual performance of SBR and related coding methods by adaptive noise addition (ANA) and noise substitution limiting (NSL)
FR2794322B1 (en) * 1999-05-27 2001-06-22 Sagem NOISE SUPPRESSION PROCESS
US7120579B1 (en) 1999-07-28 2006-10-10 Clear Audio Ltd. Filter banked gain control of audio in a noisy environment
US6876968B2 (en) * 2001-03-08 2005-04-05 Matsushita Electric Industrial Co., Ltd. Run time synthesizer adaptation to improve intelligibility of synthesized speech
DE10124189A1 (en) * 2001-05-17 2002-11-21 Siemens Ag Signal reception procedure
JP2003255993A (en) * 2002-03-04 2003-09-10 Ntt Docomo Inc Speech recognition system, speech recognition method, speech recognition program, speech synthesis system, speech synthesis method, speech synthesis program
KR20050010927A (en) * 2002-06-19 2005-01-28 코닌클리케 필립스 일렉트로닉스 엔.브이. Audio signal processing apparatus
EP1609134A1 (en) * 2003-01-31 2005-12-28 Oticon A/S Sound system improving speech intelligibility
KR20050049103A (en) * 2003-11-21 2005-05-25 삼성전자주식회사 Method and apparatus for enhancing dialog using formant
CA2621916C (en) * 2004-09-07 2015-07-21 Sensear Pty Ltd. Apparatus and method for sound enhancement
US8280730B2 (en) 2005-05-25 2012-10-02 Motorola Mobility Llc Method and apparatus of increasing speech intelligibility in noisy environments
GB2433849B (en) 2005-12-29 2008-05-21 Motorola Inc Telecommunications terminal and method of operation of the terminal
DE102006001730A1 (en) 2006-01-13 2007-07-19 Robert Bosch Gmbh Sound system, method for improving the voice quality and / or intelligibility of voice announcements and computer program
EP1814109A1 (en) * 2006-01-27 2007-08-01 Texas Instruments Incorporated Voice amplification apparatus for modelling the Lombard effect
JP2007295347A (en) * 2006-04-26 2007-11-08 Mitsubishi Electric Corp Audio processing device
KR101414233B1 (en) 2007-01-05 2014-07-02 삼성전자 주식회사 Apparatus and method for improving intelligibility of speech signal
JP4926005B2 (en) 2007-11-13 2012-05-09 ソニー・エリクソン・モバイルコミュニケーションズ株式会社 Audio signal processing apparatus, audio signal processing method, and communication terminal
EP2232700B1 (en) 2007-12-21 2014-08-13 Dts Llc System for adjusting perceived loudness of audio signals
JP5453740B2 (en) * 2008-07-02 2014-03-26 富士通株式会社 Speech enhancement device
US8538042B2 (en) 2009-08-11 2013-09-17 Dts Llc System for increasing perceived loudness of speakers
EP2372700A1 (en) * 2010-03-11 2011-10-05 Oticon A/S A speech intelligibility predictor and applications thereof
KR102060208B1 (en) 2011-07-29 2019-12-27 디티에스 엘엘씨 Adaptive voice intelligibility processor
CN103002105A (en) * 2011-09-16 2013-03-27 宏碁股份有限公司 Mobile Communication Method That Increases the Clarity of Communication Content
CN103297896B (en) * 2012-02-27 2016-07-06 联想(北京)有限公司 A kind of audio-frequency inputting method and electronic equipment
US9020818B2 (en) 2012-03-05 2015-04-28 Malaspina Labs (Barbados) Inc. Format based speech reconstruction from noisy signals
US9312829B2 (en) 2012-04-12 2016-04-12 Dts Llc System for adjusting loudness of audio signals in real time
EP3010017A1 (en) * 2014-10-14 2016-04-20 Thomson Licensing Method and apparatus for separating speech data from background data in audio communication
JP6565206B2 (en) * 2015-02-20 2019-08-28 ヤマハ株式会社 Audio processing apparatus and audio processing method
EP3107097B1 (en) 2015-06-17 2017-11-15 Nxp B.V. Improved speech intelligilibility
US9847093B2 (en) 2015-06-19 2017-12-19 Samsung Electronics Co., Ltd. Method and apparatus for processing speech signal
JP6790732B2 (en) * 2016-11-02 2020-11-25 ヤマハ株式会社 Signal processing method and signal processing device
ES2801924T3 (en) * 2017-01-03 2021-01-14 Lizn Aps Oligonucleotide-based inhibitors comprising a blocked nucleic acid motif
CN108369805B (en) * 2017-12-27 2019-08-13 深圳前海达闼云端智能科技有限公司 A voice interaction method, device and intelligent terminal
CN109346058B (en) * 2018-11-29 2024-06-28 西安交通大学 A system for expanding speech acoustic features
KR102845224B1 (en) * 2019-12-09 2025-08-12 삼성전자주식회사 Electronic apparatus and controlling method thereof
US11817114B2 (en) * 2019-12-09 2023-11-14 Dolby Laboratories Licensing Corporation Content and environmentally aware environmental noise compensation

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5870292A (en) * 1981-10-22 1983-04-26 日産自動車株式会社 Voice recognition equipment for vehicle
US4538295A (en) * 1982-08-16 1985-08-27 Nissan Motor Company, Limited Speech recognition system for an automotive vehicle
DE3689035T2 (en) * 1985-07-01 1994-01-20 Motorola Inc NOISE REDUCTION SYSTEM.
GB8801014D0 (en) * 1988-01-18 1988-02-17 British Telecomm Noise reduction
US5235669A (en) * 1990-06-29 1993-08-10 At&T Laboratories Low-delay code-excited linear-predictive coding of wideband speech at 32 kbits/sec
CA2056110C (en) * 1991-03-27 1997-02-04 Arnold I. Klayman Public address intelligibility system
FI102337B (en) * 1995-09-13 1998-11-13 Nokia Mobile Phones Ltd Procedure and circuit arrangement for processing audio signal
GB2306086A (en) * 1995-10-06 1997-04-23 Richard Morris Trim Improved adaptive audio systems

Also Published As

Publication number Publication date
AU8227798A (en) 1999-01-25
GB9714001D0 (en) 1997-09-10
GB2327835A (en) 1999-02-03
WO1999001863A1 (en) 1999-01-14
PL337717A1 (en) 2000-08-28
CA2235455A1 (en) 1999-01-02
CN1265217A (en) 2000-08-30
JP2002507291A (en) 2002-03-05
EP0993670A1 (en) 2000-04-19
KR20010014352A (en) 2001-02-26
EP0993670B1 (en) 2002-03-20
ZA985607B (en) 2000-06-01
GB2327835B (en) 2000-04-19
GB9814279D0 (en) 1998-09-02
DE69804310D1 (en) 2002-04-25

Similar Documents

Publication Publication Date Title
ATE214832T1 (en) METHOD AND DEVICE FOR VOICE IMPROVEMENT IN A VOICE TRANSMISSION SYSTEM
DE69620585D1 (en) METHOD AND DEVICE FOR DETECTING AND Bypassing TANDEM SPEECH CODING
Liu et al. Efficient joint compensation of speech for the effects of additive noise and linear filtering
Servetti et al. Perception-based partial encryption of compressed speech
ATE267443T1 (en) DEVICE FOR VOICE DETECTION IN AMBIENT NOISE
JP2002014689A (en) Method and device for improving understandability of digitally compressed speech
AU2001277647A1 (en) Method for noise robust classification in speech coding
BR9204112A (en) PROCESS AND APPARATUS FOR TEACHING LANGUAGES
DE69739545D1 (en) METHOD AND SYSTEM FOR THE AUTOMATIC TEXT-INDEPENDENT EVALUATION OF THE LANGUAGE DIRECTORY
GB2343822A (en) Using LSP to alter frequency characteristics of speech
El-Maleh Classification-based Techniques for Digital Coding of Speech-plus-noise
JP3166797B2 (en) Voice coding method, voice decoding method, and voice codec
SU1674226A1 (en) Method and apparatus for detecting speech signals and their boundaries
Cox Current methods of speech coding
Riedhammer et al. A software kit for automatic voice descrambling
KR100624694B1 (en) Sound quality improvement device for call connection sound and its method
Bertrand Secure narrowband digital conferencing
Patwardhan et al. Effect of voice quality on frequency-warped modeling
Bunnell et al. Speech processing program
McGahan et al. Modelling listeners’ identification of concurrent vowels using a Kohonen net
JPS5853349B2 (en) Speech analysis and synthesis method
Burchfield et al. Command and Control Related Computer Technology. Part 2. Speech Compression
O'Brien et al. Preliminary study of multilevel peak‐clipped and time‐quantized speech

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties