ATE450034T1 - Wahrnehmungsbezogene normierung digitaler audiosignale - Google Patents

Wahrnehmungsbezogene normierung digitaler audiosignale

Info

Publication number
ATE450034T1
ATE450034T1 AT03718091T AT03718091T ATE450034T1 AT E450034 T1 ATE450034 T1 AT E450034T1 AT 03718091 T AT03718091 T AT 03718091T AT 03718091 T AT03718091 T AT 03718091T AT E450034 T1 ATE450034 T1 AT E450034T1
Authority
AT
Austria
Prior art keywords
digital audio
audio signals
bands
audio data
sub
Prior art date
Application number
AT03718091T
Other languages
English (en)
Inventor
Alex Lopez-Estrada
Original Assignee
Intel Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Intel Corp filed Critical Intel Corp
Application granted granted Critical
Publication of ATE450034T1 publication Critical patent/ATE450034T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition

Landscapes

  • Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Diaphragms For Electromechanical Transducers (AREA)
  • Tone Control, Compression And Expansion, Limiting Amplitude (AREA)
  • Stereophonic System (AREA)
AT03718091T 2002-06-03 2003-03-28 Wahrnehmungsbezogene normierung digitaler audiosignale ATE450034T1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/158,908 US7050965B2 (en) 2002-06-03 2002-06-03 Perceptual normalization of digital audio signals
PCT/US2003/009538 WO2003102924A1 (en) 2002-06-03 2003-03-28 Perceptual normalization of digital audio signals

Publications (1)

Publication Number Publication Date
ATE450034T1 true ATE450034T1 (de) 2009-12-15

Family

ID=29582771

Family Applications (1)

Application Number Title Priority Date Filing Date
AT03718091T ATE450034T1 (de) 2002-06-03 2003-03-28 Wahrnehmungsbezogene normierung digitaler audiosignale

Country Status (10)

Country Link
US (1) US7050965B2 (de)
EP (1) EP1509905B1 (de)
JP (1) JP4354399B2 (de)
KR (1) KR100699387B1 (de)
CN (1) CN100349209C (de)
AT (1) ATE450034T1 (de)
AU (1) AU2003222105A1 (de)
DE (1) DE60330239D1 (de)
TW (1) TWI260538B (de)
WO (1) WO2003102924A1 (de)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7542892B1 (en) * 2004-05-25 2009-06-02 The Math Works, Inc. Reporting delay in modeling environments
KR100902332B1 (ko) * 2006-09-11 2009-06-12 한국전자통신연구원 변형 선형예측 부호화를 이용한 오디오 부호화 및 복호화장치 및 그 방법
KR101301245B1 (ko) * 2008-12-22 2013-09-10 한국전자통신연구원 스펙트럼 계수의 서브대역 할당 방법 및 장치
EP2717263B1 (de) * 2012-10-05 2016-11-02 Nokia Technologies Oy Verfahren, Vorrichtung und Computerprogrammprodukt zur kategorischen räumlichen Analyse-Synthese des Spektrums eines Mehrkanal-Audiosignals
WO2014148848A2 (ko) * 2013-03-21 2014-09-25 인텔렉추얼디스커버리 주식회사 오디오 신호 크기 제어 방법 및 장치
JP2016514856A (ja) * 2013-03-21 2016-05-23 インテレクチュアル ディスカバリー カンパニー リミテッド オーディオ信号大きさの制御方法及び装置
US9350312B1 (en) * 2013-09-19 2016-05-24 iZotope, Inc. Audio dynamic range adjustment system and method
WO2017100619A1 (en) * 2015-12-10 2017-06-15 Ascava, Inc. Reduction of audio data and data stored on a block processing storage system
CN106504757A (zh) * 2016-11-09 2017-03-15 天津大学 一种基于听觉模型的自适应音频盲水印方法
EP3598440B1 (de) * 2018-07-20 2022-04-20 Mimi Hearing Technologies GmbH Systeme und verfahren zur codierung eines audiosignals mit personalisierten psychoakustischen modellen
US10455335B1 (en) * 2018-07-20 2019-10-22 Mimi Hearing Technologies GmbH Systems and methods for modifying an audio signal using custom psychoacoustic models

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2067599A1 (en) * 1991-06-10 1992-12-11 Bruce Alan Smith Personal computer with riser connector for alternate master
US5285498A (en) * 1992-03-02 1994-02-08 At&T Bell Laboratories Method and apparatus for coding audio signals based on perceptual model
US5632003A (en) * 1993-07-16 1997-05-20 Dolby Laboratories Licensing Corporation Computationally efficient adaptive bit allocation for coding method and apparatus
US5646961A (en) * 1994-12-30 1997-07-08 Lucent Technologies Inc. Method for noise weighting filtering
US5819215A (en) * 1995-10-13 1998-10-06 Dobson; Kurt Method and apparatus for wavelet based data compression having adaptive bit rate control for compression of digital audio or other sensory data
US5956674A (en) * 1995-12-01 1999-09-21 Digital Theater Systems, Inc. Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels
US5825320A (en) * 1996-03-19 1998-10-20 Sony Corporation Gain control method for audio encoding device
US6345125B2 (en) * 1998-02-25 2002-02-05 Lucent Technologies Inc. Multiple description transform coding using optimal transforms of arbitrary dimension
US6128593A (en) * 1998-08-04 2000-10-03 Sony Corporation System and method for implementing a refined psycho-acoustic modeler

Also Published As

Publication number Publication date
DE60330239D1 (de) 2010-01-07
US7050965B2 (en) 2006-05-23
TW200405195A (en) 2004-04-01
CN1675685A (zh) 2005-09-28
US20030223593A1 (en) 2003-12-04
TWI260538B (en) 2006-08-21
KR20040111723A (ko) 2004-12-31
EP1509905B1 (de) 2009-11-25
EP1509905A1 (de) 2005-03-02
KR100699387B1 (ko) 2007-03-26
CN100349209C (zh) 2007-11-14
WO2003102924A1 (en) 2003-12-11
JP4354399B2 (ja) 2009-10-28
JP2005528648A (ja) 2005-09-22
AU2003222105A1 (en) 2003-12-19

Similar Documents

Publication Publication Date Title
NO20045717L (no) Fremgangsmate og anordning for frekvensselektiv tonehoydeforsterkning av syntetisk tale
Erfani et al. Audio watermarking using spikegram and a two-dictionary approach
EP4531037A3 (de) End-zu-end-sprachumwandlung
Parvaix et al. A watermarking-based method for informed source separation of audio signals with a single sensor
EP2186087A4 (de) Verbesserte transformationskodierung von sprach- und audiosignalen
EP4629107A2 (de) Auf künstlicher intelligenz basierendes text-zu-sprache-system und verfahren
US8954320B2 (en) System and method for noise reduction in processing speech signals by targeting speech and disregarding noise
EP4383249A3 (de) Sprecherdiarisierung unter verwendung von sprechereinbettung(en) und trainiertem generativem modell
ATE450034T1 (de) Wahrnehmungsbezogene normierung digitaler audiosignale
WO2005018275A3 (en) Speech-based optimization of digital hearing devices
SG10201710911VA (en) Reconstruction of the spectrum of an audiosignal with incomplete spectrum based on frequency translation
DE60101148D1 (de) Vorrichtung und verfahren zur sprachsignalmodifizierung
CN104995680A (zh) 使用高级频谱延拓降低量化噪声的压扩装置和方法
Gopalan Audio steganography by cepstrum modification
ATE234533T1 (de) Verfahren und vorrichtung zum einbringen von informationen in einen datenstrom sowie verfahren und vorrichtung zum codieren eines audiosignals
US9842607B2 (en) Speech intelligibility improving apparatus and computer program therefor
DE60311619D1 (de) Datenreduktion in Audiokodierern unter Ausnutzung nichtharmonischer Effekte
WO2005101898A3 (en) A method and system for sound source separation
CN109616131B (zh) 一种数字实时语音变音方法
Alam et al. Perceptual improvement of Wiener filtering employing a post-filter
Nouza et al. Adding controlled amount of noise to improve recognition of compressed and spectrally distorted speech
Ratanasanya et al. New psychoacoustic models for wavelet based audio watermarking
Nie et al. A perception-based processing strategy for cochlear implants and speech coding
CN106310664A (zh) 声控玩具及其控制方法
Wang et al. ANA-Mix: A Synthetic Corpus of Mandarin Speech in Airport Noise Conditions

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties