ATE450034T1 - Wahrnehmungsbezogene normierung digitaler audiosignale - Google Patents
Wahrnehmungsbezogene normierung digitaler audiosignaleInfo
- Publication number
- ATE450034T1 ATE450034T1 AT03718091T AT03718091T ATE450034T1 AT E450034 T1 ATE450034 T1 AT E450034T1 AT 03718091 T AT03718091 T AT 03718091T AT 03718091 T AT03718091 T AT 03718091T AT E450034 T1 ATE450034 T1 AT E450034T1
- Authority
- AT
- Austria
- Prior art keywords
- digital audio
- audio signals
- bands
- audio data
- sub
- Prior art date
Links
- 238000010606 normalization Methods 0.000 title 1
- 230000009466 transformation Effects 0.000 abstract 3
- 230000000873 masking effect Effects 0.000 abstract 2
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0364—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
Landscapes
- Engineering & Computer Science (AREA)
- Quality & Reliability (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Diaphragms For Electromechanical Transducers (AREA)
- Tone Control, Compression And Expansion, Limiting Amplitude (AREA)
- Stereophonic System (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US10/158,908 US7050965B2 (en) | 2002-06-03 | 2002-06-03 | Perceptual normalization of digital audio signals |
| PCT/US2003/009538 WO2003102924A1 (en) | 2002-06-03 | 2003-03-28 | Perceptual normalization of digital audio signals |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE450034T1 true ATE450034T1 (de) | 2009-12-15 |
Family
ID=29582771
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT03718091T ATE450034T1 (de) | 2002-06-03 | 2003-03-28 | Wahrnehmungsbezogene normierung digitaler audiosignale |
Country Status (10)
| Country | Link |
|---|---|
| US (1) | US7050965B2 (de) |
| EP (1) | EP1509905B1 (de) |
| JP (1) | JP4354399B2 (de) |
| KR (1) | KR100699387B1 (de) |
| CN (1) | CN100349209C (de) |
| AT (1) | ATE450034T1 (de) |
| AU (1) | AU2003222105A1 (de) |
| DE (1) | DE60330239D1 (de) |
| TW (1) | TWI260538B (de) |
| WO (1) | WO2003102924A1 (de) |
Families Citing this family (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7542892B1 (en) * | 2004-05-25 | 2009-06-02 | The Math Works, Inc. | Reporting delay in modeling environments |
| KR100902332B1 (ko) * | 2006-09-11 | 2009-06-12 | 한국전자통신연구원 | 변형 선형예측 부호화를 이용한 오디오 부호화 및 복호화장치 및 그 방법 |
| KR101301245B1 (ko) * | 2008-12-22 | 2013-09-10 | 한국전자통신연구원 | 스펙트럼 계수의 서브대역 할당 방법 및 장치 |
| EP2717263B1 (de) * | 2012-10-05 | 2016-11-02 | Nokia Technologies Oy | Verfahren, Vorrichtung und Computerprogrammprodukt zur kategorischen räumlichen Analyse-Synthese des Spektrums eines Mehrkanal-Audiosignals |
| WO2014148848A2 (ko) * | 2013-03-21 | 2014-09-25 | 인텔렉추얼디스커버리 주식회사 | 오디오 신호 크기 제어 방법 및 장치 |
| JP2016514856A (ja) * | 2013-03-21 | 2016-05-23 | インテレクチュアル ディスカバリー カンパニー リミテッド | オーディオ信号大きさの制御方法及び装置 |
| US9350312B1 (en) * | 2013-09-19 | 2016-05-24 | iZotope, Inc. | Audio dynamic range adjustment system and method |
| WO2017100619A1 (en) * | 2015-12-10 | 2017-06-15 | Ascava, Inc. | Reduction of audio data and data stored on a block processing storage system |
| CN106504757A (zh) * | 2016-11-09 | 2017-03-15 | 天津大学 | 一种基于听觉模型的自适应音频盲水印方法 |
| EP3598440B1 (de) * | 2018-07-20 | 2022-04-20 | Mimi Hearing Technologies GmbH | Systeme und verfahren zur codierung eines audiosignals mit personalisierten psychoakustischen modellen |
| US10455335B1 (en) * | 2018-07-20 | 2019-10-22 | Mimi Hearing Technologies GmbH | Systems and methods for modifying an audio signal using custom psychoacoustic models |
Family Cites Families (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CA2067599A1 (en) * | 1991-06-10 | 1992-12-11 | Bruce Alan Smith | Personal computer with riser connector for alternate master |
| US5285498A (en) * | 1992-03-02 | 1994-02-08 | At&T Bell Laboratories | Method and apparatus for coding audio signals based on perceptual model |
| US5632003A (en) * | 1993-07-16 | 1997-05-20 | Dolby Laboratories Licensing Corporation | Computationally efficient adaptive bit allocation for coding method and apparatus |
| US5646961A (en) * | 1994-12-30 | 1997-07-08 | Lucent Technologies Inc. | Method for noise weighting filtering |
| US5819215A (en) * | 1995-10-13 | 1998-10-06 | Dobson; Kurt | Method and apparatus for wavelet based data compression having adaptive bit rate control for compression of digital audio or other sensory data |
| US5956674A (en) * | 1995-12-01 | 1999-09-21 | Digital Theater Systems, Inc. | Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels |
| US5825320A (en) * | 1996-03-19 | 1998-10-20 | Sony Corporation | Gain control method for audio encoding device |
| US6345125B2 (en) * | 1998-02-25 | 2002-02-05 | Lucent Technologies Inc. | Multiple description transform coding using optimal transforms of arbitrary dimension |
| US6128593A (en) * | 1998-08-04 | 2000-10-03 | Sony Corporation | System and method for implementing a refined psycho-acoustic modeler |
-
2002
- 2002-06-03 US US10/158,908 patent/US7050965B2/en not_active Expired - Fee Related
-
2003
- 2003-03-28 DE DE60330239T patent/DE60330239D1/de not_active Expired - Lifetime
- 2003-03-28 WO PCT/US2003/009538 patent/WO2003102924A1/en not_active Ceased
- 2003-03-28 KR KR1020047019734A patent/KR100699387B1/ko not_active Expired - Fee Related
- 2003-03-28 EP EP03718091A patent/EP1509905B1/de not_active Expired - Lifetime
- 2003-03-28 CN CNB038186225A patent/CN100349209C/zh not_active Expired - Fee Related
- 2003-03-28 AU AU2003222105A patent/AU2003222105A1/en not_active Abandoned
- 2003-03-28 JP JP2004509926A patent/JP4354399B2/ja not_active Expired - Fee Related
- 2003-03-28 AT AT03718091T patent/ATE450034T1/de not_active IP Right Cessation
- 2003-05-02 TW TW092112134A patent/TWI260538B/zh not_active IP Right Cessation
Also Published As
| Publication number | Publication date |
|---|---|
| DE60330239D1 (de) | 2010-01-07 |
| US7050965B2 (en) | 2006-05-23 |
| TW200405195A (en) | 2004-04-01 |
| CN1675685A (zh) | 2005-09-28 |
| US20030223593A1 (en) | 2003-12-04 |
| TWI260538B (en) | 2006-08-21 |
| KR20040111723A (ko) | 2004-12-31 |
| EP1509905B1 (de) | 2009-11-25 |
| EP1509905A1 (de) | 2005-03-02 |
| KR100699387B1 (ko) | 2007-03-26 |
| CN100349209C (zh) | 2007-11-14 |
| WO2003102924A1 (en) | 2003-12-11 |
| JP4354399B2 (ja) | 2009-10-28 |
| JP2005528648A (ja) | 2005-09-22 |
| AU2003222105A1 (en) | 2003-12-19 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| NO20045717L (no) | Fremgangsmate og anordning for frekvensselektiv tonehoydeforsterkning av syntetisk tale | |
| Erfani et al. | Audio watermarking using spikegram and a two-dictionary approach | |
| EP4531037A3 (de) | End-zu-end-sprachumwandlung | |
| Parvaix et al. | A watermarking-based method for informed source separation of audio signals with a single sensor | |
| EP2186087A4 (de) | Verbesserte transformationskodierung von sprach- und audiosignalen | |
| EP4629107A2 (de) | Auf künstlicher intelligenz basierendes text-zu-sprache-system und verfahren | |
| US8954320B2 (en) | System and method for noise reduction in processing speech signals by targeting speech and disregarding noise | |
| EP4383249A3 (de) | Sprecherdiarisierung unter verwendung von sprechereinbettung(en) und trainiertem generativem modell | |
| ATE450034T1 (de) | Wahrnehmungsbezogene normierung digitaler audiosignale | |
| WO2005018275A3 (en) | Speech-based optimization of digital hearing devices | |
| SG10201710911VA (en) | Reconstruction of the spectrum of an audiosignal with incomplete spectrum based on frequency translation | |
| DE60101148D1 (de) | Vorrichtung und verfahren zur sprachsignalmodifizierung | |
| CN104995680A (zh) | 使用高级频谱延拓降低量化噪声的压扩装置和方法 | |
| Gopalan | Audio steganography by cepstrum modification | |
| ATE234533T1 (de) | Verfahren und vorrichtung zum einbringen von informationen in einen datenstrom sowie verfahren und vorrichtung zum codieren eines audiosignals | |
| US9842607B2 (en) | Speech intelligibility improving apparatus and computer program therefor | |
| DE60311619D1 (de) | Datenreduktion in Audiokodierern unter Ausnutzung nichtharmonischer Effekte | |
| WO2005101898A3 (en) | A method and system for sound source separation | |
| CN109616131B (zh) | 一种数字实时语音变音方法 | |
| Alam et al. | Perceptual improvement of Wiener filtering employing a post-filter | |
| Nouza et al. | Adding controlled amount of noise to improve recognition of compressed and spectrally distorted speech | |
| Ratanasanya et al. | New psychoacoustic models for wavelet based audio watermarking | |
| Nie et al. | A perception-based processing strategy for cochlear implants and speech coding | |
| CN106310664A (zh) | 声控玩具及其控制方法 | |
| Wang et al. | ANA-Mix: A Synthetic Corpus of Mandarin Speech in Airport Noise Conditions |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |