ATE450034T1 - Wahrnehmungsbezogene normierung digitaler audiosignale - Google Patents

Wahrnehmungsbezogene normierung digitaler audiosignale

Info

Publication number: ATE450034T1
Authority: AT; Austria
Prior art keywords: digital audio; audio signals; bands; audio data; sub
Prior art date: 2002-06-03

Application number

AT03718091T

Other languages

English (en)

Inventor

Alex Lopez-Estrada

Original Assignee

Intel Corp

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

2002-06-03

Filing date

2003-03-28

Publication date

2009-12-15

2003-03-28 Application filed by Intel Corp filed Critical Intel Corp

2009-12-15 Application granted granted Critical

2009-12-15 Publication of ATE450034T1 publication Critical patent/ATE450034T1/de

Links

238000010606 normalization Methods 0.000 title 1
230000009466 transformation Effects 0.000 abstract 3
230000000873 masking effect Effects 0.000 abstract 2

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0364—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition

Landscapes

Engineering & Computer Science (AREA)
Quality & Reliability (AREA)
Human Computer Interaction (AREA)
Signal Processing (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Computational Linguistics (AREA)
Physics & Mathematics (AREA)
Acoustics & Sound (AREA)
Multimedia (AREA)
Compression, Expansion, Code Conversion, And Decoders (AREA)
Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
Diaphragms For Electromechanical Transducers (AREA)
Tone Control, Compression And Expansion, Limiting Amplitude (AREA)
Stereophonic System (AREA)

AT03718091T 2002-06-03 2003-03-28 Wahrnehmungsbezogene normierung digitaler audiosignale ATE450034T1 (de)

Applications Claiming Priority (2)

Application Number	Priority Date	Filing Date	Title
US10/158,908 US7050965B2 (en)	2002-06-03	2002-06-03	Perceptual normalization of digital audio signals
PCT/US2003/009538 WO2003102924A1 (en)	2002-06-03	2003-03-28	Perceptual normalization of digital audio signals

Publications (1)

Publication Number	Publication Date
ATE450034T1 true ATE450034T1 (de)	2009-12-15

Family

ID=29582771

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
AT03718091T ATE450034T1 (de)	2002-06-03	2003-03-28	Wahrnehmungsbezogene normierung digitaler audiosignale

Country Status (10)

Country	Link
US (1)	US7050965B2 (de)
EP (1)	EP1509905B1 (de)
JP (1)	JP4354399B2 (de)
KR (1)	KR100699387B1 (de)
CN (1)	CN100349209C (de)
AT (1)	ATE450034T1 (de)
AU (1)	AU2003222105A1 (de)
DE (1)	DE60330239D1 (de)
TW (1)	TWI260538B (de)
WO (1)	WO2003102924A1 (de)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US7542892B1 (en) *	2004-05-25	2009-06-02	The Math Works, Inc.	Reporting delay in modeling environments
KR100902332B1 (ko) *	2006-09-11	2009-06-12	한국전자통신연구원	변형 선형예측 부호화를 이용한 오디오 부호화 및 복호화장치 및 그 방법
KR101301245B1 (ko) *	2008-12-22	2013-09-10	한국전자통신연구원	스펙트럼 계수의 서브대역 할당 방법 및 장치
EP2717263B1 (de) *	2012-10-05	2016-11-02	Nokia Technologies Oy	Verfahren, Vorrichtung und Computerprogrammprodukt zur kategorischen räumlichen Analyse-Synthese des Spektrums eines Mehrkanal-Audiosignals
WO2014148848A2 (ko) *	2013-03-21	2014-09-25	인텔렉추얼디스커버리 주식회사	오디오 신호 크기 제어 방법 및 장치
JP2016514856A (ja) *	2013-03-21	2016-05-23	インテレクチュアルディスカバリーカンパニーリミテッド	オーディオ信号大きさの制御方法及び装置
US9350312B1 (en) *	2013-09-19	2016-05-24	iZotope, Inc.	Audio dynamic range adjustment system and method
WO2017100619A1 (en) *	2015-12-10	2017-06-15	Ascava, Inc.	Reduction of audio data and data stored on a block processing storage system
CN106504757A (zh) *	2016-11-09	2017-03-15	天津大学	一种基于听觉模型的自适应音频盲水印方法
EP3598440B1 (de) *	2018-07-20	2022-04-20	Mimi Hearing Technologies GmbH	Systeme und verfahren zur codierung eines audiosignals mit personalisierten psychoakustischen modellen
US10455335B1 (en) *	2018-07-20	2019-10-22	Mimi Hearing Technologies GmbH	Systems and methods for modifying an audio signal using custom psychoacoustic models

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
CA2067599A1 (en) *	1991-06-10	1992-12-11	Bruce Alan Smith	Personal computer with riser connector for alternate master
US5285498A (en) *	1992-03-02	1994-02-08	At&T Bell Laboratories	Method and apparatus for coding audio signals based on perceptual model
US5632003A (en) *	1993-07-16	1997-05-20	Dolby Laboratories Licensing Corporation	Computationally efficient adaptive bit allocation for coding method and apparatus
US5646961A (en) *	1994-12-30	1997-07-08	Lucent Technologies Inc.	Method for noise weighting filtering
US5819215A (en) *	1995-10-13	1998-10-06	Dobson; Kurt	Method and apparatus for wavelet based data compression having adaptive bit rate control for compression of digital audio or other sensory data
US5956674A (en) *	1995-12-01	1999-09-21	Digital Theater Systems, Inc.	Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels
US5825320A (en) *	1996-03-19	1998-10-20	Sony Corporation	Gain control method for audio encoding device
US6345125B2 (en) *	1998-02-25	2002-02-05	Lucent Technologies Inc.	Multiple description transform coding using optimal transforms of arbitrary dimension
US6128593A (en) *	1998-08-04	2000-10-03	Sony Corporation	System and method for implementing a refined psycho-acoustic modeler

2002
- 2002-06-03 US US10/158,908 patent/US7050965B2/en not_active Expired - Fee Related
2003
- 2003-03-28 DE DE60330239T patent/DE60330239D1/de not_active Expired - Lifetime
- 2003-03-28 WO PCT/US2003/009538 patent/WO2003102924A1/en not_active Ceased
- 2003-03-28 KR KR1020047019734A patent/KR100699387B1/ko not_active Expired - Fee Related
- 2003-03-28 EP EP03718091A patent/EP1509905B1/de not_active Expired - Lifetime
- 2003-03-28 CN CNB038186225A patent/CN100349209C/zh not_active Expired - Fee Related
- 2003-03-28 AU AU2003222105A patent/AU2003222105A1/en not_active Abandoned
- 2003-03-28 JP JP2004509926A patent/JP4354399B2/ja not_active Expired - Fee Related
- 2003-03-28 AT AT03718091T patent/ATE450034T1/de not_active IP Right Cessation
- 2003-05-02 TW TW092112134A patent/TWI260538B/zh not_active IP Right Cessation

Also Published As

Publication number	Publication date
DE60330239D1 (de)	2010-01-07
US7050965B2 (en)	2006-05-23
TW200405195A (en)	2004-04-01
CN1675685A (zh)	2005-09-28
US20030223593A1 (en)	2003-12-04
TWI260538B (en)	2006-08-21
KR20040111723A (ko)	2004-12-31
EP1509905B1 (de)	2009-11-25
EP1509905A1 (de)	2005-03-02
KR100699387B1 (ko)	2007-03-26
CN100349209C (zh)	2007-11-14
WO2003102924A1 (en)	2003-12-11
JP4354399B2 (ja)	2009-10-28
JP2005528648A (ja)	2005-09-22
AU2003222105A1 (en)	2003-12-19

Similar Documents

Publication	Publication Date	Title
NO20045717L (no)	2004-12-30	Fremgangsmate og anordning for frekvensselektiv tonehoydeforsterkning av syntetisk tale
Erfani et al.	2016	Audio watermarking using spikegram and a two-dictionary approach
EP4531037A3 (de)	2025-08-06	End-zu-end-sprachumwandlung
Parvaix et al.	2009	A watermarking-based method for informed source separation of audio signals with a single sensor
EP2186087A4 (de)	2010-11-24	Verbesserte transformationskodierung von sprach- und audiosignalen
EP4629107A2 (de)	2025-10-08	Auf künstlicher intelligenz basierendes text-zu-sprache-system und verfahren
US8954320B2 (en)	2015-02-10	System and method for noise reduction in processing speech signals by targeting speech and disregarding noise
EP4383249A3 (de)	2024-07-10	Sprecherdiarisierung unter verwendung von sprechereinbettung(en) und trainiertem generativem modell
ATE450034T1 (de)	2009-12-15	Wahrnehmungsbezogene normierung digitaler audiosignale
WO2005018275A3 (en)	2006-05-18	Speech-based optimization of digital hearing devices
SG10201710911VA (en)	2018-02-27	Reconstruction of the spectrum of an audiosignal with incomplete spectrum based on frequency translation
DE60101148D1 (de)	2003-12-11	Vorrichtung und verfahren zur sprachsignalmodifizierung
CN104995680A (zh)	2015-10-21	使用高级频谱延拓降低量化噪声的压扩装置和方法
Gopalan	2005	Audio steganography by cepstrum modification
ATE234533T1 (de)	2003-03-15	Verfahren und vorrichtung zum einbringen von informationen in einen datenstrom sowie verfahren und vorrichtung zum codieren eines audiosignals
US9842607B2 (en)	2017-12-12	Speech intelligibility improving apparatus and computer program therefor
DE60311619D1 (de)	2007-03-22	Datenreduktion in Audiokodierern unter Ausnutzung nichtharmonischer Effekte
WO2005101898A3 (en)	2005-12-29	A method and system for sound source separation
CN109616131B (zh)	2023-07-07	一种数字实时语音变音方法
Alam et al.	2011	Perceptual improvement of Wiener filtering employing a post-filter
Nouza et al.	2013	Adding controlled amount of noise to improve recognition of compressed and spectrally distorted speech
Ratanasanya et al.	2005	New psychoacoustic models for wavelet based audio watermarking
Nie et al.	2004	A perception-based processing strategy for cochlear implants and speech coding
CN106310664A (zh)	2017-01-11	声控玩具及其控制方法
Wang et al.	2025	ANA-Mix: A Synthetic Corpus of Mandarin Speech in Airport Noise Conditions

Legal Events

Date	Code	Title	Description
2010-05-15	RER	Ceased as to paragraph 5 lit. 3 law introducing patent treaties