EP2158588A1 - Spektralglättungsverfahren von verrauschten signalen - Google Patents
Spektralglättungsverfahren von verrauschten signalenInfo
- Publication number
- EP2158588A1 EP2158588A1 EP08784249A EP08784249A EP2158588A1 EP 2158588 A1 EP2158588 A1 EP 2158588A1 EP 08784249 A EP08784249 A EP 08784249A EP 08784249 A EP08784249 A EP 08784249A EP 2158588 A1 EP2158588 A1 EP 2158588A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- short
- smoothing method
- smoothing
- transformation
- term
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
Definitions
- the invention relates to a smoothing method for suppressing fluctuating artifacts in noise reduction.
- noise suppression is an important aspect.
- the audio signals recorded with a microphone and subsequently digitized contain, in addition to the useful signal (FIG. 1), ambient noises superimposed on the useful signal (FIG. 2).
- FOG. 1 useful signal
- FIG. 2 useful signal
- hearing aids are constantly changing ambient noise such as traffic noise or people talking in the background, such as in a restaurant.
- the noise reduction aims accordingly to a relief of Speech understanding. Therefore, reducing the noise should not audibly distort the speech signal.
- the spectral representation is a favorable representation of the signal.
- the signal is displayed in frequency broken down.
- a practical realization of the spectral representation are short-term spectra which result from a division of the signal into short frames (FIG. 3), which are subjected to spectral transformation separately from one another (FIG. 4).
- a transformed frame then consists of M so-called frequency bins.
- the squared amplitude value of a frequency bin corresponds to the energy that contains the signal in the narrow frequency slice of about 31 Hz bandwidth represented by the respective frequency bin. Due to the symmetry properties of the spectral transformation, only M / 2 + 1 of the M frequency bins, ie 129 bins in the previous example, are relevant for the signal representation. With 129 relevant bins and 31 Hz bandwidth per bin, a total of spectral band from 0 Hz to about 4000 Hz is covered. This is sufficient to describe many speech sounds with sufficient spectral resolution.
- Another common bandwidth is 8000 Hz, which can be achieved by a higher sampling rate and thus more frequency bins with the same frame duration.
- the frequency bins are indexed with ⁇ .
- the index for frames is ⁇ .
- the amplitudes of the short-term spectrum of a frame ⁇ are generally noted here as a spectral quantity G ⁇ ( ⁇ ).
- G ⁇ ( ⁇ ) G M - ⁇ ( ⁇ ).
- a common form of presentation of the short-term spectra are so-called spectrograms, which are formed by juxtaposing time-sequential short-term spectra (see, for example, FIGS. 6 to 9).
- the advantage of the spectral representation is that the essential speech energy is concentrated in a relatively small number of frequency bins (FIGS. 4 and 6), while in the time signal all digital samples are equally relevant (FIG. 3).
- the signal energy of the disturbance is in most cases distributed over a larger number of frequency bins. Since the frequency bins contain different amounts of speech energy, it is possible to suppress the noise in those bins that contain little speech energy. The narrowband the frequency bins are, the better this separation succeeds.
- a spectral weighting function is estimated, which can be calculated according to different optimization criteria. It gives low values or zero in frequency bins, which are mostly perturbations, and values close to or equal to one for bins in which voice energy dominates (Figure 5).
- the weighting function is generally re-estimated for each signal frame in each frequency bin.
- the totality of the weighting values of all the frequency bins of a frame is also referred to herein as the "short-term spectrum of the weighting function" or simply as the "weighting function".
- the isolated outliers are heard as tonal artifacts (musical noise), which are perceived as particularly disturbing because of their tonality ( Figures 10 and 11).
- a single tonal artifact has the duration of a signal frame and its frequency is determined by the frequency bin in which the outlier occurred.
- these spectral magnitudes can be smoothed by an averaging method and thus freed from excessive values.
- Spectral magnitudes of several spectrally adjacent or temporally successive frequency bins are calculated to an average, so that the amplitude of individual outliers is relativized.
- a smoothing is above the frequency [1: Tim Fingscheidt, Christophe Beaugeant and Suhadi Suhadi. Overcoming the Statistical independence assumption is frequency in speech enhancement. Proceedings, IEEE Int. Conf.
- cepstrum basically consists of a non-linear mapping, namely the logarithmization, an absolute magnitude of the spectral magnitude and a subsequent transformation of this logarithmic magnitude spectrum with a transformation.
- the advantage of a cepstral representation of the amplitudes ( Figure 14) is that speech is no longer comb-like across the frequency ( Figures 4 and 6), but the essential information about the speech signal is represented in the cepstral small-index bins. In addition, substantial speech information is still represented in the relatively easily detectable cepstral bin of higher index, which represents the so-called pitch frequency of the speaker.
- a smoothed short-term spectrum can be calculated by setting cepstral bins to zero with relatively small amounts, and then re-transforming the altered cepstrum back to a short-term spectrum.
- cepstral bins to zero with relatively small amounts
- re-transforming the altered cepstrum back to a short-term spectrum.
- strong fluctuations or outliers to correspondingly high Amplitudes in the cepstrum these artifacts can not be detected by these methods and suppressed.
- the object of the invention is to provide a smoothing method for suppressing fluctuations in the weighting function or in spectral intermediate variables or outliers in filtered short-term spectra, which neither reduces the frequency resolution of the short-term spectra nor impairs the temporal dynamics of the speech signal.
- the smoothing method according to the invention comprises the following steps:
- the smoothing method according to the invention makes use of a transformation such as cepstrum in order to describe a broadband speech signal with as few transformation coefficients as possible in its essential structure. Unlike in known methods, however, the transformation coefficients are not set to zero independently of each other if they fall below a threshold value. Instead, the values of transformation coefficients from at least two consecutive frames are offset by smoothing over time. In this case, the degree of smoothing is made dependent on the extent to which the spectral structure represented by the coefficient is decisive for the description of the useful signal. The degree of temporal smoothing of a coefficient therefore depends, for example, on whether a transformation coefficient contains a lot of speech energy or little.
- Increased temporal smoothing thus counteracts the formation of outliers without affecting the structure of the language.
- the smoothing method thus does not result in a reduced spectral resolution for speech sounds.
- the change in the fine structure of the short-term spectrum in successive frames is delayed such that only narrowband spectral changes with time constants smaller than that of un noisy speech are suppressed.
- Transformations differ in their basic functions used.
- the process of transformation means that the signal is correlated with the various basis functions.
- the resulting degree of correlation between the signal and a basis function is then the associated transformation coefficient.
- Orthogonal transformation bases contain only basis functions that are uncorrelated. In the case that the signal is identical to one of the basic functions, transform coefficients with the value zero occur in the case of orthogonal transformations, with the exception of the one coefficient which is identical to the signal. The selectivity of an orthogonal transformation is therefore high.
- Non-orthogonal transforms use function bases that are correlated with each other.
- Another feature is that the basic functions for the considered application are discrete and finite, since the processed signal frames are discrete signals of the length of one frame.
- discrete Fourier transform is a preferred transformation.
- FFT Fast Fourier Transformation
- DCT discrete cosine transform
- DST discrete sine transform
- Term "standard transformations” An already mentioned property of the standard transformations, which is decisive for the invention, is that the amplitudes of the different transformation coefficients are different. represent different levels of fine structure of the transformed signal.
- coefficients with small indices describe the coarse structures of the transformed signal because the associated basis functions are low-frequency harmonic functions.
- the invertibility of the transforms also makes it possible to interchange the transform and its inverse in the back and forth transformations.
- the use of the DFT from (2) is thus also possible, for example, if the IDFT from (1) is used in (2).
- the spectral coefficients of the short-term spectra are mapped before non-linear transformation.
- a principal feature of non-linear imaging which is advantageous for the invention is dynamic compression of relatively large amplitudes and dynamic expansion of relatively small amplitudes.
- the spectral coefficients of the smoothed short-term spectra after the inverse transformation can be mapped non-linearly, the non-linear mapping after the inverse transformation being the inverse of the non-linear mapping before the forward transformation.
- the spectral coefficients are imaged non-linearly by logarithmization before the forward transformation.
- a form of temporal smoothing can be achieved by a recursive system of preferably first order:
- the smoothing method is applied to the magnitude or power of the magnitude of the short-term spectra.
- time constants can be selected in such a way that the transformation coefficients, which primarily represent speech, are less smoothed.
- the transformation coefficients which mainly fluctuate background noise and Artifacts of noise reduction algorithms describe being heavily smoothed.
- the spectral weighting function of a noise reduction algorithm can be provided.
- the spectral weighting function of a postfilter for multi-channel noise reduction methods can also be used as the short-term spectrum.
- the spectral weighting function results here from the minimization of an error criterion.
- a filtered short-term spectrum can also be provided.
- a spectral weighting function of a multi-channel method for noise reduction is provided as the short-term spectrum.
- an estimated coherence or an estimated magnitude squared coherence can also be provided between at least two microphone channels.
- a spectral weighting function of a multi-channel method for speaker or source separation is provided as the short-term spectrum.
- a spectral weighting function of a multi-channel method for speaker separation on the basis of phase differences of signals in the different channels is provided as a short-term spectrum.
- GCC Generalized Cross-Correlation
- spectral magnitudes containing both speech and noise components can also be provided.
- an estimate of the signal-to-noise ratio in the individual frequency bins can also be provided as the short-term spectrum.
- an estimate of the noise power can be used as the short-term spectrum.
- the line of an image is interpreted as a signal frame that can be transformed into the spectral range.
- the resulting frequency bins are called spatial frequency bins here.
- algorithms equivalent to those used in audio signal processing are used. Possible fluctuations that generate these algorithms in the spatial frequency range result in the processed image in optical artifacts. These are equivalent to tonal noise in audio processing.
- signals are derived from the human body that can be noisy, such as acoustic signals.
- the noisy signal can be transformed into the spectral range frame by frame.
- the resulting spectrograms can be processed like audio spectra.
- the smoothing method can be used in a telecommunications network and / or in broadcasting to improve speech and / or picture quality and to suppress artifacts.
- Distortions of the speech signal occur in mobile voice communication, which are caused, on the one hand, by the speech coding method used (redundant speech compression) and the associated quantization noise and, on the other hand, by the interference caused by the transmission channel.
- the latter in turn, fluctuate greatly in terms of time and spectral and lead to a noticeable deterioration of the voice quality.
- the receiver side or network must be put signal processing to ensure that the quasi-random artifacts are reduced.
- so-called post-filters and error concealment methods have hitherto been used.
- the smoothing method can thus be used as a postfilter, in a postfilter, in combination with a postfilter, as part of an error concealment method or in connection with a method for speech and / or picture coding (decompression method or decoding method), in particular on the receiver side.
- a postfilter it is meant that the method is used for post-filtering, that is to say that the data resulting from the applications are processed with an algorithm implementing the method.
- it is possible to improve the quality of the speech signal in the telecommunications network by smoothing the speech signal spectrum or a variable derived therefrom with the smoothing method according to the invention.
- FIG. 1 shows an unrisen time signal
- FIG. 2 shows a noisy time signal
- FIG. 3 shows a single signal frame in the time domain
- FIG. 4 shows a single signal frame in the spectral range
- Figure 5 shows a weighting function for a single frame
- Figure 6 shows the spectrogram of a noisy signal
- Figure 7 shows the spectrogram of a noisy signal
- Figure 8 shows the spectrogram of a signal filtered with the unsmoothed weighting function
- FIG. 9 shows the spectrogram of a signal filtered with a smoothed weighting function according to the invention.
- FIG. 10 shows a filtered time signal with tonal artifacts
- FIG. 11 shows a time signal filtered according to the invention
- Figure 12 is the spectrogram of an unsmoothed weighting function
- FIG. 13 shows the spectrogram of a weighting function smoothed according to the invention
- FIG. 14 shows the magnitude of the cepstrum of a noisy speech signal
- FIG. 15 shows the signal flow graph according to a preferred embodiment of the invention.
- FIG. 1 shows an undrawn signal in the form of the amplitude over time.
- the duration of the signal is 4 seconds, the amplitudes range from about -0.18 to about 0.18.
- the signal is shown in noisy form. One recognizes a random background noise over the entire time course.
- FIG. 3 shows the signal of a single signal frame ⁇ .
- the signal frame has a segment duration of 32 milliseconds.
- the amplitude of both graphs ranges between -0.1 and 0.1.
- the individual samples of the digital signals are connected to graphs.
- the noisy graph represents the input signal containing the noisy signal. A separation of signal and noise in the noisy signal is hardly possible in this representation of the signal.
- Figure 4 is a representation of the same signal frame after the transformation into the frequency domain.
- the individual frequency bins ⁇ are connected to graphs.
- the frequency bins are noisy and noisy, again with the noisy signal in the noisy signal is included speech signal.
- the abscissa shows the frequency bins ⁇ from 0 to 128. They have amplitudes of about -40 decibels (dB) to about 10 dB. From the comparison of the graphs, it can be seen that the energy of the speech signal in some frequency bins is concentrated in a comb-like structure, while the noise is also present in the intervening bins.
- FIG. 5 shows a weighting function for the noisy frame of FIG. For each frequency bin ⁇ , a factor between 0 and 1 results depending on the ratio of speech and noise energy.
- the individual weighting factors are connected to a graph. One recognizes the comb-like structure of the speech spectrum again.
- FIGS. 6 and 7 show spectrograms from a series of noisy or noisy short-term spectra (FIG. 4).
- the frame index ⁇ is plotted, on the ordinate of the frequency bin index ⁇ .
- the amplitudes of the individual frequency bins are shown as gray values.
- Figure 6 and 7 it becomes clear how speech is concentrated in a few frequency bins. It also trains regular structures. The noise, however, is distributed over all frequency bins.
- FIG. 8 shows the spectrogram of a filtered signal.
- the axes correspond to those in FIGS. 6 and 7. It can be seen from a comparison with FIG. 6 that estimation errors in the weighting function leave high amplitudes in frequency bins which contain no speech. To suppress these outliers is the aim of the method according to the invention.
- FIG. 9 shows the spectrogram of a signal which has been filtered with a smoothed weighting function in accordance with a preferred development of the method according to the invention.
- the axes correspond to those of the previous spectrograms.
- the outliers are greatly reduced.
- the speech components in the spectrogram are preserved in their essential form.
- FIGS. 10 and 11 show the time signals resulting from the filtered spectra of FIGS. 8 and 9, respectively. Plotted is the amplitude over time. The signals are 4 seconds long and have amplitudes between about -0.18 and 0.18.
- the outliers in the spectrogram from FIG. 8 produce clearly visible tonal artifacts in the associated time signal in FIG. 10, which are not present in the noise-free signal from FIG.
- the time signal in Figure 11 has a much quieter course of the residual noise. This time signal results from the spectrogram of Figure 9, which was generated by filtering with the smoothed weighting function.
- FIG. 12 shows the unsmoothed weighting function for all frames. Frequency bins ⁇ are plotted along the ordinate for each frame ⁇ . The values of the weighting function are shown as gray tones. The fluctuations resulting from estimation errors are recognizable as irregular patches.
- FIG. 13 shows the smoothed weighting function for all frames.
- the axes correspond to those in FIG. 12. Smoothing smears the fluctuations and greatly reduces their value.
- the structure of the voice frequency bins however, remains clearly recognizable.
- FIG. 14 shows the magnitude of the cepstrum of a noiseless signal over all frames. For each frame ⁇ , the cepstral bins ⁇ 'are plotted along the ordinate. The values of the amounts of cepstral
- Coefficients Gy pst ⁇ are shown as gray tones. A comparison with Figure 6 shows that speech in the cepstrum is concentrated to an even smaller number of coefficients. In addition, these coefficients are less variable in their position. Clearly recognizable is the course of the cepstral coefficient, which represents the pitch frequency.
- FIG. 15 shows a signal flow graph according to a preferred embodiment of the invention.
- a noisy input signal is transformed into a sequence of short-term spectra, and then a weighting function for filtering is then estimated via spectral intermediate quantities. It will be respectively a frame currently being edited.
- the short-term spectra of the weighting function are subjected to a non-linear, logarithmic mapping. This is followed by a transformation into the cepstral area.
- the thus transformed short-term spectra are thus represented by transformation coefficients of the basis functions.
- the transformation coefficients calculated in this way are smoothed separately using different time constants.
- the recursive nature of the smoothing is indicated by the return of the output of the smoothing to its input.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Networks Using Active Elements (AREA)
- Spectrometry And Color Measurement (AREA)
- Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Investigating Or Analyzing Materials By The Use Of Ultrasonic Waves (AREA)
- Ultra Sonic Daignosis Equipment (AREA)
- Optical Communication System (AREA)
- Color Television Image Signal Generators (AREA)
- Holo Graphy (AREA)
- Photoreceptors In Electrophotography (AREA)
Description
Claims
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| DE102007030209A DE102007030209A1 (de) | 2007-06-27 | 2007-06-27 | Glättungsverfahren |
| PCT/DE2008/001047 WO2009000255A1 (de) | 2007-06-27 | 2008-06-25 | Spektralglättungsverfahren von verrauschten signalen |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| EP2158588A1 true EP2158588A1 (de) | 2010-03-03 |
| EP2158588B1 EP2158588B1 (de) | 2010-10-13 |
Family
ID=39767094
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EP08784249A Not-in-force EP2158588B1 (de) | 2007-06-27 | 2008-06-25 | Spektralglättungsverfahren von verrauschten signalen |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US8892431B2 (de) |
| EP (1) | EP2158588B1 (de) |
| AT (1) | ATE484822T1 (de) |
| DE (2) | DE102007030209A1 (de) |
| DK (1) | DK2158588T3 (de) |
| WO (1) | WO2009000255A1 (de) |
Families Citing this family (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| ATE454696T1 (de) * | 2007-08-31 | 2010-01-15 | Harman Becker Automotive Sys | Schnelle schätzung der spektraldichte der rauschleistung zur sprachsignalverbesserung |
| US8588138B2 (en) * | 2009-07-23 | 2013-11-19 | Qualcomm Incorporated | Header compression for relay nodes |
| US8675115B1 (en) | 2011-02-14 | 2014-03-18 | DigitalOptics Corporation Europe Limited | Forward interpolation approach for constructing a second version of an image from a first version of the image |
| US8577186B1 (en) * | 2011-02-14 | 2013-11-05 | DigitalOptics Corporation Europe Limited | Forward interpolation approach using forward and backward mapping |
| WO2012128678A1 (en) * | 2011-03-21 | 2012-09-27 | Telefonaktiebolaget L M Ericsson (Publ) | Method and arrangement for damping of dominant frequencies in an audio signal |
| JP2014513320A (ja) | 2011-03-21 | 2014-05-29 | テレフオンアクチーボラゲット エル エム エリクソン(パブル) | オーディオ信号におけるドミナント周波数を減衰する方法及び装置 |
| GB201114737D0 (en) * | 2011-08-26 | 2011-10-12 | Univ Belfast | Method and apparatus for acoustic source separation |
| US9026451B1 (en) * | 2012-05-09 | 2015-05-05 | Google Inc. | Pitch post-filter |
| JP5772723B2 (ja) * | 2012-05-31 | 2015-09-02 | ヤマハ株式会社 | 音響処理装置および分離マスク生成装置 |
| CN105144290B (zh) * | 2013-04-11 | 2021-06-15 | 日本电气株式会社 | 信号处理装置、信号处理方法和信号处理程序 |
| US20150179181A1 (en) * | 2013-12-20 | 2015-06-25 | Microsoft Corporation | Adapting audio based upon detected environmental accoustics |
| DE102014210760B4 (de) * | 2014-06-05 | 2023-03-09 | Bayerische Motoren Werke Aktiengesellschaft | Betrieb einer Kommunikationsanlage |
| US11385168B2 (en) * | 2015-03-31 | 2022-07-12 | Nec Corporation | Spectroscopic analysis apparatus, spectroscopic analysis method, and readable medium |
| US9721581B2 (en) * | 2015-08-25 | 2017-08-01 | Blackberry Limited | Method and device for mitigating wind noise in a speech signal generated at a microphone of the device |
| US9972134B2 (en) | 2016-06-30 | 2018-05-15 | Microsoft Technology Licensing, Llc | Adaptive smoothing based on user focus on a target object |
| WO2019213769A1 (en) | 2018-05-09 | 2019-11-14 | Nureva Inc. | Method, apparatus, and computer-readable media utilizing residual echo estimate information to derive secondary echo reduction parameters |
| EP3573058B1 (de) * | 2018-05-23 | 2021-02-24 | Harman Becker Automotive Systems GmbH | Trocken- und raumschalltrennung |
| JP7278092B2 (ja) * | 2019-02-15 | 2023-05-19 | キヤノン株式会社 | 画像処理装置、撮像装置、画像処理方法、撮像装置の制御方法、及びプログラム |
| US12462782B2 (en) | 2021-06-25 | 2025-11-04 | Nureva, Inc. | System for dynamically adjusting a soundmask signal based on realtime ambient noise parameters while maintaining echo canceller calibration performance |
| CN113726348B (zh) * | 2021-07-21 | 2022-06-21 | 湖南艾科诺维科技有限公司 | 一种无线电信号频谱的平滑滤波方法及系统 |
Family Cites Families (17)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH02195400A (ja) * | 1989-01-24 | 1990-08-01 | Canon Inc | 音声認識装置 |
| US5365592A (en) * | 1990-07-19 | 1994-11-15 | Hughes Aircraft Company | Digital voice detection apparatus and method using transform domain processing |
| US5737485A (en) * | 1995-03-07 | 1998-04-07 | Rutgers The State University Of New Jersey | Method and apparatus including microphone arrays and neural networks for speech/speaker recognition systems |
| US6070140A (en) * | 1995-06-05 | 2000-05-30 | Tran; Bao Q. | Speech recognizer |
| DE19629132A1 (de) * | 1996-07-19 | 1998-01-22 | Daimler Benz Ag | Verfahren zur Verringerung von Störungen eines Sprachsignals |
| US7272556B1 (en) * | 1998-09-23 | 2007-09-18 | Lucent Technologies Inc. | Scalable and embedded codec for speech and audio signals |
| US6226606B1 (en) * | 1998-11-24 | 2001-05-01 | Microsoft Corporation | Method and apparatus for pitch tracking |
| US6766292B1 (en) * | 2000-03-28 | 2004-07-20 | Tellabs Operations, Inc. | Relative noise ratio weighting techniques for adaptive noise cancellation |
| JP3566197B2 (ja) * | 2000-08-31 | 2004-09-15 | 松下電器産業株式会社 | 雑音抑圧装置及び雑音抑圧方法 |
| US7054810B2 (en) * | 2000-10-06 | 2006-05-30 | International Business Machines Corporation | Feature vector-based apparatus and method for robust pattern recognition |
| US6964023B2 (en) * | 2001-02-05 | 2005-11-08 | International Business Machines Corporation | System and method for multi-modal focus detection, referential ambiguity resolution and mood classification using multi-modal input |
| US7124075B2 (en) * | 2001-10-26 | 2006-10-17 | Dmitry Edward Terez | Methods and apparatus for pitch determination |
| US20040002856A1 (en) * | 2002-03-08 | 2004-01-01 | Udaya Bhaskar | Multi-rate frequency domain interpolative speech CODEC system |
| US7725314B2 (en) * | 2004-02-16 | 2010-05-25 | Microsoft Corporation | Method and apparatus for constructing a speech filter using estimates of clean speech and noise |
| US7689419B2 (en) * | 2005-09-22 | 2010-03-30 | Microsoft Corporation | Updating hidden conditional random field model parameters after processing individual training samples |
| US7680663B2 (en) * | 2006-08-21 | 2010-03-16 | Micrsoft Corporation | Using a discretized, higher order representation of hidden dynamic variables for speech recognition |
| US8145488B2 (en) * | 2008-09-16 | 2012-03-27 | Microsoft Corporation | Parameter clustering and sharing for variable-parameter hidden markov models |
-
2007
- 2007-06-27 DE DE102007030209A patent/DE102007030209A1/de not_active Ceased
-
2008
- 2008-06-25 DE DE502008001543T patent/DE502008001543D1/de active Active
- 2008-06-25 EP EP08784249A patent/EP2158588B1/de not_active Not-in-force
- 2008-06-25 AT AT08784249T patent/ATE484822T1/de active
- 2008-06-25 DK DK08784249.8T patent/DK2158588T3/da active
- 2008-06-25 US US12/665,526 patent/US8892431B2/en not_active Expired - Fee Related
- 2008-06-25 WO PCT/DE2008/001047 patent/WO2009000255A1/de not_active Ceased
Non-Patent Citations (1)
| Title |
|---|
| See references of WO2009000255A1 * |
Also Published As
| Publication number | Publication date |
|---|---|
| DE502008001543D1 (de) | 2010-11-25 |
| EP2158588B1 (de) | 2010-10-13 |
| WO2009000255A9 (de) | 2010-05-14 |
| DK2158588T3 (da) | 2011-02-07 |
| US20100182510A1 (en) | 2010-07-22 |
| DE102007030209A1 (de) | 2009-01-08 |
| US8892431B2 (en) | 2014-11-18 |
| ATE484822T1 (de) | 2010-10-15 |
| WO2009000255A1 (de) | 2008-12-31 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP2158588B1 (de) | Spektralglättungsverfahren von verrauschten signalen | |
| DE60116255T2 (de) | Rauschunterdückungsvorrichtung und -verfahren | |
| DE60131639T2 (de) | Vorrichtungen und Verfahren zur Bestimmung von Leistungswerten für die Geräuschunterdrückung für ein Sprachkommunikationssystem | |
| DE112009000805B4 (de) | Rauschreduktion | |
| DE602005000539T2 (de) | Verstärkungsgesteuerte Geräuschunterdrückung | |
| DE60225130T2 (de) | Verbesserung der transientenleistung bei kodierern mit niedriger bitrate durch unterdrückung des vorgeräusches | |
| EP1143416B1 (de) | Geräuschunterdrückung im Zeitbereich | |
| DE69428119T2 (de) | Verringerung des hintergrundrauschens zur sprachverbesserung | |
| DE60024501T2 (de) | Verbesserung der perzeptuellen Qualität von SBR (Spektralbandreplikation) UND HFR (Hochfrequenzen-Rekonstruktion) Kodierverfahren mittels adaptivem Addieren von Grundrauschen und Begrenzung der Rauschsubstitution | |
| DE60027438T2 (de) | Verbesserung eines verrauschten akustischen signals | |
| DE19747885B4 (de) | Verfahren zur Reduktion von Störungen akustischer Signale mittels der adaptiven Filter-Methode der spektralen Subtraktion | |
| DE3689035T2 (de) | Rauschminderungssystem. | |
| DE69630580T2 (de) | Rauschunterdrücker und Verfahren zur Unterdrückung des Hintergrundrauschens in einem verrauschten Sprachsignal und eine Mobilstation | |
| DE69627580T2 (de) | Verfahren zur Rauschverminderung in einem Sprachsignal | |
| DE112011104737B4 (de) | Geräuschunterdrückungsvorrichtung | |
| DE112012006876T5 (de) | Formantabhaengige Sprachsignalverbesserung | |
| AT509570B1 (de) | Methode und apparat zur einkanal-sprachverbesserung basierend auf einem latenzzeitreduzierten gehörmodell | |
| DE112011106045B4 (de) | Audiosignal-Wiederherstellungsvorrichtung und Audiosignal-Wiederherstellungsverfahren | |
| EP3065417B1 (de) | Verfahren zur unterdrückung eines störgeräusches in einem akustischen system | |
| EP1103956A2 (de) | Exponentielle Echo- und Geräuschabsenkung in Sprachpausen | |
| EP1155561B1 (de) | Vorrichtung und verfahren zur geräuschunterdrückung in fernsprecheinrichtungen | |
| DE102018117558A1 (de) | Adaptives nachfiltern | |
| EP1239455A2 (de) | Verfahren und Anordnung zur Durchführung einer an die Übertragungsfunktion menschilcher Sinnesorgane angepassten Fourier Transformation sowie darauf basierende Vorrichtungen zur Geräuschreduktion und Spracherkennung | |
| DE602004006912T2 (de) | Verfahren zur Verarbeitung eines akustischen Signals und ein Hörgerät | |
| DE10137348A1 (de) | Verfahren und Schaltungsanordnung zur Geräuschreduktion bei der Sprachübertragung in Kommunikationssystemen |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
| 17P | Request for examination filed |
Effective date: 20091210 |
|
| AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MT NL NO PL PT RO SE SI SK TR |
|
| AX | Request for extension of the european patent |
Extension state: AL BA MK RS |
|
| GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
| GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
| GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
| DAX | Request for extension of the european patent (deleted) | ||
| AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MT NL NO PL PT RO SE SI SK TR |
|
| REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D Free format text: NOT ENGLISH |
|
| REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP |
|
| REG | Reference to a national code |
Ref country code: IE Ref legal event code: FG4D Free format text: LANGUAGE OF EP DOCUMENT: GERMAN |
|
| REF | Corresponds to: |
Ref document number: 502008001543 Country of ref document: DE Date of ref document: 20101125 Kind code of ref document: P |
|
| REG | Reference to a national code |
Ref country code: CH Ref legal event code: NV Representative=s name: SIEMENS SCHWEIZ AG |
|
| REG | Reference to a national code |
Ref country code: DK Ref legal event code: T3 |
|
| REG | Reference to a national code |
Ref country code: NL Ref legal event code: VDEP Effective date: 20101013 |
|
| LTIE | Lt: invalidation of european patent or patent extension |
Effective date: 20101013 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: NO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110113 Ref country code: LT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20101013 |
|
| REG | Reference to a national code |
Ref country code: IE Ref legal event code: FD4D |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LV Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20101013 Ref country code: NL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20101013 Ref country code: BG Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110113 Ref country code: IS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110213 Ref country code: HR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20101013 Ref country code: FI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20101013 Ref country code: PT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110214 Ref country code: SE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20101013 Ref country code: SI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20101013 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110114 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CZ Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20101013 Ref country code: ES Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110124 Ref country code: IE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20101013 Ref country code: EE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20101013 |
|
| PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: RO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20101013 Ref country code: PL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20101013 Ref country code: SK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20101013 |
|
| 26N | No opposition filed |
Effective date: 20110714 |
|
| REG | Reference to a national code |
Ref country code: DE Ref legal event code: R097 Ref document number: 502008001543 Country of ref document: DE Effective date: 20110714 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20101013 |
|
| BERE | Be: lapsed |
Owner name: RUHR-UNIVERSITAT BOCHUM Effective date: 20110630 Owner name: SIEMENS AUDIOLOGISCHE TECHNIK G.M.B.H. Effective date: 20110630 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: BE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20110630 |
|
| PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: IT Payment date: 20120626 Year of fee payment: 5 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MC Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20110630 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LU Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20110625 Ref country code: CY Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION Effective date: 20101013 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: TR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20101013 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: HU Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20101013 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IT Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20130625 |
|
| REG | Reference to a national code |
Ref country code: AT Ref legal event code: MM01 Ref document number: 484822 Country of ref document: AT Kind code of ref document: T Effective date: 20130625 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: AT Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20130625 |
|
| REG | Reference to a national code |
Ref country code: DE Ref legal event code: R082 Ref document number: 502008001543 Country of ref document: DE Representative=s name: FDST PATENTANWAELTE FREIER DOERR STAMMLER TSCH, DE |
|
| REG | Reference to a national code |
Ref country code: DE Ref legal event code: R082 Ref document number: 502008001543 Country of ref document: DE Representative=s name: FDST PATENTANWAELTE FREIER DOERR STAMMLER TSCH, DE Ref country code: DE Ref legal event code: R081 Ref document number: 502008001543 Country of ref document: DE Owner name: RUHR-UNIVERSITAET BOCHUM, DE Free format text: FORMER OWNERS: RUHR-UNIVERSITAET BOCHUM, 44801 BOCHUM, DE; SIEMENS AUDIOLOGISCHE TECHNIK GMBH, 91058 ERLANGEN, DE Ref country code: DE Ref legal event code: R081 Ref document number: 502008001543 Country of ref document: DE Owner name: SIVANTOS GMBH, DE Free format text: FORMER OWNERS: RUHR-UNIVERSITAET BOCHUM, 44801 BOCHUM, DE; SIEMENS AUDIOLOGISCHE TECHNIK GMBH, 91058 ERLANGEN, DE |
|
| REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 9 |
|
| REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 10 |
|
| REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 11 |
|
| PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DK Payment date: 20190624 Year of fee payment: 12 |
|
| PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20190626 Year of fee payment: 12 |
|
| PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: CH Payment date: 20190624 Year of fee payment: 12 |
|
| PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20190624 Year of fee payment: 12 Ref country code: GB Payment date: 20190624 Year of fee payment: 12 |
|
| REG | Reference to a national code |
Ref country code: DE Ref legal event code: R119 Ref document number: 502008001543 Country of ref document: DE |
|
| REG | Reference to a national code |
Ref country code: DK Ref legal event code: EBP Effective date: 20200630 |
|
| REG | Reference to a national code |
Ref country code: CH Ref legal event code: PL |
|
| GBPC | Gb: european patent ceased through non-payment of renewal fee |
Effective date: 20200625 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: FR Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20200630 Ref country code: GB Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20200625 Ref country code: CH Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20200630 Ref country code: LI Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20200630 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20210101 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DK Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20200630 |