SU544990A1

SU544990A1 - Speech recognition device

Info

Publication number: SU544990A1
Application number: SU2016330A
Authority: SU
Inventors: Вера Лазаревна Могильницкая; Лазарь Моисеевич Могилиницкий; Моисей Львович Ханин
Priority date: 1974-04-08
Filing date: 1974-04-08
Publication date: 1977-01-30

Description

1one

Изобретение относитс к области вычислительной техники и автоматики и может быть использовано дл ввода речевой информации в вычислительные маилины, исполнительные автоматы и т. д.The invention relates to the field of computer technology and automation and can be used to input speech information into computational mailings, executive automation machines, etc.

Известны устройства дл автоматического распознавани звуков речи, в которых инвариантные признаки получают в результате сопоставлени эпергий частотных полос речевого спектра. В этих устройствах звуковой сигнал посредством микрофона преобразуетс в электрический, усиливаетс усилителем, подвергаетс компрессии, спектральному анализу гребенкой фильтров. Затем выдел етс огибающа полученных полос спектра и после временной обработки проводитс их сопоставление (обычно попарно) в блоках сравнени . По результатам сопоставлени уровней суд т о наличии того или иного инвариантного признака . Инвариантные признаки, полученные как описанным методом, т. е. путем сопоставлени энергетических уровней разных спектральных полос речевого сигнала, так и на основе других параметров речи, поступают на блоки решени о фонемах. Так, например, в одном из устройств звуковой сигнал, преобразованный в электрический, подаетс на избирательный усилитель с автоматической регулировкой усилени , где усиливаетс , ограничиваетс и сжимаетс по динамическому диапазону , после чего анализируютс частотновременные характеристик.Devices for the automatic recognition of speech sounds are known, in which invariant features are obtained by comparing the ehergy of the frequency bands of the speech spectrum. In these devices, the sound signal through a microphone is converted into an electrical one, amplified by an amplifier, compressed, and spectrally analyzed by a filter bank. Then the envelope of the obtained spectrum bands is selected and, after time processing, they are compared (usually in pairs) in the comparison units. According to the results of the comparison of levels, one or another invariant feature is judged. The invariant features, obtained both by the described method, i.e., by comparing the energy levels of different spectral bands of a speech signal, and on the basis of other speech parameters, are sent to the phoneme decision blocks. For example, in one of the devices, a sound signal converted into an electrical signal is fed to a selective amplifier with automatic gain control, where it is amplified, limited and compressed over a dynamic range, after which time-frequency characteristics are analyzed.

Динамический диапазон несет определенную информацию о звуках речи и достигает значительной величины (с учетом возможных интонационных изменений и перемещений диктора-свыше 80 дб). Ограничение и сжатие динамического диапазона приводит к потере части информации. Кроме того, в блоках усилени и временной обработк при большом динамическом диапазоне возникают нелинейные искажени . Все это, как следствие, снижает достоверность распознавани .Dynamic range carries certain information about the sounds of speech and reaches a significant amount (taking into account possible intonation changes and movements of the speaker, over 80 dB). The limitation and compression of the dynamic range leads to the loss of some information. In addition, non-linear distortions arise in the amplification and temporal processing units with a large dynamic range. All this, as a result, reduces the reliability of recognition.

Наиболее близким к изобретению вл етс устройство дл распознавани речи, содержащее блоки прин ти решени и микрофон, подключенный к группе фильтров, каждый из которых последовательно соединен с первым усилителем и основным блоком детектировани , попарно подключенными к основным блокам сравнени .Closest to the invention is a speech recognition device comprising decision units and a microphone connected to a group of filters, each of which is sequentially connected to a first amplifier and a main detecting unit connected in pairs to the main comparison units.

Однако такое устройство не обеспечивает достаточной достоверности распознавани речевых сигналов.However, such a device does not provide sufficient reliability of speech recognition.

Цель изобретени - повышение достоверности распознавани речи за счет устранени вли ни инерционности цепей регулировани и исключени как потери части информации ири компрессии динамического диапазона.речи , так и искажений, возникающих при обработке сигналов с широким динамическим диапазоном .The purpose of the invention is to increase the reliability of speech recognition by eliminating the influence of the inertia of the control circuits and eliminating both the loss of part of the information and the compression of the dynamic range, as well as the distortions that occur when processing signals with a wide dynamic range.

Дл этого в устройство введены дополнительные блоки сравнени , группы элементов ИЛИ, последовательно соединенные вторые усилители, подключенные к первым усилител м , и дополнительные блоки детектировани , попарно подключенные к дополнительным блокам сравнени , соединенным с первыми входами группы элементов ИЛИ, вторые входы которых св заны с основными блоками сравнени , а выходы-с блоками прин ти решени .For this purpose, additional comparison units, groups of OR elements, successively connected second amplifiers connected to the first amplifiers, and additional detection units connected in pairs to additional comparison blocks connected to the first inputs of the group of OR elements, the second inputs of which are associated with main units of comparison, and exits with decision blocks.

На чертеже приведена блок-схема предлагаемого устройства.The drawing shows a block diagram of the proposed device.

Оно состоит из микрофона 1, группы фильтров 2, первых усилителей 3, основных блоков 4 детектировани , вторых усилителей 5, дополнительных блоков 6 детектировани , основных 7 и дополнительных 8 блоков сравнени , групп элементов ИЛИ 9 и блоков 10 прин ти решени .It consists of a microphone 1, a group of filters 2, first amplifiers 3, main detection units 4, second amplifiers 5, additional detection units 6, main 7 and additional 8 comparison units, groups of elements OR 9 and decision blocks 10.

Микрофоном 1 звуковой сигнал преобразуетс в электрический и подаетс на частотноанализирующую гребенку из группы фильтров 2 разных частот (/i, fa, /з ... fn). К каждому из выходов группы фильтров 2 подключены усилители 3, с которых выделенные полосы спектров поступают на основной 4 и дополнительный 6 блоки детектировани через усилители 5. Полученные напр жени определенных уровней с блоков 4 и 6 детектировани разных фильтров, например fi и /2, подаютс соответственно на основной 7 и дополнительный 8 блоки сравнени , а результируюш,ие сигналы с них попадают на группы элементов ИЛИ 9, которые отрегулированы таким образом , чтобы они срабатывали только при наличии уровн определенной пол рности, хот бы на выходе одного из основного или дополнительного блоков сравнени . Срабатывание группы элементов ИЛИ 9 свидетельствует оBy microphone 1, the audio signal is converted into an electric one and is fed to a frequency analysis comb from a group of 2 filters of different frequencies (/ i, fa, 3 ... fn). Each of the outputs of the filter group 2 is connected to amplifiers 3, from which the selected spectral bands come to the main 4 and additional 6 detection units via amplifiers 5. The obtained voltages of certain levels from the 4 and 6 detection units of different filters, for example fi and / 2, are supplied respectively, on the main 7 and additional 8 blocks of comparison, and the resultant signals from them fall on groups of elements OR 9, which are adjusted so that they only work when there is a level of a certain polarity, x t to the output of one of the primary or secondary comparison blocks. The operation of a group of elements OR 9 indicates

выработке инвариантного признака, но дл этого с выходов блоков 7 и 8 (или одного из них) должны поступать сигналы определенной пол рности и по своему уровню превосходить порог срабатывани элемента ИЛИ в группе элементов ИЛИ 9. Иол рность выходных сигналов блоков 7 и 8 определ етс соотношением сравниваемых уровней. Дл нормальной работы устройства коэффициент усилени усилителей 5 должен быть по своей величине равен заданному динамическому диапазону. Уменьшение коэффициента усилени усилителей 5 ниже величины динамического диапазона приводит к сужениюdeveloping an invariant feature, but for this, the outputs of blocks 7 and 8 (or one of them) must receive signals of a certain polarity and exceed the OR element threshold in the group of elements OR 9 in their level. The polarity of the output signals of blocks 7 and 8 is determined by the ratio of the compared levels. For normal operation of the device, the gain of the amplifiers 5 must be equal in magnitude to the specified dynamic range. A decrease in the gain of the amplifiers 5 below the value of the dynamic range leads to a narrowing

динамического диапазона устройства.dynamic range of the device.

Предлагаемое устройство выгодно отличаетс от известных возможностью исключить вли ние нелинейных искажений, возникаюш ,их при распознавании речи с широким динамическим диапазоном, что способствует повышению достоверности распознавани .The proposed device favorably differs from the known possibility of eliminating the influence of nonlinear distortions that occur when recognizing speech with a wide dynamic range, which contributes to an increase in the reliability of recognition.

Claims

Invention Formula

A speech recognition device comprising, its decision units and a microphone connected to a group of filters, each of which is connected in series with the first amplifier and the main detection unit, which are connected in pairs to the main comparison units, in order to increase the reliability recognition, additional comparison blocks are introduced into it, groups of OR elements, second amplifiers connected in series, connected to the first amplifiers, and additional detection blocks, pairwise under keys to additional comparison blocks connected to the first inputs of a group of elements OR, second inputs

which are connected to the main units of comparison, and the outputs - to the blocks of decision.