KR970050118A - 음성인식의 자동모델 결정방법 - Google Patents
음성인식의 자동모델 결정방법 Download PDFInfo
- Publication number
- KR970050118A KR970050118A KR1019950058739A KR19950058739A KR970050118A KR 970050118 A KR970050118 A KR 970050118A KR 1019950058739 A KR1019950058739 A KR 1019950058739A KR 19950058739 A KR19950058739 A KR 19950058739A KR 970050118 A KR970050118 A KR 970050118A
- Authority
- KR
- South Korea
- Prior art keywords
- codebook
- model
- speech
- recognition
- models
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/038—Vector quantisation, e.g. TwinVQ audio
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0004—Design or structure of the codebook
- G10L2019/0005—Multi-stage vector quantisation
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Signal Processing (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims (1)
- 입력 음성에 대한 특징벡터를 추출하는 제1과정과, 상기 제1과정에 의하여 추출된 벡터열의 소정 프레임 구간 동아너에 대해 기 훈련된 다수개의 코드북으로 양자화한 다음 그 양자화 오차를 계산하여 그 누적 거리를 계산하는 제2과정과, 상기 제2과정에 의하여 계산된 누적 거리가 최소인 코드북을 선택하는 제3과정과, 상기 제3과정에 의하여 선택된 코드북의 코드워드에 가중치를 부여한 다음 입력음성 전체를 입력받아 선택된 코드북의 코드워드로써 양자화하는 제4과정과, 상기 제4과정에서 양자화에 사용된 코드북에 해당하는 모델을 선택하여 음성인식 결과로 출력하는 제5과정으로 이루어진 것을 특징으로 하는 음성인식의 자동모델 결정방법.※ 참고사항 : 최초출원 내용에 의하여 공개하는 것임.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| KR1019950058739A KR0176788B1 (ko) | 1995-12-27 | 1995-12-27 | 음성인식의 자동모델 결정방법 |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| KR1019950058739A KR0176788B1 (ko) | 1995-12-27 | 1995-12-27 | 음성인식의 자동모델 결정방법 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| KR970050118A true KR970050118A (ko) | 1997-07-29 |
| KR0176788B1 KR0176788B1 (ko) | 1999-04-01 |
Family
ID=19445074
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1019950058739A Expired - Fee Related KR0176788B1 (ko) | 1995-12-27 | 1995-12-27 | 음성인식의 자동모델 결정방법 |
Country Status (1)
| Country | Link |
|---|---|
| KR (1) | KR0176788B1 (ko) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2024053854A1 (ko) * | 2022-09-05 | 2024-03-14 | 서울대학교산학협력단 | 비터비 빔 서치를 이용한 레지듀얼 벡터 양자화 장치, 방법 및 컴퓨터 판독 가능 매체 |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| SE521225C2 (sv) * | 1998-09-16 | 2003-10-14 | Ericsson Telefon Ab L M | Förfarande och anordning för CELP-kodning/avkodning |
-
1995
- 1995-12-27 KR KR1019950058739A patent/KR0176788B1/ko not_active Expired - Fee Related
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2024053854A1 (ko) * | 2022-09-05 | 2024-03-14 | 서울대학교산학협력단 | 비터비 빔 서치를 이용한 레지듀얼 벡터 양자화 장치, 방법 및 컴퓨터 판독 가능 매체 |
Also Published As
| Publication number | Publication date |
|---|---|
| KR0176788B1 (ko) | 1999-04-01 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| AU707355B2 (en) | Speech recognition | |
| Giacobello et al. | Sparse linear prediction and its applications to speech processing | |
| WO2022141678A1 (zh) | 语音合成方法、装置、设备及存储介质 | |
| JP2746039B2 (ja) | 音声符号化方式 | |
| US5033087A (en) | Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system | |
| US5122951A (en) | Subject and word associating devices | |
| JPH0816187A (ja) | 音声分析における音声認識方法 | |
| RU2009119491A (ru) | Способ и устройство кодирования кадров перехода в речевых сигналах | |
| CN115132170B (zh) | 语种分类方法、装置及计算机可读存储介质 | |
| CN116386605B (zh) | 模型训练方法和装置、语音合成方法、设备及存储介质 | |
| CN118298803B (zh) | 语音克隆方法 | |
| US11404045B2 (en) | Speech synthesis method and apparatus | |
| JPH05216500A (ja) | 音声符号化装置 | |
| JP2624130B2 (ja) | 音声符号化方式 | |
| KR970050118A (ko) | 음성인식의 자동모델 결정방법 | |
| Syrdal et al. | Perceptually-based data-driven join costs: comparing join types. | |
| JPH08292797A (ja) | 音声符号化装置 | |
| JP3256215B2 (ja) | 音声符号化装置 | |
| JP3252285B2 (ja) | 音声帯域信号符号化方法 | |
| JP3578933B2 (ja) | 重み符号帳の作成方法及び符号帳設計時における学習時のma予測係数の初期値の設定方法並びに音響信号の符号化方法及びその復号方法並びに符号化プログラムが記憶されたコンピュータに読み取り可能な記憶媒体及び復号プログラムが記憶されたコンピュータに読み取り可能な記憶媒体 | |
| JP3024467B2 (ja) | 音声符号化装置 | |
| JP3194930B2 (ja) | 音声符号化装置 | |
| CN118411979B (zh) | 合成语音的调整方法、训练方法及相关装置 | |
| JP3144194B2 (ja) | 音声符号化装置 | |
| KR930011740B1 (ko) | 유사단어 인식방법 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A201 | Request for examination | ||
| PA0109 | Patent application |
St.27 status event code: A-0-1-A10-A12-nap-PA0109 |
|
| PA0201 | Request for examination |
St.27 status event code: A-1-2-D10-D11-exm-PA0201 |
|
| R17-X000 | Change to representative recorded |
St.27 status event code: A-3-3-R10-R17-oth-X000 |
|
| PG1501 | Laying open of application |
St.27 status event code: A-1-1-Q10-Q12-nap-PG1501 |
|
| E701 | Decision to grant or registration of patent right | ||
| PE0701 | Decision of registration |
St.27 status event code: A-1-2-D10-D22-exm-PE0701 |
|
| PR1002 | Payment of registration fee |
St.27 status event code: A-2-2-U10-U11-oth-PR1002 Fee payment year number: 1 |
|
| GRNT | Written decision to grant | ||
| PR0701 | Registration of establishment |
St.27 status event code: A-2-4-F10-F11-exm-PR0701 |
|
| PN2301 | Change of applicant |
St.27 status event code: A-5-5-R10-R13-asn-PN2301 St.27 status event code: A-5-5-R10-R11-asn-PN2301 |
|
| PG1601 | Publication of registration |
St.27 status event code: A-4-4-Q10-Q13-nap-PG1601 |
|
| PN2301 | Change of applicant |
St.27 status event code: A-5-5-R10-R13-asn-PN2301 St.27 status event code: A-5-5-R10-R11-asn-PN2301 |
|
| PN2301 | Change of applicant |
St.27 status event code: A-5-5-R10-R13-asn-PN2301 St.27 status event code: A-5-5-R10-R11-asn-PN2301 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 4 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 5 |
|
| PN2301 | Change of applicant |
St.27 status event code: A-5-5-R10-R13-asn-PN2301 St.27 status event code: A-5-5-R10-R11-asn-PN2301 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 6 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 7 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 8 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 9 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 10 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 11 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 12 |
|
| FPAY | Annual fee payment |
Payment date: 20100929 Year of fee payment: 13 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 13 |
|
| LAPS | Lapse due to unpaid annual fee | ||
| PC1903 | Unpaid annual fee |
St.27 status event code: A-4-4-U10-U13-oth-PC1903 Not in force date: 20111115 Payment event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE |
|
| PC1903 | Unpaid annual fee |
St.27 status event code: N-4-6-H10-H13-oth-PC1903 Ip right cessation event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE Not in force date: 20111115 |
|
| P22-X000 | Classification modified |
St.27 status event code: A-4-4-P10-P22-nap-X000 |
|
| P22-X000 | Classification modified |
St.27 status event code: A-4-4-P10-P22-nap-X000 |