ATE311650T1 - Korrektur eines von einer spracherkennung erkannten textes mittels vergleich der phonemfolgen des erkannten textes mit einer phonetischen transkription eines manuell eingegebenen korrekturwortes - Google Patents
Korrektur eines von einer spracherkennung erkannten textes mittels vergleich der phonemfolgen des erkannten textes mit einer phonetischen transkription eines manuell eingegebenen korrekturwortesInfo
- Publication number
- ATE311650T1 ATE311650T1 AT02762708T AT02762708T ATE311650T1 AT E311650 T1 ATE311650 T1 AT E311650T1 AT 02762708 T AT02762708 T AT 02762708T AT 02762708 T AT02762708 T AT 02762708T AT E311650 T1 ATE311650 T1 AT E311650T1
- Authority
- AT
- Austria
- Prior art keywords
- recognized
- text
- correction
- entred
- manually
- Prior art date
Links
- 238000013518 transcription Methods 0.000 title 1
- 230000035897 transcription Effects 0.000 title 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/221—Announcement of recognition results
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/225—Feedback of the input speech
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
- Character Discrimination (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP01000468 | 2001-09-17 | ||
| PCT/IB2002/003688 WO2003025904A1 (en) | 2001-09-17 | 2002-09-10 | Correcting a text recognized by speech recognition through comparison of phonetic sequences in the recognized text with a phonetic transcription of a manually input correction word |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE311650T1 true ATE311650T1 (de) | 2005-12-15 |
Family
ID=8176063
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT02762708T ATE311650T1 (de) | 2001-09-17 | 2002-09-10 | Korrektur eines von einer spracherkennung erkannten textes mittels vergleich der phonemfolgen des erkannten textes mit einer phonetischen transkription eines manuell eingegebenen korrekturwortes |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US6735565B2 (de) |
| EP (1) | EP1430474B1 (de) |
| JP (1) | JP4241376B2 (de) |
| CN (1) | CN1235188C (de) |
| AT (1) | ATE311650T1 (de) |
| DE (1) | DE60207742T2 (de) |
| WO (1) | WO2003025904A1 (de) |
Families Citing this family (35)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7263483B2 (en) * | 2003-04-28 | 2007-08-28 | Dictaphone Corporation | USB dictation device |
| US7310602B2 (en) * | 2004-09-27 | 2007-12-18 | Kabushiki Kaisha Equos Research | Navigation apparatus |
| JP4784120B2 (ja) * | 2005-03-23 | 2011-10-05 | 日本電気株式会社 | 音声書き起こし支援装置及びその方法ならびにプログラム |
| US9020811B2 (en) * | 2006-10-13 | 2015-04-28 | Syscom, Inc. | Method and system for converting text files searchable text and for processing the searchable text |
| US8543393B2 (en) * | 2008-05-20 | 2013-09-24 | Calabrio, Inc. | Systems and methods of improving automated speech recognition accuracy using statistical analysis of search terms |
| US9659559B2 (en) * | 2009-06-25 | 2017-05-23 | Adacel Systems, Inc. | Phonetic distance measurement system and related methods |
| US8494852B2 (en) | 2010-01-05 | 2013-07-23 | Google Inc. | Word-level correction of speech input |
| CN102682763B (zh) * | 2011-03-10 | 2014-07-16 | 北京三星通信技术研究有限公司 | 修正语音输入文本中命名实体词汇的方法、装置及终端 |
| JP5638479B2 (ja) * | 2011-07-26 | 2014-12-10 | 株式会社東芝 | 書き起こし支援システムおよび書き起こし支援方法 |
| JP2013025299A (ja) * | 2011-07-26 | 2013-02-04 | Toshiba Corp | 書き起こし支援システムおよび書き起こし支援方法 |
| JP5404726B2 (ja) * | 2011-09-26 | 2014-02-05 | 株式会社東芝 | 情報処理装置、情報処理方法およびプログラム |
| US8423366B1 (en) * | 2012-07-18 | 2013-04-16 | Google Inc. | Automatically training speech synthesizers |
| CN103714048B (zh) | 2012-09-29 | 2017-07-21 | 国际商业机器公司 | 用于校正文本的方法和系统 |
| KR101892734B1 (ko) * | 2013-01-04 | 2018-08-28 | 한국전자통신연구원 | 음성 인식 시스템에서의 오류 수정 방법 및 그 장치 |
| US20150058006A1 (en) * | 2013-08-23 | 2015-02-26 | Xerox Corporation | Phonetic alignment for user-agent dialogue recognition |
| CN105210147B (zh) * | 2014-04-22 | 2020-02-07 | 纳宝株式会社 | 用于改进至少一个语义单元集合的方法、设备及计算机可读记录介质 |
| CN105374356B (zh) * | 2014-08-29 | 2019-07-30 | 株式会社理光 | 语音识别方法、语音评分方法、语音识别系统及语音评分系统 |
| EP3089159B1 (de) | 2015-04-28 | 2019-08-28 | Google LLC | Korrekturspracherkennung mittels selektivem re-speak |
| US9978370B2 (en) | 2015-07-31 | 2018-05-22 | Lenovo (Singapore) Pte. Ltd. | Insertion of characters in speech recognition |
| US10049655B1 (en) | 2016-01-05 | 2018-08-14 | Google Llc | Biasing voice correction suggestions |
| CN105827417A (zh) * | 2016-05-31 | 2016-08-03 | 安徽声讯信息技术有限公司 | 一种用于会议记录并可随时修改的语音速记装置 |
| US10019986B2 (en) | 2016-07-29 | 2018-07-10 | Google Llc | Acoustic model training using corrected terms |
| US10062385B2 (en) | 2016-09-30 | 2018-08-28 | International Business Machines Corporation | Automatic speech-to-text engine selection |
| CN106710597B (zh) * | 2017-01-04 | 2020-12-11 | 广东小天才科技有限公司 | 语音数据的录音方法及装置 |
| CN106875949B (zh) * | 2017-04-28 | 2020-09-22 | 深圳市大乘科技股份有限公司 | 一种语音识别的校正方法及装置 |
| CN109145281B (zh) * | 2017-06-15 | 2020-12-25 | 北京嘀嘀无限科技发展有限公司 | 语音识别方法、装置及存储介质 |
| WO2018228515A1 (en) | 2017-06-15 | 2018-12-20 | Beijing Didi Infinity Technology And Development Co., Ltd. | Systems and methods for speech recognition |
| JP7173049B2 (ja) * | 2018-01-10 | 2022-11-16 | ソニーグループ株式会社 | 情報処理装置、情報処理システム、および情報処理方法、並びにプログラム |
| US10269376B1 (en) * | 2018-06-28 | 2019-04-23 | Invoca, Inc. | Desired signal spotting in noisy, flawed environments |
| US10832679B2 (en) | 2018-11-20 | 2020-11-10 | International Business Machines Corporation | Method and system for correcting speech-to-text auto-transcription using local context of talk |
| US11532308B2 (en) * | 2020-05-04 | 2022-12-20 | Rovi Guides, Inc. | Speech-to-text system |
| US11790916B2 (en) | 2020-05-04 | 2023-10-17 | Rovi Guides, Inc. | Speech-to-text system |
| CN112530402B (zh) * | 2020-11-30 | 2024-01-12 | 深圳市优必选科技股份有限公司 | 一种语音合成方法、语音合成装置及智能设备 |
| CN113823265B (zh) * | 2021-07-19 | 2025-06-24 | 腾讯科技(深圳)有限公司 | 一种语音识别方法、装置和计算机设备 |
| US12165647B2 (en) * | 2022-05-27 | 2024-12-10 | Microsoft Technology Licensing, Llc | Phoneme-based text transcription searching |
Family Cites Families (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4866778A (en) * | 1986-08-11 | 1989-09-12 | Dragon Systems, Inc. | Interactive speech recognition apparatus |
| SE513456C2 (sv) * | 1994-05-10 | 2000-09-18 | Telia Ab | Metod och anordning vid tal- till textomvandling |
| US5799276A (en) * | 1995-11-07 | 1998-08-25 | Accent Incorporated | Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals |
| US5864805A (en) * | 1996-12-20 | 1999-01-26 | International Business Machines Corporation | Method and apparatus for error correction in a continuous dictation system |
| US5909667A (en) * | 1997-03-05 | 1999-06-01 | International Business Machines Corporation | Method and apparatus for fast voice selection of error words in dictated text |
| US6173259B1 (en) * | 1997-03-27 | 2001-01-09 | Speech Machines Plc | Speech to text conversion |
| US6269335B1 (en) * | 1998-08-14 | 2001-07-31 | International Business Machines Corporation | Apparatus and methods for identifying homophones among words in a speech recognition system |
| US6064961A (en) * | 1998-09-02 | 2000-05-16 | International Business Machines Corporation | Display for proofreading text |
| US6457031B1 (en) * | 1998-09-02 | 2002-09-24 | International Business Machines Corp. | Method of marking previously dictated text for deferred correction in a speech recognition proofreader |
| US20020116196A1 (en) * | 1998-11-12 | 2002-08-22 | Tran Bao Q. | Speech recognizer |
| US6611802B2 (en) * | 1999-06-11 | 2003-08-26 | International Business Machines Corporation | Method and system for proofreading and correcting dictated text |
| US6418410B1 (en) * | 1999-09-27 | 2002-07-09 | International Business Machines Corporation | Smart correction of dictated speech |
| EP2261893B1 (de) * | 1999-12-20 | 2016-03-30 | Nuance Communications Austria GmbH | Audiowiedergabe für texteingabe in einem spracherkennungssystem |
| US6912498B2 (en) * | 2000-05-02 | 2005-06-28 | Scansoft, Inc. | Error correction in speech recognition by correcting text around selected area |
-
2002
- 2002-09-10 JP JP2003529447A patent/JP4241376B2/ja not_active Expired - Fee Related
- 2002-09-10 WO PCT/IB2002/003688 patent/WO2003025904A1/en not_active Ceased
- 2002-09-10 EP EP02762708A patent/EP1430474B1/de not_active Expired - Lifetime
- 2002-09-10 AT AT02762708T patent/ATE311650T1/de not_active IP Right Cessation
- 2002-09-10 CN CNB028181328A patent/CN1235188C/zh not_active Expired - Fee Related
- 2002-09-10 DE DE60207742T patent/DE60207742T2/de not_active Expired - Lifetime
- 2002-09-13 US US10/242,930 patent/US6735565B2/en not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| EP1430474A1 (de) | 2004-06-23 |
| WO2003025904A1 (en) | 2003-03-27 |
| CN1235188C (zh) | 2006-01-04 |
| DE60207742D1 (de) | 2006-01-05 |
| EP1430474B1 (de) | 2005-11-30 |
| JP2005503590A (ja) | 2005-02-03 |
| JP4241376B2 (ja) | 2009-03-18 |
| US6735565B2 (en) | 2004-05-11 |
| DE60207742T2 (de) | 2006-08-03 |
| US20030061043A1 (en) | 2003-03-27 |
| CN1555553A (zh) | 2004-12-15 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ATE311650T1 (de) | Korrektur eines von einer spracherkennung erkannten textes mittels vergleich der phonemfolgen des erkannten textes mit einer phonetischen transkription eines manuell eingegebenen korrekturwortes | |
| ATE297588T1 (de) | Anpassung des phonetischen kontextes zur verbesserung der spracherkennung | |
| TW200601263A (en) | Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition | |
| ATE325413T1 (de) | Verfahren und vorrichtung zur wandlung gesprochener in geschriebene texte und korrektur der erkannten texte | |
| DE60203705D1 (de) | Umschreibung und anzeige eines eingegebenen sprachsignals | |
| ATE362633T1 (de) | Erlernen der aussprache neuer worte unter verwendung eines aussprachegraphen | |
| TW200643896A (en) | Voice nametag audio feedback for dialing a telephone call | |
| WO2007118020A3 (en) | Method and system for managing pronunciation dictionaries in a speech application | |
| DE602004018290D1 (de) | Spracherkennungs- und korrektursystem, korrekturvorrichtung und verfahren zur erstellung eines lexikons von alternativen | |
| WO2004100638A3 (en) | Source-dependent text-to-speech system | |
| DE50209455D1 (de) | Verfahren zum Training oder zur Adaption eines Spracherkenners | |
| TW200638337A (en) | Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system | |
| ATE395685T1 (de) | Spracherkennung durch wort-in-phrase-befehl | |
| WO2002097590A3 (en) | Language independent and voice operated information management system | |
| EP1291848A3 (de) | Ausprachen in mehreren Sprachen zur Spracherkennung | |
| EP1696421A3 (de) | Lernen zur Spracherkennung | |
| EP3920181A3 (de) | Textunabhängige sprechererkennung | |
| ATE496363T1 (de) | Spracherkennungsvorrichtung mit markierung von erkannten textteilen | |
| WO2009006081A3 (en) | Pronunciation correction of text-to-speech systems between different spoken languages | |
| DE69827667D1 (de) | Vokoder basierter spracherkenner | |
| Bauer et al. | New zealand english | |
| ATE401644T1 (de) | Verfahren zur spracherkennung | |
| DE60020504D1 (de) | Anpassung eines spracherkenners an korrigierte texte | |
| WO2004049305A3 (en) | Discriminative training of hidden markov models for continuous speech recognition | |
| AU2003205955A1 (en) | Method and device for the rapid, pattern-recognition-supported transcription of spoken and written utterances |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |