ATE419710T1 - Sprachsqualitätsschätzung für off-line spracherkennung - Google Patents

Sprachsqualitätsschätzung für off-line spracherkennung

Info

Publication number
ATE419710T1
ATE419710T1 AT01969315T AT01969315T ATE419710T1 AT E419710 T1 ATE419710 T1 AT E419710T1 AT 01969315 T AT01969315 T AT 01969315T AT 01969315 T AT01969315 T AT 01969315T AT E419710 T1 ATE419710 T1 AT E419710T1
Authority
AT
Austria
Prior art keywords
speech
information
quality
recognition device
feedback
Prior art date
Application number
AT01969315T
Other languages
English (en)
Inventor
Heinrich Bartosik
Original Assignee
Koninkl Philips Electronics Nv
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninkl Philips Electronics Nv filed Critical Koninkl Philips Electronics Nv
Application granted granted Critical
Publication of ATE419710T1 publication Critical patent/ATE419710T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/69Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for evaluating synthetic or decoded voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42221Conversation recording systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/60Medium conversion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • H04M3/51Centralised call answering arrangements requiring operator intervention, e.g. call or contact centers for telemarketing
    • H04M3/5108Secretarial services

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)
  • Testing Electric Properties And Detecting Electric Faults (AREA)
  • Machine Translation (AREA)
  • Communication Control (AREA)
AT01969315T 2000-06-29 2001-06-25 Sprachsqualitätsschätzung für off-line spracherkennung ATE419710T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP00890205 2000-06-29

Publications (1)

Publication Number Publication Date
ATE419710T1 true ATE419710T1 (de) 2009-01-15

Family

ID=8175950

Family Applications (1)

Application Number Title Priority Date Filing Date
AT01969315T ATE419710T1 (de) 2000-06-29 2001-06-25 Sprachsqualitätsschätzung für off-line spracherkennung

Country Status (7)

Country Link
US (1) US6910005B2 (de)
EP (1) EP1299996B1 (de)
JP (1) JP4917729B2 (de)
CN (1) CN1205800C (de)
AT (1) ATE419710T1 (de)
DE (1) DE60137225D1 (de)
WO (1) WO2002005537A1 (de)

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2375935A (en) * 2001-05-22 2002-11-27 Motorola Inc Speech quality indication
JP2003091299A (ja) * 2001-07-13 2003-03-28 Honda Motor Co Ltd 車載用音声認識装置
DE10243955B4 (de) * 2002-09-20 2006-03-30 Kid-Systeme Gmbh Verfahren und Vorrichtung zur Übertragung von Sprachsignalen mittels einer Flugzeug-Sprachübertragungseinrichtung
GB0224806D0 (en) * 2002-10-24 2002-12-04 Ibm Method and apparatus for a interactive voice response system
DE10251113A1 (de) * 2002-11-02 2004-05-19 Philips Intellectual Property & Standards Gmbh Verfahren zum Betrieb eines Spracherkennungssystems
US8380510B2 (en) 2005-05-20 2013-02-19 Nuance Communications, Inc. System and method for multi level transcript quality checking
GB2426368A (en) * 2005-05-21 2006-11-22 Ibm Using input signal quality in speeech recognition
US7806833B2 (en) * 2006-04-27 2010-10-05 Hd Medical Group Limited Systems and methods for analysis and display of heart sounds
US8364492B2 (en) * 2006-07-13 2013-01-29 Nec Corporation Apparatus, method and program for giving warning in connection with inputting of unvoiced speech
CN101001294B (zh) * 2006-12-19 2010-10-06 中山大学 一种基于语音识别技术的智能化家居语音记录及提醒系统
US8332212B2 (en) * 2008-06-18 2012-12-11 Cogi, Inc. Method and system for efficient pacing of speech for transcription
US8639514B2 (en) * 2008-12-18 2014-01-28 At&T Intellectual Property I, L.P. Method and apparatus for accessing information identified from a broadcast audio signal
CN102044251B (zh) * 2009-10-13 2012-07-25 创杰科技股份有限公司 无线电语音数据传输系统的语音质量改善方法
CN102934160A (zh) * 2010-03-30 2013-02-13 Nvoq股份有限公司 用于提高音频质量的听写客户端反馈
CN102376303B (zh) * 2010-08-13 2014-03-12 国基电子(上海)有限公司 录音设备及利用该录音设备进行声音处理与录入的方法
US8892444B2 (en) * 2011-07-27 2014-11-18 International Business Machines Corporation Systems and methods for improving quality of user generated audio content in voice applications
US9620128B2 (en) 2012-05-31 2017-04-11 Elwha Llc Speech recognition adaptation systems based on adaptation data
US9305565B2 (en) 2012-05-31 2016-04-05 Elwha Llc Methods and systems for speech adaptation data
US10431235B2 (en) 2012-05-31 2019-10-01 Elwha Llc Methods and systems for speech adaptation data
US9495966B2 (en) 2012-05-31 2016-11-15 Elwha Llc Speech recognition adaptation systems based on adaptation data
US20130325449A1 (en) 2012-05-31 2013-12-05 Elwha Llc Speech recognition adaptation systems based on adaptation data
US9899040B2 (en) * 2012-05-31 2018-02-20 Elwha, Llc Methods and systems for managing adaptation data
WO2015196063A1 (en) * 2014-06-19 2015-12-23 Robert Bosch Gmbh System and method for speech-enabled personalized operation of devices and services in multiple operating environments
JP6772839B2 (ja) * 2014-12-25 2020-10-21 ソニー株式会社 情報処理装置、情報処理方法およびプログラム
JP6893150B2 (ja) * 2017-08-14 2021-06-23 株式会社ディーアンドエムホールディングス オーディオ装置およびコンピュータで読み取り可能なプログラム
WO2023050301A1 (zh) * 2021-09-30 2023-04-06 华为技术有限公司 语音质量评估、语音识别质量预测与提高的方法及装置
US12469509B2 (en) * 2023-04-04 2025-11-11 Meta Platforms Technologies, Llc Voice avatars in extended reality environments

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA1169969A (en) * 1980-08-20 1984-06-26 Gregor N. Neff Dictation system and method
JPH0434499A (ja) * 1990-05-30 1992-02-05 Sharp Corp 発声法指示装置
US5243149A (en) * 1992-04-10 1993-09-07 International Business Machines Corp. Method and apparatus for improving the paper interface to computing systems
JPH0675588A (ja) * 1992-08-27 1994-03-18 Fujitsu Ltd 音声認識装置
DE4434255A1 (de) * 1994-09-24 1996-03-28 Sel Alcatel Ag Vorrichtung zur Sprachaufzeichnung mit anschließender Texterstellung
US5835667A (en) * 1994-10-14 1998-11-10 Carnegie Mellon University Method and apparatus for creating a searchable digital video library and a system and method of using such a library
US5684921A (en) * 1995-07-13 1997-11-04 U S West Technologies, Inc. Method and system for identifying a corrupted speech message signal
JPH0944183A (ja) * 1995-07-26 1997-02-14 Sony Corp レベル表示装置、音声認識装置およびナビゲーション装置
JPH1070613A (ja) * 1996-08-28 1998-03-10 Bell Syst Nijiyuuyon:Kk 住所・氏名のオフライン認識サポートシステム、通信販売等の電話注文受けシステム、および聞き起し画面
JPH10240291A (ja) * 1996-12-26 1998-09-11 Seiko Epson Corp 音声認識装置における音声入力可能状態報知方法及びその装置
JP3402100B2 (ja) * 1996-12-27 2003-04-28 カシオ計算機株式会社 音声制御ホスト装置
GB2323693B (en) * 1997-03-27 2001-09-26 Forum Technology Ltd Speech to text conversion
JP3886024B2 (ja) * 1997-11-19 2007-02-28 富士通株式会社 音声認識装置及びそれを用いた情報処理装置
JPH11194795A (ja) * 1997-12-26 1999-07-21 Kyocera Corp 音声認識作動装置
JPH11352995A (ja) * 1998-06-08 1999-12-24 Toshiba Tec Corp 音声認識装置
JP2000075893A (ja) * 1998-08-31 2000-03-14 Olympus Optical Co Ltd 音声認識装置
US6336091B1 (en) * 1999-01-22 2002-01-01 Motorola, Inc. Communication device for screening speech recognizer input
US6477493B1 (en) * 1999-07-15 2002-11-05 International Business Machines Corporation Off site voice enrollment on a transcription device for speech recognition
EP1171869B1 (de) * 2000-01-27 2010-11-24 Nuance Communications Austria GmbH Sprachdetektiongsgerät mit zwei abschaltkriterien

Also Published As

Publication number Publication date
CN1205800C (zh) 2005-06-08
WO2002005537A1 (en) 2002-01-17
US6910005B2 (en) 2005-06-21
DE60137225D1 (de) 2009-02-12
WO2002005537A8 (en) 2002-02-28
EP1299996A1 (de) 2003-04-09
US20020019734A1 (en) 2002-02-14
JP2004502985A (ja) 2004-01-29
EP1299996B1 (de) 2008-12-31
JP4917729B2 (ja) 2012-04-18
CN1389059A (zh) 2003-01-01

Similar Documents

Publication Publication Date Title
ATE419710T1 (de) Sprachsqualitätsschätzung für off-line spracherkennung
US6700953B1 (en) System, apparatus, method and article of manufacture for evaluating the quality of a transmission channel using voice recognition technology
CA2706046C (en) Method for determining the on-hold status in a call
Rix et al. Objective assessment of speech and audio quality—technology and applications
KR960014222B1 (ko) 전화의 부재중 응답 장치
NO970727L (no) Analyse av audio-kvalitet
EP1407399B1 (de) Verfahren zum bereitstellen von kontoinformation und system zum aufschreiben von diktiertem text
US8417524B2 (en) Analysis of the temporal evolution of emotions in an audio interaction in a service delivery environment
CN103366729B (zh) 语音对话系统、终端装置和数据中心装置
US20120303369A1 (en) Energy-Efficient Unobtrusive Identification of a Speaker
ATE556405T1 (de) Erfassungssystem für audio-, video- und gerätedaten mit echtzeitspracherkennungsbefehls- und steuersystem
CN101490741A (zh) 使用语音识别来检测应答机
WO2005040966A3 (en) Voice tagging, voice annotation, and speech recognition for portable devices with optional post processing
WO2007091453A1 (ja) モニタリング装置、評価データ選別装置、応対者評価装置、応対者評価システムおよびプログラム
WO2006076078A3 (en) Interactive apparatus with recording and playback capability usable with encoded writing medium
CN108229441A (zh) 一种基于图像和语音分析的课堂教学自动反馈系统和反馈方法
CN104835504A (zh) 一种消除语音互动过程中录音评测噪声干扰的方法及装置
CN103258544A (zh) 一种录音检测方法、装置及考试终端和考试系统
CN208461820U (zh) 一种智能对讲检测系统
Kazemzadeh et al. Acoustic correlates of user response to error in human-computer dialogues
Padmanabhan et al. Issues involved in voicemail data collection
CN116778965A (zh) 一种语音质检的装置及系统
CN118136021A (zh) 一种基于振动信号的录音角色切换装置及方法
DE602004001945D1 (de) Verfahren und System zur Benutzungssteuerung eines Netzzugangspunktes sowie Speichermedien, Netzzugangspunkt und Kontrolleinrichtung zur Durchführung des Verfahrens
TANAKA et al. ANALYSIS OF NONLINEARITIES IN NONVERBAL VOICES Toward Extraction of Kansei Information via Nonverbal Voices

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties