|
US6675027B1
(en)
*
|
1999-11-22 |
2004-01-06 |
Microsoft Corp |
Personal mobile computing device having antenna microphone for improved speech recognition
|
|
US7274800B2
(en)
*
|
2001-07-18 |
2007-09-25 |
Intel Corporation |
Dynamic gesture recognition from stereo sequences
|
|
US20030212552A1
(en)
*
|
2002-05-09 |
2003-11-13 |
Liang Lu Hong |
Face recognition procedure useful for audiovisual speech recognition
|
|
US7209883B2
(en)
*
|
2002-05-09 |
2007-04-24 |
Intel Corporation |
Factorial hidden markov model for audiovisual speech recognition
|
|
US7165029B2
(en)
*
|
2002-05-09 |
2007-01-16 |
Intel Corporation |
Coupled hidden Markov model for audiovisual speech recognition
|
|
US7587318B2
(en)
*
|
2002-09-12 |
2009-09-08 |
Broadcom Corporation |
Correlating video images of lip movements with audio signals to improve speech recognition
|
|
CN1685380A
(zh)
*
|
2002-09-27 |
2005-10-19 |
银河网路股份有限公司 |
远程教育系统、听讲确认方法及听讲确认程序
|
|
US7171043B2
(en)
*
|
2002-10-11 |
2007-01-30 |
Intel Corporation |
Image recognition using hidden markov models and coupled hidden markov models
|
|
US7472063B2
(en)
*
|
2002-12-19 |
2008-12-30 |
Intel Corporation |
Audio-visual feature fusion and support vector machine useful for continuous speech recognition
|
|
US7203368B2
(en)
*
|
2003-01-06 |
2007-04-10 |
Intel Corporation |
Embedded bayesian network for pattern recognition
|
|
US20050033571A1
(en)
*
|
2003-08-07 |
2005-02-10 |
Microsoft Corporation |
Head mounted multi-sensory audio input system
|
|
CA2473195C
(en)
*
|
2003-07-29 |
2014-02-04 |
Microsoft Corporation |
Head mounted multi-sensory audio input system
|
|
US7383181B2
(en)
*
|
2003-07-29 |
2008-06-03 |
Microsoft Corporation |
Multi-sensory speech detection system
|
|
US7447630B2
(en)
*
|
2003-11-26 |
2008-11-04 |
Microsoft Corporation |
Method and apparatus for multi-sensory speech enhancement
|
|
US20050154593A1
(en)
*
|
2004-01-14 |
2005-07-14 |
International Business Machines Corporation |
Method and apparatus employing electromyographic sensors to initiate oral communications with a voice-based device
|
|
US7499686B2
(en)
*
|
2004-02-24 |
2009-03-03 |
Microsoft Corporation |
Method and apparatus for multi-sensory speech enhancement on a mobile device
|
|
US20050228673A1
(en)
*
|
2004-03-30 |
2005-10-13 |
Nefian Ara V |
Techniques for separating and evaluating audio and video source data
|
|
US8244542B2
(en)
*
|
2004-07-01 |
2012-08-14 |
Emc Corporation |
Video surveillance
|
|
US20060046845A1
(en)
*
|
2004-08-26 |
2006-03-02 |
Alexandre Armand |
Device for the acoustic control of a game system and application
|
|
US7574008B2
(en)
*
|
2004-09-17 |
2009-08-11 |
Microsoft Corporation |
Method and apparatus for multi-sensory speech enhancement
|
|
US7283850B2
(en)
*
|
2004-10-12 |
2007-10-16 |
Microsoft Corporation |
Method and apparatus for multi-sensory speech enhancement on a mobile device
|
|
US7346504B2
(en)
*
|
2005-06-20 |
2008-03-18 |
Microsoft Corporation |
Multi-sensory speech enhancement using a clean speech prior
|
|
US7680656B2
(en)
*
|
2005-06-28 |
2010-03-16 |
Microsoft Corporation |
Multi-sensory speech enhancement using a speech-state model
|
|
US7406303B2
(en)
|
2005-07-05 |
2008-07-29 |
Microsoft Corporation |
Multi-sensory speech enhancement using synthesized sensor signal
|
|
US20070033042A1
(en)
*
|
2005-08-03 |
2007-02-08 |
International Business Machines Corporation |
Speech detection fusing multi-class acoustic-phonetic, and energy features
|
|
US7962340B2
(en)
*
|
2005-08-22 |
2011-06-14 |
Nuance Communications, Inc. |
Methods and apparatus for buffering data for use in accordance with a speech recognition system
|
|
WO2007026280A1
(en)
*
|
2005-08-31 |
2007-03-08 |
Philips Intellectual Property & Standards Gmbh |
A dialogue system for interacting with a person by making use of both visual and speech-based recognition
|
|
US7697827B2
(en)
|
2005-10-17 |
2010-04-13 |
Konicek Jeffrey C |
User-friendlier interfaces for a camera
|
|
US7860718B2
(en)
*
|
2005-12-08 |
2010-12-28 |
Electronics And Telecommunications Research Institute |
Apparatus and method for speech segment detection and system for speech recognition
|
|
US7930178B2
(en)
*
|
2005-12-23 |
2011-04-19 |
Microsoft Corporation |
Speech modeling and enhancement based on magnitude-normalized spectra
|
|
US8326636B2
(en)
*
|
2008-01-16 |
2012-12-04 |
Canyon Ip Holdings Llc |
Using a physical phenomenon detector to control operation of a speech recognition engine
|
|
JP3139277U
(ja)
*
|
2007-11-26 |
2008-02-07 |
株式会社Srj |
バーチャルスクールシステムおよびスクールシティシステム
|
|
JP2011186351A
(ja)
*
|
2010-03-11 |
2011-09-22 |
Sony Corp |
情報処理装置、および情報処理方法、並びにプログラム
|
|
US8635066B2
(en)
*
|
2010-04-14 |
2014-01-21 |
T-Mobile Usa, Inc. |
Camera-assisted noise cancellation and speech recognition
|
|
US20110311144A1
(en)
*
|
2010-06-17 |
2011-12-22 |
Microsoft Corporation |
Rgb/depth camera for improving speech recognition
|
|
US8856212B1
(en)
|
2011-02-08 |
2014-10-07 |
Google Inc. |
Web-based configurable pipeline for media processing
|
|
US8681866B1
(en)
|
2011-04-28 |
2014-03-25 |
Google Inc. |
Method and apparatus for encoding video by downsampling frame resolution
|
|
US9106787B1
(en)
|
2011-05-09 |
2015-08-11 |
Google Inc. |
Apparatus and method for media transmission bandwidth control using bandwidth estimation
|
|
US8863042B2
(en)
*
|
2012-01-24 |
2014-10-14 |
Charles J. Kulas |
Handheld device with touch controls that reconfigure in response to the way a user operates the device
|
|
US8913103B1
(en)
|
2012-02-01 |
2014-12-16 |
Google Inc. |
Method and apparatus for focus-of-attention control
|
|
US8782271B1
(en)
|
2012-03-19 |
2014-07-15 |
Google, Inc. |
Video mixing using video speech detection
|
|
US9185429B1
(en)
|
2012-04-30 |
2015-11-10 |
Google Inc. |
Video encoding and decoding using un-equal error protection
|
|
KR101992676B1
(ko)
|
2012-07-26 |
2019-06-25 |
삼성전자주식회사 |
영상 인식을 이용하여 음성 인식을 하는 방법 및 장치
|
|
WO2014025012A1
(ja)
*
|
2012-08-10 |
2014-02-13 |
株式会社ホンダアクセス |
音声認識方法及び音声認識装置
|
|
US9704486B2
(en)
|
2012-12-11 |
2017-07-11 |
Amazon Technologies, Inc. |
Speech recognition power management
|
|
US9172740B1
(en)
|
2013-01-15 |
2015-10-27 |
Google Inc. |
Adjustable buffer remote access
|
|
US9311692B1
(en)
|
2013-01-25 |
2016-04-12 |
Google Inc. |
Scalable buffer remote access
|
|
US9225979B1
(en)
|
2013-01-30 |
2015-12-29 |
Google Inc. |
Remote access encoding
|
|
JP2016526331A
(ja)
|
2013-05-23 |
2016-09-01 |
ノールズ エレクトロニクス,リミテッド ライアビリティ カンパニー |
Vad検出マイク及びその動作方法
|
|
US9711166B2
(en)
|
2013-05-23 |
2017-07-18 |
Knowles Electronics, Llc |
Decimation synchronization in a microphone
|
|
US10020008B2
(en)
|
2013-05-23 |
2018-07-10 |
Knowles Electronics, Llc |
Microphone and corresponding digital interface
|
|
US9165182B2
(en)
*
|
2013-08-19 |
2015-10-20 |
Cisco Technology, Inc. |
Method and apparatus for using face detection information to improve speaker segmentation
|
|
JP6221535B2
(ja)
*
|
2013-09-11 |
2017-11-01 |
ソニー株式会社 |
情報処理装置、情報処理方法、およびプログラム
|
|
US9502028B2
(en)
*
|
2013-10-18 |
2016-11-22 |
Knowles Electronics, Llc |
Acoustic activity detection apparatus and method
|
|
US9147397B2
(en)
|
2013-10-29 |
2015-09-29 |
Knowles Electronics, Llc |
VAD detection apparatus and method of operating the same
|
|
WO2015094369A1
(en)
*
|
2013-12-20 |
2015-06-25 |
Intel Corporation |
Transition from low power always listening mode to high power speech recognition mode
|
|
US10304458B1
(en)
|
2014-03-06 |
2019-05-28 |
Board of Trustees of the University of Alabama and the University of Alabama in Huntsville |
Systems and methods for transcribing videos using speaker identification
|
|
US9966079B2
(en)
*
|
2014-03-24 |
2018-05-08 |
Lenovo (Singapore) Pte. Ltd. |
Directing voice input based on eye tracking
|
|
WO2016118480A1
(en)
|
2015-01-21 |
2016-07-28 |
Knowles Electronics, Llc |
Low power voice trigger for acoustic apparatus and method
|
|
US10121472B2
(en)
|
2015-02-13 |
2018-11-06 |
Knowles Electronics, Llc |
Audio buffer catch-up apparatus and method with two microphones
|
|
CN105991851A
(zh)
|
2015-02-17 |
2016-10-05 |
杜比实验室特许公司 |
处理电话会议系统中的烦扰
|
|
DE102015206566A1
(de)
*
|
2015-04-13 |
2016-10-13 |
BSH Hausgeräte GmbH |
Haushaltsgerät und Verfahren zum Betreiben eines Haushaltsgeräts
|
|
US9478234B1
(en)
|
2015-07-13 |
2016-10-25 |
Knowles Electronics, Llc |
Microphone apparatus and method with catch-up buffer
|
|
EP3185244B1
(de)
*
|
2015-12-22 |
2019-02-20 |
Nxp B.V. |
Sprachaktivierungssystem
|
|
US20190066676A1
(en)
*
|
2016-05-16 |
2019-02-28 |
Sony Corporation |
Information processing apparatus
|
|
CN107437420A
(zh)
*
|
2016-05-27 |
2017-12-05 |
富泰华工业(深圳)有限公司 |
语音信息的接收方法、系统及装置
|
|
JP6744025B2
(ja)
*
|
2016-06-21 |
2020-08-19 |
日本電気株式会社 |
作業支援システム、管理サーバ、携帯端末、作業支援方法およびプログラム
|
|
US10621992B2
(en)
*
|
2016-07-22 |
2020-04-14 |
Lenovo (Singapore) Pte. Ltd. |
Activating voice assistant based on at least one of user proximity and context
|
|
CN106373568A
(zh)
*
|
2016-08-30 |
2017-02-01 |
深圳市元征科技股份有限公司 |
智能车载单元控制方法和装置
|
|
KR102591413B1
(ko)
*
|
2016-11-16 |
2023-10-19 |
엘지전자 주식회사 |
이동단말기 및 그 제어방법
|
|
US10332515B2
(en)
|
2017-03-14 |
2019-06-25 |
Google Llc |
Query endpointing based on lip detection
|
|
US10664533B2
(en)
|
2017-05-24 |
2020-05-26 |
Lenovo (Singapore) Pte. Ltd. |
Systems and methods to determine response cue for digital assistant based on context
|
|
CN109102801A
(zh)
*
|
2017-06-20 |
2018-12-28 |
京东方科技集团股份有限公司 |
语音识别方法和语音识别装置
|
|
WO2019175960A1
(ja)
|
2018-03-13 |
2019-09-19 |
三菱電機株式会社 |
音声処理装置および音声処理方法
|
|
KR102512446B1
(ko)
*
|
2018-05-04 |
2023-03-22 |
구글 엘엘씨 |
자동화된 어시스턴트 기능(들)의 핫-워드 프리 적응
|
|
CN109147779A
(zh)
*
|
2018-08-14 |
2019-01-04 |
苏州思必驰信息科技有限公司 |
语音数据处理方法和装置
|
|
US11151993B2
(en)
*
|
2018-12-28 |
2021-10-19 |
Baidu Usa Llc |
Activating voice commands of a smart display device based on a vision-based mechanism
|
|
US12322386B2
(en)
*
|
2019-07-22 |
2025-06-03 |
Lg Electronics Inc. |
Display device and operation method for same
|
|
KR20210042520A
(ko)
*
|
2019-10-10 |
2021-04-20 |
삼성전자주식회사 |
전자 장치 및 이의 제어 방법
|
|
US11615781B2
(en)
*
|
2019-10-18 |
2023-03-28 |
Google Llc |
End-to-end multi-speaker audio-visual automatic speech recognition
|
|
US12522142B2
(en)
|
2019-10-23 |
2026-01-13 |
Sony Group Corporation |
Display system, display device, display method, and mobile apparatus
|
|
US11590929B2
(en)
*
|
2020-05-05 |
2023-02-28 |
Nvidia Corporation |
Systems and methods for performing commands in a vehicle using speech and image recognition
|
|
CN111768760B
(zh)
*
|
2020-05-26 |
2023-04-18 |
云知声智能科技股份有限公司 |
一种多模态语音端点检测方法及装置
|
|
CN113345472B
(zh)
*
|
2021-05-08 |
2022-03-25 |
北京百度网讯科技有限公司 |
语音端点检测方法、装置、电子设备及存储介质
|