CN105900169B - 音频内容的空间误差度量 - Google Patents
音频内容的空间误差度量 Download PDFInfo
- Publication number
- CN105900169B CN105900169B CN201580004002.0A CN201580004002A CN105900169B CN 105900169 B CN105900169 B CN 105900169B CN 201580004002 A CN201580004002 A CN 201580004002A CN 105900169 B CN105900169 B CN 105900169B
- Authority
- CN
- China
- Prior art keywords
- audio
- output
- clusters
- objects
- spatial error
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
-
- F—MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
- F24—HEATING; RANGES; VENTILATING
- F24C—DOMESTIC STOVES OR RANGES ; DETAILS OF DOMESTIC STOVES OR RANGES, OF GENERAL APPLICATION
- F24C15/00—Details
- F24C15/20—Removing cooking fumes
- F24C15/2028—Removing cooking fumes using an air curtain
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
- H04R29/00—Monitoring arrangements; Testing arrangements
- H04R29/008—Visual indication of individual signal levels
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/008—Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/13—Aspects of volume control, not necessarily automatic, in stereophonic sound systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Chemical & Material Sciences (AREA)
- Combustion & Propulsion (AREA)
- Mechanical Engineering (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- General Health & Medical Sciences (AREA)
- Otolaryngology (AREA)
- Stereophonic System (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| ES201430016 | 2014-01-09 | ||
| ESP201430016 | 2014-01-09 | ||
| US201461951048P | 2014-03-11 | 2014-03-11 | |
| US61/951,048 | 2014-03-11 | ||
| PCT/US2015/010126 WO2015105748A1 (en) | 2014-01-09 | 2015-01-05 | Spatial error metrics of audio content |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN105900169A CN105900169A (zh) | 2016-08-24 |
| CN105900169B true CN105900169B (zh) | 2020-01-03 |
Family
ID=52469071
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201580004002.0A Active CN105900169B (zh) | 2014-01-09 | 2015-01-05 | 音频内容的空间误差度量 |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US10492014B2 (2) |
| EP (1) | EP3092642B1 (2) |
| JP (1) | JP6518254B2 (2) |
| CN (1) | CN105900169B (2) |
| WO (1) | WO2015105748A1 (2) |
Families Citing this family (32)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2015017037A1 (en) | 2013-07-30 | 2015-02-05 | Dolby International Ab | Panning of audio objects to arbitrary speaker layouts |
| CN105336335B (zh) * | 2014-07-25 | 2020-12-08 | 杜比实验室特许公司 | 利用子带对象概率估计的音频对象提取 |
| CN105895086B (zh) | 2014-12-11 | 2021-01-12 | 杜比实验室特许公司 | 元数据保留的音频对象聚类 |
| MX379477B (es) | 2015-06-17 | 2025-03-10 | Fraunhofer Ges Zur Foerderung Der Angewandten Foerschung E V | Control de intensidad para interacción del usuario en sistemas de codificación de audio |
| CN106385660B (zh) * | 2015-08-07 | 2020-10-16 | 杜比实验室特许公司 | 处理基于对象的音频信号 |
| US10277997B2 (en) | 2015-08-07 | 2019-04-30 | Dolby Laboratories Licensing Corporation | Processing object-based audio signals |
| US10278000B2 (en) | 2015-12-14 | 2019-04-30 | Dolby Laboratories Licensing Corporation | Audio object clustering with single channel quality preservation |
| US9949052B2 (en) | 2016-03-22 | 2018-04-17 | Dolby Laboratories Licensing Corporation | Adaptive panner of audio objects |
| WO2018017394A1 (en) * | 2016-07-20 | 2018-01-25 | Dolby Laboratories Licensing Corporation | Audio object clustering based on renderer-aware perceptual difference |
| CN109479178B (zh) * | 2016-07-20 | 2021-02-26 | 杜比实验室特许公司 | 基于呈现器意识感知差异的音频对象聚集 |
| US10861436B1 (en) * | 2016-08-24 | 2020-12-08 | Gridspace Inc. | Audio call classification and survey system |
| US11601552B2 (en) | 2016-08-24 | 2023-03-07 | Gridspace Inc. | Hierarchical interface for adaptive closed loop communication system |
| US11721356B2 (en) | 2016-08-24 | 2023-08-08 | Gridspace Inc. | Adaptive closed loop communication system |
| US11715459B2 (en) | 2016-08-24 | 2023-08-01 | Gridspace Inc. | Alert generator for adaptive closed loop communication system |
| US12132866B2 (en) | 2016-08-24 | 2024-10-29 | Gridspace Inc. | Configurable dynamic call routing and matching system |
| US20200126582A1 (en) * | 2017-04-25 | 2020-04-23 | Sony Corporation | Signal processing device and method, and program |
| EP4358085A3 (en) | 2017-04-26 | 2024-07-10 | Sony Group Corporation | Signal processing device, method, and program |
| CN110800048B (zh) * | 2017-05-09 | 2023-07-28 | 杜比实验室特许公司 | 多通道空间音频格式输入信号的处理 |
| WO2019067620A1 (en) * | 2017-09-29 | 2019-04-04 | Zermatt Technologies Llc | SPEECH REDUCTION AUDIO MIXING |
| US10628486B2 (en) * | 2017-11-15 | 2020-04-21 | Google Llc | Partitioning videos |
| WO2019106221A1 (en) * | 2017-11-28 | 2019-06-06 | Nokia Technologies Oy | Processing of spatial audio parameters |
| CN108984628B (zh) * | 2018-06-20 | 2020-01-24 | 北京达佳互联信息技术有限公司 | 内容描述生成模型的损失值获取方法及装置 |
| US11929082B2 (en) * | 2018-11-02 | 2024-03-12 | Dolby International Ab | Audio encoder and an audio decoder |
| EP4641561A3 (en) * | 2019-03-29 | 2026-01-21 | Telefonaktiebolaget LM Ericsson (publ) | Method and apparatus for error recovery in predictive coding in multichannel audio frames |
| KR20240046634A (ko) * | 2019-03-29 | 2024-04-09 | 텔레폰악티에볼라겟엘엠에릭슨(펍) | 예측 코딩에서 저비용 에러 복구를 위한 방법 및 장치 |
| CN110493649B (zh) * | 2019-09-12 | 2021-08-20 | 重庆市群众艺术馆 | 基于群众满意度的文化馆数字资源加工方法 |
| EP4553831A3 (en) * | 2019-12-09 | 2025-05-21 | Dolby Laboratories Licensing Corporation | Adjusting audio and non-audio features based on noise metrics and speech intelligibility metrics |
| CN113096671B (zh) * | 2020-01-09 | 2022-05-13 | 齐鲁工业大学 | 一种大容量音频文件可逆信息隐藏方法及系统 |
| US11704087B2 (en) * | 2020-02-03 | 2023-07-18 | Google Llc | Video-informed spatial audio expansion |
| EP4346234A1 (en) * | 2022-09-29 | 2024-04-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for perception-based clustering of object-based audio scenes |
| WO2025199350A1 (en) * | 2024-03-22 | 2025-09-25 | Dolby Laboratories Licensing Corporation | Low-latency gain interpolation for audio object clustering |
| WO2026006172A1 (en) * | 2024-06-25 | 2026-01-02 | Dolby Laboratories Licensing Corporation | Audio object clustering system |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101485202A (zh) * | 2005-05-11 | 2009-07-15 | 高通股份有限公司 | 一种用于统一的错误隐匿框架的方法及设备 |
| CN101547000A (zh) * | 2009-05-08 | 2009-09-30 | 炬力集成电路设计有限公司 | 一种信号转换电路、数模转换装置和音频输出设备 |
| GB2459012A (en) * | 2008-03-20 | 2009-10-14 | Univ Surrey | Predicting the perceived spatial quality of sound processing and reproducing equipment |
| CN101582262A (zh) * | 2009-06-16 | 2009-11-18 | 武汉大学 | 一种空间音频参数帧间预测编解码方法 |
| CN101859563A (zh) * | 2009-04-09 | 2010-10-13 | 哈曼国际工业有限公司 | 基于音频系统输出的有源噪声控制系统 |
Family Cites Families (19)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7617099B2 (en) * | 2001-02-12 | 2009-11-10 | FortMedia Inc. | Noise suppression by two-channel tandem spectrum modification for speech signal in an automobile |
| DE60206269T2 (de) * | 2001-06-08 | 2006-06-29 | Koninklijke Philips Electronics N.V. | Editieren von audiosignalen |
| KR100479478B1 (ko) | 2002-07-26 | 2005-03-31 | 연세대학교 산학협력단 | 객체별 중요도를 고려한 객체 기반의 트랜스코딩 방법 및그 장치 |
| FR2862799B1 (fr) * | 2003-11-26 | 2006-02-24 | Inst Nat Rech Inf Automat | Dispositif et methode perfectionnes de spatialisation du son |
| US8363865B1 (en) | 2004-05-24 | 2013-01-29 | Heather Bottum | Multiple channel sound system using multi-speaker arrays |
| US8509313B2 (en) * | 2006-10-10 | 2013-08-13 | Texas Instruments Incorporated | Video error concealment |
| KR101012259B1 (ko) | 2006-10-16 | 2011-02-08 | 돌비 스웨덴 에이비 | 멀티채널 다운믹스된 객체 코딩의 개선된 코딩 및 파라미터 표현 |
| EP2095365A4 (en) * | 2006-11-24 | 2009-11-18 | Lg Electronics Inc | METHOD FOR ENCODING AND DECODING AUDIO SIGNALS BASED ON OBJECTS AND APPARATUS THEREOF |
| EP2123047A2 (en) | 2007-01-04 | 2009-11-25 | BRITISH TELECOMMUNICATIONS public limited company | Video signal encoding |
| EP2111617B1 (en) | 2007-02-14 | 2013-09-04 | LG Electronics Inc. | Audio decoding method and corresponding apparatus |
| US7945119B2 (en) | 2007-06-26 | 2011-05-17 | Microsoft Corporation | Optimizing character rendering |
| US8295494B2 (en) | 2007-08-13 | 2012-10-23 | Lg Electronics Inc. | Enhancing audio with remixing capability |
| JP5883561B2 (ja) | 2007-10-17 | 2016-03-15 | フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ | アップミックスを使用した音声符号器 |
| MX2011011399A (es) | 2008-10-17 | 2012-06-27 | Univ Friedrich Alexander Er | Aparato para suministrar uno o más parámetros ajustados para un suministro de una representación de señal de mezcla ascendente sobre la base de una representación de señal de mezcla descendete, decodificador de señal de audio, transcodificador de señal de audio, codificador de señal de audio, flujo de bits de audio, método y programa de computación que utiliza información paramétrica relacionada con el objeto. |
| JP5604933B2 (ja) | 2010-03-30 | 2014-10-15 | 富士通株式会社 | ダウンミクス装置およびダウンミクス方法 |
| JP5740531B2 (ja) | 2011-07-01 | 2015-06-24 | ドルビー ラボラトリーズ ライセンシング コーポレイション | オブジェクトベースオーディオのアップミキシング |
| US9516446B2 (en) * | 2012-07-20 | 2016-12-06 | Qualcomm Incorporated | Scalable downmix design for object-based surround codec with cluster analysis by synthesis |
| JP6186435B2 (ja) * | 2012-08-07 | 2017-08-23 | ドルビー ラボラトリーズ ライセンシング コーポレイション | ゲームオーディオコンテンツを示すオブジェクトベースオーディオの符号化及びレンダリング |
| US9805725B2 (en) | 2012-12-21 | 2017-10-31 | Dolby Laboratories Licensing Corporation | Object clustering for rendering object-based audio content based on perceptual criteria |
-
2015
- 2015-01-05 EP EP15700522.4A patent/EP3092642B1/en active Active
- 2015-01-05 US US15/110,371 patent/US10492014B2/en active Active
- 2015-01-05 WO PCT/US2015/010126 patent/WO2015105748A1/en not_active Ceased
- 2015-01-05 CN CN201580004002.0A patent/CN105900169B/zh active Active
- 2015-01-05 JP JP2016544661A patent/JP6518254B2/ja active Active
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101485202A (zh) * | 2005-05-11 | 2009-07-15 | 高通股份有限公司 | 一种用于统一的错误隐匿框架的方法及设备 |
| GB2459012A (en) * | 2008-03-20 | 2009-10-14 | Univ Surrey | Predicting the perceived spatial quality of sound processing and reproducing equipment |
| CN101859563A (zh) * | 2009-04-09 | 2010-10-13 | 哈曼国际工业有限公司 | 基于音频系统输出的有源噪声控制系统 |
| CN101547000A (zh) * | 2009-05-08 | 2009-09-30 | 炬力集成电路设计有限公司 | 一种信号转换电路、数模转换装置和音频输出设备 |
| CN101582262A (zh) * | 2009-06-16 | 2009-11-18 | 武汉大学 | 一种空间音频参数帧间预测编解码方法 |
Also Published As
| Publication number | Publication date |
|---|---|
| US10492014B2 (en) | 2019-11-26 |
| EP3092642B1 (en) | 2018-05-16 |
| US20160337776A1 (en) | 2016-11-17 |
| EP3092642A1 (en) | 2016-11-16 |
| WO2015105748A1 (en) | 2015-07-16 |
| JP6518254B2 (ja) | 2019-05-22 |
| JP2017508175A (ja) | 2017-03-23 |
| CN105900169A (zh) | 2016-08-24 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN105900169B (zh) | 音频内容的空间误差度量 | |
| US9479886B2 (en) | Scalable downmix design with feedback for object-based surround codec | |
| US9761229B2 (en) | Systems, methods, apparatus, and computer-readable media for audio object clustering | |
| CN103403800B (zh) | 确定多声道音频信号的声道间时间差 | |
| US11138989B2 (en) | Sound quality prediction and interface to facilitate high-quality voice recordings | |
| US20240249737A1 (en) | Audio encoding and decoding method and related product | |
| US9451304B2 (en) | Sound feature priority alignment | |
| CN105874533A (zh) | 音频对象提取 | |
| MX2013013261A (es) | Asignacion de bits, codificacion y decodificacion de audio. | |
| US11269589B2 (en) | Inter-channel audio feature measurement and usages | |
| CN102165519A (zh) | 处理信号的方法和装置 | |
| CN104900236A (zh) | 音频信号处理 | |
| US9936328B2 (en) | Apparatus and method for estimating an overall mixing time based on at least a first pair of room impulse responses, as well as corresponding computer program | |
| US10734006B2 (en) | Audio coding based on audio pattern recognition | |
| US10984811B2 (en) | Audio coding method and related apparatus | |
| US12424225B2 (en) | Lecturer speech signal processing | |
| JP2026511174A (ja) | フレームレベルの非同期メタデータの符号化 | |
| JP2025540764A (ja) | パラメトリック空間オーディオ符号化 | |
| CN116978360A (zh) | 语音端点检测方法、装置和计算机设备 | |
| CN102760442B (zh) | 一种3d音频中水平方位参数量化方法 | |
| CN117321680A (zh) | 用于处理多声道音频信号的装置和方法 | |
| CN120641979A (zh) | 用于参数化空间音频编码的优先级值 | |
| HK1220803A1 (en) | Adaptive audio content generation | |
| HK1220803B (en) | Adaptive audio content generation |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |