PL4189674T3 - Urządzenie, sposób oraz program komputerowy do enkodowania sceny audio - Google Patents
Urządzenie, sposób oraz program komputerowy do enkodowania sceny audioInfo
- Publication number
- PL4189674T3 PL4189674T3 PL21729320.8T PL21729320T PL4189674T3 PL 4189674 T3 PL4189674 T3 PL 4189674T3 PL 21729320 T PL21729320 T PL 21729320T PL 4189674 T3 PL4189674 T3 PL 4189674T3
- Authority
- PL
- Poland
- Prior art keywords
- encoding
- computer program
- audio scene
- scene
- audio
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/012—Comfort noise or silence coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/173—Transcoding, i.e. converting between two coded representations avoiding cascaded coding-decoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Signal Processing (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Mathematical Physics (AREA)
- Stereophonic System (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP20188707 | 2020-07-30 | ||
| PCT/EP2021/064576 WO2022022876A1 (en) | 2020-07-30 | 2021-05-31 | Apparatus, method and computer program for encoding an audio signal or for decoding an encoded audio scene |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| PL4189674T3 true PL4189674T3 (pl) | 2025-05-26 |
Family
ID=71894727
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PL21729320.8T PL4189674T3 (pl) | 2020-07-30 | 2021-05-31 | Urządzenie, sposób oraz program komputerowy do enkodowania sceny audio |
Country Status (13)
| Country | Link |
|---|---|
| US (1) | US12586595B2 (pl) |
| EP (2) | EP4550322A3 (pl) |
| JP (1) | JP7614328B2 (pl) |
| KR (1) | KR20230049660A (pl) |
| CN (1) | CN116348951A (pl) |
| AU (2) | AU2021317755B2 (pl) |
| CA (1) | CA3187342A1 (pl) |
| ES (1) | ES3013669T3 (pl) |
| MX (1) | MX2023001152A (pl) |
| PL (1) | PL4189674T3 (pl) |
| TW (2) | TWI794911B (pl) |
| WO (1) | WO2022022876A1 (pl) |
| ZA (1) | ZA202301024B (pl) |
Families Citing this family (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP3719799A1 (en) * | 2019-04-04 | 2020-10-07 | FRAUNHOFER-GESELLSCHAFT zur Förderung der angewandten Forschung e.V. | A multi-channel audio encoder, decoder, methods and computer program for switching between a parametric multi-channel operation and an individual channel operation |
| US20230110255A1 (en) * | 2021-10-12 | 2023-04-13 | Zoom Video Communications, Inc. | Audio super resolution |
| CN115150718A (zh) * | 2022-06-30 | 2022-10-04 | 雷欧尼斯(北京)信息技术有限公司 | 一种车载沉浸式音频的播放方法和制作方法 |
| WO2024051954A1 (en) * | 2022-09-09 | 2024-03-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Encoder and encoding method for discontinuous transmission of parametrically coded independent streams with metadata |
| WO2024051955A1 (en) | 2022-09-09 | 2024-03-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Decoder and decoding method for discontinuous transmission of parametrically coded independent streams with metadata |
| KR20250067839A (ko) * | 2022-09-13 | 2025-05-15 | 텔레폰악티에볼라겟엘엠에릭슨(펍) | 적응형 채널 간 시간 차이 추정 |
| CN116368460A (zh) * | 2023-02-14 | 2023-06-30 | 北京小米移动软件有限公司 | 音频处理方法、装置 |
| TWI907957B (zh) * | 2023-02-23 | 2025-12-11 | 弗勞恩霍夫爾協會 | 音訊訊號表示解碼單元和音訊訊號表示編碼單元 |
| WO2024208964A1 (en) * | 2023-04-06 | 2024-10-10 | Telefonaktiebolaget Lm Ericsson (Publ) | Stabilization of rendering with varying detail |
| GB2640667A (en) * | 2024-04-30 | 2025-11-05 | Nokia Technologies Oy | Apparatus and methods |
Family Cites Families (28)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| FR2739995B1 (fr) | 1995-10-13 | 1997-12-12 | Massaloux Dominique | Procede et dispositif de creation d'un bruit de confort dans un systeme de transmission numerique de parole |
| US5960389A (en) | 1996-11-15 | 1999-09-28 | Nokia Mobile Phones Limited | Methods for generating comfort noise during discontinuous transmission |
| JPH113099A (ja) * | 1997-04-16 | 1999-01-06 | Mitsubishi Electric Corp | 音声符号化復号化システム、音声符号化装置及び音声復号化装置 |
| SE0004187D0 (sv) | 2000-11-15 | 2000-11-15 | Coding Technologies Sweden Ab | Enhancing the performance of coding systems that use high frequency reconstruction methods |
| CN101213591B (zh) | 2005-06-18 | 2013-07-24 | 诺基亚公司 | 用于非连续语音传输期间的舒适噪声参数自适应传输的系统和方法 |
| EP2205007B1 (en) | 2008-12-30 | 2019-01-09 | Dolby International AB | Method and apparatus for three-dimensional acoustic field encoding and optimal reconstruction |
| US8898058B2 (en) * | 2010-10-25 | 2014-11-25 | Qualcomm Incorporated | Systems, methods, and apparatus for voice activity detection |
| CN103180899B (zh) * | 2010-11-17 | 2015-07-22 | 松下电器(美国)知识产权公司 | 立体声信号的编码装置、解码装置、编码方法及解码方法 |
| PL2676264T3 (pl) * | 2011-02-14 | 2015-06-30 | Fraunhofer Ges Forschung | Koder audio estymujący szum tła podczas faz aktywnych |
| HUE054452T2 (hu) | 2011-07-01 | 2021-09-28 | Dolby Laboratories Licensing Corp | Rendszer és eljárás adaptív hangjel elõállítására, kódolására és renderelésére |
| SG11201500595TA (en) * | 2012-09-11 | 2015-04-29 | Ericsson Telefon Ab L M | Generation of comfort noise |
| KR101690899B1 (ko) * | 2012-12-21 | 2016-12-28 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | 오디오 신호의 불연속 전송에서 높은 스펙트럼-시간 해상도를 가진 편안한 잡음의 생성 |
| CN104050969A (zh) * | 2013-03-14 | 2014-09-17 | 杜比实验室特许公司 | 空间舒适噪声 |
| CN104282309A (zh) | 2013-07-05 | 2015-01-14 | 杜比实验室特许公司 | 丢包掩蔽装置和方法以及音频处理系统 |
| CN103680509B (zh) * | 2013-12-16 | 2016-04-06 | 重庆邮电大学 | 一种语音信号非连续传输及背景噪声生成方法 |
| US9489955B2 (en) | 2014-01-30 | 2016-11-08 | Qualcomm Incorporated | Indicating frame parameter reusability for coding vectors |
| US10861470B2 (en) | 2014-02-14 | 2020-12-08 | Telefonaktiebolaget Lm Ericsson (Publ) | Comfort noise generation |
| CN110459229B (zh) * | 2014-06-27 | 2023-01-10 | 杜比国际公司 | 用于解码声音或声场的高阶高保真度立体声响复制(hoa)表示的方法 |
| US10140996B2 (en) * | 2014-10-10 | 2018-11-27 | Qualcomm Incorporated | Signaling layers for scalable coding of higher order ambisonic audio data |
| CN104318927A (zh) * | 2014-11-04 | 2015-01-28 | 东莞市北斗时空通信科技有限公司 | 一种抗噪声的低速率语音编码方法及解码方法 |
| PL3503097T3 (pl) | 2016-01-22 | 2024-03-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Urządzenie oraz sposób do enkodowania lub dekodowania sygnału wielokanałowego z wykorzystaniem ponownego próbkowania w dziedzinie widmowej |
| CN107742521B (zh) * | 2016-08-10 | 2021-08-13 | 华为技术有限公司 | 多声道信号的编码方法和编码器 |
| WO2018058379A1 (zh) | 2016-09-28 | 2018-04-05 | 华为技术有限公司 | 一种处理多声道音频信号的方法、装置和系统 |
| CN117133297A (zh) | 2017-08-10 | 2023-11-28 | 华为技术有限公司 | 时域立体声参数的编码方法和相关产品 |
| US11417348B2 (en) | 2018-04-05 | 2022-08-16 | Telefonaktiebolaget Lm Erisson (Publ) | Truncateable predictive coding |
| EP3815082B1 (en) * | 2018-06-28 | 2023-08-02 | Telefonaktiebolaget Lm Ericsson (Publ) | Adaptive comfort noise parameter determination |
| GB201818959D0 (en) | 2018-11-21 | 2019-01-09 | Nokia Technologies Oy | Ambience audio representation and associated rendering |
| CN109448741B (zh) * | 2018-11-22 | 2021-05-11 | 广州广晟数码技术有限公司 | 一种3d音频编码、解码方法及装置 |
-
2021
- 2021-05-31 JP JP2023506177A patent/JP7614328B2/ja active Active
- 2021-05-31 MX MX2023001152A patent/MX2023001152A/es unknown
- 2021-05-31 PL PL21729320.8T patent/PL4189674T3/pl unknown
- 2021-05-31 EP EP25151257.0A patent/EP4550322A3/en active Pending
- 2021-05-31 EP EP21729320.8A patent/EP4189674B1/en active Active
- 2021-05-31 WO PCT/EP2021/064576 patent/WO2022022876A1/en not_active Ceased
- 2021-05-31 AU AU2021317755A patent/AU2021317755B2/en active Active
- 2021-05-31 CN CN202180067397.4A patent/CN116348951A/zh active Pending
- 2021-05-31 ES ES21729320T patent/ES3013669T3/es active Active
- 2021-05-31 CA CA3187342A patent/CA3187342A1/en active Pending
- 2021-05-31 KR KR1020237006968A patent/KR20230049660A/ko active Pending
- 2021-07-29 TW TW110127932A patent/TWI794911B/zh active
- 2021-07-29 TW TW112106853A patent/TWI884423B/zh active
-
2023
- 2023-01-24 ZA ZA2023/01024A patent/ZA202301024B/en unknown
- 2023-01-27 US US18/160,894 patent/US12586595B2/en active Active
- 2023-12-27 AU AU2023286009A patent/AU2023286009B2/en active Active
Also Published As
| Publication number | Publication date |
|---|---|
| KR20230049660A (ko) | 2023-04-13 |
| CA3187342A1 (en) | 2022-02-03 |
| EP4189674A1 (en) | 2023-06-07 |
| EP4550322A2 (en) | 2025-05-07 |
| EP4550322A3 (en) | 2025-05-21 |
| EP4189674B1 (en) | 2025-01-15 |
| ES3013669T3 (en) | 2025-04-14 |
| JP7614328B2 (ja) | 2025-01-15 |
| WO2022022876A1 (en) | 2022-02-03 |
| TW202347316A (zh) | 2023-12-01 |
| TWI794911B (zh) | 2023-03-01 |
| AU2021317755A1 (en) | 2023-03-02 |
| CN116348951A (zh) | 2023-06-27 |
| MX2023001152A (es) | 2023-04-05 |
| TWI884423B (zh) | 2025-05-21 |
| JP2023536156A (ja) | 2023-08-23 |
| AU2023286009A1 (en) | 2024-01-25 |
| ZA202301024B (en) | 2024-04-24 |
| EP4189674C0 (en) | 2025-01-15 |
| BR112023001616A2 (pt) | 2023-02-23 |
| AU2021317755B2 (en) | 2023-11-09 |
| AU2023286009B2 (en) | 2025-07-24 |
| US12586595B2 (en) | 2026-03-24 |
| TW202230333A (zh) | 2022-08-01 |
| US20230306975A1 (en) | 2023-09-28 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| PL4189674T3 (pl) | Urządzenie, sposób oraz program komputerowy do enkodowania sceny audio | |
| ZA202200585B (en) | An apparatus, a method and a computer program for video encoding and decoding | |
| EP3906699A4 (en) | APPARATUS, METHOD AND COMPUTER PROGRAM FOR CODING AND DECODING VIDEO | |
| EP3776477A4 (en) | APPARATUS, METHOD, AND COMPUTER PROGRAM FOR ENCODING AND DECODING VIDEO | |
| ZA202110472B (en) | An apparatus, a method and a computer program for video coding and decoding | |
| EP3906675A4 (en) | DEVICE, METHOD AND COMPUTER PROGRAM FOR VIDEO ENCODING AND DECODING | |
| ZA202001726B (en) | Apparatus, method and computer program for encoding, decoding, scene processing and other procedures related to dirac based spatial audio coding | |
| EP4085645A4 (en) | METHOD, APPARATUS AND COMPUTER PROGRAM PRODUCT FOR VIDEO CODING AND VIDEO CODING | |
| EP3566445A4 (en) | APPARATUS, PROCESS AND COMPUTER PROGRAM FOR VIDEO CODING AND DECODING | |
| PL3695602T3 (pl) | Urządzenie, sposób i program komputerowy do kodowania i dekodowania wideo | |
| EP3539291A4 (en) | APPARATUS, METHOD AND COMPUTER PROGRAM FOR VIDEO ENCODING AND DECODING | |
| EP3535977A4 (en) | APPARATUS, METHOD, AND COMPUTER PROGRAM FOR VIDEO ENCODING AND DECODING | |
| PL3346709T3 (pl) | Urządzenie, sposób i program komputerowy do kodowania oraz dekodowania wideo | |
| ZA202103741B (en) | Apparatus, method and computer program for encoding, decoding, scene processing and other procedures related to dirac based spatial audio coding using diffuse compensation | |
| GB201807537D0 (en) | An apparatus, method and computer program for audio signal processing | |
| ZA201908191B (en) | An apparatus, a method and a computer program for video coding and decoding | |
| EP3939329A4 (en) | DEVICE, METHOD AND COMPUTER PROGRAM FOR VIDEO ENCODING AND DECODING | |
| GB202101657D0 (en) | Appartus, method and computer programs for enabling audio rendering | |
| EP3891989A4 (en) | DEVICE, METHOD AND COMPUTER PROGRAM FOR VIDEO ENCODING AND DECODING | |
| GB201707792D0 (en) | An apparatus, a method and a computer program for video coding and decoding | |
| EP3942803A4 (en) | METHOD, APPARATUS AND COMPUTER PROGRAM PRODUCT FOR VIDEO CODING AND DECODING | |
| EP3580935A4 (en) | DEVICE, METHOD AND COMPUTER PROGRAM FOR VIDEO CODING AND DECODING | |
| GB202019567D0 (en) | Apparatus, Methods and Computer Programs for Providing Spatial Audio | |
| EP4162691A4 (en) | METHOD, DEVICE AND COMPUTER PROGRAM PRODUCT FOR VIDEO CODING AND VIDEO DECODING | |
| GB202308064D0 (en) | Method, apparatus and computer program field |