PL4189674T3 - Urządzenie, sposób oraz program komputerowy do enkodowania sceny audio - Google Patents

Urządzenie, sposób oraz program komputerowy do enkodowania sceny audio

Info

Publication number
PL4189674T3
PL4189674T3 PL21729320.8T PL21729320T PL4189674T3 PL 4189674 T3 PL4189674 T3 PL 4189674T3 PL 21729320 T PL21729320 T PL 21729320T PL 4189674 T3 PL4189674 T3 PL 4189674T3
Authority
PL
Poland
Prior art keywords
encoding
computer program
audio scene
scene
audio
Prior art date
Application number
PL21729320.8T
Other languages
English (en)
Inventor
Guillaume Fuchs
Archit TAMARAPU
Andrea EICHENSEER
Srikanth KORSE
Stefan DÖHLA
Markus Multrus
Original Assignee
Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. filed Critical Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.
Publication of PL4189674T3 publication Critical patent/PL4189674T3/pl

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/012Comfort noise or silence coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/173Transcoding, i.e. converting between two coded representations avoiding cascaded coding-decoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
PL21729320.8T 2020-07-30 2021-05-31 Urządzenie, sposób oraz program komputerowy do enkodowania sceny audio PL4189674T3 (pl)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP20188707 2020-07-30
PCT/EP2021/064576 WO2022022876A1 (en) 2020-07-30 2021-05-31 Apparatus, method and computer program for encoding an audio signal or for decoding an encoded audio scene

Publications (1)

Publication Number Publication Date
PL4189674T3 true PL4189674T3 (pl) 2025-05-26

Family

ID=71894727

Family Applications (1)

Application Number Title Priority Date Filing Date
PL21729320.8T PL4189674T3 (pl) 2020-07-30 2021-05-31 Urządzenie, sposób oraz program komputerowy do enkodowania sceny audio

Country Status (13)

Country Link
US (1) US12586595B2 (pl)
EP (2) EP4550322A3 (pl)
JP (1) JP7614328B2 (pl)
KR (1) KR20230049660A (pl)
CN (1) CN116348951A (pl)
AU (2) AU2021317755B2 (pl)
CA (1) CA3187342A1 (pl)
ES (1) ES3013669T3 (pl)
MX (1) MX2023001152A (pl)
PL (1) PL4189674T3 (pl)
TW (2) TWI794911B (pl)
WO (1) WO2022022876A1 (pl)
ZA (1) ZA202301024B (pl)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3719799A1 (en) * 2019-04-04 2020-10-07 FRAUNHOFER-GESELLSCHAFT zur Förderung der angewandten Forschung e.V. A multi-channel audio encoder, decoder, methods and computer program for switching between a parametric multi-channel operation and an individual channel operation
US20230110255A1 (en) * 2021-10-12 2023-04-13 Zoom Video Communications, Inc. Audio super resolution
CN115150718A (zh) * 2022-06-30 2022-10-04 雷欧尼斯(北京)信息技术有限公司 一种车载沉浸式音频的播放方法和制作方法
WO2024051954A1 (en) * 2022-09-09 2024-03-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Encoder and encoding method for discontinuous transmission of parametrically coded independent streams with metadata
WO2024051955A1 (en) 2022-09-09 2024-03-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoder and decoding method for discontinuous transmission of parametrically coded independent streams with metadata
KR20250067839A (ko) * 2022-09-13 2025-05-15 텔레폰악티에볼라겟엘엠에릭슨(펍) 적응형 채널 간 시간 차이 추정
CN116368460A (zh) * 2023-02-14 2023-06-30 北京小米移动软件有限公司 音频处理方法、装置
TWI907957B (zh) * 2023-02-23 2025-12-11 弗勞恩霍夫爾協會 音訊訊號表示解碼單元和音訊訊號表示編碼單元
WO2024208964A1 (en) * 2023-04-06 2024-10-10 Telefonaktiebolaget Lm Ericsson (Publ) Stabilization of rendering with varying detail
GB2640667A (en) * 2024-04-30 2025-11-05 Nokia Technologies Oy Apparatus and methods

Family Cites Families (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2739995B1 (fr) 1995-10-13 1997-12-12 Massaloux Dominique Procede et dispositif de creation d'un bruit de confort dans un systeme de transmission numerique de parole
US5960389A (en) 1996-11-15 1999-09-28 Nokia Mobile Phones Limited Methods for generating comfort noise during discontinuous transmission
JPH113099A (ja) * 1997-04-16 1999-01-06 Mitsubishi Electric Corp 音声符号化復号化システム、音声符号化装置及び音声復号化装置
SE0004187D0 (sv) 2000-11-15 2000-11-15 Coding Technologies Sweden Ab Enhancing the performance of coding systems that use high frequency reconstruction methods
CN101213591B (zh) 2005-06-18 2013-07-24 诺基亚公司 用于非连续语音传输期间的舒适噪声参数自适应传输的系统和方法
EP2205007B1 (en) 2008-12-30 2019-01-09 Dolby International AB Method and apparatus for three-dimensional acoustic field encoding and optimal reconstruction
US8898058B2 (en) * 2010-10-25 2014-11-25 Qualcomm Incorporated Systems, methods, and apparatus for voice activity detection
CN103180899B (zh) * 2010-11-17 2015-07-22 松下电器(美国)知识产权公司 立体声信号的编码装置、解码装置、编码方法及解码方法
PL2676264T3 (pl) * 2011-02-14 2015-06-30 Fraunhofer Ges Forschung Koder audio estymujący szum tła podczas faz aktywnych
HUE054452T2 (hu) 2011-07-01 2021-09-28 Dolby Laboratories Licensing Corp Rendszer és eljárás adaptív hangjel elõállítására, kódolására és renderelésére
SG11201500595TA (en) * 2012-09-11 2015-04-29 Ericsson Telefon Ab L M Generation of comfort noise
KR101690899B1 (ko) * 2012-12-21 2016-12-28 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. 오디오 신호의 불연속 전송에서 높은 스펙트럼-시간 해상도를 가진 편안한 잡음의 생성
CN104050969A (zh) * 2013-03-14 2014-09-17 杜比实验室特许公司 空间舒适噪声
CN104282309A (zh) 2013-07-05 2015-01-14 杜比实验室特许公司 丢包掩蔽装置和方法以及音频处理系统
CN103680509B (zh) * 2013-12-16 2016-04-06 重庆邮电大学 一种语音信号非连续传输及背景噪声生成方法
US9489955B2 (en) 2014-01-30 2016-11-08 Qualcomm Incorporated Indicating frame parameter reusability for coding vectors
US10861470B2 (en) 2014-02-14 2020-12-08 Telefonaktiebolaget Lm Ericsson (Publ) Comfort noise generation
CN110459229B (zh) * 2014-06-27 2023-01-10 杜比国际公司 用于解码声音或声场的高阶高保真度立体声响复制(hoa)表示的方法
US10140996B2 (en) * 2014-10-10 2018-11-27 Qualcomm Incorporated Signaling layers for scalable coding of higher order ambisonic audio data
CN104318927A (zh) * 2014-11-04 2015-01-28 东莞市北斗时空通信科技有限公司 一种抗噪声的低速率语音编码方法及解码方法
PL3503097T3 (pl) 2016-01-22 2024-03-11 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Urządzenie oraz sposób do enkodowania lub dekodowania sygnału wielokanałowego z wykorzystaniem ponownego próbkowania w dziedzinie widmowej
CN107742521B (zh) * 2016-08-10 2021-08-13 华为技术有限公司 多声道信号的编码方法和编码器
WO2018058379A1 (zh) 2016-09-28 2018-04-05 华为技术有限公司 一种处理多声道音频信号的方法、装置和系统
CN117133297A (zh) 2017-08-10 2023-11-28 华为技术有限公司 时域立体声参数的编码方法和相关产品
US11417348B2 (en) 2018-04-05 2022-08-16 Telefonaktiebolaget Lm Erisson (Publ) Truncateable predictive coding
EP3815082B1 (en) * 2018-06-28 2023-08-02 Telefonaktiebolaget Lm Ericsson (Publ) Adaptive comfort noise parameter determination
GB201818959D0 (en) 2018-11-21 2019-01-09 Nokia Technologies Oy Ambience audio representation and associated rendering
CN109448741B (zh) * 2018-11-22 2021-05-11 广州广晟数码技术有限公司 一种3d音频编码、解码方法及装置

Also Published As

Publication number Publication date
KR20230049660A (ko) 2023-04-13
CA3187342A1 (en) 2022-02-03
EP4189674A1 (en) 2023-06-07
EP4550322A2 (en) 2025-05-07
EP4550322A3 (en) 2025-05-21
EP4189674B1 (en) 2025-01-15
ES3013669T3 (en) 2025-04-14
JP7614328B2 (ja) 2025-01-15
WO2022022876A1 (en) 2022-02-03
TW202347316A (zh) 2023-12-01
TWI794911B (zh) 2023-03-01
AU2021317755A1 (en) 2023-03-02
CN116348951A (zh) 2023-06-27
MX2023001152A (es) 2023-04-05
TWI884423B (zh) 2025-05-21
JP2023536156A (ja) 2023-08-23
AU2023286009A1 (en) 2024-01-25
ZA202301024B (en) 2024-04-24
EP4189674C0 (en) 2025-01-15
BR112023001616A2 (pt) 2023-02-23
AU2021317755B2 (en) 2023-11-09
AU2023286009B2 (en) 2025-07-24
US12586595B2 (en) 2026-03-24
TW202230333A (zh) 2022-08-01
US20230306975A1 (en) 2023-09-28

Similar Documents

Publication Publication Date Title
PL4189674T3 (pl) Urządzenie, sposób oraz program komputerowy do enkodowania sceny audio
ZA202200585B (en) An apparatus, a method and a computer program for video encoding and decoding
EP3906699A4 (en) APPARATUS, METHOD AND COMPUTER PROGRAM FOR CODING AND DECODING VIDEO
EP3776477A4 (en) APPARATUS, METHOD, AND COMPUTER PROGRAM FOR ENCODING AND DECODING VIDEO
ZA202110472B (en) An apparatus, a method and a computer program for video coding and decoding
EP3906675A4 (en) DEVICE, METHOD AND COMPUTER PROGRAM FOR VIDEO ENCODING AND DECODING
ZA202001726B (en) Apparatus, method and computer program for encoding, decoding, scene processing and other procedures related to dirac based spatial audio coding
EP4085645A4 (en) METHOD, APPARATUS AND COMPUTER PROGRAM PRODUCT FOR VIDEO CODING AND VIDEO CODING
EP3566445A4 (en) APPARATUS, PROCESS AND COMPUTER PROGRAM FOR VIDEO CODING AND DECODING
PL3695602T3 (pl) Urządzenie, sposób i program komputerowy do kodowania i dekodowania wideo
EP3539291A4 (en) APPARATUS, METHOD AND COMPUTER PROGRAM FOR VIDEO ENCODING AND DECODING
EP3535977A4 (en) APPARATUS, METHOD, AND COMPUTER PROGRAM FOR VIDEO ENCODING AND DECODING
PL3346709T3 (pl) Urządzenie, sposób i program komputerowy do kodowania oraz dekodowania wideo
ZA202103741B (en) Apparatus, method and computer program for encoding, decoding, scene processing and other procedures related to dirac based spatial audio coding using diffuse compensation
GB201807537D0 (en) An apparatus, method and computer program for audio signal processing
ZA201908191B (en) An apparatus, a method and a computer program for video coding and decoding
EP3939329A4 (en) DEVICE, METHOD AND COMPUTER PROGRAM FOR VIDEO ENCODING AND DECODING
GB202101657D0 (en) Appartus, method and computer programs for enabling audio rendering
EP3891989A4 (en) DEVICE, METHOD AND COMPUTER PROGRAM FOR VIDEO ENCODING AND DECODING
GB201707792D0 (en) An apparatus, a method and a computer program for video coding and decoding
EP3942803A4 (en) METHOD, APPARATUS AND COMPUTER PROGRAM PRODUCT FOR VIDEO CODING AND DECODING
EP3580935A4 (en) DEVICE, METHOD AND COMPUTER PROGRAM FOR VIDEO CODING AND DECODING
GB202019567D0 (en) Apparatus, Methods and Computer Programs for Providing Spatial Audio
EP4162691A4 (en) METHOD, DEVICE AND COMPUTER PROGRAM PRODUCT FOR VIDEO CODING AND VIDEO DECODING
GB202308064D0 (en) Method, apparatus and computer program field