EP4571741A4 - Verfahren zur trennung einer zielschallquelle von einer gemischten schallquelle und elektronische vorrichtung dafür - Google Patents

Verfahren zur trennung einer zielschallquelle von einer gemischten schallquelle und elektronische vorrichtung dafür

Info

Publication number
EP4571741A4
EP4571741A4 EP23855072.7A EP23855072A EP4571741A4 EP 4571741 A4 EP4571741 A4 EP 4571741A4 EP 23855072 A EP23855072 A EP 23855072A EP 4571741 A4 EP4571741 A4 EP 4571741A4
Authority
EP
European Patent Office
Prior art keywords
sound source
separating
electronic device
device therefor
mixed
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP23855072.7A
Other languages
English (en)
French (fr)
Other versions
EP4571741A1 (de
Inventor
Jaemo Yang
Joonhyuk Chang
Geeyeun Kim
Hangil Moon
Kyoungho Bang
Dail Kim
Yungyeo Kim
Minsang Baek
Wongook Choi
Jeonghwan Choi
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Samsung Electronics Co Ltd
Industry University Cooperation Foundation IUCF HYU
Original Assignee
Samsung Electronics Co Ltd
Industry University Cooperation Foundation IUCF HYU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from KR1020220125096A external-priority patent/KR20240025427A/ko
Application filed by Samsung Electronics Co Ltd, Industry University Cooperation Foundation IUCF HYU filed Critical Samsung Electronics Co Ltd
Priority claimed from PCT/KR2023/010971 external-priority patent/WO2024039102A1/ko
Publication of EP4571741A1 publication Critical patent/EP4571741A1/de
Publication of EP4571741A4 publication Critical patent/EP4571741A4/de
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • G10L21/0308Voice signal separating characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/81Detection of presence or absence of voice signals for discriminating voice from music
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Multimedia (AREA)
  • Software Systems (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • General Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Medical Informatics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Databases & Information Systems (AREA)
  • Quality & Reliability (AREA)
  • Telephone Function (AREA)
EP23855072.7A 2022-08-18 2023-07-27 Verfahren zur trennung einer zielschallquelle von einer gemischten schallquelle und elektronische vorrichtung dafür Pending EP4571741A4 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
KR20220103538 2022-08-18
KR1020220125096A KR20240025427A (ko) 2022-08-18 2022-09-30 혼합 음원으로부터 타겟 음원을 분리하는 방법 및 그 전자 장치
PCT/KR2023/010971 WO2024039102A1 (ko) 2022-08-18 2023-07-27 혼합 음원으로부터 타겟 음원을 분리하는 방법 및 그 전자 장치

Publications (2)

Publication Number Publication Date
EP4571741A1 EP4571741A1 (de) 2025-06-18
EP4571741A4 true EP4571741A4 (de) 2025-07-30

Family

ID=89907129

Family Applications (1)

Application Number Title Priority Date Filing Date
EP23855072.7A Pending EP4571741A4 (de) 2022-08-18 2023-07-27 Verfahren zur trennung einer zielschallquelle von einer gemischten schallquelle und elektronische vorrichtung dafür

Country Status (2)

Country Link
US (1) US12424241B2 (de)
EP (1) EP4571741A4 (de)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200160878A1 (en) * 2018-11-16 2020-05-21 Samsung Electronics Co., Ltd. Electronic device and method of recognizing audio scene

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3074952B2 (ja) 1992-08-18 2000-08-07 日本電気株式会社 雑音除去装置
JP3434730B2 (ja) 1999-05-21 2003-08-11 Necエレクトロニクス株式会社 音声認識方法および装置
KR100413797B1 (ko) 2001-08-23 2003-12-31 삼성전자주식회사 음성 신호 보상 방법 및 그 장치
KR101456866B1 (ko) 2007-10-12 2014-11-03 삼성전자주식회사 혼합 사운드로부터 목표 음원 신호를 추출하는 방법 및장치
KR101620866B1 (ko) 2014-12-17 2016-05-13 서울대학교산학협력단 학습 기법을 적용한 사전 학습 알고리즘 기반의 음원 분리 방법
US11568731B2 (en) 2019-07-15 2023-01-31 Apple Inc. Systems and methods for identifying an acoustic source based on observed sound
US10930301B1 (en) 2019-08-27 2021-02-23 Nec Corporation Sequence models for audio scene recognition
KR102740717B1 (ko) 2019-08-30 2024-12-11 엘지전자 주식회사 지능형 음원 분리 방법 및 장치
KR102845224B1 (ko) 2019-12-09 2025-08-12 삼성전자주식회사 전자 장치 및 이의 제어 방법
CN111179961B (zh) 2020-01-02 2022-10-25 腾讯科技(深圳)有限公司 音频信号处理方法、装置、电子设备及存储介质
CN111243620B (zh) 2020-01-07 2022-07-19 腾讯科技(深圳)有限公司 语音分离模型训练方法、装置、存储介质和计算机设备
US11114108B1 (en) 2020-05-11 2021-09-07 Cirrus Logic, Inc. Acoustic source classification using hyperset of fused voice biometric and spatial features
KR102410850B1 (ko) 2020-08-18 2022-06-20 부산대학교 산학협력단 잔향 제거 오토 인코더를 이용한 잔향 환경 임베딩 추출 방법 및 장치
US11257503B1 (en) 2021-03-10 2022-02-22 Vikram Ramesh Lakkavalli Speaker recognition using domain independent embedding

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200160878A1 (en) * 2018-11-16 2020-05-21 Samsung Electronics Co., Ltd. Electronic device and method of recognizing audio scene

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
LEE JOO-HYUN ET AL: "NAS-TasNet: Neural Architecture Search for Time-Domain Speech Separation", IEEE ACCESS, vol. 10, 18 May 2022 (2022-05-18), USA, pages 56031 - 56043, XP093287735, ISSN: 2169-3536, Retrieved from the Internet <URL:https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9777717> [retrieved on 20250617], DOI: 10.1109/ACCESS.2022.3176003 *
See also references of WO2024039102A1 *
TZINIS EFTHYMIOS ET AL: "Improving Universal Sound Separation Using Sound Classification", ICASSP 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), IEEE, 4 May 2020 (2020-05-04), pages 96 - 100, XP033793569, [retrieved on 20200401], DOI: 10.1109/ICASSP40776.2020.9053921 *

Also Published As

Publication number Publication date
EP4571741A1 (de) 2025-06-18
US20240062773A1 (en) 2024-02-22
US12424241B2 (en) 2025-09-23

Similar Documents

Publication Publication Date Title
EP4325487A4 (de) Verfahren und vorrichtung zur verbesserung von sprachsignalen und elektronische vorrichtung
EP4177832A4 (de) Verfahren zur bereitstellung einer erfassungsfunktion und elektronische vorrichtung dafür
EP4283974A4 (de) Verfahren und vorrichtung zur fokussierung und elektronische vorrichtung
EP4280586A4 (de) Verfahren zur erkennung von punktlichtquellenbildern und elektronische vorrichtung
EP4133300C0 (de) Elektronische vorrichtung zur positionierung und verfahren dafür
EP4459430A4 (de) Verfahren zur erkennung von gelenkoperationen und elektronische vorrichtung
EP4387251A4 (de) Verfahren und vorrichtung zur belichtungssteuerung und elektronische vorrichtung
EP4173510A4 (de) Verfahren zur erzeugung von aerosol und elektronische vorrichtung zur durchführung davon
EP4647906A4 (de) Verfahren zur anpassung von containerressourcen und elektronische vorrichtung
EP4370999A4 (de) Verfahren und elektronische vorrichtung zur bereitstellung einer steuerungsfunktion einer anzeige
EP4459879A4 (de) Verfahren und vorrichtung zur signalinterferenzunterdrückung und elektronische vorrichtung
EP4258778A4 (de) Verfahren und vorrichtung zur konfiguration von ro-zeit-domänenressourcen und elektronische vorrichtung
EP4240046A4 (de) Verfahren und vorrichtung zur verwaltung von übertragungsparametern und elektronische vorrichtung
EP4571741A4 (de) Verfahren zur trennung einer zielschallquelle von einer gemischten schallquelle und elektronische vorrichtung dafür
EP4582917A4 (de) Verfahren zur verwaltung von dienstwidgets und elektronische vorrichtung
EP4611446A4 (de) Elektronische vorrichtung und verfahren zur phasenausrichtung
EP4557220A4 (de) Verfahren zur erzeugung einer punktwolke und elektronische vorrichtung
EP4513326A4 (de) Verfahren zur wiederherstellung differentieller dateien und elektronische vorrichtung
EP4356362A4 (de) Verfahren und elektronische vorrichtung zur verwaltung von objekten
EP4483565A4 (de) Verfahren und elektronische vorrichtung zur verkehrsweiterleitung
EP4456616A4 (de) Elektronische vorrichtung zur durchführung von backoff und betriebsverfahren dafür
EP4216054C0 (de) Erweiterbare elektronische vorrichtung und verfahren zur aktualisierung einer elektronischen vorrichtung
EP4283921A4 (de) Verfahren zur übertragung von dienstinformationen und elektronische vorrichtung
EP4610792A4 (de) Verfahren und elektronische vorrichtung zur identifizierung von eingabebewegung
EP4456067A4 (de) Elektronische vorrichtung und verfahren zur tonerkennung

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20250313

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC ME MK MT NL NO PL PT RO RS SE SI SK SM TR

REG Reference to a national code

Ref country code: DE

Ref legal event code: R079

Free format text: PREVIOUS MAIN CLASS: G10L0021028000

Ipc: G10L0021030800

A4 Supplementary search report drawn up and despatched

Effective date: 20250630

RIC1 Information provided on ipc code assigned before grant

Ipc: G10L 21/0308 20130101AFI20250624BHEP

Ipc: G06F 18/2413 20230101ALI20250624BHEP

Ipc: G06N 20/00 20190101ALI20250624BHEP

Ipc: G10L 25/30 20130101ALN20250624BHEP

Ipc: G10L 25/51 20130101ALN20250624BHEP

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)