WO2014146547A1 - 声音信号处理方法及设备 - Google Patents

声音信号处理方法及设备 Download PDF

Info

Publication number
WO2014146547A1
WO2014146547A1 PCT/CN2014/073124 CN2014073124W WO2014146547A1 WO 2014146547 A1 WO2014146547 A1 WO 2014146547A1 CN 2014073124 W CN2014073124 W CN 2014073124W WO 2014146547 A1 WO2014146547 A1 WO 2014146547A1
Authority
WO
WIPO (PCT)
Prior art keywords
sound signal
sound
signal processing
user
indication information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2014/073124
Other languages
English (en)
French (fr)
Inventor
张德明
刘媛媛
龙志明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to EP14767360.2A priority Critical patent/EP2977985A4/en
Publication of WO2014146547A1 publication Critical patent/WO2014146547A1/zh
Priority to US14/859,743 priority patent/US20160011851A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/165Management of the audio stream, e.g. setting of volume, audio stream path
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • H04R1/323Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only for loudspeakers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • H04R1/40Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
    • H04R1/406Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R29/00Monitoring arrangements; Testing arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers
    • H04R3/005Circuits for transducers for combining the signals of two or more microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers
    • H04R3/12Circuits for transducers for distributing signals to two or more loudspeakers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R2201/00Details of transducers, loudspeakers or microphones covered by H04R1/00 but not provided for in any of its subgroups
    • H04R2201/40Details of arrangements for obtaining desired directional characteristic by combining a number of identical transducers covered by H04R1/40 but not provided for in any of its subgroups
    • H04R2201/4012D or 3D arrays of transducers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R2201/00Details of transducers, loudspeakers or microphones covered by H04R1/00 but not provided for in any of its subgroups
    • H04R2201/40Details of arrangements for obtaining desired directional characteristic by combining a number of identical transducers covered by H04R1/40 but not provided for in any of its subgroups
    • H04R2201/403Linear arrays of transducers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R2203/00Details of circuits for transducers, loudspeakers or microphones covered by H04R3/00 but not provided for in any of its subgroups
    • H04R2203/12Beamforming aspects for stereophonic sound reproduction with loudspeaker arrays
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R2499/00Aspects covered by H04R or H04S not otherwise provided for in their subgroups
    • H04R2499/10General applications
    • H04R2499/11Transducers incorporated or for use in hand-held devices, e.g. mobile phones, PDA's, camera's

Definitions

  • the embodiments of the present invention relate to communication technologies, and in particular, to a voice signal processing method and device. Background technique
  • a microphone array is arranged on the terminal device for sound signal pickup, or a speaker array is arranged on the terminal device for sound signal transmission.
  • a speaker array is arranged on the terminal device for sound signal transmission.
  • the terminal device can calculate the spatial information of the sound source according to factors such as the strength of the sound signal collected by each microphone in the microphone array and the position of each microphone, and then pass the sound source.
  • the spatial information estimates the location of the primary sound source, and adaptively adjusts the position of the primary sound source, and adjusts the acquisition pointing to acquire the sound beam of the primary sound source. In the end, only the sound signal of the main sound source is retained or enhanced, so that the sound of the main sound source can be collected or transmitted more clearly.
  • Embodiments of the present invention provide a method and a device for processing a sound signal.
  • a method for processing a sound signal includes: receiving direction indication information input by a user for indicating a target direction; and adjusting a beam direction of the sound signal processing array according to the direction indication information to A state corresponding to the target direction.
  • the adjusting a beam direction of the sound signal processing array to a state corresponding to the target direction according to the direction indication information including: indicating according to the direction Information, determining a signal delay corresponding to each sound signal processing unit in the sound signal processing array; and delaying the sound signal to be processed by the sound signal processing unit corresponding to the signal delay according to each of the signal delays Processing to obtain a delayed processed sound signal, and transmitting the delayed processed sound signal to a beamformer to adjust beam orientation of the sound signal processing array to correspond to the target direction status.
  • determining, according to the direction indication information, a signal delay corresponding to each sound signal processing unit in the sound signal processing array The specific time includes: obtaining, according to the direction indication information, a signal delay corresponding to each sound processing unit n in the sound signal processing array by using the following formula Wherein d n represents a distance between the sound signal processing unit n of one of the sound signal processing units included in the sound signal processing array arranged in a line and the center of the sound signal processing array, and represents the center of the sound signal processing array Pointing at an approximate angle between the target direction indicated by the direction indication information and a reference coordinate, c represents a speed of sound, and f s represents a sound signal sampling frequency of the sound signal processing array; correspondingly, according to each The signal delay, the delay processing of the sound signal to be processed by the sound signal processing unit corresponding to the signal delay to obtain the sound signal after the delay processing specifically includes: according to the signal corresponding to the sound signal
  • a frequency domain signal obtained by time-frequency changing the sound signal collected by the sound signal processing unit n, where ⁇ represents a frequency index.
  • determining, according to the direction indication information, a signal delay corresponding to each sound signal processing unit in the sound signal processing array includes: guiding, according to the direction indication information, The following formula obtains a set ⁇ of signal delays corresponding to respective sound processing units in the sound signal processing array or
  • ⁇ ⁇ represents a signal delay corresponding to the third sound processing unit in the sound processing array
  • r m represents a certain target ⁇ to the mth target in the target direction indicated by the direction indication information
  • r a represents the distance between a certain target A and a coordinate origin of a reference coordinate in the target direction indicated by the direction indication letter
  • the coordinate origin of the reference coordinates is the circular arrangement a center of the sound processing array
  • b represents a radius of the sound processing array uniformly arranged in a ring shape
  • a m represents a projection A of the target A on a plane of the sound processing array
  • An angle between a line connecting and a line connecting the mth sound processing unit to the coordinate origin; the a m is obtained by the following formula:
  • the processed sound signal specifically includes: delaying processing the sound signal corresponding to the sound signal processing unit M according to the signal delay T M or M in the signal delay set T or ⁇ , and delay processing
  • the subsequent sound signal can be expressed as:
  • ⁇ ⁇ ⁇ , ⁇ ⁇ ⁇ (i )*e- jiyr " or
  • is a frequency domain signal obtained by time-frequency changing the sound signal collected by the sound signal processing unit ⁇ , "representing the frequency index.
  • the sound signal processing array includes a sound signal collecting array
  • the sound signal acquisition array includes a plurality of sound signal acquisition units; correspondingly, Determining, according to the direction indication information, a signal delay corresponding to each sound signal processing unit in the sound signal processing array, comprising: determining, according to the direction indication information, a corresponding to each sound signal collection unit in the sound signal collection array Correspondingly, according to each signal delay, performing delay processing on the sound signal to be processed by the sound signal processing unit corresponding to the signal delay, including: according to each signal delay, The sound signal collected by the sound signal collecting unit corresponding to the signal delay is subjected to delay processing; or the sound signal processing array includes a sound signal transmitting array, and the sound signal transmitting array includes a plurality of sound signal transmitting units; correspondingly, Determining, according to the direction indication information, the corresponding signal delay of each sound signal processing unit in the sound signal processing array,
  • the receiving user input is used to indicate a target Before the direction indication information of the direction, the method further includes: receiving a control mode display instruction input by the user; acquiring and displaying, according to the control mode display instruction, a plurality of control modes selectable, each of the plurality of control modes The control mode corresponds to at least one direction indication information input manner; receiving a control mode selection instruction sent by the user for the multiple control modes, and turning on a control mode selected by the user; and receiving the user input input indicating the target direction
  • the direction indication information includes: receiving the direction indication information used by the user to indicate a target direction by using a direction indication information input manner corresponding to the selected control mode.
  • the multiple control modes include a user autonomous control mode, where the user autonomous control mode is to allow a user to input autonomously The mode of the direction indication information; if the control mode selected by the user is the user autonomous control mode, the receiving user inputs the direction indication information input mode corresponding to the selected control mode to indicate the target direction
  • the direction indication information includes: receiving a voice signal input by the user by using a voice input manner, parsing the voice signal to obtain direction indication information included in the voice signal; or receiving the user by using a touch input manner Slide the input track information on the touch screen to parse the track information Determining the target direction, and generating direction indication information to indicate the target direction; or receiving gesture image information input by the user through the projection screen, parsing the target direction indicated by the gesture image information, and generating Direction indicating information indicating the target direction; or receiving a control instruction for indicating a target direction input by the user through an instruction corresponding to the selected control mode, the
  • a seventh possible implementation manner of the first aspect if the mode corresponding to the control mode selected by the user indicates that the information input mode is a touch input mode, where: Receiving, by the user, the trajectory information input on the touch screen by the touch input manner, comprising: receiving a trajectory information that the user slides and inputs on the touch screen; correspondingly, the parsing the target direction indicated by the trajectory information, and generating
  • the direction indication information used to indicate the target direction includes: determining, according to the start position information and the end position information of the sliding input on the touch screen included in the one piece of track information, the indication indicated by the sliding input a direction of the target direction, and generating direction indication information to indicate the target direction; or receiving the track information input by the user on the touch screen in a touch input manner, including: receiving two of the user sliding input on the touch screen Trajectory information; corresponding to the parsing of the trace information
  • the target direction, and the direction indication information for indicating the target direction includes: determining,
  • the sliding input station is determined according to an angular range defined by an extending direction of the two tracks Determining the target direction, comprising: determining a direction in which the angle bisector of the angle formed by the two tracks extends according to the extending direction of the two tracks; determining the sliding input station according to the direction in which the angle bisector extends
  • the indicated target direction is in a first angular range centered on the direction in which the angle bisector extends, with the half of the angle formed by the two tracks being the main lobe width.
  • a ninth possible implementation manner of the first aspect before the receiving the direction indication information input by the user for indicating the target direction And further comprising: sensing a change in orientation of the sound signal processing array, and determining a change amount of orientation; determining the sound signal processing according to the change amount of the orientation and a target direction before the orientation change of the sound signal processing array The target direction after the orientation of the array is changed; prompting the user to input the direction indication information for indicating the target direction after the orientation change.
  • the control mode selected by the user is a scan mode, where the scan mode is prompting a user input by scanning The mode of the direction indication information; before the receiving the direction indication information input by the direction indication information input mode corresponding to the selected control mode, the method further includes: collecting sound signals in at least two directions by scanning; The energy of the sound signal determines a sound source direction of the sound signal with the largest energy; and prompts the user to input the direction indication information according to the sound source direction.
  • another method for processing a sound signal includes: determining a receiving direction of a sound signal processing array according to a direction of a primary sound source, and determining a beam direction according to the receiving direction and the sound signal processing array a target direction of the processing array, where the target direction refers to an orientation of the processing array when a receiving direction of the processing array is consistent with a direction of a main sound source; determining an orientation of the sound signal processing array and the target direction The angle is prompted to prompt the user to adjust the orientation of the sound signal processing array according to the angle to make the receiving direction of the sound signal processing array coincide with the direction of the main sound source.
  • a third aspect of the present invention provides a sound signal processing device, including: a receiving module, configured to receive direction indication information input by a user for indicating a target direction; and an adjustment module, configured to display information according to the direction, The beam pointing of the sound signal processing array is adjusted to a state corresponding to the target direction.
  • the adjusting module is specifically configured to: determine, according to the direction indication information, a signal delay corresponding to each sound signal processing unit in the sound signal processing array; a signal delay, delaying processing of the sound signal to be processed by the sound signal processing unit corresponding to the signal delay to obtain a delayed processed sound signal, and the delayed processed sound Signal is transmitted to the beamformer to The beam direction of the sound signal processing array is adjusted to a state corresponding to the target direction.
  • the adjusting module is specifically configured to:
  • d n represents a distance between the sound signal processing unit n of one of the sound signal processing units included in the sound signal processing array arranged in a line and the center of the sound signal processing array, and represents the center of the sound signal processing array Pointing to an approximate angle between the target direction indicated by the direction indication information and a reference coordinate, c represents a speed of sound, and f s represents a sound signal sampling frequency of the sound signal processing array;
  • a frequency domain signal obtained by time-frequency changing the sound signal collected by the sound signal processing unit ⁇ , where ⁇ represents a frequency index.
  • the adjusting module is specifically configured to:
  • r a represents a certain target in the target direction indicated by the direction indication letter
  • the distance between the coordinate A and the coordinate origin of a reference coordinate, the coordinate origin of the reference coordinate is the center of the sound processing array arranged in a ring shape
  • b represents the radius of the sound processing array uniformly arranged in a ring shape
  • a m represents a projection A of the certain target A on the plane of the sound processing array, a connection between the coordinate origin and the mth sound processing unit to the coordinate origin The angle between the lines;
  • ai represents a connection between the projection of the target A on the plane of the sound processing array and the coordinate origin and the connection between the first sound processing unit and the coordinate origin The angle between the lines;
  • the sound signal corresponding to the sound signal processing unit ⁇ is delayed, and the delayed sound signal can be Expressed as:
  • is a frequency domain signal obtained by time-frequency changing the sound signal collected by the sound signal processing unit ⁇ , where ⁇ represents a frequency index.
  • the sound signal processing array includes a sound signal collecting array, and the sound signal collecting The array includes a plurality of sound signal collecting units.
  • the adjusting module is specifically configured to: determine, according to the direction indication information, a signal delay corresponding to each sound signal collecting unit in the sound signal collecting array; And delaying the sound signal collected by the sound signal collecting unit corresponding to the signal delay; or the sound signal processing array includes a sound signal sending array, and the sound signal sending array includes a plurality of sound signal sending units
  • the adjusting module is specifically configured to: determine, according to the direction indication information, a signal delay corresponding to each sound signal sending unit in the sound signal sending array; and delay the signal according to each signal delay The sound signal sent by the corresponding sound signal transmitting unit is delayed. Reason.
  • the receiving module is further configured to: receive user input The control mode displays the command;
  • the device further includes: a display module, configured to: according to the control mode display instruction, acquire and display to the user a plurality of control modes that are selectable, and each of the multiple control modes controls The mode corresponds to at least one direction indicating information input manner;
  • the receiving module is further configured to: receive a control mode selection instruction sent by the user for the multiple control modes, and enable a control mode selected by the user, and receive the user through the device
  • the direction corresponding to the selected control mode indicates the direction indication information input by the information input mode for indicating the target direction.
  • the multiple control modes include a user autonomous control mode, where the user autonomous control mode allows the user to input autonomously.
  • the direction indicates the mode of the information; if the control mode selected by the user is the user autonomous control mode; the receiving module is specifically configured to: receive the voice signal input by the user through the voice input mode, and parse the voice signal And acquiring, by the receiving module, the trajectory information that is slid and input on the touch screen by the user in a touch input manner, and parsing the target direction indicated by the trajectory information.
  • the receiving module is specifically configured to: receive the gesture image information that is input by the user through the projection screen, and parse the target direction indicated by the gesture image information And generating a direction to indicate the target direction
  • the receiving module is configured to: receive a control instruction for indicating a target direction input by the user through an instruction input terminal device corresponding to the selected control mode, where the control is used to indicate a target direction
  • the instruction includes the direction indication information for indicating the target direction
  • the receiving module is specifically configured to: receive, by the user, scroll information input by a keyboard wheel corresponding to the selected control mode, where the scroll information includes The direction indication information for indicating a target direction.
  • the receiving module is specifically configured to: receive a trajectory information that is slidably input by the user on the touch screen; and determine, according to the starting position information and the end position information of the sliding input on the touch screen, the trajectory information included in the one trajectory information
  • the sliding direction inputs the indicated target direction, and generates direction indication information to indicate the target direction
  • the receiving module is specifically configured to: receive two pieces of track information that the user slides on the touch screen; a starting point of the sliding input included in the two track information on the touch screen
  • the position information and the end position information respectively determine an extending direction of the two tracks; determining, according to an angular range defined by the extending directions of the two tracks, a target direction indicated by the sliding input, and generating to indicate the target The direction indication of the direction.
  • the receiving module is specifically configured to: determine the two tracks according to an extending direction of the two tracks a direction in which the angle bisector of the included angle extends; determining, according to a direction in which the angle bisector extends, determining that the target direction indicated by the sliding input is in a first angular range, the first angular range being equally divided by the angle
  • the line extending direction is centered, and half of the angle formed by the two trajectories is the main lobe width.
  • the method further includes: a sensing module: configured to sense the sound signal processing array The orientation of the change is changed, and the amount of change toward the orientation is determined; the determining module is configured to determine the target after the orientation change of the sound signal processing array according to the orientation change amount and the target direction before the orientation change of the sound signal processing array Direction; a prompting module: configured to prompt the user to input the direction indication information for indicating a target direction after the orientation change.
  • the control mode selected by the user is a scan mode, where the scan mode is prompting a user input by scanning
  • the mode of the direction indication information correspondingly, the method further includes: a scanning module, configured to acquire sound signals in at least two directions by scanning; and an analysis module, configured to analyze energy of the sound signal, and determine a sound signal with the largest energy a sound source direction; the prompting module is further configured to prompt the user to input the direction indication information according to the sound source direction.
  • another sound signal processing device includes: a determining module, configured to determine a receiving direction of the sound signal processing array according to the direction of the main sound source, and process the sound signal according to the receiving direction and the sound signal
  • the beam direction of the array determines a target direction of the processing array, the target direction refers to an orientation of the processing array when a receiving direction of the processing array is consistent with a direction of a main sound source; determining an orientation and a position of the sound signal processing array The angle of the target direction, the beam direction of the sound signal processing array is fixed; the prompting module is configured to prompt the user to adjust the orientation of the sound signal processing array according to the angle to enable the receiving of the sound signal processing array The direction is consistent with the direction of the main sound source.
  • the sound signal processing method and device provided by the embodiment of the present invention are provided according to a user provided Indicates the direction indication of the target direction, and adjusts the beam direction of the sound signal processing array, so that the sound signal can be accurately processed in a noisy environment, thereby enhancing the processed sound signal.
  • a sound signal processing method and apparatus according to another embodiment of the present invention may determine a receiving direction of a signal processing array according to a direction of a primary sound source, and determine a processing array according to the receiving direction and beam pointing of the sound signal processing array.
  • FIG. 1 is a flow chart of FIG audio signal processing method provided by the embodiment of the present invention
  • FIG. 2 provides an audio signal processing method according to a second embodiment of a schematic diagram of the present invention
  • FIG. 4 is a schematic diagram of Embodiment 4 of a sound signal processing method according to the present invention
  • FIG. 5 is a schematic diagram of Embodiment 5 of a sound signal processing method provided by the present invention
  • FIG. Figure 7 is a flowchart of Embodiment 7 of a sound signal processing method provided by the present invention
  • Figure 8 is a schematic structural diagram of Embodiment 1 of a sound signal processing device provided by the present invention
  • Figure 9 is a sound signal processing device provided by the present invention
  • FIG. 1 is a flow chart of FIG audio signal processing method provided by the embodiment of the present invention
  • FIG. 2 provides an audio signal processing method according to a second embodiment of a schematic diagram of the present invention
  • FIG. 4 is a schematic diagram of Embodiment 4
  • FIG. 10 is a schematic structural diagram of Embodiment 3 of a sound signal processing device according to the present invention
  • FIG. 1 is a schematic structural diagram of Embodiment 4 of a sound signal processing device provided by the present invention.
  • DETAILED DESCRIPTION OF THE EMBODIMENTS In order to make the objects, technical solutions and advantages of the embodiments of the present invention more clear, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. It is apparent that the described embodiments are a part of the embodiments of the invention, and not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
  • FIG. 1 is a flowchart of a first embodiment of a sound signal processing method according to the present invention. As shown in FIG. 1 , the sound signal processing method provided in this embodiment may include:
  • the target direction may be the direction of the main sound source or other microphone arrays need to be collected.
  • the direction in which the sound signal is located; and in the case where the embodiment of the present invention is applied to the sound signal transmitting array to transmit the sound signal, for example, in the case where the speaker array transmits the sound signal, the target direction may be the direction in which the sound signal is sent.
  • the sound signal acquisition array and the sound signal transmission array are collectively referred to as a sound signal processing array.
  • the present invention is applied to the case where the sound signal collecting array is used to collect the sound signal, and the embodiment of the present invention is described as an example. Those skilled in the art can obtain the embodiment of the present invention without using creative labor. An embodiment in the case where the sound signal transmitting array transmits a sound signal.
  • the apparatus for performing the sound signal processing method may receive direction indication information input by the user for indicating a target direction, and the direction indication information may be information indicating a target direction input by the user by using a voice, a gesture, or a rotary knob. .
  • the beam pointing of the sound signal processing array can be understood as the source or transmission direction of the sound signal enhanced by the sound signal processing array.
  • the beam pointing of the microphone array can be understood as the source direction of the sound signal mainly collected after the microphone array is enhanced;
  • the beam pointing of the speaker array can be understood as the main direction of the sound signal transmission after the speaker array is enhanced.
  • the sound signal processing device may adjust the beam direction of the sound signal processing array to a state corresponding to the target direction according to the direction indication information for indicating the target direction, for example, adjusting the beam direction of the microphone array to the desired sound source.
  • the sound signal processing method provided by the embodiment adjusts the beam direction of the sound signal processing array according to the direction indication information provided by the user for indicating the target direction, so that the sound signal can be accurately processed in a noisy environment. Further, the processed sound signal has an enhanced effect.
  • adjusting the beam direction of the sound signal processing array to a state corresponding to the target direction according to the direction indication information includes: determining, according to the direction indication information, corresponding to each sound signal processing unit in the sound signal processing array Signal delay; according to each signal delay, delaying processing of the sound signal to be processed by the sound signal processing unit corresponding to the signal delay to obtain the delayed processed sound signal, and delay processing
  • the sound signal is transmitted to a beamformer to adjust the beam pointing of the sound signal processing array to a state corresponding to the target direction.
  • the sound signal processing array includes a sound signal collecting array, and the sound signal collecting array includes a plurality of sound signal collecting units.
  • determining a signal delay corresponding to each sound signal processing unit in the sound signal processing array according to the direction indication information including: Determining, according to the direction indication information, a signal delay corresponding to each sound signal acquisition unit in the sound signal acquisition array; correspondingly, according to each signal delay, delaying the sound signal to be processed by the sound signal processing unit corresponding to the signal delay
  • the time processing includes: delaying processing, according to each signal delay, a sound signal collected by the sound signal collecting unit corresponding to the signal delay; or, the sound signal processing array includes a sound signal sending array, and the sound signal is sent
  • the array includes a plurality of sound signal sending units; correspondingly, determining, according to the direction indication information, the corresponding signal delay of each sound signal processing unit in the sound signal processing array, comprising: determining the sound signal according to the direction indication information a signal delay corresponding to each sound signal sending unit in the array;
  • the direction indication information determining a signal delay corresponding to each sound signal processing unit in the sound signal processing array, for example, according to direction indication information for indicating a target direction, and a sound signal processing unit included in the microphone array, that is, a microphone Arrangement position of the unit Obtaining a signal delay corresponding to each microphone unit according to the information; performing delay processing on the sound signal to be processed by the sound signal processing unit corresponding to the signal delay according to each signal delay, for example, acquiring each microphone included in the microphone array After the signal corresponding to the unit is delayed, the sound signal collected by the corresponding microphone unit may be delayed according to each signal delay, so that the microphone unit only retains or enhances the collected sound in the target direction. .
  • the sound signal to be sent by the corresponding speaker unit may be delayed according to each signal delay, so that the speaker unit only retains or enhances the sound sent to the target direction. purpose.
  • the sound signal processing method provided by the embodiment according to the direction indication information provided by the user for indicating the target direction, first determines the delay of the sound signal, and then delays the sound signal according to the delay of the sound signal, thereby using the sound signal.
  • the beam pointing of the processing array is adjusted, so that the sound signal can be accurately processed in a noisy environment, thereby enhancing the processed sound signal.
  • determining the signal delay corresponding to each sound signal processing unit in the sound signal processing array according to the direction indication information may include: determining, according to each direction indication information, the sound signal processing array The signal delay of each of the sound signal processing units arranged in a line or ring.
  • obtaining a signal delay corresponding to the microphone unit included in the microphone array according to the direction indication information may include: acquiring, according to the direction indication information, a signal delay corresponding to one or more microphone units included in the microphone array
  • the microphone unit may be arranged in various forms such as a line shape, a ring shape or other irregular shapes.
  • FIG. 2 is a schematic diagram of Embodiment 2 of a method for processing a sound signal according to the present invention.
  • the microphone units 1 to N included in the microphone array of the mobile phone are linear.
  • the center point of the mobile phone is taken as the coordinate origin 0, and the target is A.
  • the mobile phone receives the direction indication information BC that the user slides on the touch screen, and the direction indication information BC is used to indicate the direction of the target A.
  • the angle between the direction indication information BC and the X axis of the abscissa axis is ⁇ , and the center D of the microphone array points to the target.
  • the distance between the microphone unit ⁇ of one of the microphone units included in the linear array of microphones and the center D of the microphone array is d n . Then the signal delay ⁇ ) corresponding to the microphone unit n can be expressed as:
  • Equation (1) c represents the speed of sound, and f s represents the sound signal sampling frequency of the microphone array.
  • ⁇ ⁇ is a frequency domain signal obtained by time-frequency changing the collected sound signal, and ⁇ represents a frequency index.
  • the delayed processed sound signal is sent to the beamformer to adjust the beam pointing of the microphone array to preserve or enhance the sound signal from the target ⁇ according to the direction indication information BC.
  • FIG. 3 is a schematic diagram of Embodiment 3 of a method for processing a sound signal according to the present invention.
  • the M microphone units included in the microphone array of the mobile phone are evenly arranged in a ring shape.
  • the center point of the mobile phone is the coordinate origin 0
  • the target is A
  • the radius of the ring array composed of M microphone units is b
  • the angle between the angles is ⁇
  • the distance from the target A to the coordinate origin 0 is r a
  • the sound signal sampling frequency of the microphone array is f a
  • the sound speed is c
  • the projection of the target A on the plane of the microphone array is A
  • the OA The angle between the line connecting the microphone unit 1 and the coordinate origin 0 is defined as ap
  • the angle a m between the OA and the microphone unit m to the coordinate origin 0 can be expressed as: 2 ⁇ ( ⁇ - 1)
  • Equation (5) According to the signal delay set T or in the signal delay T M or M will be collected corresponding to the sound signal processing unit M The sound signal is delayed, and the delayed sound signal can be expressed as:
  • ⁇ ⁇ ( ⁇ , ⁇ ) ⁇ ⁇ ( ⁇ , ⁇ )* ⁇ - ⁇
  • is a frequency domain signal obtained by time-frequency changing the sound signal collected by the sound signal processing unit ⁇ , where ⁇ represents a frequency index.
  • the delayed sound signal is sent to the beamformer, and The beam pointing of the microphone array is adjusted to preserve or enhance the purpose of the sound signal originating from the target pupil.
  • the embodiment of the sound signal processing method provided by FIG. 2 and FIG. 3 is exemplified by the linear and circular arrangement of the microphone unit, and the sound signal processing method provided by the embodiment of the present invention is described. Not limited to this, the calculation method is not limited to this.
  • the signal delay in addition to calculating the signal delay in a sample-based manner, the signal delay can also be expressed in the form of a phase difference as follows:
  • the corresponding delay processing is performed to adjust the beam direction of the microphone array, so that the sound signal can be accurately processed in a noisy environment, thereby enhancing the processed sound signal.
  • the method further includes: receiving a control mode display instruction input by the user; acquiring and displaying to the user, the plurality of control modes that are selectable according to the control mode display instruction, Each of the plurality of control modes corresponds to at least one direction indication information input mode; receiving a control mode selection instruction sent by the user, turning on a control mode selected by the user; receiving a direction indication input by the user for indicating the target direction
  • the information includes: receiving direction indication information indicating a target direction input by the user through a direction indication information input manner corresponding to the selected control mode.
  • the embodiment of the present invention can provide the user with various control modes for controlling the manner of processing the sound signal for the user to select.
  • the sound signal processing device can acquire various control modes that can be provided to the user and display to the user.
  • the control mode may have one or more direction indication information input modes, that is, the user may input direction indication information indicating the direction of the target in one or more modes in the control mode.
  • the sound signal processing device turns on the control mode selected by the user, and the user inputs the direction indication information by using the direction indication information input mode in the selected control mode to control the processing of the sound signal.
  • the embodiment of the present invention provides a plurality of control modes for the user to control the processing of the sound signal, and aims to provide the user with a plurality of different options to implement a plurality of methods for processing the sound signal, and at the same time increase the convenience of the user.
  • the control mode selected by the user is the user autonomous control mode, and the user autonomous control mode is a mode that allows the user to input the direction indication information autonomously;
  • the voice signal can be processed in the following ways:
  • Receiving the direction indication information for indicating the target direction that is input by the user through the direction indication information input manner corresponding to the selected control mode including: receiving a voice signal input by the user through the voice input manner, and parsing the voice signal to obtain the voice signal Directional information included.
  • the user can input a voice signal for indicating the target direction through voice input, and the voice signal can be recognized by the voice recognizer, and the voice recognizer is in the word
  • the words indicating the azimuth or angle are strengthened to ensure that the recognition rate of similar words is above 95%. According to relevant verification, the recognition rate of key words of small vocabulary can reach 95%. the above.
  • the user can say the target direction of the desired microphone array beam of the mobile phone, for example, "the right front thirty-five degrees.”
  • the voice recognizer of the mobile phone obtains the orientations indicated by "right”, “front” and “fifty-five degrees” in the voice signal according to the pre-established rule
  • the beam direction of the sound signal processing array is adjusted to the target direction corresponding to the target direction information.
  • the embodiment of the sound signal processing method for inputting a voice signal for indicating a target direction by voice does not need to be based on a user's manual input operation, and can acquire a corresponding microphone unit included in the microphone array according to a voice signal provided by the user.
  • the signal delay is performed by performing corresponding delay processing on the sound signal collected by the microphone unit, thereby realizing the retention or enhancement of the sound signal in the direction indicated by the voice signal by the microphone unit, and then the collected sound can still be obtained in a noisy environment.
  • the signal has an enhanced effect.
  • Receiving the trajectory information input by the user on the touch screen by touch input, analyzing the target direction indicated by the trajectory information, and generating direction indication information indicating the target direction comprising: receiving track information that the user slides on the touch screen; correspondingly, analyzing the target direction indicated by the track information, and generating direction indication information indicating the target direction
  • the method includes: determining, according to the start position information and the end position information of the sliding input on the touch screen, the target direction indicated by the sliding input, and generating direction indication information for indicating the target direction; or receiving the user by touch input
  • the method of sliding the input track information on the touch screen includes: receiving two track information that the user slides on the touch screen; correspondingly, analyzing the target direction indicated by the track information, and generating direction indication information indicating the target direction, including: According to the starting position information and the end position information of the sliding input included in the two track information on the touch screen, the extending directions of the two tracks are respectively determined, and the sliding
  • the user slides the input direction finger toward the target direction on the touch screen of the mobile phone.
  • the information that is, the trajectory information BC, is used to control the acquisition of the sound signal of the target array by the microphone array, and the sound signal processing method provided by the embodiment is described in detail:
  • the first step receiving a track information BC that the user slides on the touch screen of the mobile phone toward the target direction;
  • the second step determining the coordinates (x b , y b ) of the starting point B of the track information BC in the coordinate system of the mobile phone, and the coordinates (x c , y c ) of the end point C of the track information BC in the coordinate system of the mobile phone, Obtain an angle between the trajectory information BC and the X coordinate of the abscissa axis of the coordinate system of the mobile phone: - arctan ⁇ c ⁇ b
  • the method of obtaining ⁇ is not limited thereto.
  • the a when the user inputs the track information BC through a voice signal or through a knob, the a can be obtained according to a predefined voice indication rule, or ⁇ can be directly learned according to the knob scale value;
  • Step 4 According to the angle ⁇ of the center of the microphone array pointing to the target ⁇ and the angle X of the abscissa axis, the signal delay corresponding to the microphone unit n included in the microphone array is obtained :
  • the distance between the microphone unit n of one of the microphone units included in the linear array of microphones and the center D of the microphone array is d n
  • c represents the speed of sound, representing the sound signal sampling of the microphone array. frequency
  • Step 5 Pre-processing the sound signal collected by the microphone array, including noise reduction and echo suppression, and then changing the time-frequency of the sound signal to obtain the frequency domain signal ⁇ ⁇ , where "representing the frequency index, for the collected The sound signal is delayed, and the delayed sound signal is:
  • Step 6 Send the delayed processed sound signal to the beamformer to achieve The trajectory information BC adjusts the beam pointing of the microphone array to preserve or enhance the purpose of the sound signal originating from the target A.
  • the user Before performing the above-described embodiment, or after obtaining the angle ⁇ between the direction in which the microphone array center D points to the target A and the axis of the abscissa axis X, the user may be provided with a confirmation prompt to prompt the user to confirm whether to change the direction for indicating the target direction. Indicates the information, or prompts the user to confirm whether the newly entered direction indication information is correct, etc., to avoid the user's false start or misoperation of the input direction indication information.
  • the embodiment of the sound signal processing method for inputting the trajectory information for indicating the target direction through the touch screen provides the user with an intuitive direction indication information input mode, and the user can conveniently provide the trajectory information to the mobile phone according to the target direction,
  • the mobile phone determines the target direction according to the trajectory information provided by the user, thereby adjusting the beam direction of the sound signal processing array, so that the sound source signal can be accurately processed in the noisy environment, and then the processed sound signal is played.
  • Enhanced effect is e.
  • FIG. 4 is a schematic diagram of a method for processing a sound signal according to a fourth embodiment of the present invention. As shown in FIG. 4, the method for processing a sound signal according to the embodiment is different from the previous embodiment in the method for processing a sound signal provided by the embodiment.
  • the track information input by the user input and input through the touch screen for indicating the target direction may include two or more pieces of track information.
  • the sound signal processing method provided by the embodiment is described in detail by taking the input track information and the track information 8 (3 ⁇ 4 to control the beam pointing of the microphone array) on the touch screen of the mobile phone as an example.
  • First step receiving the track information input by the user on the touch screen of the mobile phone toward the target direction, the track information includes the track information having the same starting point B and the track information BC 2 , and the track information Bd and the track information BC 2 should be synchronously input, ie The trajectory information Bd and the trajectory information BC 2 should not be two pieces of trajectory information input at intervals.
  • the trajectory information and the trajectory information BC 2 are used to indicate the range of the source of the sound signal to be collected, that is, the target A is located within the range of the trajectory information BC 1 and the trajectory information BC 2 ;
  • Step 2 prompting the user to confirm the specific trajectory information for controlling the beam pointing of the microphone array. If the user confirms that the specific trajectory information for controlling the beam pointing of the microphone array contains only one of the trajectory information Bd or the trajectory information BC 2 , Indicates that the user synchronizes the input track information and the track information BC 2 as an erroneous input, and subsequently obtains a signal delay according to the specific trajectory information.
  • the method is similar to that in the embodiment shown in FIG. 2, and will not be described again; if the user confirms that the specific trajectory information for controlling the beam pointing of the microphone array includes the trajectory information and the trajectory information BC 2 , the next step is performed;
  • the third step determining the coordinates (x b , y b ) of the start point B of the track information Bd and the track information BC 2 in the coordinate system of the mobile phone, and the coordinates of the end point of the track information ⁇ in the coordinate system of the mobile phone (x el , Yci ) and the coordinates (x c2 , y c2 ) of the end point C 2 of the trajectory information BC 2 in the coordinate system of the mobile phone, and the angle ⁇ 1 of the trajectory information and the axis coordinate of the coordinate system of the mobile phone in the X direction, and the trajectory information BC 2
  • the method of obtaining ⁇ and ⁇ 2 is not limited thereto;
  • the fourth step: the angle bisector BC 3 of the trajectory information and the trajectory information BC 2 and the angle X of the abscissa axis of the coordinate system are ⁇ 3 : , - 3 ⁇ 4
  • the sound signal collected by the microphone unit 1 ⁇ N included in the microphone array is centered on the angle bisector 8 3 as the main lobe width, that is, the range of the track information and the track information BC 2 is retained or enhanced.
  • the sound signal inside, the sound signal in other directions is suppressed.
  • the embodiment of the sound signal processing method for inputting a plurality of track information for indicating the target direction through the touch screen provides the user with an intuitive direction indication information input mode, and the user can conveniently provide the mobile phone according to the approximate orientation of the target. Trajectory information, so that the mobile phone determines the target direction according to the extending direction of the trajectory information provided by the user, thereby adjusting the beam direction of the sound signal processing array, so that the sound signal can be accurately processed in a noisy environment, and then The processed sound signal has an enhanced effect.
  • Receiving the direction indication information for indicating the target direction input by the user through the direction indication information input manner corresponding to the selected control mode including: receiving the gesture image information input by the user through the projection screen, and parsing the target indicated by the gesture image information Direction, and generating direction indication information for indicating a target direction; receiving a control instruction for indicating a target direction input by the user through an instruction input terminal device corresponding to the selected control mode, and the control instruction for indicating the target direction includes Direction information indicating the direction of the target.
  • the user can first slide on the projection screen with a finger or other pointing tool to form gesture image information, which is used to indicate the orientation of the speaker or other sound source to be collected, or to indicate the destination location of the sound transmission;
  • the camera can continuously take photos of the gesture image information that the user slides on the projection screen, collect feature information including position information, gray information and change information, and extract the feature information; and extract the extracted feature information and pre-established
  • the gesture information base is matched, and the image information corresponding to the gesture and the gesture corresponding to the feature information is searched; the signal delay corresponding to the sound processing unit included in the sound processing array is obtained according to the image information; and the sound signal is made according to the signal delay Delay processing.
  • Receiving direction indication information for indicating a target direction input by the user through the direction indication information input manner corresponding to the selected control mode comprising: receiving the user input by the instruction input terminal device corresponding to the selected control mode for indicating A control instruction of the target direction, the control instruction for indicating the target direction includes direction indication information for indicating the target direction.
  • the command input terminal device may be a wireless control device or a remote control device or the like.
  • the user can input a control command for indicating the target direction by using a terminal device separately provided with the sound signal processing array or other kinds of wireless control devices that can control the sound signal processing array to control the sound signal in a short-distance remote control or a remote control manner.
  • a terminal device separately provided with the sound signal processing array or other kinds of wireless control devices that can control the sound signal processing array to control the sound signal in a short-distance remote control or a remote control manner.
  • Processing the acquisition or transmission of sound signals by the array such as controlling the acquisition of sound signals by the microphone array.
  • FIG. 5 is a schematic diagram of a fifth embodiment of a sound signal processing method according to the present invention. As shown in FIG. 5, the embodiment shows a specific method for a short-distance remote control input control command, which may include:
  • the wireless control device establishes a wireless control connection with the receiving end.
  • the wireless control device establishes a wireless control connection with the receiving end according to the basic principle of the wireless remote control system, and the wireless control device can be a mobile phone or other terminal device having a wireless signal transmitting function, and the receiving end includes a microphone.
  • a sound signal processing array in the form of a wind array or a speaker array.
  • the wireless control device receives a control instruction input by the user for indicating a target direction.
  • the manner in which the user inputs the control commands may include various types, such as by voice indication input, or by touch screen input provided on the wireless control device, and the like.
  • the wireless control device sends a control command to the receiving end.
  • the wireless control device can encode, modulate, and amplify the control commands to the receiving end according to the basic principle of the wireless remote control system.
  • the receiving end processes the sound signal according to the received control instruction.
  • the microphone array After receiving the processed information including the control command sent by the wireless control device, the microphone array amplifies, detects, and decodes the information, and acquires the microphone array according to the obtained control command.
  • the signal delay corresponding to each microphone unit included in the microphone unit is subjected to corresponding delay processing for the collected sound signal to achieve the purpose of adjusting the acquisition pointing angle.
  • the wireless transmission mode between the wireless control device and the receiving end may be infrared or Bluetooth, and the present invention does not limit this.
  • FIG. 6 is a schematic diagram of Embodiment 6 of the sound signal processing method provided by the present invention. As shown in FIG. 6, the embodiment shows a specific method for remotely inputting a control command, which may include:
  • the remote control device establishes a connection with the receiving end via the Internet according to a communication protocol.
  • the remote control device may be a terminal device having a wireless communication function such as a computer or a mobile phone and capable of accessing the Internet, and the receiving end includes an audio signal processing array in the form of a microphone array or a speaker array.
  • the remote control device receives a control instruction input by the user for indicating a target direction.
  • the manner in which the user inputs the control commands may include a plurality of types, such as by voice indication input, or by a touch screen input provided on the remote control device, or by a wheel input provided on the remote control device, or the like.
  • the remote control device sends a control command to the receiving end.
  • the remote control device may package and transmit the data including the control instruction for indicating the target direction to the receiving according to the protocol.
  • the receiving end processes the sound signal according to the received control instruction. Taking the receiving end as a microphone array as an example, after receiving the processed information including the control instruction sent by the remote control device, the microphone array acquires the control instruction according to the protocol unpacking according to the protocol; the receiving end acquires the sound signal processing array according to the control instruction.
  • the signal delay corresponding to the processing unit included in the processing unit is subjected to corresponding delay processing for the sound signal to achieve the purpose of adjusting the acquisition or transmission of the pointing angle.
  • Manner 5 receiving direction indication information input by the user through the direction indication information input manner corresponding to the selected control mode, including: receiving scroll information input by the user through a keyboard wheel corresponding to the selected control mode, and using the scroll information to include In the direction indicating the direction of the target, it means no information.
  • the user can input the direction indication information in various manners, and is intended to provide the user with different selections of the input direction indication information, so as to increase the convenience of the user.
  • the method before receiving the direction indication information input by the user for indicating the target direction, the method further includes: changing a direction of the sensing sound signal processing array, and determining a change amount of the orientation; changing according to the orientation change amount and the orientation of the sound signal processing array The previous target direction determines the target direction after the orientation of the sound signal processing array is changed; prompting the user to input direction indication information for indicating the target direction after the change.
  • the embodiment may be applied to a sound signal processing device or a sound signal processing device where the sound signal processing array is provided with sensors such as a motion sensor, a position sensor, an angle sensor, and the like.
  • the sound signal processing array itself is a mobile phone with a three-axis gyroscope.
  • the specific embodiment may include:
  • Step 1 The user rotates the phone at a certain angle
  • Step 2 The three-axis gyroscope of the mobile phone senses the change of the position of the mobile phone, that is, the change of the orientation of the microphone array on the mobile phone, assuming that the orientation of the microphone array changes by an angle ⁇ before being rotated, that is, the change amount of the orientation of the microphone array is ⁇ ;
  • the third step according to the change amount ⁇ of the orientation, and the orientation of the sound signal processing array is changed before a target direction, determining a direction of the target relative to the rotated microphone array, that is, determining a target direction after the orientation of the sound signal processing array is changed;
  • Step 4 prompting the user to confirm the target direction after the orientation change of the sound signal processing array determined in the third step, the user inputting direction indication information indicating the target direction after the orientation change of the sound signal processing array, and the mobile phone according to the direction indication
  • the information adjusts the beam direction of the microphone array; of course, the mobile phone can also adjust the beam direction of the microphone array according to the target direction without determining by the user after determining the target direction.
  • the orientation change is sensed, and the target direction after the orientation change of the sound signal processing array is determined according to the change amount of the orientation, according to the sound.
  • the target signal direction of the signal processing array is processed to process the sound signal, so that the sound signal can be accurately processed in a noisy environment, thereby enhancing the processed sound signal.
  • control mode selected by the user is a scan mode
  • scan mode is a mode for prompting the user to input direction indication information by scanning; before receiving the direction indication information input by the user through the direction indication information input manner corresponding to the selected control mode,
  • the method further includes: collecting sound signals in at least two directions by scanning; analyzing energy of the sound signal, determining a sound source direction of the sound signal with the largest energy; and prompting the user to input direction indication information according to the direction of the sound source.
  • the sound signal processing array is a microphone array, and the sound signal processing unit of the microphone array is arranged in a line.
  • the specific implementation may include:
  • Step 1 The microphone array collects multiple sound signals in multiple directions.
  • the resolution ⁇ can be set as needed, and the resolution ⁇ is used to indicate the width of the beam of the sound signal processing array. If the resolution ⁇ is 30°, the respective sound signal processing units arranged in a line are in the range of 180° which can be oriented. Within the six directions of the beam acquisition, the sound signals in all directions can be collected to all the sound signals in the range of 180°. When collecting the sound signal in a certain direction, the beam can be kept pointing in the direction for a certain time, for example, 500ms.
  • Step 2 Analyze multiple sound signals in multiple directions. For example, the energy of the sound signal is calculated by using a frame of 20 ms, and the sound signal of 500 frames is acquired in each direction, and the sound signal of 25 frames is calculated, and the average value of the energy of the 25 frame sound signals in one direction is calculated, and all the six directions are sequentially calculated. The average value of the energy of the sound signal, and the direction in which the average value of the energy is the largest is the target direction.
  • Step 3 Prompt the user to input direction indication information according to the target direction determined in the second step.
  • Step 4 Adjust the beam pointing of the sound signal processing array according to the direction indication information.
  • the sound signal processing method provided in this embodiment acquires a target direction by collecting and analyzing sound signals in multiple directions, and prompts the user to input corresponding direction indication information for indicating the target direction, and processes the sound signal according to the direction indication information. Therefore, the sound signal can be accurately processed in a noisy environment, thereby enhancing the processed sound signal.
  • the control mode of the sound signal processing method in the embodiment of the present invention may further include other different kinds of modes other than the above modes for the user to select, and is intended to provide the user with various differences including the user autonomous control mode and the scan mode. Select to achieve a variety of ways to process the sound signal, while increasing the convenience of the user.
  • receiving the direction indication information input by the user for indicating the target direction may further include receiving direction indication information for indicating the target direction input by the user through the knob.
  • the knob may be a knob provided on a device for controlling the sound signal processing array, or may be a software-style knob on a device for controlling the sound signal processing array, which is not limited in the present invention. The way of inputting the direction indication through the knob enriches the implementation of the user's manipulation of the sound signal to process the array beam pointing.
  • FIG. 7 is a flowchart of a seventh embodiment of a sound signal processing method according to the present invention. As shown in FIG. 7, the sound signal processing method provided in this embodiment may include:
  • S710 Determine a receiving direction of the sound signal processing array according to the direction of the main sound source, and determine a target direction of the processing array according to the receiving direction and the beam pointing of the sound signal processing array.
  • the target direction refers to processing when the receiving direction of the processing array is consistent with the direction of the main sound source.
  • the orientation of the array determining an angle between the orientation of the sound signal processing array and the target direction;
  • the user in the situation that the beam direction of the sound signal processing array is fixed, the user may be prompted to adjust the receiving direction of the sound signal processing array according to the direction of the main sound source, thereby enhancing the processed sound signal. Effect.
  • Step 1 Fix the beam pointing of the microphone array, that is, keep the beam pointing of the microphone array unchanged with respect to the microphone array itself.
  • Figure 2 that is, ⁇ remains unchanged, and the beam pointing direction of the fixed microphone array can be controlled by software or fixed by hardware, for example, selecting a microphone array with directivity, such as a heart-shaped microphone, heart-shaped
  • the microphone works well for sound signal acquisition in a certain direction.
  • the invention does not limit the form of beam pointing of the fixed microphone array;
  • Step 2 Determine the target direction in an adaptive manner, and obtain an angle of the microphone array toward the desired adjustment according to the angle between the target direction and the fixed beam direction of the microphone array;
  • the third step prompting the user to input direction indication information according to the target direction, that is, prompting the user to adjust the orientation of the microphone array, and suggesting that the user adjust the angle of the microphone array in the prompt;
  • Step 4 The user adjusts the orientation of the microphone array according to the prompt. For example, if the microphone array is set on the mobile phone, the user can rotate the mobile phone according to the prompt given by the mobile phone, so that the beam direction of the microphone array of the mobile phone is directed. Target direction.
  • the user in the case where the beam direction of the sound signal processing array is fixed, the user may be prompted to adjust the direction of the sound signal processing array according to the target direction, thereby enhancing the processed sound signal.
  • FIG. 8 is a schematic structural diagram of Embodiment 1 of a sound signal processing device according to the present invention.
  • the sound signal processing device 800 provided in this embodiment may include: a receiving module 810, an adjusting module 820, a display module 830, and a sense.
  • the measurement module 840, the determination module 850, the prompting module 860, the scanning module 870, and the analysis module 880 specifically:
  • the receiving module 810 is configured to receive direction indication information input by the user for indicating a target direction.
  • the adjusting module 820 is configured to adjust a beam direction of the sound signal processing array to a state corresponding to the target direction according to the direction indication information.
  • the adjusting module 820 is specifically configured to: determine, according to the direction indication information, a signal delay corresponding to each sound signal processing unit in the sound signal processing array; and process the sound signal corresponding to the signal delay according to each signal delay The sound signal to be processed by the unit is subjected to delay processing to obtain the delayed processed sound signal, and the delayed processed sound signal is transmitted to the beamformer to adjust the beam pointing of the sound signal processing array to The state in which the target direction corresponds. Further, the adjustment module 820 is specifically configured to:
  • d n represents a distance between the sound signal processing unit n of one of the sound signal processing units included in the sound signal processing array arranged in a line array and the center of the sound signal processing array, and represents the sound signal processing array
  • the center points to an approximate angle between the target direction indicated by the direction indication information and a reference coordinate
  • c represents a speed of sound
  • f s represents a sound signal sampling frequency of the sound signal processing array
  • is a frequency domain signal obtained by time-frequency changing the sound signal collected by the sound signal processing unit ⁇ , and ⁇ represents a frequency index.
  • adjustment module 820 is specifically configured to:
  • T M the signal delay corresponding to the Mth sound processing unit in the sound processing array
  • r m the target indicated by the direction indication information a distance from a certain target A in the direction to the mth sound processing unit
  • f a a sound signal sampling frequency of the sound processing array
  • c a sound speed
  • r a represents the distance between a certain target A and a coordinate origin of a reference coordinate in the target direction indicated by the direction indication letter
  • the coordinate origin of the reference coordinates is the circular arrangement
  • the center of the sound processing array b represents the radius of the sound processing array uniformly arranged in a ring shape
  • a m represents that the target A is at the level of the sound processing array
  • ai represents a connection between the projection of the target A on the plane of the sound processing array and the coordinate origin and the connection between the first sound processing unit and the coordinate origin The angle between the lines;
  • the sound signal corresponding to the sound signal processing unit ⁇ is subjected to delay processing, and the delayed sound signal can be expressed as :
  • ⁇ ⁇ ⁇ , ⁇ ⁇ ⁇ (i )* e- jiyr " or
  • ⁇ ⁇ ( ⁇ , ⁇ ) ⁇ ⁇ ( ⁇ , ⁇ ) * ⁇ -' ⁇
  • ( ⁇ is a frequency domain signal obtained by time-frequency changing the sound signal collected by the sound signal processing unit ⁇ , and ⁇ represents a frequency index.
  • the sound signal processing array includes a sound signal collecting array
  • the sound signal collecting array includes a plurality of sound signal collecting units.
  • the adjusting module 820 is specifically configured to: determine, according to the direction indication information, corresponding to each sound signal collecting unit in the sound signal collecting array Signal delay; according to each signal delay, delaying the sound signal collected by the sound signal acquisition unit corresponding to the signal delay; or the sound signal processing array includes a sound signal transmission array, and the sound signal is sent
  • the array includes a plurality of sound signal sending units.
  • the adjusting module 820 is specifically configured to: determine, according to the direction indication information, a signal delay corresponding to each sound signal sending unit in the sound signal sending array; The sound signal sent by the sound signal transmitting unit corresponding to the signal delay is delayed.
  • the receiving module 810 is further configured to: receive a control mode display instruction input by the user; the display module 830 is configured to: according to the control mode display instruction, obtain and display to the user, a plurality of control modes that are selectable, in the multiple control modes Each control mode corresponds to at least one direction indication information input manner; the receiving module 810 is further configured to: receive a control mode selection instruction sent by the user for multiple control modes, and enable a control mode selected by the user, and receive the user to pass The direction indication information corresponding to the selected control mode indicates the direction indication information for indicating the target direction.
  • the plurality of control modes include a user autonomous control mode, and the user autonomous control mode is a mode that allows the user to input the direction indication information autonomously; if the control mode selected by the user is the user autonomous control mode, the receiving module 810 is specifically configured to: receive the user The voice signal input by the voice input method is used to parse the voice signal to obtain the direction indication information included in the voice signal. Alternatively, the receiving module 810 is specifically configured to: receive the track information that the user slides on the touch screen by the touch input manner, and analyze the track. The target direction indicated by the information is generated, and the direction indication information is used to indicate the target direction.
  • the receiving module 810 is specifically configured to: receive the gesture image information input by the user through the projection screen, and parse the target direction indicated by the gesture image information, And generating the direction indication information to indicate the target direction; or, the receiving module 810 is specifically configured to: receive, by the instruction input terminal device corresponding to the selected control mode, a control instruction for indicating a target direction, The control instruction indicating the target direction includes direction indication information for indicating the target direction; or the receiving module 810 is specifically configured to: receive scroll information input by the user through a keyboard wheel corresponding to the selected control mode, and use the scroll information to include Direction indication information indicating the direction of the target.
  • the receiving module 810 is specifically configured to: receive a track information that the user slides on the touch screen; and according to a track information Sliding input start position information and end position information on the touch screen, determining a target direction indicated by the sliding input, and generating direction indication information indicating the target direction; or, the receiving module 810 is specifically configured to: receive the user to slide on the touch screen Two track information input; according to the start position information and the end position information of the sliding input included in the two track information on the touch screen, respectively determine the extending direction of the two tracks; the angle range defined according to the extending direction of the two tracks , determining a target direction indicated by the sliding input, and generating direction indication information indicating the target direction.
  • the receiving module 810 is specifically configured to: determine, according to the extending direction of the two trajectories, a direction in which the angle bisector of the angle formed by the two trajectories extends; according to the direction in which the bisector extends, determine that the target direction indicated by the sliding input is In the first angular range, the first angular range is centered on the direction in which the angle bisector extends, and the half of the angle formed by the two tracks is the main lobe width.
  • the sensing module 840 changes the orientation of the sensing sound signal processing array, and determines the orientation change amount
  • the determining module 850 configured to change the target direction according to the orientation change amount and the orientation of the sound signal processing array, Determine the orientation of the sound signal processing array The target direction after the change
  • the prompting module 860 for prompting the user to input direction indication information for indicating the target direction after the change.
  • control mode selected by the user is a scan mode
  • the scan mode is a mode for prompting the user to input direction indication information by scanning
  • the scan module 870 is configured to collect sound signals in at least two directions by scanning
  • 880 for analyzing the energy of the sound signal, determining the sound source direction of the sound signal with the largest energy
  • the prompting module 860 is further configured to prompt the user to input the direction indication information according to the direction of the sound source.
  • the foregoing embodiment of the sound signal processing device 800 provided in this embodiment may be used to perform the technical solution of the method embodiment shown in any of FIG. 1 to FIG. 6.
  • the implementation principle and technical effects are similar, and details are not described herein again.
  • FIG. 9 is a schematic structural diagram of a second embodiment of a sound signal processing device according to the present invention.
  • the sound signal processing device 900 provided in this embodiment may include:
  • a determining module 910 configured to determine a receiving direction of the sound signal processing array according to the direction of the primary sound source, and determine a target direction of the processing array according to the receiving direction and a beam direction of the sound signal processing array, where the target direction refers to Determining the orientation of the processing array when the receiving direction of the processing array is consistent with the direction of the main sound source; determining an angle between the orientation of the sound signal processing array and the target direction;
  • the prompting module 920 is configured to prompt the user to adjust the orientation of the sound signal processing array according to the angle, so that the receiving direction of the sound signal processing array is consistent with the direction of the main sound source.
  • the foregoing embodiment of the sound signal processing device 900 provided in this embodiment may be used to implement the technical solution of the method embodiment shown in FIG. 7.
  • the implementation principle and technical effects are similar, and details are not described herein again.
  • FIG. 10 is a schematic structural diagram of Embodiment 3 of a sound signal processing device according to the present invention.
  • the sound signal processing device 1000 provided in this embodiment may include:
  • the processor 1020 can be a central processing unit (CPU);
  • the memory 1010 may include a high speed random access memory, a nonvolatile solid storage device, or the like;
  • the memory controller 1015 can control access to the memory 1010 by other components of the sound signal processing device 1000, such as a processor, to call each module in the memory 1010 to perform a corresponding function;
  • the sound signal processing array 1050 can be a microphone array or a speaker array.
  • the beam pointing adjustment command receiver 1060 can be configured to receive direction indication information input by the user for indicating the direction of the target;
  • the processor 1020 is configured to adjust a beam direction of the sound signal processing array to a state corresponding to the target direction according to the direction indication information.
  • the processor 1020 is specifically configured to: determine, according to the direction indication information, a signal delay corresponding to each sound signal processing unit in the sound signal processing array; and process the sound signal corresponding to the signal delay according to each signal delay
  • the sound signal to be processed by the unit is subjected to delay processing to obtain the delayed processed sound signal, and the delayed processed sound signal is transmitted to the beamformer to adjust the beam pointing of the sound signal processing array to The state in which the target direction corresponds.
  • processor 1020 is specifically configured to:
  • d n represents a distance between the sound signal processing unit n of one of the sound signal processing units included in the sound signal processing array arranged in a line array and the center of the sound signal processing array, and represents the sound signal processing array
  • the center points to an approximate angle between the target direction indicated by the direction indication information and a reference coordinate
  • c represents a speed of sound
  • f s represents a sound signal sampling frequency of the sound signal processing array
  • the processor 1020 is specifically configured to:
  • r m represents a distance from a certain target A to the mth sound processing unit in a target direction indicated by the direction indication information
  • f a represents a sound of the sound processing array
  • the signal sampling frequency is, c represents the speed of sound;
  • r a represents the distance between a certain target A and a coordinate origin of a reference coordinate in the target direction indicated by the direction indication letter
  • the coordinate origin of the reference coordinates is the circular arrangement a center of the sound processing array
  • b represents a radius of the sound processing array uniformly arranged in a ring shape
  • a m represents a projection A of the target A on a plane of the sound processing array
  • the coordinate origin An angle between a line connecting and a line connecting the mth sound processing unit to the coordinate origin;
  • ai represents a connection between the projection of the target A on the plane of the sound processing array and the coordinate origin and the connection between the first sound processing unit and the coordinate origin The angle between the lines;
  • the sound signal corresponding to the sound signal processing unit ⁇ is subjected to delay processing, and the delayed sound signal can be expressed as :
  • the array includes a plurality of sound signal collecting units.
  • the processor 1020 is specifically configured to: determine, according to the direction indication information, a signal delay corresponding to each sound signal collecting unit in the sound signal collecting array; and delay the signal according to each signal delay The sound signal collected by the corresponding sound signal collecting unit is subjected to delay processing; or the sound signal processing array includes a sound signal transmitting array, and the sound signal transmitting array includes a plurality of sound signal transmitting units; correspondingly, the processor 1020 is specifically For: determining, according to the direction indication information, a signal delay corresponding to each sound signal sending unit in the sound signal sending array; delaying the sound signal sent by the sound signal sending unit corresponding to the signal delay according to each signal delay deal with.
  • the beam pointing adjustment command receiver 1060 is further configured to: receive a control mode display instruction input by the user; the processor 1020 is further configured to: according to the control mode display instruction, acquire and display to the user multiple control modes that are selectable, and Each of the control modes corresponds to at least one direction indication information input mode; the beam pointing adjustment command receiver 1060 is further configured to: receive a control mode selection instruction sent by the user for multiple control modes, and enable the user to The selected control mode receives direction indication information for indicating a target direction input by the user through a direction indication information input manner corresponding to the selected control mode.
  • the plurality of control modes include a user autonomous control mode, wherein the user autonomous control mode is a mode that allows the user to input the direction indication information autonomously; if the control mode selected by the user is the user autonomous control mode; the beam pointing adjustment command receiver 1060 is specifically used And receiving the voice signal input by the user through the voice input manner, and parsing the voice signal to obtain the direction indication information included in the voice signal; or, the beam pointing adjustment command receiver 1060 is specifically configured to: receive the user by touch input on the touch screen Sliding the input trajectory information, parsing the target direction indicated by the trajectory information, and generating direction indication information for indicating the target direction; or, the beam pointing adjustment command receiver 1060 is specifically configured to: receive a gesture image that the user projects the input through the projection screen The information, the target direction indicated by the gesture image information is parsed, and the direction indication information is used to indicate the target direction.
  • the user autonomous control mode is a mode that allows the user to input the direction indication information autonomously; if the control mode selected by the user is the
  • the beam pointing adjustment command receiver 1060 is specifically configured to: receive the user through the selected control mode.
  • the command input terminal device inputs a control command for indicating a target direction, and the control command for indicating the target direction includes direction indication information for indicating a target direction.
  • the beam pointing adjustment command receiver 1060 is specifically configured to: receive The scroll information included by the user through the keyboard wheel corresponding to the selected control mode includes scroll direction information indicating direction indication information indicating the target direction.
  • the beam pointing adjustment command receiver 1060 is specifically configured to: receive a track information that the user slides on the touch screen; The information includes the starting position information and the end position information of the sliding input on the touch screen, determining the target direction indicated by the sliding input, and generating direction indicating information for indicating the target direction; or, the beam pointing adjustment command receiver 1060 is specifically used.
  • the beam pointing adjustment command receiver 1060 is specifically configured to: determine a direction in which an angle bisector of the angle formed by the two tracks extends according to the extending direction of the two tracks; determine the sliding input according to the direction in which the angle bisector extends
  • the indicated target direction is in the first angular range, the first angular range is centered on the direction in which the angle bisector extends, and the half of the angle formed by the two tracks is the main lobe width.
  • the senor 1070 changes the orientation of the sensing sound signal processing array, and determines the orientation change amount; the processor 1020: determines, according to the orientation change amount and the target direction before the orientation of the sound signal processing array is changed, The target direction after the orientation of the sound signal processing array is changed; prompting the user to input direction indication information for indicating the target direction after the change.
  • control mode selected by the user is a scan mode
  • scan mode is a mode for prompting the user to input direction indication information by scanning
  • the sound signal processing array 1050 is configured to collect sound signals in at least two directions by scanning
  • the processor 1020 is further configured to analyze the energy of the sound signal, determine the sound source direction of the sound signal with the largest energy, and prompt the user to input the direction indication information according to the direction of the sound source.
  • FIG. 11 is a schematic structural diagram of Embodiment 4 of a sound signal processing device according to the present invention.
  • the sound signal processing device 1100 provided in this embodiment may include: Memory 1 110, memory controller 1115, processor 1 120, peripheral interface 1 130, audio subsystem 1 140, sound signal processing array 1150, and the like.
  • the processor 1 120 can be a central processing unit CPU;
  • the memory 1 110 may include a high speed random access memory, a nonvolatile solid storage device, etc.; the memory controller 1 115 may control access to the memory 11 10 by other components of the sound signal processing device 1100, such as a processor, to call the memory 11 Each module in 10 performs the corresponding function;
  • the sound signal processing array 1 150 may be a microphone array or a speaker array.
  • the processor 1120 is configured to determine a receiving direction of the sound signal processing array according to the direction of the primary sound source, and determine a target direction of the processing array according to the receiving direction and a beam direction of the sound signal processing array,
  • the target direction refers to the orientation of the processing array when the receiving direction of the processing array is consistent with the direction of the main sound source; determining the angle between the orientation of the sound signal processing array and the target direction; prompting the user to adjust the orientation of the sound signal processing array according to the angle So that the receiving direction of the sound signal processing array is consistent with the direction of the main sound source.
  • the sound signal processing method and device provided by the embodiments of the present invention can adjust the beam direction of the sound signal processing array according to the direction indication provided by the user for indicating the target direction, so that the noise direction can still be adjusted in a noisy environment.
  • the sound signal is accurately processed to enhance the processed sound signal.
  • the aforementioned program can be stored in a computer readable storage medium.
  • the program when executed, performs the steps including the above method embodiments; and the foregoing storage medium includes: a medium that can store program codes, such as a ROM, a RAM, a magnetic disk, or an optical disk.

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Otolaryngology (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • General Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Multimedia (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

一种声音信号处理方法,包括:接收用户输入的用于指示目标方向的方向指示信息(S110);根据方向指示信息,将声音信号处理阵列的波束指向调整至与目标方向相对应的状态(S120)。一种声音信号处理设备,包括:接收模块(810),用于接收用户输入的用于指示目标方向的方向指示信息;调整模块(820),用于根据方向指示信息,将声音信号处理阵列的波束指向调整至与目标方向相对应的状态。该声音信号处理方法及设备,根据用户提供的用于指示目标方向的方向指示信息,对声音信号处理阵列的波束指向进行调整,从而在嘈杂环境下仍可以对声音信号进行准确处理,进而对所处理的声音信号起到增强的效果。

Description

声音信号处理方法及设备
技术领域
本发明实施例涉及通信技术, 尤其涉及一种声音信号处理方法及设 备。 背景技术
随着手机或电脑等各种终端设备在会议通讯或者声音收录等场景下 的普遍应用, 在终端设备上配置麦克风阵列进行声音信号的拾取, 或在终 端设备上配置扬声器阵列进行声音信号的放送, 已经成为研究与应用的热 点。
以终端设备中配置的麦克风阵列为例, 终端设备可以根据麦克风阵列 中的各个麦克风采集到的声音信号的强弱和各麦克风设置位置等因素, 进 行声源空间信息的计算, 再通过声源的空间信息估计出主声源所在位置, 并针对主声源位置进行自适应调整, 调整采集指向以采集主声源的声音波 束。 最终只保留或增强主声源的声音信号, 从而使主声源的声音能够更加 清晰地被采集或传送。
现有技术中, 如果在嘈杂环境下使用终端设备采集声音, 终端设备无 法准确估计主声源所在位置; 如果使用终端设备放送声音, 终端设备无法 获知需要将声音信号主要放送至哪个方向。所以终端设备无法对声音信号 进行准确的处理,无法实现对所处理的声音信号在主声源方向或主要放送 方向上的增强。 发明内容 本发明实施例提供一种声音信号处理方法及设备。
第一方面, 本发明实施例提供的一种声音信号处理方法, 包括: 接收用户输入的用于指示目标方向的方向指示信息; 根据所述方向指 示信息, 将声音信号处理阵列的波束指向调整至与所述目标方向相对应的 状态。 在第一方面的第一种可能的实现方式中, 所述根据所述方向指示信 息, 将声音信号处理阵列的波束指向调整至与所述目标方向相对应的状 态, 包括: 根据所述方向指示信息, 确定声音信号处理阵列中各个声音信 号处理单元所对应的信号延时; 根据各个所述信号延时, 对与所述信号延 时相对应的声音信号处理单元需要处理的声音信号进行延时处理以获取 经过延时处理的声音信号, 并将所述经延时处理的所述声音信号传输至波 束形成器以将所述声音信号处理阵列的波束指向调整至与所述目标方向 相对应的状态。
根据第一方面的第一种可能的实现方式, 在第一方面的第二种可能的 实现方式中, 根据所述方向指示信息, 确定声音信号处理阵列中各个声音 信号处理单元所对应的信号延时具体包括: 根据所述方向指示信息通过以 下公式获取声音信号处理阵列中各个声音处理单元 n对应的信号延时
Figure imgf000004_0001
其中, dn代表呈线形排列的所述声音信号处理阵列中包含的声音信号 处理单元之一的声音信号处理单元 n与声音信号处理阵列中心之间的距 离, 代表由所述声音信号处理阵列中心指向所述方向指示信息所指示的 所述目标方向与一参考坐标之间的近似夹角, c代表声速, fs代表声音信 号处理阵列的声音信号采样频率; 对应地, 所述根据各个所述信号延时, 对与所述信号延时相对应的声音信号处理单元需要处理的声音信号进行 延时处理以获取经过延时处理的声音信号具体包括: 根据所述声音信号处 理单元 n对应的信号延时 τη )对所述声音信号处理单元 n所采集的声音信 号做延时处理, 经延时处理后的声音信号可以表达为:
Υη (ω,β) = Ϋη { , )* Q im P)
Ϋη , 为将由声音信号处理单元 n采集到的声音信号进行时频变化 后得到的频域信号, ^代表频率索引。
根据第一方面的第一种可能的实现方式, 在第一方面的第三种可能的 实现方式中, 根据所述方向指示信息, 确定声音信号处理阵列中各个声音 信号处理单元所对应的信号延时具体包括: 根据所述方向指示信息通过以 下公式获取声音信号处理阵列中各个声音处理单元对应的信号延时的集 合 τ或
! = τ2...,τΜ ri-ra f,r2-ra f RM "ra
¾f,(¾-¾) a ;2 (¾-¾), ■irM
Ψ = [ψ Ψι···, ΨτΆ =
其中, ΤΜ代表所^-声音处理阵列中第 Μ个声音处理单元对应的信号 延时, rm代表在所述方向指示信息所指示的目标方向上的某一目标 Α至所 述第 m个声音处理单元的距离, fa代表所述声音处理阵列的声音信号采样 频率为, c代表声速; 所述! "m通过以下公式获取:
rm= ra +b - 2brasinft;osam, mm == l1,,22..., M
其中, ra代表所述在所述方向指示信 所指示的目标方向上的某一目 标 A与一参考坐标的坐标原点之间的距离,所述参考坐标的坐标原点为呈 环形排列的所述声音处理阵列的圆心, b代表所述呈环形均匀排列的所述 声音处理阵列的半径, am代表所述某一目标 A在所述声音处理阵列所在平 面上的投影 A、与所述坐标原点之间的连线与所述第 m个声音处理单元至 所述坐标原点之间的连线之间的夹角; 所述 am通过以下公式获取:
2π(ηι -l) , ^ Λ .
am = a, + ~ -,m = 1,2..., M
m 1 M 其中, ai代表所述某一目标 A在所述声音处理阵列所在平面上的投影 与所述坐标原点之间的连线与所述第 1个声音处理单元至所述坐标原点 之间的连线之间的夹角; 对应地, 所述根据各个所述信号延时, 对与所述 信号延时相对应的声音信号处理单元需要处理的声音信号进行延时处理 以获取经过延时处理的声音信号具体包括: 根据所述信号延时集合 T或 ^ 中的信号延时 TMM将对应于所述声音信号处理单元 M所采集的声音 信号做延时处理, 经延时处理后的声音信号可以表达为:
ΥΜ {ω,β) = ΫΜ (i )*e-jiyr "或
ΥΜ {ω, β) = ΫΜ {ω, β) * e j M
Ϋη 为将由声音信号处理单元 η采集到的声音信号进行时频变化 后得到的频域信号, 《代表频率索引。
根据第一方面的第一种至第三种可能的实现方式中的任意一种, 在第 一方面的第四种可能的实现方式中, 所述声音信号处理阵列包括声音信号 采集阵列, 所述声音信号采集阵列包括多个声音信号采集单元; 对应地, 所述根据所述方向指示信息, 确定声音信号处理阵列中各个声音信号处理 单元所对应的信号延时, 包括: 根据所述方向指示信息, 确定声音信号采 集阵列中各个声音信号采集单元所对应的信号延时; 对应地, 所述根据各 个信号延时, 对与所述信号延时对应的声音信号处理单元需要处理的声音 信号进行延时处理, 包括: 根据各信号延时, 对与所述信号延时对应的声 音信号采集单元采集的声音信号进行延时处理; 或者, 所述声音信号处理 阵列包括声音信号发送阵列, 所述声音信号发送阵列包括多个声音信号发 送单元; 对应地, 所述根据所述方向指示信息, 确定声音信号处理阵列中 各个声音信号处理单元的所对应的信号延时, 包括: 根据所述方向指示信 息, 确定声音信号发送阵列中各个声音信号发送单元所对应的信号延时; 对应地, 所述根据各个信号延时, 对与所述信号延时对应的声音信号处理 单元需要处理的声音信号进行延时处理, 包括: 根据各信号延时, 对与所 述信号延时对应的声音信号发送单元发送的声音信号进行延时处理。
根据第一方面或第一方面的第一种至第三种可能的实现方式中的任 意一种, 在第一方面的第五种可能的实现方式中, 所述接收用户输入的用 于指示目标方向的方向指示信息之前, 还包括: 接收用户输入的控制模式 显示指令; 根据所述控制模式显示指令, 获取并显示可供选择的多种控制 模式, 所述多种控制模式中的每一种控制模式对应至少一种方向指示信息 输入方式; 接收用户发送的针对所述多种控制模式的控制模式选择指令, 并开启用户所选择的控制模式; 所述接收用户输入的用于指示目标方向的 方向指示信息, 包括: 接收用户通过与所选择的控制模式对应的方向指示 信息输入方式所输入的用于指示目标方向的所述方向指示信息。
根据第一方面的第五种可能的实现方式, 在第一方面的第六种可能的 实现方式中, 所述多种控制模式包括用户自主控制模式, 所述用户自主控 制模式为允许用户自主输入所述方向指示信息的模式; 若所述用户所选择 的控制模式为用户自主控制模式, 所述接收用户通过所选择的控制模式对 应的方向指示信息输入方式所输入的用于指示目标方向的所述方向指示 信息, 包括: 接收所述用户通过语音输入方式所输入的语音信号, 解析所 述语音信号以获取所述语音信号中包含的方向指示信息; 或者, 接收所述 用户以触摸输入方式在触摸屏上滑动输入的轨迹信息, 解析所述轨迹信息 所指示的目标方向,并生成用以指示所述目标方向的方向指示信息;或者, 接收所述用户通过投影屏幕投射输入的手势图像信息, 解析所述手势图像 信息所指示的目标方向, 并生成用以指示所述目标方向的方向指示信息; 或者, 接收所述用户通过与所选择的控制模式对应的指令输入终端装置输 入的用于指示目标方向的控制指令, 所述用于指示目标方向的控制指令包 括所述用于指示目标方向的方向指示信息; 或者, 接收所述用户通过与所 选择的控制模式对应的键盘滚轮输入的滚动信息, 所述用滚动信息包括所 述用于指示目标方向的方向指示信息。
根据第一方面的第六种可能的实现方式, 在第一方面的第七种可能的 实现方式中, 若所述用户选择的控制模式所对应的方式指示信息输入方式 为触摸输入方式, 其中: 接收所述用户以触摸输入方式在触摸屏上滑动输 入的轨迹信息,包括:接收所述用户在触摸屏上滑动输入的一条轨迹信息; 对应, 所述解析所述轨迹信息所指示的目标方向, 并生成用以指示所述目 标方向的方向指示信息包括: 根据所述一条轨迹信息所包含的所述滑动输 入在所述触摸屏上的起点位置信息和终点位置信息, 确定所述滑动输入所 指示的所述目标方向, 并生成用以指示所述目标方向的方向指示信息; 或 者,接收所述用户以触摸输入方式在触摸屏上滑动输入的轨迹信息,包括: 接收所述用户在触摸屏上滑动输入的两条轨迹信息; 对应, 所述解析所述 轨迹信息所指示的目标方向, 并生成用以指示所述目标方向的方向指示信 息, 包括: 根据所述两条轨迹信息所包含的所述滑动输入在所述触摸屏上 的起点位置信息和终点位置信息, 分别确定两条轨迹的延伸方向; 根据所 述两条轨迹的延伸方向所限定的角度范围, 确定所述滑动输入所指示的目 标方向, 并生成用以指示所述目标方向的方向指示信息。
根据第一方面的第七种可能的实现方式, 在第一方面的第八种可能的 实现方式中, 所述根据所述两条轨迹的延伸方向所限定的角度范围, 确定 所述滑动输入所指示的目标方向, 包括: 根据所述两条轨迹的延伸方向确 定所述两条轨迹所构成夹角的角平分线延伸的方向; 根据所述角平分线延 伸的方向, 确定所述滑动输入所指示的目标方向处于第一角度范围中, 所 述第一角度范围以所述角平分线延伸方向为中心, 以所述两条轨迹所构成 夹角的角度的一半为主瓣宽度。 根据第一方面或第一方面的前八种可能的实现方式之一, 在第一方面 的第九种可能的实现方式中, 在所述接收用户输入的用于指示目标方向的 方向指示信息之前,还包括:感测所述声音信号处理阵列的朝向发生改变, 并确定朝向变化量; 根据所述朝向变化量以及所述声音信号处理阵列的朝 向改变之前的目标方向, 确定所述声音信号处理阵列的朝向改变之后的目 标方向; 提示用户输入用于指示所述朝向改变之后的目标方向的所述方向 指示信息。
根据第一方面的第五种可能的实现方式, 在第一方面的第十种可能的 实现方式中, 所述用户所选择的控制模式为扫描模式, 所述扫描模式为通 过扫描提示用户输入所述方向指示信息的模式; 所述接收用户通过所选择 的控制模式对应的方向指示信息输入方式所输入的所述方向指示信息之 前, 还包括: 通过扫描采集至少两个方向上的声音信号; 分析所述声音信 号的能量, 确定能量最大的声音信号的声源方向; 提示用户根据所述声源 方向输入所述方向指示信息。
第二方面, 本发明实施例提供的另一种声音信号处理方法, 包括: 根据主声源方向确定声音信号处理阵列的接收方向, 根据所述接收方 向及所述声音信号处理阵列的波束指向确定所述处理阵列的目标方向,所 述目标方向指所述处理阵列的接收方向与主声源方向一致时所述处理阵 列的朝向; 确定所述声音信号处理阵列的朝向与所述目标方向之间的夹 角; 提示用户根据所述夹角调整所述声音信号处理阵列的朝向, 以使所述 声音信号处理阵列的接收方向与所述主声源方向一致。
第三方面, 本发明实施例提供的一种声音信号处理设备, 包括: 接收模块, 用于接收用户输入的用于指示目标方向的方向指示信息; 调整模块, 用于根据所述方向指示信息, 将声音信号处理阵列的波束指向 调整至与所述目标方向相对应的状态。
在第三方面的第一种可能的实现方式中, 所述调整模块具体用于: 根 据所述方向指示信息, 确定声音信号处理阵列中各个声音信号处理单元所 对应的信号延时; 根据各个所述信号延时, 对与所述信号延时相对应的声 音信号处理单元需要处理的声音信号进行延时处理以获取经过延时处理 的声音信号, 并将所述经延时处理的所述声音信号传输至波束形成器以将 所述声音信号处理阵列的波束指向调整至与所述目标方向相对应的状态。 根据第三方面的第一种可能的实现方式, 在第三方面的第二种可能的 实现方式中, 所述调整模块具体用于:
根据所述方向指示信息通过以下公式获取声音信号处理阵列中各 声音处理单元 n对应的信号延时 0^:
Figure imgf000009_0001
其中, dn代表呈线形排列的所述声音信号处理阵列中包含的声音信号 处理单元之一的声音信号处理单元 n与声音信号处理阵列中心之间的距 离, 代表由所述声音信号处理阵列中心指向所述方向指示信息所指示的 所述目标方向与一参考坐标之间的近似夹角, c代表声速, fs代表声音信 号处理阵列的声音信号采样频率;
根据所述声音信号处理单元 n对应的信号延时 τη0 ^对所述声音信号处 理单元 n所采集的声音信号做延时处理, 经延时处理后的声音信号可以表 达为:
Figure imgf000009_0002
Ϋη , 为将由声音信号处理单元 η采集到的声音信号进行时频变化 后得到的频域信号, ^代表频率索引。
根据第三方面的第一种可能的实现方式, 在第三方面的第三种可能的 实现方式中, 所述调整模块具体用于:
根据所述方向指示信息通过以下公式获取声音信号处理阵列中各个 声
Figure imgf000009_0003
单元对应的信号 延时, rm代表在所述方向指示信息所指示的目标方向上的某一目标 A至所 述第 m个声音处理单元的距离, fa代表所述声音处理阵列的声音信号采样 频率为, c代表声速;
所述 rm通过以下公式获取:
rm = ra + b - 2brasinft;osam, m = 1,2. M
其中, ra代表所述在所述方向指示信 所指示的目标方向上的某一目 标 A与一参考坐标的坐标原点之间的距离,所述参考坐标的坐标原点为呈 环形排列的所述声音处理阵列的圆心, b代表所述呈环形均匀排列的所述 声音处理阵列的半径, am代表所述某一目标 A在所述声音处理阵列所在平 面上的投影 A、与所述坐标原点之间的连线与所述第 m个声音处理单元至 所述坐标原点之间的连线之间的夹角;
所述 am通过以下公式获取:
2π(ηι -l) , ^ ιί
am = a, + ~ ^ L , m = 1,2..., M
M 其中, ai代表所述某一目标 A在所述声音处理阵列所在平面上的投影 与所述坐标原点之间的连线与所述第 1个声音处理单元至所述坐标原点 之间的连线之间的夹角;
根据所述信号延时集合 T或 ^中的信号延时 TM或 ζ^ Μ将对应于所述声 音信号处理单元 Μ所采集的声音信号做延时处理, 经延时处理后的声音 信号可以表达为:
ΎΜ {ω, β) = ΫΜ 或
Figure imgf000010_0001
Ϋη 为将由声音信号处理单元 η采集到的声音信号进行时频变化 后得到的频域信号, ^代表频率索引。
根据第三方面的第一种至第三种可能的实现方式之一, 在第三方面的 第四种可能的实现方式中, 所述声音信号处理阵列包括声音信号采集阵 歹 所述声音信号采集阵列包括多个声音信号采集单元; 对应地, 所述调 整模块具体用于: 根据所述方向指示信息, 确定声音信号采集阵列中各个 声音信号采集单元所对应的信号延时; 根据各信号延时, 对与所述信号延 时对应的声音信号采集单元采集的声音信号进行延时处理; 或者, 所述声 音信号处理阵列包括声音信号发送阵列, 所述声音信号发送阵列包括多个 声音信号发送单元; 对应地, 所述调整模块具体用于: 根据所述方向指示 信息, 确定声音信号发送阵列中各个声音信号发送单元所对应的信号延 时; 根据各信号延时, 对与所述信号延时对应的声音信号发送单元发送的 声音信号进行延时处理。
根据第三方面或第三方面的第一种至第三种可能的实现方式, 在第三 方面的第五种可能的实现方式中, 所述接收模块还用于: 接收用户输入的 控制模式显示指令; 所述设备还包括: 显示模块, 用于根据所述控制模式 显示指令, 获取并向用户显示可供选择的多种控制模式, 所述多种控制模 式中的每一种控制模式对应至少一种方向指示信息输入方式; 所述接收模 块还用于: 接收用户发送的针对所述多种控制模式的控制模式选择指令, 并开启用户所选择的控制模式, 接收用户通过与所选择的控制模式对应的 方向指示信息输入方式所输入的用于指示目标方向的所述方向指示信息。
根据第三方面的第五种可能的实现方式, 在第三方面的第六种可能的 实现方式中, 所述多种控制模式包括用户自主控制模式, 所述用户自主控 制模式为允许用户自主输入所述方向指示信息的模式; 若所述用户所选择 的控制模式为用户自主控制模式; 所述接收模块具体用于: 接收所述用户 通过语音输入方式所输入的语音信号, 解析所述语音信号以获取所述语音 信号中包含的方向指示信息; 或者, 所述接收模块具体用于: 接收所述用 户以触摸输入方式在触摸屏上滑动输入的轨迹信息, 解析所述轨迹信息所 指示的目标方向, 并生成用以指示所述目标方向的方向指示信息; 或者, 所述接收模块具体用于: 接收所述用户通过投影屏幕投射输入的手势图像 信息, 解析所述手势图像信息所指示的目标方向, 并生成用以指示所述目 标方向的方向指示信息; 或者, 所述接收模块具体用于: 接收所述用户通 过与所选择的控制模式对应的指令输入终端装置输入的用于指示目标方 向的控制指令, 所述用于指示目标方向的控制指令包括所述用于指示目标 方向的方向指示信息; 或者, 所述接收模块具体用于: 接收所述用户通过 与所选择的控制模式对应的键盘滚轮输入的滚动信息, 所述用滚动信息包 括所述用于指示目标方向的方向指示信息。
根据第三方面的第六种可能的实现方式, 在第三方面的第七种可能的 实现方式中, 若所述用户选择的控制模式所对应的方式指示信息输入方式 为触摸输入方式, 其中: 所述接收模块具体用于: 接收所述用户在触摸屏 上滑动输入的一条轨迹信息; 根据所述一条轨迹信息所包含的所述滑动输 入在所述触摸屏上的起点位置信息和终点位置信息, 确定所述滑动输入所 指示的目标方向, 并生成用以指示所述目标方向的方向指示信息; 或者, 所述接收模块具体用于: 接收所述用户在触摸屏上滑动输入的两条轨迹信 息; 根据所述两条轨迹信息所包含的所述滑动输入在所述触摸屏上的起点 位置信息和终点位置信息, 分别确定两条轨迹的延伸方向; 根据所述两条 轨迹的延伸方向所限定的角度范围, 确定所述滑动输入所指示的目标方 向, 并生成用以指示所述目标方向的方向指示信息。
根据第三方面的第七种可能的实现方式, 在第三方面的第八种可能的 实现方式中, 所述接收模块具体用于: 根据所述两条轨迹的延伸方向确定 所述两条轨迹所构成夹角的角平分线延伸的方向; 根据所述角平分线延伸 的方向, 确定所述滑动输入所指示的目标方向处于第一角度范围中, 所述 第一角度范围以所述角平分线延伸方向为中心, 以所述两条轨迹所构成夹 角的角度的一半为主瓣宽度。
根据第三方面或第一方面的前八种可能的实现方式之一, 在第三方面 的第九种可能的实现方式中, 还包括: 感测模块: 用于感测所述声音信号 处理阵列的朝向发生改变, 并确定朝向变化量; 确定模块: 用于根据所述 朝向变化量以及所述声音信号处理阵列的朝向改变之前的目标方向, 确定 所述声音信号处理阵列的朝向改变之后的目标方向; 提示模块: 用于提示 用户输入用于指示所述朝向改变之后的目标方向的所述方向指示信息。
根据第三方面的第五种可能的实现方式, 在第三方面的第十种可能的 实现方式中, 所述用户所选择的控制模式为扫描模式, 所述扫描模式为通 过扫描提示用户输入所述方向指示信息的模式; 对应地, 还包括: 扫描模 块, 用于通过扫描采集至少两个方向上的声音信号; 分析模块, 用于分析 所述声音信号的能量, 确定能量最大的声音信号的声源方向; 所述提示模 块还用于提示用户根据所述声源方向输入所述方向指示信息。
第四方面, 本发明实施例提供的另一种声音信号处理设备, 包括: 确定模块, 用于根据主声源方向确定声音信号处理阵列的接收方向, 根据所述接收方向及所述声音信号处理阵列的波束指向确定所述处理阵 列的目标方向,所述目标方向指所述处理阵列的接收方向与主声源方向一 致时所述处理阵列的朝向; 确定所述声音信号处理阵列的朝向与所述目标 方向的夹角, 所述声音信号处理阵列的波束指向固定; 提示模块, 用于提 示用户根据所述夹角调整所述声音信号处理阵列的朝向, 以使所述声音信 号处理阵列的接收方向与所述主声源方向一致。
本发明实施例提供的声音信号处理方法及设备, 根据用户提供的用于 指示目标方向的方向指示, 对声音信号处理阵列的波束指向进行调整, 从 而在嘈杂环境下仍可以对声音信号进行准确的处理, 进而对所处理的声音 信号起到增强的效果。 本发明另一实施例提供的声音信号处理方法及设 备, 可根据主声源方向确定信号处理阵列的接收方向, 并根据所述接收方 向及所述声音信号处理阵列的波束指向来确定处理阵列的目标方向, 以及 所述处理阵列的朝向和所述目标方向之间的夹角来提示用户通过调整所 述声音信号处理阵列的朝向以使所述声音信号处理阵列的接收方向与所 述主声源方向一致, 从而提高信号处理阵列对主声源估计的精度。 附图说明 为了更清楚地说明本发明实施例或现有技术中的技术方案, 下面将对 实施例或现有技术描述中所需要使用的附图作一简单地介绍, 显而易见 地, 下面描述中的附图是本发明的一些实施例, 对于本领域普通技术人员 来讲, 在不付出创造性劳动性的前提下, 还可以根据这些附图获得其他的 附图。
图 1为本发明提供的声音信号处理方法实施例一的流程图; 图 2为本发明提供的声音信号处理方法实施例二的示意图; 图 3为本发明提供的声音信号处理方法实施例三的示意图; 图 4为本发明提供的声音信号处理方法实施例四的示意图; 图 5为本发明提供的声音信号处理方法实施例五的示意图; 图 6为本发明提供的声音信号处理方法实施例六的示意图; 图 7为本发明提供的声音信号处理方法实施例七的流程图; 图 8为本发明提供的声音信号处理设备实施例一的结构示意图; 图 9为本发明提供的声音信号处理设备实施例二的结构示意图; 图 10为本发明提供的声音信号处理设备实施例三的结构示意图; 图 1 1为本发明提供的声音信号处理设备实施例四的结构示意图。 具体实施方式 为使本发明实施例的目的、 技术方案和优点更加清楚, 下面将结合本 发明实施例中的附图, 对本发明实施例中的技术方案进行清楚、 完整地描 述, 显然,所描述的实施例是本发明一部分实施例, 而不是全部的实施例。 基于本发明中的实施例, 本领域普通技术人员在没有作出创造性劳动前提 下所获得的所有其他实施例, 都属于本发明保护的范围。
图 1为本发明提供的声音信号处理方法实施例一的流程图, 如图 1所 示, 本实施例提供的声音信号处理方法可以包括:
S 1 10、 接收用户输入的用于指示目标方向的方向指示信息。
具体而言, 在本发明实施例应用于声音信号采集阵列采集声音信号的 情况下, 例如是麦克风阵列采集声音信号的情况下, 目标方向可以是主要 声源的方向或者其它麦克风阵列需要着重采集的声音信号所处的方向; 而 在本发明实施例应用于声音信号发送阵列发送声音信号的情况下, 例如是 扬声器阵列发送声音信号的情况下, 目标方向则可以是声音信号发送目的 地的方向。本发明实施例将声音信号采集阵列和声音信号发送阵列等统称 为声音信号处理阵列。 后续实施例多以将本发明应用于声音信号采集阵列 采集声音信号的情况为例对本发明实施方式加以说明, 本领域技术人员可 以在此基础上无需花费创造性劳动地得出本发明实施例应用于声音信号 发送阵列发送声音信号的情况下的实施方式。
用于执行声音信号处理方法的装置可以接收用户输入的用于指示目 标方向的方向指示信息, 该方向指示信息可以是用户通过语音、 手势或旋 转旋钮等多种方式输入的能够指示目标方向的信息。
S 120、 根据方向指示信息, 将声音信号处理阵列的波束指向调整至与 目标方向相对应的状态。
声音信号处理阵列的波束指向, 可以理解为经声音信号处理阵列处理 后所加强的声音信号对应的来源或者传输方向。例如麦克风阵列的波束指 向, 可以理解为经麦克风阵列增强处理后所主要采集到的声音信号的来源 方向; 扬声器阵列的波束指向, 可以理解为经扬声器阵列增强处理后声音 信号放送的主要方向。 声音信号处理装置可以根据用于指示目标方向的方 向指示信息, 将声音信号处理阵列的波束指向调整至与目标方向相对应的 状态, 例如将麦克风阵列的波束指向调整至所需采集声源所在的方向, 从 而对采集到的声音信号进行准确的处理; 或者也可以使将麦克风阵列的波 束指向调整至与所采集声源所在的方向相对应的状态, 即与所采集声源所 在的方向相同或者与所采集声源所在的方向之间的偏差在预设或者合理 的范围之内。
本实施例提供的声音信号处理方法, 根据用户提供的用于指示目标方 向的方向指示信息, 对声音信号处理阵列的波束指向进行调整, 从而在嘈 杂环境下仍可以对声音信号进行准确的处理, 进而对所处理的声音信号起 到增强的效果。
图 1实施例中, 根据方向指示信息, 将声音信号处理阵列的波束指向 调整至与目标方向相对应的状态, 包括: 根据方向指示信息, 确定声音信 号处理阵列中各个声音信号处理单元所对应的信号延时; 根据各个信号延 时, 对与信号延时相对应的声音信号处理单元需要处理的声音信号进行延 时处理以获取经过延时处理的声音信号, 并将所述经延时处理的所述声音 信号传输至波束形成器以将声音信号处理阵列的波束指向调整至与目标 方向相对应的状态。
声音信号处理阵列包括声音信号采集阵列, 声音信号采集阵列包括多 个声音信号采集单元; 对应地, 根据方向指示信息, 确定声音信号处理阵 列中各个声音信号处理单元所对应的信号延时,包括:根据方向指示信息, 确定声音信号采集阵列中各个声音信号采集单元所对应的信号延时; 对应 地, 根据各个信号延时, 对与信号延时对应的声音信号处理单元需要处理 的声音信号进行延时处理, 包括: 根据各信号延时, 对与信号延时对应的 声音信号采集单元采集的声音信号进行延时处理; 或者, 所述声音信号处 理阵列包括声音信号发送阵列, 所述声音信号发送阵列包括多个声音信号 发送单元; 对应地, 根据方向指示信息, 确定声音信号处理阵列中各个声 音信号处理单元的所对应的信号延时, 包括: 根据方向指示信息, 确定声 音信号发送阵列中各个声音信号发送单元所对应的信号延时; 对应地, 根 据各个信号延时, 对与信号延时对应的声音信号处理单元需要处理的声音 信号进行延时处理, 包括: 根据各信号延时, 对与信号延时对应的声音信 号发送单元发送的声音信号进行延时处理。
换言之, 根据方向指示信息, 确定声音信号处理阵列中各声音信号处 理单元所对应的信号延时, 例如可以根据用于指示目标方向的方向指示信 息, 以及麦克风阵列中包含的声音信号处理单元即麦克风单元的排列位置 等信息, 获取各麦克风单元所对应的信号延时; 根据各个信号延时, 对与 信号延时对应的声音信号处理单元需要处理的声音信号进行延时处理, 例 如获取麦克风阵列中包含的各麦克风单元对应的信号延时后, 可以根据各 信号延时对与其对应的麦克风单元采集的声音信号做延时处理, 以实现麦 克风单元仅对采集到的在目标方向上的声音进行保留或增强的目的。
根据方向指示信息, 确定声音信号处理阵列中各声音信号处理单元所 对应的信号延时, 例如可以根据用于指示目标方向的方向指示信息, 以及 扬声器阵列中包含的声音信号处理单元即扬声器单元的排列位置等信息, 获取各扬声器单元所对应的信号延时; 根据各信号延时, 对与信号延时对 应的声音信号处理单元需要处理的声音信号进行延时处理, 例如获取扬声 器阵列中包含的各扬声器单元对应的信号延时后, 可以根据各信号延时对 与其对应的扬声器单元将要发送的声音信号做延时处理, 以实现扬声器单 元仅对发送给目标方向上的声音进行保留或增强的目的。
本实施例提供的声音信号处理方法, 根据用户提供的用于指示目标方 向的方向指示信息, 首先确定声音信号延时, 再根据声音信号延时对声音 信号进行延时处理, 以此对声音信号处理阵列的波束指向进行调整, 从而 在嘈杂环境下仍可以对声音信号进行准确的处理, 进而对所处理的声音信 号起到增强的效果。
本发明实施例提供的声音信号处理方法中, 根据各方向指示信息, 确 定声音信号处理阵列中各声音信号处理单元对应的信号延时, 可以包括: 根据各方向指示信息确定声音信号处理阵列中包含的呈线形排列或者环 形排列的各声音信号处理单元的信号延时。
以声音信号处理阵列为麦克风阵列为例, 根据方向指示信息获取麦克 风阵列中包含的麦克风单元对应的信号延时可以包括: 根据方向指示信息 获取麦克风阵列中包含的一个以上的麦克风单元对应的信号延时; 麦克风 单元可以呈线形、 环形或者其它不规则形状等多种形式的排列。
具体地, 图 2为本发明提供的声音信号处理方法实施例二的示意图, 如图 2所示, 如果采用手机作为声音信号采集设备, 以手机的麦克风阵列 中包含的麦克风单元 1~N呈线形排列为例,其中以手机的中心点为坐标原 点 0, 目标为 A。 首先, 手机接收用户在触摸屏上滑动输入的方向指示信息 BC, 方向 指示信息 BC用于指示目标 A的方向, 方向指示信息 BC与横坐标轴 X方 向的夹角为 α,麦克风阵列中心 D指向目标 Α的方向与横坐标 X方向的夹 角为 β, 由于目标 Α与手机的距离远大于麦克风阵列本身的尺寸, 所以可 以近似认为 α=β。 呈线形排列的麦克风阵列中包含的麦克风单元之一的麦 克风单元 η与麦克风阵列中心 D之间的距离为 dn。则麦克风单元 n对应的 信号延时 ^)可以表达为:
TM = ^→x fs
C
( 1 ) , 式 (1 ) 中, c代表声速, fs代表麦克风阵列的声音信号采样频率。 获得麦克风单元 n对应的信号延时 τη0 ^后, 根据该信号延时对与该信 号延时对应的麦克风单元 n采集的声音信号做延时处理, 经延时处理后的 声音信号可以表达为:
Υη [ω,β、 = Ϋ„ [ω,β)* e j (β) 式 ( 2 ) ,
式 (2 ) 中, Ϋη , 为将采集到的声音信号进行时频变化后得到的频 域信号, ^代表频率索引。
将经延时处理后的声音信号送入波束形成器, 即可达到根据方向指示 信息 BC , 对麦克风阵列的波束指向进行调整以保留或增强源自目标 Α的 声音信号的目的。
3为本发明提供的声音信号处理方法实施例三的示意图, 如图 3所 示, 如果仍采用手机作为声音信号采集设备, 但以手机的麦克风阵列中包 含的 M个麦克风单元呈环形均匀排列为例, 其中以手机的中心点为坐标 原点 0, 目标为 A, M个麦克风单元组成的环形阵列的半径为 b, 目标 A 与坐标原点 0的连线与垂直于麦克风阵列所在平面的直线之间的夹角为 Θ , 目标 A至坐标原点 0的距离为 ra, 麦克风阵列的声音信号采样频率为 fa, 声速为 c, 目标 A在麦克风阵列所在平面上的投影为 A、 那么将 OA、 与麦克风单元 1至坐标原点 0连线的夹角定义为 ap则 OA与麦克风单元 m至坐标原点 0连线的夹角 am可以表示为: 2π(ηι - 1)
a„, = a, + m = l,2..., M
M
(3) , 则目标 A至麦克 m的距离 rm可以表示为
rm = -J ΑΑΊ +|mA'|
Figure imgf000018_0001
τ = [η, τ2,···,τΜ]: 式(5) 根据所述信号延时集合 T或 中的信号延时 TMM将对应于所述声 音信号处理单元 M所采集的声音信号做延时处理, 经延时处理后的声音 信号可以表达为:
ΎΜ {ω,β) = ΎΜ , *e r "或
ΥΜ (ω,β) = ΫΜ (ω,β)*^-ίωψΜ
( ^为将由声音信号处理单元 η采集到的声音信号进行时频变化 后得到的频域信号, ^代表频率索引。 将经延时处理后的声音信号送入波 束形成器, 即可达到, 对麦克风阵列的波束指向进行调整以保留或增强源 自目标 Α处声音信号的目的。
图 2和图 3提供的声音信号处理方法实施例分别以麦克风单元呈线形 和环形排列为例, 对本发明提供实施例提供的声音信号处理方法加以说 明, 需要明确的是, 本发明实施例应用场景不以此为限, 计算方式也不以 此为限。 例如在图 3实施例中, 除了以基于样点的方式计算信号延时, 还 可以以相位差的形式表示信号延时如下:
Ψ = [ψι> Ψι···, ]:
Figure imgf000018_0002
(6) , 图 2和图 3提供的声音信号处理方法实施例, 根据用户自主提供的方 号进行相应的延时处理, 对麦克风阵列的波束指向进行调整, 从而在嘈杂 环境下仍可以对声音信号进行准确的处理, 进而对所处理的声音信号起到 增强的效果。
进一步地, 接收用户输入的用于指示目标方向的方向指示信息之前, 还包括: 接收用户输入的控制模式显示指令; 根据控制模式显示指令, 获 取并向用户显示可供选择的多种控制模式, 多种控制模式中的每一种控制 模式对应至少一种方向指示信息输入方式; 接收用户发送的控制模式选择 指令, 开启用户所选择的控制模式; 接收用户输入的用于指示目标方向的 方向指示信息, 包括: 接收用户通过与所选择的控制模式对应的方向指示 信息输入方式所输入的用于指示目标方向的方向指示信息。
换言之, 本发明实施例可以为用户提供用于控制声音信号处理方式的 多种控制模式供用户选择。 声音信号处理设备接收到用户输入的控制模式 显示指令后, 可以获取能够为用户提供的多种控制模式并向用户进行显 示。 控制模式可以有一种或多种的方向指示信息输入方式, 即用户在该种 控制模式下可以通过一种或多种方式输入用于指示目标方向的方向指示 信息。 用户选择好控制模式后, 声音信号处理设备开启用户所选择的控制 模式, 用户通过在所选择的控制模式下的方向指示信息输入方式输入方向 指示信息, 用以控制声音信号的处理。 本发明实施例为用户提供多种控制 模式用以控制声音信号的处理, 旨在为用户提供多种不同选择, 以实现多 种对声音信号进行处理的方式, 同时增加用户使用的方便性。
用户所选择的控制模式为用户自主控制模式, 用户自主控制模式为允 许用户自主输入方向指示信息的模式; 则可以包括下列几种方式对声音信 号进行处理:
方式一:
接收用户通过所选择的控制模式对应的方向指示信息输入方式所输 入的用于指示目标方向的方向指示信息, 包括: 接收用户通过语音输入方 式所输入的语音信号, 解析语音信号以获取语音信号中包含的方向指示信 息。
用户自主控制模式下, 用户可以通过语音输入用于指示目标方向的语 音信号, 可以通过语音识别器来实现该语音信号的识别, 语音识别器在词 表设计和模型训练的过程中, 将表示方位或角度的词语进行加强训练, 确 保对类似词汇的识别率在 95%以上, 经相关验证, 目前小词汇量的关键词 语的识别率可以达到 95%以上。 例如, 以手机作为声音采集设备为例, 用 户可以说出期望手机的麦克风阵列波束指向的目标方向, 比如为"右前方 三十五度"。 手机的语音识别器接收到该语音信号后, 根据预先的约定规 贝 |J, 解析获取该语音信号中"右"、 "前"以及"三十五度"分别表示的方位, 从而准确判断出该语音信号中包含的目标方向信息。将声音信号处理阵列 的波束指向调整至目标方向信息对应的目标方向。
这种通过语音输入用于指示目标方向的语音信号的声音信号处理方 法的实施方式, 无需基于用户的手动输入操作, 可以根据用户自主提供的 语音信号, 获取麦克风阵列中包含的各麦克风单元对应的信号延时, 通过 对麦克风单元采集的声音信号进行相应的延时处理, 实现麦克风单元对语 音信号所指示的方向上的声音信号的保留或增强, 进而在嘈杂环境下仍可 以对所采集的声音信号起到增强的效果。
方式二:
接收用户以触摸输入方式在触摸屏上滑动输入的轨迹信息, 解析轨迹 信息所指示的目标方向, 并生成用以指示目标方向的方向指示信息。 接收 用户以触摸输入方式在触摸屏上滑动输入的轨迹信息, 包括: 接收用户在 触摸屏上滑动输入的轨迹信息; 对应, 解析轨迹信息所指示的目标方向, 并生成用以指示目标方向的方向指示信息包括: 根据轨迹信息所包含的滑 动输入在触摸屏上的起点位置信息和终点位置信息, 确定滑动输入所指示 的目标方向, 并生成用以指示目标方向的方向指示信息; 或者, 接收用户 以触摸输入方式在触摸屏上滑动输入的轨迹信息, 包括: 接收用户在触摸 屏上滑动输入的两条轨迹信息; 对应, 解析轨迹信息所指示的目标方向, 并生成用以指示目标方向的方向指示信息, 包括: 根据两条轨迹信息所包 含的滑动输入在触摸屏上的起点位置信息和终点位置信息, 分别确定两条 轨迹的延伸方向, 根据两条轨迹的延伸方向所限定的角度范围, 确定滑动 输入所指示的目标方向, 并生成用以指示目标方向的方向指示信息。 具体 地:
以图 2为例, 以用户在手机的触摸屏上朝向目标方向滑动输入方向指 示信息, 也就是一条轨迹信息 BC来控制麦克风阵列对目标 A的声音信号 的采集为例, 对本实施例提供的声音信号处理方法做详细说明:
第一步: 接收用户在手机的触摸屏上朝向目标方向滑动输入的一条轨 迹信息 BC;
第二步: 确定轨迹信息 BC的起点 B在手机的坐标系中的坐标 (xb, yb) , 以及轨迹信息 BC的终点 C在手机的坐标系中的坐标 (xc, yc) , 获取轨迹信息 BC与手机的坐标系的横坐标轴 X方向的夹角 a: - arctan ^c ^b
( 7) , α的获取方法不以此为限, 例如用户通过语音信号或通过旋钮来输入 轨迹信息 BC时, 即可以根据预定义的语音指示规则获取 a, 或者根据旋 钮刻度值直接获知 α;
第三步: 得到麦克风阵列中心 D指向目标 Α的方向与横坐标轴 X方 向的夹角 β; 由于目标 Α与手机的距离远大于麦克风阵列本身的尺寸, 所 以可以近似认为 β=α;
第四步: 根据麦克风阵列中心 D指向目标 Α的方向与横坐标轴 X方 向的夹角 β,获取麦克风阵列中包含的麦克风单元 n对应的信号延时 0^ :
TM = ^→x fs
C
( 1 ) , 式 (1 ) 中, 呈线形排列的麦克风阵列中包含的麦克风单元之一麦克 风单元 n与麦克风阵列中心 D之间的距离为 dn, c代表声速, 代表麦克 风阵列的声音信号采样频率;
第五步: 可以对麦克风阵列采集到的声音信号进行预处理, 包括降噪 和回声抑制等, 再将声音信号时频变化得到频域信号 Ϋη ^, , 其中《代表 频率索引,对采集到的声音信号做延时处理,经延时处理后的声音信号为:
Υη [ω,β、 = Ϋ„ [ω,β)* e- (β)
(2 ) ;
第六步: 将经延时处理后的声音信号送入波束形成器, 即可达到根据 轨迹信息 BC对麦克风阵列的波束指向进行调整, 以保留或增强源自目标 A的声音信号的目的。
在执行上述的实施方式之前,或者得到麦克风阵列中心 D指向目标 A 的方向与横坐标轴 X方向的夹角 β之后, 可以向用户提供确认提示, 提示 用户确认是否改变用于指示目标方向的方向指示信息, 或者提示用户确认 新输入的方向指示信息是否正确等, 以避免用户对输入方向指示信息的误 启动或误操作。
这种通过触摸屏输入用于指示目标方向的轨迹信息的声音信号处理 方法的实施方式, 为用户提供了一种直观的方向指示信息输入方式, 用户 可以根据目标方向简便的向手机提供轨迹信息, 以使手机根据用户提供的 轨迹信息,确定目标方向,从而对声音信号处理阵列的波束指向进行调整, 从而在嘈杂环境下仍可以对声源信号进行准确的处理, 进而对所处理的声 音信号起到增强的效果。
图 4为本发明提供的声音信号处理方法实施例四的示意图, 如图 4所 示, 本实施例提供的声音信号处理方法与上一实施例的区别在于, 本实施 例提供的声音信号处理方法中接收用户输入的通过触摸屏输入的用于指 示目标方向的轨迹信息可以包括两条或两条以上的轨迹信息。
具体而言, 以用户在手机的触摸屏上朝向目标方向滑动输入轨迹信息 和轨迹信息 8(¾来控制麦克风阵列的波束指向为例, 对本实施例提供 的声音信号处理方法做详细说明:
第一步: 接收用户在手机的触摸屏上朝向目标方向滑动输入的轨迹信 息, 轨迹信息包含具有相同起点 B的轨迹信息 和轨迹信息 BC2, 轨迹 信息 Bd和轨迹信息 BC2应被同步输入, 即轨迹信息 Bd和轨迹信息 BC2 不应是先后间隔输入的两条轨迹信息。轨迹信息 和轨迹信息 BC2用于 指示需采集的声音信号来源的范围,即目标 A位于轨迹信息 BC1和轨迹信 息 BC2夹角的范围内;
第二步: 提示用户确认用于控制麦克风阵列的波束指向的具体轨迹信 息, 如果用户确认用于控制麦克风阵列的波束指向的具体轨迹信息只包含 轨迹信息 Bd或轨迹信息 BC2的其中之一, 表示用户同步输入轨迹信息 和轨迹信息 BC2为误输入, 后续根据具体轨迹信息获取信号延时的方 法与图 2所示实施例中类似, 不再赘述; 如果用户确认用于控制麦克风阵 列的波束指向的具体轨迹信息包含轨迹信息 和轨迹信息 BC2, 那么执 行下一步;
第三步:确定轨迹信息 Bd和轨迹信息 BC2的起点 B在手机的坐标系 中的坐标 (xb, yb) , 以及轨迹信息 的终点 ^在手机的坐标系中的坐 标 (xel, yci ) 和轨迹信息 BC2的终点 C2在手机的坐标系中的坐标 (xc2, yc2 ) , 获取轨迹信息 与手机的坐标系的横坐标轴 X方向的夹角 αι, 以 及轨迹信息 BC2与手机的坐标系的横坐标轴 X方向的夹角 α2 : ax = arctan y"1 ^b
( 8 ) , a2 = arctan— ~― 式
( 9 ) ,
^和 α2的获取方法不以此为限;
第四步:轨迹信息 和轨迹信息 BC2的角平分线 BC3与坐标系的横 坐标轴 X方向的夹角 α3 : , =— ¾
2
( 10 ) , 对麦克风阵列中包含的麦克风单元 1~N采集到的声音信号,以角平分 线 8 3为中心, 作为主瓣宽度, 即保留或增强处于轨迹信息 和轨迹信息 BC2夹角范围内的声音信号, 处于其它方位的声音信号被 抑制。
这种通过触摸屏输入多条用于指示目标方向的轨迹信息的声音信号 处理方法的实施方式, 为用户提供了一种直观的方向指示信息输入方式, 用户可以根据目标的大致方位简便的向手机提供轨迹信息, 以使手机根据 用户提供的轨迹信息的延伸方向, 确定目标方向, 从而对声音信号处理阵 列的波束指向进行调整, 从而在嘈杂环境下仍可以对声音信号进行准确的 处理, 进而对所处理的声音信号起到增强的效果。 方式三:
接收用户通过所选择的控制模式对应的方向指示信息输入方式所输 入的用于指示目标方向的方向指示信息, 包括: 接收用户通过投影屏幕投 射输入的手势图像信息, 解析手势图像信息所指示的目标方向, 并生成用 以指示目标方向的方向指示信息; 接收用户通过与所选择的控制模式对应 的指令输入终端装置输入的用于指示目标方向的控制指令, 用于指示目标 方向的控制指令包括用于指示目标方向的方向指示信息。
用户首先可以用手指或其它指示工具在投影屏幕上滑动, 形成手势图 像信息, 用以指示发言人或其它需采集的声源所处的方位, 或者指示声音 发送的目的地方位; 投影设备上配备的摄像头可以对用户在投影屏幕上滑 动的手势图像信息进行连续拍照, 采集包括位置信息、 灰度信息以及变化 信息等在内的特征信息, 并提取这些特征信息; 将提取的特征信息与预先 建立的手势信息库进行匹配, 查找与特征信息最接近的手势及手势对应的 图像信息; 根据图像信息获取声音处理阵列中包含的声音处理单元对应的 信号延时; 再根据信号延时对声音信号做延时处理。
方式四:
接收用户通过所选择的控制模式对应的方向指示信息输入方式所输 入的用于指示目标方向的方向指示信息, 包括: 接收用户通过与所选择的 控制模式对应的指令输入终端装置输入的用于指示目标方向的控制指令, 用于指示目标方向的控制指令包括用于指示目标方向的方向指示信息。
指令输入终端装置可以是无线控制装置或远程控制装置等。
用户可以通过与声音信号处理阵列分离设置的终端设备或其它种类 的可以控制声音信号处理阵列的无线控制装置, 以近距离遥控或远程控制 的方式输入用于指示目标方向的控制指令, 以控制声音信号处理阵列对声 音信号的采集或发送, 例如控制麦克风阵列对声音信号的采集。
图 5为本发明提供的声音信号处理方法实施例五的示意图, 如图 5所 示, 本实施例示出了近距离遥控输入控制指令的具体方法, 可以包括:
S510、 无线控制装置与接收端建立无线控制连接。 其中, 无线控制装 置按照无线遥控系统的基本原理与接收端建立无线控制连接, 无线控制装 置可以为手机或其它具有无线信号发射功能的终端设备, 接收端包括麦克 风阵列或者扬声器阵列等形式的声音信号处理阵列。
S520、 无线控制装置接收用户输入的用于指示目标方向的控制指令。 用户输入控制指令的方式可以包括多种, 例如通过语音指示输入, 或通过 设置在无线控制装置上的触摸屏输入等。
S530、 无线控制装置将控制指令发送给接收端。 无线控制装置可以按 照无线遥控系统的基本原理, 将控制指令进行编码、 调制以及放大后发送 给接收端。
S540、 接收端根据接收到的控制指令对声音信号进行处理。 以接收端 为麦克风阵列为例, 麦克风阵列通过天线接收到无线控制装置发送的经过 处理的包含控制指令的信息后, 对该信息进行放大、 检波以及解码, 根据 解码后得到的控制指令获取麦克风阵列中包含的各麦克风单元对应的信 号延时, 并对采集到的声音信号做相应的延时处理, 以实现调整采集指向 角度的目的。其中无线控制装置与接收端之间的无线传输方式可以采用红 外或蓝牙等, 本发明对此不做限制。
用户可以通过与声音信号处理阵列分离设置的终端设备或其它种类 的可以控制声音信号处理阵列的远程控制装置, 以远程控制的方式输入用 于指示目标方向的控制指令, 以控制声音信号处理阵列对声音信号的采集 或发送,例如控制麦克风阵列对声音信号的采集。以通过互联网控制为例: 图 6为本发明提供的声音信号处理方法实施例六的示意图, 如图 6所 示, 本实施例示出了远程输入控制指令的具体方法, 可以包括:
S610、 远程控制装置按照通信协议通过互联网与接收端建立连接。 其 中, 远程控制装置可以为电脑或手机等具有无线通信功能并可以接入互联 网的终端设备, 接收端包括麦克风阵列或者扬声器阵列等形式的声音信号 处理阵列。
S620、 远程控制装置接收用户输入的用于指示目标方向的控制指令。 用户输入控制指令的方式可以包括多种, 例如通过语音指示输入, 或通过 设置在远程控制装置上的触摸屏输入, 或通过设置在远程控制装置上的滚 轮输入等。
S630、 远程控制装置将控制指令发送给接收端。 远程控制装置可以按 照协议将包含用于指示目标方向的控制指令的数据进行打包发送给接收 S640、 接收端根据接收到的控制指令对声音信号进行处理。 以接收端 为麦克风阵列为例, 麦克风阵列通过天线接收到远程控制装置发送的经过 处理的包含控制指令的信息后, 按照协议对数据解包获取控制指令; 接收 端根据控制指令获取声音信号处理阵列中包含的处理单元对应的信号延 时, 并对声音信号做相应的延时处理, 以实现调整采集或发送指向角度的 目的。
方式五: 接收用户通过所选择的控制模式对应的方向指示信息输入方 式所输入的方向指示信息, 包括: 接收用户通过与所选择的控制模式对应 的键盘滚轮输入的滚动信息, 用滚动信息包括用于指示目标方向的方向指 不信息
上述实施例中提供的声音信号处理方法中, 用户可以通过多种方式输 入方向指示信息, 旨在为用户提供输入方向指示信息的不同选择, 以增加 用户使用的方便性。
进一步地, 接收用户输入的用于指示目标方向的方向指示信息之前, 还包括: 感测声音信号处理阵列的朝向发生改变, 并确定朝向变化量; 根 据朝向变化量以及声音信号处理阵列的朝向改变之前的目标方向, 确定声 音信号处理阵列的朝向改变之后的目标方向; 提示用户输入用于指示朝向 改变之后的目标方向的方向指示信息。
详细而言, 此实施方式可以应用于, 在声音信号处理阵列或者声音信 号处理阵列所在的声音信号处理设备上配备了运动感测器、 位置感测器、 角度感测器等传感器, 具备感测声音信号处理阵列自身朝向变化功能的场 旦 以声音信号处理设备为配置有三轴陀螺仪的手机为例, 具体实施方式 可以包括:
第一步: 用户将手机旋转一定角度;
第二步: 手机的三轴陀螺仪感测手机的位置变化, 即手机上麦克风阵 列朝向的变化, 假设麦克风阵列朝向相对于未被旋转前变化了角度 φ, 即 麦克风阵列的朝向变化量为 φ;
第三步: 根据朝向变化量 φ, 以及声音信号处理阵列的朝向改变之前 的目标方向, 确定目标相对于旋转后的麦克风阵列的方向, 即确定声音信 号处理阵列的朝向改变之后的目标方向;
第四步: 提示用户确认第三步中确定的声音信号处理阵列的朝向改变 之后的目标方向, 用户输入用于指示声音信号处理阵列的朝向改变之后的 目标方向的方向指示信息, 手机根据方向指示信息对麦克风阵列的波束指 向进行调整; 当然, 手机也可以在确定目标方向后不经用户确认, 直接根 据目标方向对麦克风阵列的波束指向进行调整。
本实施例提供的声音信号处理方法, 在用户对声音信号处理阵列朝向 改变的情况下, 感测该朝向改变, 并根据该朝向变化量确定声音信号处理 阵列的朝向改变之后的目标方向, 根据声音信号处理阵列的朝向改变之后 的目标方向对声音信号进行处理, 从而在嘈杂环境下仍可以对声音信号进 行准确的处理, 进而对所处理的声音信号起到增强的效果。
进一步地, 用户所选择的控制模式为扫描模式, 扫描模式为通过扫描 提示用户输入方向指示信息的模式; 接收用户通过所选择的控制模式对应 的方向指示信息输入方式所输入的方向指示信息之前, 还包括: 通过扫描 采集至少两个方向上的声音信号; 分析声音信号的能量, 确定能量最大的 声音信号的声源方向; 提示用户根据声源方向输入方向指示信息。
以声音信号处理阵列为麦克风阵列, 麦克风阵列的声音信号处理单元 呈线形排列为例, 具体实施方式可以包括:
第一步: 麦克风阵列采集多个方向上的多个声音信号。 可以根据需要 设定分辨率 Δ , 分辨率 Δ用于表示声音信号处理阵列的波束的宽度, 如果 分辨率 Δ为 30° , 则呈线形排列的各声音信号处理单元在其可以朝向的 180°范围内, 波束采集均布的 6个方向上的声音信号, 即可采集到 180° 范围内的所有声音信号。 采集某一方向上的声音信号时, 可以保持波束指 向该方向一定时间, 例如 500ms等。
第二步: 对采集到的多个方向上的多个声音信号进行分析。 例如, 以 20ms为一帧计算声音信号的能量, 每一方向上采集 500ms声音信号则对 应 25帧的声音信号, 计算 1个方向上 25个帧声音信号的能量平均值, 依 次计算全部 6个方向上的声音信号的能量平均值, 确定能量平均值最大的 方向为目标方向。 第三步: 提示用户根据第二步中确定的目标方向输入方向指示信息。 第四步: 根据方向指示信息, 对声音信号处理阵列的波束指向进行调 整。
本实施例提供的声音信号处理方法, 通过采集和分析多个方向上的声 音信号获取目标方向, 并提示用户输入相应的用于指示目标方向的方向指 示信息, 根据方向指示信息对声音信号进行处理, 从而在嘈杂环境下仍可 以对声音信号进行准确的处理, 进而对所处理的声音信号起到增强的效 果。
本发明实施例中声音信号处理方法的控制模式还可以包括除上述模 式之外的其它不同种类的模式以供用户选择, 旨在为用户提供包括用户自 主控制模式和扫描模式在内的多种不同选择, 以实现多种对声音信号进行 处理的方式, 同时增加用户使用的方便性。
进一步地, 接收用户输入的用于指示目标方向的方向指示信息, 还可 以包括接收用户通过旋钮输入的用于指示目标方向的方向指示信息。 旋钮 可以是设置在用于控制声音信号处理阵列的设备上的实体的旋钮, 也可以 是用于控制声音信号处理阵列的设备上的软件形式的旋钮, 本发明对此不 作限制。 采用通过旋钮输入方向指示的方式, 丰富了用户操控声音信号处 理阵列波束指向的实现方式。
图 7为本发明提供的声音信号处理方法实施例七的流程图, 如图 7所 示, 本实施例提供的声音信号处理方法可以包括:
S710、 根据主声源方向确定声音信号处理阵列的接收方向, 根据接收 方向及声音信号处理阵列的波束指向确定处理阵列的目标方向,目标方向 指处理阵列的接收方向与主声源方向一致时处理阵列的朝向; 确定声音信 号处理阵列的朝向与目标方向之间的夹角;
S720、 提示用户根据夹角调整声音信号处理阵列的朝向, 以使声音信 号处理阵列的接收方向与主声源方向一致。
本实施例提供的声音信号处理方法, 在声音信号处理阵列的波束指向 固定的情境下, 可以根据主声源方向提示用户调整声音信号处理阵列的接 收方向, 从而对所处理的声音信号起到增强的效果。
图 7所示实施例的具体实现方式可以包括: 第一步: 固定麦克风阵列的波束指向, 即保持麦克风阵列的波束指向 相对于麦克风阵列自身保持不变。 以图 2为例, 即 β保持不变, 固定麦克 风阵列的波束指向的方式可以采用软件控制方式, 也可以采用硬件方式固 定, 例如选择带有指向性的麦克风阵列, 比如心形麦克风, 心形麦克风对 于某一个方向上的声音信号采集效果较好。 本发明对固定麦克风阵列的波 束指向所采用的形式不做限制;
第二步: 以自适应方式确定目标方向, 根据目标方向和麦克风阵列被 固定的波束指向的夹角, 得到麦克风阵列朝向所需调整的角度;
第三步: 提示用户根据目标方向输入方向指示信息, 即提示用户调整 麦克风阵列的朝向, 提示中给出建议用户调整麦克风阵列的角度;
第四步: 用户根据提示对麦克风阵列的朝向进行调整, 例如如果麦克 风阵列设置在手机上, 用户可以根据手机给出的提示将手机进行相应角度 的旋转, 以使手机的麦克风阵列的波束指向朝向目标方向。
本实施例提供的声音信号处理方法, 在声音信号处理阵列的波束指向 固定的情境下, 可以根据目标方向提示用户调整声音信号处理阵列的朝 向, 从而对所处理的声音信号起到增强的效果。
图 8为本发明提供的声音信号处理设备实施例一的结构示意图, 如图 8所示, 本实施例提供的声音信号处理设备 800可以包括: 接收模块 810、 调整模块 820、显示模块 830、感测模块 840、确定模块 850、提示模块 860、 扫描模块 870以及分析模块 880, 具体地:
接收模块 810, 用于接收用户输入的用于指示目标方向的方向指示信 息;
调整模块 820, 用于根据方向指示信息, 将声音信号处理阵列的波束 指向调整至与目标方向相对应的状态。
进一步地, 调整模块 820具体用于: 根据方向指示信息, 确定声音信 号处理阵列中各个声音信号处理单元所对应的信号延时; 根据各个信号延 时, 对与信号延时相对应的声音信号处理单元需要处理的声音信号进行延 时处理以获取经过延时处理的声音信号, 并将所述经延时处理的所述声音 信号传输至波束形成器以将声音信号处理阵列的波束指向调整至与目标 方向相对应的状态。 进一步地, 所述调整模块 820具体用于:
根据所述方向指示信息通过以下公式获取声音信号处理阵列中各个 声音处理单元 n对应的信号延时 0^ :
C 其中, dn代表呈线形排列的所述声音信号处理阵列中包含的声音信号 处理单元之一的声音信号处理单元 n与声音信号处理阵列中心之间的距 离, 代表由所述声音信号处理阵列中心指向所述方向指示信息所指示的 所述目标方向与一参考坐标之间的近似夹角, c代表声速, fs代表声音信 号处理阵列的声音信号采样频率;
根据所述声音信号处理单元 n对应的信号延时 τη0 ^对所述声音信号处 理单元 n所采集的声音信号做延时处理, 经延时处理后的声音信号可以表 达为:
Figure imgf000030_0001
Ϋη ,^为将由声音信号处理单元 η采集到的声音信号进行时频变化 后得到的频域信号, ^代表频率索引。
进一步地, 所述调整模块 820具体用于:
根据所述方向指示信息通过以下公式获取声音信号处理阵列中各个 声音处理单元对应的信号延时的集合 Τ或
τ = [τ, τ2...,τΜ 其中, TM代表所述声音处理阵列中第 M个声普处理单元对应的信号 延时, rm代表在所述方向指示信息所指示的目标方向上的某一目标 A至所 述第 m个声音处理单元的距离, fa代表所述声音处理阵列的声音信号采样 频率为, c代表声速;
所述 rm通过以下公式获取:
rm = ^ra 2 + b2 - 2brasinft;osam, m = 1,2…, M
其中, ra代表所述在所述方向指示信 所指示的目标方向上的某一目 标 A与一参考坐标的坐标原点之间的距离,所述参考坐标的坐标原点为呈 环形排列的所述声音处理阵列的圆心, b代表所述呈环形均匀排列的所述 声音处理阵列的半径, am代表所述某一目标 A在所述声音处理阵列所在平 面上的投影 A、与所述坐标原点之间的连线与所述第 m个声音处理单元至 所述坐标原点之间的连线之间的夹角;
所述 am通过以下公式获取:
2π(ηι - l) ^
am = a, + ~ -,m = 1,2..., M
M 其中, ai代表所述某一目标 A在所述声音处理阵列所在平面上的投影 与所述坐标原点之间的连线与所述第 1个声音处理单元至所述坐标原点 之间的连线之间的夹角;
根据所述信号延时集合 T或 ^中的信号延时 TMM将对应于所述声 音信号处理单元 Μ所采集的声音信号做延时处理, 经延时处理后的声音 信号可以表达为:
ΥΜ {ω,β) = ΫΜ (i )* e-jiyr "或
ΥΜ (ω,β) = ΫΜ (ω,β) * ^-'ωψΜ
Ϋη ( ^为将由声音信号处理单元 η采集到的声音信号进行时频变化 后得到的频域信号, ^代表频率索引。
进一步地, 声音信号处理阵列包括声音信号采集阵列, 声音信号采集 阵列包括多个声音信号采集单元; 调整模块 820具体用于: 根据方向指示 信息, 确定声音信号采集阵列中各个声音信号采集单元所对应的信号延 时; 根据各信号延时, 对与信号延时对应的声音信号采集单元采集的声音 信号进行延时处理;或者,所述声音信号处理阵列包括声音信号发送阵列, 所述声音信号发送阵列包括多个声音信号发送单元;对应地,调整模块 820 具体用于: 根据方向指示信息, 确定声音信号发送阵列中各个声音信号发 送单元所对应的信号延时; 根据各信号延时, 对与信号延时对应的声音信 号发送单元发送的声音信号进行延时处理。
进一步地,接收模块 810还用于:接收用户输入的控制模式显示指令; 显示模块 830, 用于根据控制模式显示指令, 获取并向用户显示可供选择 的多种控制模式, 多种控制模式中的每一种控制模式对应至少一种方向指 示信息输入方式; 接收模块 810还用于: 接收用户发送的针对多种控制模 式的控制模式选择指令, 并开启用户所选择的控制模式, 接收用户通过与 所选择的控制模式对应的方向指示信息输入方式所输入的用于指示目标 方向的方向指示信息。 进一步地, 多种控制模式包括用户自主控制模式, 用户自主控制模式 为允许用户自主输入方向指示信息的模式; 若用户所选择的控制模式为用 户自主控制模式, 接收模块 810具体用于: 接收用户通过语音输入方式所 输入的语音信号, 解析语音信号以获取语音信号中包含的方向指示信息; 或者, 接收模块 810具体用于: 接收用户以触摸输入方式在触摸屏上滑动 输入的轨迹信息, 解析轨迹信息所指示的目标方向, 并生成用以指示目标 方向的方向指示信息; 或者, 接收模块 810具体用于: 接收用户通过投影 屏幕投射输入的手势图像信息, 解析手势图像信息所指示的目标方向, 并 生成用以指示目标方向的方向指示信息; 或者, 接收模块 810具体用于: 接收用户通过与所选择的控制模式对应的指令输入终端装置输入的用于 指示目标方向的控制指令, 用于指示目标方向的控制指令包括用于指示目 标方向的方向指示信息; 或者, 接收模块 810具体用于: 接收用户通过与 所选择的控制模式对应的键盘滚轮输入的滚动信息, 用滚动信息包括用于 指示目标方向的方向指示信息。
进一步地, 若用户选择的控制模式所对应的方式指示信息输入方式为 触摸输入方式, 其中: 接收模块 810具体用于: 接收用户在触摸屏上滑动 输入的一条轨迹信息; 根据一条轨迹信息所包含的滑动输入在触摸屏上的 起点位置信息和终点位置信息, 确定滑动输入所指示的目标方向, 并生成 用以指示目标方向的方向指示信息; 或者, 接收模块 810具体用于: 接收 用户在触摸屏上滑动输入的两条轨迹信息; 根据两条轨迹信息所包含的滑 动输入在触摸屏上的起点位置信息和终点位置信息, 分别确定两条轨迹的 延伸方向; 根据两条轨迹的延伸方向所限定的角度范围, 确定滑动输入所 指示的目标方向, 并生成用以指示目标方向的方向指示信息。
进一步地, 接收模块 810具体用于: 根据两条轨迹的延伸方向确定两 条轨迹所构成夹角的角平分线延伸的方向; 根据角平分线延伸的方向, 确 定滑动输入所指示的目标方向处于第一角度范围中, 第一角度范围以角平 分线延伸方向为中心, 以两条轨迹所构成夹角的角度的一半为主瓣宽度。
进一步地, 感测模块 840 : 用于感测声音信号处理阵列的朝向发生改 变, 并确定朝向变化量; 确定模块 850 : 用于根据朝向变化量以及声音信 号处理阵列的朝向改变之前的目标方向, 确定声音信号处理阵列的朝向改 变之后的目标方向; 提示模块 860: 用于提示用户输入用于指示朝向改变 之后的目标方向的方向指示信息。
进一步地, 用户所选择的控制模式为扫描模式, 扫描模式为通过扫描 提示用户输入方向指示信息的模式; 对应地, 扫描模块 870, 用于通过扫 描采集至少两个方向上的声音信号; 分析模块 880, 用于分析声音信号的 能量, 确定能量最大的声音信号的声源方向; 提示模块 860还用于提示用 户根据声源方向输入方向指示信息。
本实施例提供的声音信号处理设备 800的上述实施方式可以用于执行 图 1至图 6任一所示方法实施例的技术方案,其实现原理和技术效果类似, 此处不再赘述。
图 9为本发明提供的声音信号处理设备实施例二的结构示意图, 如图 9所示, 本实施例提供的声音信号处理设备 900可以包括:
确定模块 910, 用于根据主声源方向确定声音信号处理阵列的接收方 向, 根据所述接收方向及所述声音信号处理阵列的波束指向确定所述处理 阵列的目标方向,所述目标方向指所述处理阵列的接收方向与主声源方向 一致时所述处理阵列的朝向; 确定声音信号处理阵列的朝向与目标方向的 夹角;
提示模块 920,用于提示用户根据夹角调整声音信号处理阵列的朝向, 以使声音信号处理阵列的接收方向与主声源方向一致。
本实施例提供的声音信号处理设备 900的上述实施方式可以用于执行 图 7所示方法实施例的技术方案, 其实现原理和技术效果类似, 此处不再 赘述。
图 10为本发明提供的声音信号处理设备实施例三的结构示意图, 如 图 10所示, 本实施例提供的声音信号处理设备 1000可以包括:
存储器 1010、 存储器控制器 1015、 处理器 1020、 外围接口 1030、 音 频子系统 1040、 声音信号处理阵列 1050、 波束指向调整指令接收器 1060 以及传感器 1070等。
处理器 1020可以为中央处理器 CPU ( Central Processing Unit, 简称 CPU) ;
存储器 1010可包括高速随机存取存储器、 非易失性固体存储设备等; 存储器控制器 1015可控制声音信号处理设备 1000的诸如处理器等其 它组件对存储器 1010的访问, 以调用存储器 1010中的各模块执行相应功 能;
声音信号处理阵列 1050可以为麦克风阵列或者扬声器阵列。具体地: 波束指向调整指令接收器 1060,可以用于接收用户输入的用于指示目 标方向的方向指示信息;
处理器 1020, 可以用于根据方向指示信息, 将声音信号处理阵列的波 束指向调整至与目标方向相对应的状态。
进一步地, 处理器 1020具体用于: 根据方向指示信息, 确定声音信 号处理阵列中各个声音信号处理单元所对应的信号延时; 根据各个信号延 时, 对与信号延时相对应的声音信号处理单元需要处理的声音信号进行延 时处理以获取经过延时处理的声音信号, 并将所述经延时处理的所述声音 信号传输至波束形成器以将声音信号处理阵列的波束指向调整至与目标 方向相对应的状态。
进一步地, 所述处理器 1020具体用于:
根据所述方向指示信息通过以下公式获取声音信号处理阵列中各个 声音处理单元 n对应的信号延时 0^ :
C 其中, dn代表呈线形排列的所述声音信号处理阵列中包含的声音信号 处理单元之一的声音信号处理单元 n与声音信号处理阵列中心之间的距 离, 代表由所述声音信号处理阵列中心指向所述方向指示信息所指示的 所述目标方向与一参考坐标之间的近似夹角, c代表声速, fs代表声音信 号处理阵列的声音信号采样频率;
根据所述声音信号处理单元 n对应的信号延时 τη0 ^对所述声音信号处 理单元 n所采集的声音信号做延时处理, 经延时处理后的声音信号可以表 达为:
Ϋη , 为将由声音信号处理单元 η采集到的声音信号进行时频变化 后得到的频域信号, ^代表频率索引。 进一步地, 所述处理器 1020具体用于:
根据所述方向指示信息通过以下公式获取声音信号处理阵列中各个
Figure imgf000035_0001
处理单元对应的信号 延时, rm代表在所述方向指示信息所指示的目标方向上的某一目标 A至所 述第 m个声音处理单元的距离, fa代表所述声音处理阵列的声音信号采样 频率为, c代表声速;
所述 rm通过以下公式获取:
Figure imgf000035_0002
其中, ra代表所述在所述方向指示信 所指示的目标方向上的某一目 标 A与一参考坐标的坐标原点之间的距离,所述参考坐标的坐标原点为呈 环形排列的所述声音处理阵列的圆心, b代表所述呈环形均匀排列的所述 声音处理阵列的半径, am代表所述某一目标 A在所述声音处理阵列所在平 面上的投影 A、与所述坐标原点之间的连线与所述第 m个声音处理单元至 所述坐标原点之间的连线之间的夹角;
所述 am通过以下公式获取:
2π(ηι - l) ^
am = a, + ~ -, m = 1,2..., M
M 其中, ai代表所述某一目标 A在所述声音处理阵列所在平面上的投影 与所述坐标原点之间的连线与所述第 1个声音处理单元至所述坐标原点 之间的连线之间的夹角;
根据所述信号延时集合 T或 ^中的信号延时 TMM将对应于所述声 音信号处理单元 Μ所采集的声音信号做延时处理, 经延时处理后的声音 信号可以表达为:
ΥΜ (^, ^) = ΫΜ (i )*EJ 或
Figure imgf000035_0003
Ϋη ,^为将由声音信号处理单元 η采集到的声音信号进行时频变化 后得到的频域信号, ^代表频率索引。 阵列包括多个声音信号采集单元; 处理器 1020具体用于: 根据方向指示 信息, 确定声音信号采集阵列中各个声音信号采集单元所对应的信号延 时; 根据各信号延时, 对与信号延时对应的声音信号采集单元采集的声音 信号进行延时处理;或者,所述声音信号处理阵列包括声音信号发送阵列, 所述声音信号发送阵列包括多个声音信号发送单元; 对应地, 处理器 1020 具体用于: 根据方向指示信息, 确定声音信号发送阵列中各个声音信号发 送单元所对应的信号延时; 根据各信号延时, 对与信号延时对应的声音信 号发送单元发送的声音信号进行延时处理。
进一步地, 波束指向调整指令接收器 1060还用于: 接收用户输入的 控制模式显示指令; 处理器 1020还用于根据控制模式显示指令, 获取并 向用户显示可供选择的多种控制模式, 多种控制模式中的每一种控制模式 对应至少一种方向指示信息输入方式; 波束指向调整指令接收器 1060还 用于: 接收用户发送的针对多种控制模式的控制模式选择指令, 并开启用 户所选择的控制模式, 接收用户通过与所选择的控制模式对应的方向指示 信息输入方式所输入的用于指示目标方向的方向指示信息。
进一步地, 多种控制模式包括用户自主控制模式, 用户自主控制模式 为允许用户自主输入方向指示信息的模式; 若用户所选择的控制模式为用 户自主控制模式; 波束指向调整指令接收器 1060具体用于: 接收用户通 过语音输入方式所输入的语音信号, 解析语音信号以获取语音信号中包含 的方向指示信息; 或者, 波束指向调整指令接收器 1060具体用于: 接收 用户以触摸输入方式在触摸屏上滑动输入的轨迹信息, 解析轨迹信息所指 示的目标方向, 并生成用以指示目标方向的方向指示信息; 或者, 波束指 向调整指令接收器 1060具体用于: 接收用户通过投影屏幕投射输入的手 势图像信息, 解析手势图像信息所指示的目标方向, 并生成用以指示目标 方向的方向指示信息; 或者, 波束指向调整指令接收器 1060具体用于: 接收用户通过与所选择的控制模式对应的指令输入终端装置输入的用于 指示目标方向的控制指令, 用于指示目标方向的控制指令包括用于指示目 标方向的方向指示信息; 或者, 波束指向调整指令接收器 1060具体用于: 接收用户通过与所选择的控制模式对应的键盘滚轮输入的滚动信息, 用滚 动信息包括用于指示目标方向的方向指示信息。 进一步地, 若用户选择的控制模式所对应的方式指示信息输入方式为 触摸输入方式, 其中: 波束指向调整指令接收器 1060具体用于: 接收用 户在触摸屏上滑动输入的一条轨迹信息; 根据一条轨迹信息所包含的滑动 输入在触摸屏上的起点位置信息和终点位置信息, 确定滑动输入所指示的 目标方向, 并生成用以指示目标方向的方向指示信息; 或者, 波束指向调 整指令接收器 1060具体用于: 接收用户在触摸屏上滑动输入的两条轨迹 信息; 根据两条轨迹信息所包含的滑动输入在触摸屏上的起点位置信息和 终点位置信息, 分别确定两条轨迹的延伸方向; 根据两条轨迹的延伸方向 所限定的角度范围, 确定滑动输入所指示的目标方向, 并生成用以指示目 标方向的方向指示信息。
可选地, 波束指向调整指令接收器 1060具体用于: 根据两条轨迹的 延伸方向确定两条轨迹所构成夹角的角平分线延伸的方向; 根据角平分线 延伸的方向, 确定滑动输入所指示的目标方向处于第一角度范围中, 第一 角度范围以角平分线延伸方向为中心, 以两条轨迹所构成夹角的角度的一 半为主瓣宽度。
可选地, 传感器 1070: 用于感测声音信号处理阵列的朝向发生改变, 并确定朝向变化量; 处理器 1020 : 用于根据朝向变化量以及声音信号处理 阵列的朝向改变之前的目标方向, 确定声音信号处理阵列的朝向改变之后 的目标方向; 提示用户输入用于指示朝向改变之后的目标方向的方向指示 信息。
可选地, 用户所选择的控制模式为扫描模式, 扫描模式为通过扫描提 示用户输入方向指示信息的模式; 对应地, 声音信号处理阵列 1050, 用于 通过扫描采集至少两个方向上的声音信号; 处理器 1020还用于分析声音 信号的能量, 确定能量最大的声音信号的声源方向, 提示用户根据声源方 向输入方向指示信息。
本实施例提供的声音信号处理设备 1000的上述实施方式可以用于执 行图 1至图 6任一所示方法实施例的技术方案, 其实现原理和技术效果类 似, 此处不再赘述。
图 11为本发明提供的声音信号处理设备实施例四的结构示意图, 如 图 11所示, 本实施例提供的声音信号处理设备 1100可以包括: 存储器 1 110、 存储器控制器 1115、 处理器 1 120、 外围接口 1 130、 音 频子系统 1 140、 声音信号处理阵列 1150等。
处理器 1 120可以为中央处理器 CPU;
存储器 1 110可包括高速随机存取存储器、 非易失性固体存储设备等; 存储器控制器 1 115可控制声音信号处理设备 1100的诸如处理器等其 它组件对存储器 11 10的访问, 以调用存储器 11 10中的各模块执行相应功 能;
声音信号处理阵列 1 150可以为麦克风阵列或者扬声器阵列。具体地: 处理器 1 120, 用于根据主声源方向确定声音信号处理阵列的接收方 向, 根据所述接收方向及所述声音信号处理阵列的波束指向确定所述处理 阵列的目标方向,所述目标方向指所述处理阵列的接收方向与主声源方向 一致时所述处理阵列的朝向; 确定声音信号处理阵列的朝向与目标方向的 夹角; 提示用户根据夹角调整声音信号处理阵列的朝向, 以使声音信号处 理阵列的接收方向与主声源方向一致。
本实施例提供的声音信号处理设备 1100的上述实施方式可以用于执 行图 7所示方法实施例的技术方案, 其实现原理和技术效果类似, 此处不 再赘述。
综上所述, 本发明实施例提供的声音信号处理方法及设备, 可以根据 用户提供的用于指示目标方向的方向指示, 对声音信号处理阵列的波束指 向进行调整, 从而在嘈杂环境下仍可以对声音信号进行准确的处理, 进而 对所处理的声音信号起到增强的效果。
本领域普通技术人员可以理解: 实现上述各方法实施例的全部或部分 步骤可以通过程序指令相关的硬件来完成。 前述的程序可以存储于一计算 机可读取存储介质中。 该程序在执行时, 执行包括上述各方法实施例的步 骤; 而前述的存储介质包括: ROM、 RAM, 磁碟或者光盘等各种可以存 储程序代码的介质。
最后应说明的是: 以上各实施例仅用以说明本发明的技术方案, 而非 对其限制; 尽管参照前述各实施例对本发明进行了详细的说明, 本领域的 普通技术人员应当理解: 其依然可以对前述各实施例所记载的技术方案进 行修改, 或者对其中部分或者全部技术特征进行等同替换; 而这些修改或 者替换, 并不使相应技术方案的本质脱离本发明各实施例技术方案的范 围。

Claims

权 利 要 求 书
1、 一种声音信号处理方法, 其特征在于, 包括:
接收用户输入的用于指示目标方向的方向指示信息;
根据所述方向指示信息, 将声音信号处理阵列的波束指向调整至与所 述目标方向相对应的状态。
2、 根据权利要求 1所述的方法, 其特征在于, 所述根据所述方向指 示信息, 将声音信号处理阵列的波束指向调整至与所述目标方向相对应的 状态, 包括:
根据所述方向指示信息, 确定声音信号处理阵列中各个声音信号处理 单元所对应的信号延时;
根据各个所述信号延时, 对与所述信号延时相对应的声音信号处理单 元需要处理的声音信号进行延时处理以获取经过延时处理的声音信号, 并 将所述经延时处理的所述声音信号传输至波束形成器以将所述声音信号 处理阵列的波束指向调整至与所述目标方向相对应的状态。
3、 根据权利要求 2所述的方法, 其特征在于, 根据所述方向指示信 息, 确定声音信号处理阵列中各个声音信号处理单元所对应的信号延时具 体包括:
根据所述方向指示信息通过以下公式获取声音信号处理阵列中各个 声音处理单元 n对应的信号延时 0^: rn( ) = ^ xfs
C 其中, dn代表呈线形排列的所述声音信号处理阵列中包含的声音信号 处理单元之一的声音信号处理单元 n与声音信号处理阵列中心之间的距 离, 代表由所述声音信号处理阵列中心指向所述方向指示信息所指示的 所述目标方向与一参考坐标之间的近似夹角, c代表声速, fs代表声音信 号处理阵列的声音信号采样频率;
对应地, 所述根据各个所述信号延时, 对与所述信号延时相对应的声 音信号处理单元需要处理的声音信号进行延时处理以获取经过延时处理 的声音信号具体包括:
根据所述声音信号处理单元 n对应的信号延时 τη0 ^对所述声音信号处 理单元 n所采集的声音信号做延时处理, 经延时处理后的声音信号可以表 达为:
Υη {ω,β) = Ύΐί (ω,β)* &-^Λβ
Ϋη ( ^为将由声音信号处理单元 η采集到的声音信号进行时频变化 后得到的频域信号, 《代表频率索引。
4、 根据权利要求 2所述的方法, 其特征在于, 根据所述方向指示信 息, 确定声音信号处理阵列中各个声音信号处理单元所对应的信号延时具 体包括:
根据所述方向指示信息通过以下公式获取声音信号处理阵列中各个 声音处理单元对应的信号延时的集合 Τ或
τ = [η, , ,
Ψ = ψ、, e c , e c
其中, TM
Figure imgf000041_0001
表所述 音处理阵列中第 理单元对应的信号 延时, rm代表在所述方向指示信息所指示的目标方向上的某一目标 A至所 述第 m个声音处理单元的距离, fa代表所述声音处理阵列的声音信号采样 频率为, c代表声速;
所述 rm通过以下公式获取:
rm = + b - 2brasinft;osam , m = 1,2. M
其中, ra代表所述在所述方向指示信息所指示的目标方向上的某一目 标 A与一参考坐标的坐标原点之间的距离,所述参考坐标的坐标原点为呈 环形排列的所述声音处理阵列的圆心, b代表所述呈环形均匀排列的所述 声音处理阵列的半径, am代表所述某一目标 A在所述声音处理阵列所在平 面上的投影 A、与所述坐标原点之间的连线与所述第 m个声音处理单元至 所述坐标原点之间的连线之间的夹角;
2π(ηι - 1)
a„ = a, + m = l,2. Μ
Μ 其中, ai代表所述某一目标 A在所述声音处理阵列所在平面上的投影 与所述坐标原点之间的连线与所述第 1个声音处理单元至所述坐标原点 之间的连线之间的夹角;
对应地, 所述根据各个所述信号延时, 对与所述信号延时相对应的声 音信号处理单元需要处理的声音信号进行延时处理以获取经过延时处理 的声音信号具体包括:
根据所述信号延时集合 T或 ^中的信号延时 TMM将对应于所述声 音信号处理单元 Μ所采集的声音信号做延时处理, 经延时处理后的声音 信号可以表达为:
ΥΜ (^, ^) = ΫΜ (i ) * E J 或
ΥΜ (ω, β) = ΫΜ (ω, β) * &-'ωψΜ
ΫΗ ,^为将由声音信号处理单元 η采集到的声音信号进行时频变化 后得到的频域信号, ^代表频率索引。
5、 根据权利要求 2至 4任意一项所述的方法, 其特征在于, 所述声 音信号处理阵列包括声音信号采集阵列, 所述声音信号采集阵列包括多个 声音信号采集单元; 对应地,
所述根据所述方向指示信息, 确定声音信号处理阵列中各个声音信号 处理单元所对应的信号延时, 包括: 根据所述方向指示信息, 确定声音信 号采集阵列中各个声音信号采集单元所对应的信号延时; 对应地,
所述根据各个信号延时, 对与所述信号延时对应的声音信号处理单元 需要处理的声音信号进行延时处理, 包括: 根据各信号延时, 对与所述信 号延时对应的声音信号采集单元采集的声音信号进行延时处理;
或者,
所述声音信号处理阵列包括声音信号发送阵列, 所述声音信号发送阵 列包括多个声音信号发送单元; 对应地,
所述根据所述方向指示信息, 确定声音信号处理阵列中各个声音信号 处理单元的所对应的信号延时, 包括: 根据所述方向指示信息, 确定声音 信号发送阵列中各个声音信号发送单元所对应的信号延时; 对应地,
所述根据各个信号延时, 对与所述信号延时对应的声音信号处理单元 需要处理的声音信号进行延时处理, 包括: 根据各信号延时, 对与所述信 号延时对应的声音信号发送单元发送的声音信号进行延时处理。
6、 根据权利要求 1至 4任意一项所述的方法, 其特征在于, 所述接 收用户输入的用于指示目标方向的方向指示信息之前, 还包括:
接收用户输入的控制模式显示指令;
根据所述控制模式显示指令, 获取并显示可供选择的多种控制模式, 所述多种控制模式中的每一种控制模式对应至少一种方向指示信息输入 方式;
接收用户发送的针对所述多种控制模式的控制模式选择指令, 并开启 用户所选择的控制模式;
所述接收用户输入的用于指示目标方向的方向指示信息, 包括: 接收用户通过与所选择的控制模式对应的方向指示信息输入方式所 输入的用于指示目标方向的所述方向指示信息。
7、 根据权利要求 6所述的方法, 其特征在于, 所述多种控制模式包 括用户自主控制模式, 所述用户自主控制模式为允许用户自主输入所述方 向指示信息的模式; 若所述用户所选择的控制模式为用户自主控制模式, 所述接收用户通过所选择的控制模式对应的方向指示信息输入方式 所输入的用于指示目标方向的所述方向指示信息, 包括:
接收所述用户通过语音输入方式所输入的语音信号, 解析所述语音信 号以获取所述语音信号中包含的方向指示信息;
或者,
接收所述用户以触摸输入方式在触摸屏上滑动输入的轨迹信息, 解析 所述轨迹信息所指示的目标方向, 并生成用以指示所述目标方向的方向指 不信息;
或者,
接收所述用户通过投影屏幕投射输入的手势图像信息, 解析所述手势 图像信息所指示的目标方向, 并生成用以指示所述目标方向的方向指示信 息;
或者,
接收所述用户通过与所选择的控制模式对应的指令输入终端装置输 入的用于指示目标方向的控制指令, 所述用于指示目标方向的控制指令包 括所述用于指示目标方向的方向指示信息;
或者,
接收所述用户通过与所选择的控制模式对应的键盘滚轮输入的滚动 信息, 所述用滚动信息包括所述用于指示目标方向的方向指示信息。
8、 根据权利要求 7所述的方法, 其特征在于, 若所述用户选择的控 制模式所对应的方式指示信息输入方式为触摸输入方式, 其中:
接收所述用户以触摸输入方式在触摸屏上滑动输入的轨迹信息, 包 括:
接收所述用户在触摸屏上滑动输入的一条轨迹信息;
对应, 所述解析所述轨迹信息所指示的目标方向, 并生成用以指示所 述目标方向的方向指示信息包括:
根据所述一条轨迹信息所包含的所述滑动输入在所述触摸屏上的起 点位置信息和终点位置信息, 确定所述滑动输入所指示的所述目标方向, 并生成用以指示所述目标方向的方向指示信息;
或者,
接收所述用户以触摸输入方式在触摸屏上滑动输入的轨迹信息, 包 括:
接收所述用户在触摸屏上滑动输入的两条轨迹信息;
对应, 所述解析所述轨迹信息所指示的目标方向, 并生成用以指示所 述目标方向的方向指示信息, 包括:
根据所述两条轨迹信息所包含的所述滑动输入在所述触摸屏上的起 点位置信息和终点位置信息, 分别确定两条轨迹的延伸方向; 根据所述两 条轨迹的延伸方向所限定的角度范围, 确定所述滑动输入所指示的目标方 向, 并生成用以指示所述目标方向的方向指示信息。
9、 根据权利要求 8所述的方法, 其特征在于, 所述根据所述两条轨 迹的延伸方向所限定的角度范围, 确定所述滑动输入所指示的目标方向, 包括:
根据所述两条轨迹的延伸方向确定所述两条轨迹所构成夹角的角平 分线延伸的方向;
根据所述角平分线延伸的方向, 确定所述滑动输入所指示的目标方向 处于第一角度范围中, 所述第一角度范围以所述角平分线延伸方向为中 心, 以所述两条轨迹所构成夹角的角度的一半为主瓣宽度。
10、 根据权利要求 1至 9任意一项所述的方法, 其特征在于, 在所述 接收用户输入的用于指示目标方向的方向指示信息之前, 还包括:
感测所述声音信号处理阵列的朝向发生改变, 并确定朝向变化量; 根据所述朝向变化量以及所述声音信号处理阵列的朝向改变之前的 目标方向, 确定所述声音信号处理阵列的朝向改变之后的目标方向;
提示用户输入用于指示所述朝向改变之后的目标方向的所述方向指 不信息
1 1、 根据权利要求 6所述的方法, 其特征在于, 所述用户所选择的控 制模式为扫描模式, 所述扫描模式为通过扫描提示用户输入所述方向指示 信息的模式;
所述接收用户通过所选择的控制模式对应的方向指示信息输入方式 所输入的所述方向指示信息之前, 还包括:
通过扫描采集至少两个方向上的声音信号;
分析所述声音信号的能量, 确定能量最大的声音信号的声源方向; 提示用户根据所述声源方向输入所述方向指示信息。
12、 一种声音信号处理方法, 其特征在于, 包括:
根据主声源方向确定声音信号处理阵列的接收方向, 根据所述接收方 向及所述声音信号处理阵列的波束指向确定所述处理阵列的目标方向, 所 述目标方向指所述处理阵列的接收方向与主声源方向一致时所述处理阵 列的朝向;
确定所述声音信号处理阵列的朝向与所述目标方向之间的夹角; 提示用户根据所述夹角调整所述声音信号处理阵列的朝向, 以使所述 声音信号处理阵列的接收方向与所述主声源方向一致。
13、 一种声音信号处理设备, 其特征在于, 包括:
接收模块, 用于接收用户输入的用于指示目标方向的方向指示信息; 调整模块, 用于根据所述方向指示信息, 将声音信号处理阵列的波束 指向调整至与所述目标方向相对应的状态。
14、 根据权利要求 13所述的设备, 其特征在于, 所述调整模块具体 用于:
根据所述方向指示信息, 确定声音信号处理阵列中各个声音信号处理 单元所对应的信号延时;
根据各个所述信号延时, 对与所述信号延时相对应的声音信号处理单 元需要处理的声音信号进行延时处理以获取经过延时处理的声音信号, 并 将所述经延时处理的所述声音信号传输至波束形成器以将所述声音信号 处理阵列的波束指向调整至与所述目标方向相对应的状态。
15、 根据权利要求 14所述的设备, 其特征在于, 所述调整模块具体 用于:
根据所述方向指示信息通过以下公式获取声音信号处理阵列中各个 声音处理 n对应的信号延时 0^:
Figure imgf000046_0001
其中, dn代表呈线形排列的所述声音信号处理阵列中包含的声音信号 处理单元之一的声音信号处理单元 n与声音信号处理阵列中心之间的距 离, 代表由所述声音信号处理阵列中心指向所述方向指示信息所指示的 所述目标方向与一参考坐标之间的近似夹角, c代表声速, fs代表声音信 号处理阵列的声音信号采样频率;
根据所述声音信号处理单元 n对应的信号延时 τη0 ^对所述声音信号处 理单元 n所采集的声音信号做延时处理, 经延时处理后的声音信号可以表 达为:
Figure imgf000046_0002
Ϋη 为将由声音信号处理单元 η采集到的声音信号进行时频变化 后得到的频域信号, ^代表频率索引。
16、 根据权利要求 14所述的设备, 其特征在于, 所述调整模块具体 用于:
根据所述方向指示信息通过以下公式获取声音信号处理阵列中各个 声
Figure imgf000046_0003
的信号 延时, rm代表在所述方向指示信息所指示的目标方向上的某一目标 A至所 述第 m个声音处理单元的距离, fa代表所述声音处理阵列的声音信号采样 频率为, c代表声速;
所述 rm通过以下公式获取:
rm = ^ra 2 + b2 - 2brasinft;osam , m = 1,2…, M 其中, ra代表所述在所述方向指示信息所指示的目标方向上的某一目 标 A与一参考坐标的坐标原点之间的距离,所述参考坐标的坐标原点为呈 环形排列的所述声音处理阵列的圆心, b代表所述呈环形均匀排列的所述 声音处理阵列的半径, am代表所述某一目标 A在所述声音处理阵列所在平 面上的投影 A、与所述坐标原点之间的连线与所述第 m个声音处理单元至 所述坐标原点之间的连线之间的夹角;
所述 am通过以下公式获取:
2π(ηι - l) , ^ ιί
am = a, + ~ ^ L , m = 1,2... , M
M 其中, ai代表所述某一目标 A在所述声音处理阵列所在平面上的投影 与所述坐标原点之间的连线与所述第 1个声音处理单元至所述坐标原点 之间的连线之间的夹角;
根据所述信号延时集合 T或 ^中的信号延时 TMM将对应于所述声 音信号处理单元 Μ所采集的声音信号做延时处理, 经延时处理后的声音 信号可以表达为:
ΥΜ (ω, β) = ΫΜ (ω, β) * e jiu7»或
ΥΜ {ω, β) = ΎΜ {ω, β) * '^
ΫΗ 为将由声音信号处理单元 η采集到的声音信号进行时频变化 后得到的频域信号, 代表频率索引。
17、 根据权利要求 14至 16任意一项所述的设备, 其特征在于, 所述 声音信号处理阵列包括声音信号采集阵列, 所述声音信号采集阵列包括多 个声音信号采集单元; 对应地,
所述调整模块具体用于: 根据所述方向指示信息, 确定声音信号采集 阵列中各个声音信号采集单元所对应的信号延时; 根据各信号延时, 对与 所述信号延时对应的声音信号采集单元采集的声音信号进行延时处理; 或者,
所述声音信号处理阵列包括声音信号发送阵列, 所述声音信号发送阵 列包括多个声音信号发送单元; 对应地,
所述调整模块具体用于: 根据所述方向指示信息, 确定声音信号发送 阵列中各个声音信号发送单元所对应的信号延时; 根据各信号延时, 对与 所述信号延时对应的声音信号发送单元发送的声音信号进行延时处理。
18、 根据权利要求 13至 16任意一项所述的设备, 其特征在于, 所述接收模块还用于: 接收用户输入的控制模式显示指令;
所述设备还包括: 显示模块, 用于根据所述控制模式显示指令, 获取 并向用户显示可供选择的多种控制模式, 所述多种控制模式中的每一种控 制模式对应至少一种方向指示信息输入方式;
所述接收模块还用于: 接收用户发送的针对所述多种控制模式的控制 模式选择指令, 并开启用户所选择的控制模式, 接收用户通过与所选择的 控制模式对应的方向指示信息输入方式所输入的用于指示目标方向的所 述方向指示信息。
19、 根据权利要求 18所述的设备, 其特征在于, 所述多种控制模式 包括用户自主控制模式, 所述用户自主控制模式为允许用户自主输入所述 方向指示信息的模式; 若所述用户所选择的控制模式为用户自主控制模 式;
所述接收模块具体用于: 接收所述用户通过语音输入方式所输入的语 音信号, 解析所述语音信号以获取所述语音信号中包含的方向指示信息; 或者,
所述接收模块具体用于: 接收所述用户以触摸输入方式在触摸屏上滑 动输入的轨迹信息, 解析所述轨迹信息所指示的目标方向, 并生成用以指 示所述目标方向的方向指示信息;
或者,
所述接收模块具体用于: 接收所述用户通过投影屏幕投射输入的手势 图像信息, 解析所述手势图像信息所指示的目标方向, 并生成用以指示所 述目标方向的方向指示信息;
或者,
所述接收模块具体用于: 接收所述用户通过与所选择的控制模式对应 的指令输入终端装置输入的用于指示目标方向的控制指令, 所述用于指示 目标方向的控制指令包括所述用于指示目标方向的方向指示信息;
或者,
所述接收模块具体用于: 接收所述用户通过与所选择的控制模式对应 的键盘滚轮输入的滚动信息, 所述用滚动信息包括所述用于指示目标方向 的方向指示信息。
20、 根据权利要求 19所述的设备, 其特征在于, 若所述用户选择的 控制模式所对应的方式指示信息输入方式为触摸输入方式, 其中:
所述接收模块具体用于: 接收所述用户在触摸屏上滑动输入的一条轨 迹信息; 根据所述一条轨迹信息所包含的所述滑动输入在所述触摸屏上的 起点位置信息和终点位置信息, 确定所述滑动输入所指示的目标方向, 并 生成用以指示所述目标方向的方向指示信息;
或者,
所述接收模块具体用于: 接收所述用户在触摸屏上滑动输入的两条轨 迹信息; 根据所述两条轨迹信息所包含的所述滑动输入在所述触摸屏上的 起点位置信息和终点位置信息, 分别确定两条轨迹的延伸方向; 根据所述 两条轨迹的延伸方向所限定的角度范围, 确定所述滑动输入所指示的目标 方向, 并生成用以指示所述目标方向的方向指示信息。
21、 根据权利要求 20所述的设备, 其特征在于, 所述接收模块具体 用于:
根据所述两条轨迹的延伸方向确定所述两条轨迹所构成夹角的角平 分线延伸的方向; 根据所述角平分线延伸的方向, 确定所述滑动输入所指 示的目标方向处于第一角度范围中, 所述第一角度范围以所述角平分线延 伸方向为中心, 以所述两条轨迹所构成夹角的角度的一半为主瓣宽度。
22、 根据权利要求 13至 21任意一项所述的设备, 其特征在于, 还包 括:
感测模块: 用于感测所述声音信号处理阵列的朝向发生改变, 并确定 朝向变化量;
确定模块: 用于根据所述朝向变化量以及所述声音信号处理阵列的朝 向改变之前的目标方向, 确定所述声音信号处理阵列的朝向改变之后的目 标方向;
提示模块: 用于提示用户输入用于指示所述朝向改变之后的目标方向 的所述方向指示信息。
23、 根据权利要求 18所述的设备, 其特征在于, 所述用户所选择的 控制模式为扫描模式, 所述扫描模式为通过扫描提示用户输入所述方向指 示信息的模式; 对应地, 还包括:
扫描模块, 用于通过扫描采集至少两个方向上的声音信号;
分析模块, 用于分析所述声音信号的能量, 确定能量最大的声音信号 的声源方向;
所述提示模块还用于提示用户根据所述声源方向输入所述方向指示 信息。
24、 一种声音信号处理设备, 其特征在于, 包括:
确定模块, 用于根据主声源方向确定声音信号处理阵列的接收方向, 根据所述接收方向及所述声音信号处理阵列的波束指向确定所述处理阵 列的目标方向,所述目标方向指所述处理阵列的接收方向与主声源方向一 致时所述处理阵列的朝向; 确定所述声音信号处理阵列的朝向与所述目标 方向的夹角;
提示模块, 用于提示用户根据所述夹角调整所述声音信号处理阵列的 朝向, 以使所述声音信号处理阵列的接收方向与所述主声源方向一致。
PCT/CN2014/073124 2013-03-21 2014-03-10 声音信号处理方法及设备 Ceased WO2014146547A1 (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP14767360.2A EP2977985A4 (en) 2013-03-21 2014-03-10 Sound signal processing method and device
US14/859,743 US20160011851A1 (en) 2013-03-21 2015-09-21 Sound signal processing method and device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201310092267.3 2013-03-21
CN201310092267.3A CN104065798B (zh) 2013-03-21 2013-03-21 声音信号处理方法及设备

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US14/859,743 Continuation US20160011851A1 (en) 2013-03-21 2015-09-21 Sound signal processing method and device

Publications (1)

Publication Number Publication Date
WO2014146547A1 true WO2014146547A1 (zh) 2014-09-25

Family

ID=51553321

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/073124 Ceased WO2014146547A1 (zh) 2013-03-21 2014-03-10 声音信号处理方法及设备

Country Status (4)

Country Link
US (1) US20160011851A1 (zh)
EP (1) EP2977985A4 (zh)
CN (1) CN104065798B (zh)
WO (1) WO2014146547A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110119264A (zh) * 2019-05-21 2019-08-13 北京达佳互联信息技术有限公司 音效调整方法、装置及存储介质

Families Citing this family (53)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104378570A (zh) * 2014-09-28 2015-02-25 小米科技有限责任公司 录音方法及装置
US9716944B2 (en) * 2015-03-30 2017-07-25 Microsoft Technology Licensing, Llc Adjustable audio beamforming
US9565493B2 (en) 2015-04-30 2017-02-07 Shure Acquisition Holdings, Inc. Array microphone system and method of assembling the same
US9554207B2 (en) 2015-04-30 2017-01-24 Shure Acquisition Holdings, Inc. Offset cartridge microphones
CN106302974B (zh) * 2015-06-12 2020-01-31 联想(北京)有限公司 一种信息处理的方法及电子设备
KR102362121B1 (ko) * 2015-07-10 2022-02-11 삼성전자주식회사 전자 장치 및 그 입출력 방법
CN105611458B (zh) * 2015-11-11 2019-07-16 Oppo广东移动通信有限公司 一种移动终端的定向录音控制方法及装置
CN106935246A (zh) * 2015-12-31 2017-07-07 芋头科技(杭州)有限公司 一种基于麦克风阵列的人声获取方法及电子设备
CN107404684A (zh) * 2016-05-19 2017-11-28 华为终端(东莞)有限公司 一种采集声音信号的方法和装置
JP2017228080A (ja) * 2016-06-22 2017-12-28 ソニー株式会社 情報処理装置、情報処理方法、及び、プログラム
CN106098075B (zh) * 2016-08-08 2018-02-02 腾讯科技(深圳)有限公司 基于麦克风阵列的音频采集方法和装置
CN106611600A (zh) * 2016-12-02 2017-05-03 广州音书科技有限公司 用于远场拾音及移动充电的音频处理装置及系统
CN107016710B (zh) * 2016-12-08 2021-05-18 创新先进技术有限公司 一种坐标轴的绘制方法及装置
CN106548783B (zh) * 2016-12-09 2020-07-14 西安Tcl软件开发有限公司 语音增强方法、装置及智能音箱、智能电视
US10367948B2 (en) 2017-01-13 2019-07-30 Shure Acquisition Holdings, Inc. Post-mixing acoustic echo cancellation systems and methods
CN106816156B (zh) * 2017-02-04 2020-06-30 北京时代拓灵科技有限公司 一种音频质量增强的方法及装置
US10435148B2 (en) * 2017-05-08 2019-10-08 Aurora Flight Sciences Corporation Systems and methods for acoustic radiation control
CN107437063A (zh) * 2017-07-04 2017-12-05 上海小蚁科技有限公司 用于感测环境的装置和方法、非暂态计算机可读介质
JP6644197B2 (ja) * 2017-09-07 2020-02-12 三菱電機株式会社 雑音除去装置および雑音除去方法
CN108172236B (zh) * 2018-01-12 2021-08-20 歌尔科技有限公司 一种拾音降噪方法以及智能电子设备
CN108181992A (zh) * 2018-01-22 2018-06-19 北京百度网讯科技有限公司 基于手势的语音唤醒方法、装置、设备及计算机可读介质
CN110121048A (zh) * 2018-02-05 2019-08-13 青岛海尔多媒体有限公司 一种会议一体机的控制方法及控制系统和会议一体机
CN112335261B (zh) 2018-06-01 2023-07-18 舒尔获得控股公司 图案形成麦克风阵列
US11297423B2 (en) 2018-06-15 2022-04-05 Shure Acquisition Holdings, Inc. Endfire linear array microphone
KR102466134B1 (ko) * 2018-06-26 2022-11-10 엘지디스플레이 주식회사 표시 장치
US11310596B2 (en) 2018-09-20 2022-04-19 Shure Acquisition Holdings, Inc. Adjustable lobe shape for array microphones
CN109286790B (zh) * 2018-10-17 2021-05-07 杭州兆华电子有限公司 一种基于声源定位的定向监听系统及其监听方法
JP7115216B2 (ja) 2018-10-24 2022-08-09 トヨタ自動車株式会社 情報処理装置および情報処理方法
CN109785835A (zh) * 2019-01-25 2019-05-21 广州富港万嘉智能科技有限公司 一种通过移动终端实现声音录制的方法及装置
JP7572964B2 (ja) 2019-03-21 2024-10-24 シュアー アクイジッション ホールディングス インコーポレイテッド 阻止機能を伴うビーム形成マイクロフォンローブの自動集束、領域内自動集束、および自動配置
US11303981B2 (en) 2019-03-21 2022-04-12 Shure Acquisition Holdings, Inc. Housings and associated design features for ceiling array microphones
US11558693B2 (en) 2019-03-21 2023-01-17 Shure Acquisition Holdings, Inc. Auto focus, auto focus within regions, and auto placement of beamformed microphone lobes with inhibition and voice activity detection functionality
TW202101422A (zh) 2019-05-23 2021-01-01 美商舒爾獲得控股公司 可操縱揚聲器陣列、系統及其方法
GB2584629A (en) 2019-05-29 2020-12-16 Nokia Technologies Oy Audio processing
WO2020243471A1 (en) 2019-05-31 2020-12-03 Shure Acquisition Holdings, Inc. Low latency automixer integrated with voice and noise activity detection
CN110121129B (zh) * 2019-06-20 2021-04-20 歌尔股份有限公司 耳机的麦克风阵列降噪方法、装置、耳机及tws耳机
CN114467312A (zh) 2019-08-23 2022-05-10 舒尔获得控股公司 具有改进方向性的二维麦克风阵列
US12028678B2 (en) 2019-11-01 2024-07-02 Shure Acquisition Holdings, Inc. Proximity microphone
US11552611B2 (en) 2020-02-07 2023-01-10 Shure Acquisition Holdings, Inc. System and method for automatic adjustment of reference gain
US11706562B2 (en) 2020-05-29 2023-07-18 Shure Acquisition Holdings, Inc. Transducer steering and configuration systems and methods using a local positioning system
CN111688580B (zh) * 2020-05-29 2023-03-14 阿波罗智联(北京)科技有限公司 智能后视镜进行拾音的方法以及装置
US11122364B1 (en) * 2020-08-31 2021-09-14 Nanning Fugui Precision Industrial Co., Ltd. Footsteps tracking method and system thereof
CN114449393B (zh) * 2020-10-31 2023-10-13 华为技术有限公司 一种声音增强方法、耳机控制方法、装置及耳机
CN112466305B (zh) * 2020-12-16 2022-09-02 思必驰科技股份有限公司 饮水机的语音控制方法及装置
JP7814400B2 (ja) 2021-01-28 2026-02-16 シュアー アクイジッション ホールディングス インコーポレイテッド ハイブリッドオーディオビーム形成システム
US12452584B2 (en) 2021-01-29 2025-10-21 Shure Acquisition Holdings, Inc. Scalable conferencing systems and methods
US12542123B2 (en) 2021-08-31 2026-02-03 Shure Acquisition Holdings, Inc. Mask non-linear processor for acoustic echo cancellation
US12289584B2 (en) 2021-10-04 2025-04-29 Shure Acquisition Holdings, Inc. Networked automixer systems and methods
EP4427465A1 (en) 2021-11-05 2024-09-11 Shure Acquisition Holdings, Inc. Distributed algorithm for automixing speech over wireless networks
US12250526B2 (en) 2022-01-07 2025-03-11 Shure Acquisition Holdings, Inc. Audio beamforming with nulling control system and methods
CN115499342B (zh) * 2022-08-10 2025-06-03 新华三技术有限公司 一种时延计算方法及装置
US12598261B2 (en) 2022-09-28 2026-04-07 Shure Acquisition Holdings, Inc. Wideband doubletalk detection for optimization of acoustic echo cancellation
CN115762519A (zh) * 2022-10-28 2023-03-07 歌尔科技有限公司 语音识别方法、装置、设备及存储介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1762179A (zh) * 2003-01-17 2006-04-19 1...有限公司 阵列型声系统配置方法
CN101242677A (zh) * 2007-02-05 2008-08-13 索尼株式会社 耳机设备、声音再现系统和声音再现方法
CN102421050A (zh) * 2010-09-17 2012-04-18 三星电子株式会社 使用麦克风的非均匀布局来增强音频质量的设备和方法
WO2012061151A1 (en) * 2010-10-25 2012-05-10 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for orientation-sensitive recording control
CN102804806A (zh) * 2009-06-23 2012-11-28 诺基亚公司 用于处理音频信号的方法及装置

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6049607A (en) * 1998-09-18 2000-04-11 Lamar Signal Processing Interference canceling method and apparatus
US6594367B1 (en) * 1999-10-25 2003-07-15 Andrea Electronics Corporation Super directional beamforming design and implementation
JP3824920B2 (ja) * 2001-12-07 2006-09-20 ヤマハ発動機株式会社 マイクロホンユニット及び音源方向同定システム
WO2005076661A1 (ja) * 2004-02-10 2005-08-18 Mitsubishi Denki Engineering Kabushiki Kaisha 超指向性スピーカ搭載型移動体
US20080259731A1 (en) * 2007-04-17 2008-10-23 Happonen Aki P Methods and apparatuses for user controlled beamforming
US20100123785A1 (en) * 2008-11-17 2010-05-20 Apple Inc. Graphic Control for Directional Audio Input
US20100254543A1 (en) * 2009-02-03 2010-10-07 Squarehead Technology As Conference microphone system
US8861756B2 (en) * 2010-09-24 2014-10-14 LI Creative Technologies, Inc. Microphone array system
CN102164328B (zh) * 2010-12-29 2013-12-11 中国科学院声学研究所 一种用于家庭环境的基于传声器阵列的音频输入系统
US8525868B2 (en) * 2011-01-13 2013-09-03 Qualcomm Incorporated Variable beamforming with a mobile platform
US20130332156A1 (en) * 2012-06-11 2013-12-12 Apple Inc. Sensor Fusion to Improve Speech/Audio Processing in a Mobile Device
US9286898B2 (en) * 2012-11-14 2016-03-15 Qualcomm Incorporated Methods and apparatuses for providing tangible control of sound
KR102150013B1 (ko) * 2013-06-11 2020-08-31 삼성전자주식회사 음향신호를 위한 빔포밍 방법 및 장치

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1762179A (zh) * 2003-01-17 2006-04-19 1...有限公司 阵列型声系统配置方法
CN101242677A (zh) * 2007-02-05 2008-08-13 索尼株式会社 耳机设备、声音再现系统和声音再现方法
CN102804806A (zh) * 2009-06-23 2012-11-28 诺基亚公司 用于处理音频信号的方法及装置
CN102421050A (zh) * 2010-09-17 2012-04-18 三星电子株式会社 使用麦克风的非均匀布局来增强音频质量的设备和方法
WO2012061151A1 (en) * 2010-10-25 2012-05-10 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for orientation-sensitive recording control

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110119264A (zh) * 2019-05-21 2019-08-13 北京达佳互联信息技术有限公司 音效调整方法、装置及存储介质

Also Published As

Publication number Publication date
CN104065798A (zh) 2014-09-24
EP2977985A1 (en) 2016-01-27
CN104065798B (zh) 2016-08-03
EP2977985A4 (en) 2017-06-28
US20160011851A1 (en) 2016-01-14

Similar Documents

Publication Publication Date Title
WO2014146547A1 (zh) 声音信号处理方法及设备
US9870775B2 (en) Method and device for voice recognition and electronic device thereof
US10909988B2 (en) Systems and methods for displaying a user interface
US9532140B2 (en) Listen to people you recognize
KR102146462B1 (ko) 음성 인식 시스템 및 방법
EP3001414B1 (en) Method for executing voice command and electronic device
CN106713598B (zh) 基于指示方向的指令传输方法及装置、智能设备
US9330666B2 (en) Gesture-based messaging method, system, and device
KR20170013264A (ko) 방향성 인터페이스를 갖는 핸즈 프리 디바이스
WO2021037129A1 (zh) 一种声音采集方法及装置
CN104811862A (zh) 一种音箱音量控制方法及终端
CN112098934B (zh) 一种智能设备的定位方法和智能设备
CN109545217B (zh) 语音信号接收方法、装置、智能终端及可读存储介质
JP2016170584A (ja) 情報処理装置、情報処理方法およびプログラム
CN116193319A (zh) 语音增强处理方法、语音增强处理装置及存储介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14767360

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2014767360

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE