WO2024046416A1 - 一种音量调节方法、电子设备及系统 - Google Patents
一种音量调节方法、电子设备及系统 Download PDFInfo
- Publication number
- WO2024046416A1 WO2024046416A1 PCT/CN2023/116067 CN2023116067W WO2024046416A1 WO 2024046416 A1 WO2024046416 A1 WO 2024046416A1 CN 2023116067 W CN2023116067 W CN 2023116067W WO 2024046416 A1 WO2024046416 A1 WO 2024046416A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- volume
- sound
- electronic device
- level
- sound level
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/165—Management of the audio stream, e.g. setting of volume, audio stream path
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/12—Audiometering
- A61B5/121—Audiometering evaluating hearing capacity
- A61B5/123—Audiometering evaluating hearing capacity subjective methods
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/68—Arrangements of detecting, measuring or recording means, e.g. sensors, in relation to patient
- A61B5/6801—Arrangements of detecting, measuring or recording means, e.g. sensors, in relation to patient specially adapted to be attached to or worn on the body surface
- A61B5/6802—Sensor mounted on worn items
- A61B5/6803—Head-worn items, e.g. helmets, masks, headphones or goggles
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/68—Arrangements of detecting, measuring or recording means, e.g. sensors, in relation to patient
- A61B5/6887—Arrangements of detecting, measuring or recording means, e.g. sensors, in relation to patient mounted on external non-worn devices, e.g. non-medical devices
- A61B5/6898—Portable consumer electronic devices, e.g. music players, telephones, tablet computers
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/72—Signal processing specially adapted for physiological signals or for diagnostic purposes
- A61B5/7203—Signal processing specially adapted for physiological signals or for diagnostic purposes for noise prevention, reduction or removal
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/72—Signal processing specially adapted for physiological signals or for diagnostic purposes
- A61B5/7235—Details of waveform analysis
- A61B5/7264—Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems
- A61B5/7267—Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems involving training the classification device
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03G—CONTROL OF AMPLIFICATION
- H03G3/00—Gain control in amplifiers or frequency changers
- H03G3/002—Control of digital or coded signals
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03G—CONTROL OF AMPLIFICATION
- H03G3/00—Gain control in amplifiers or frequency changers
- H03G3/20—Automatic control
- H03G3/30—Automatic control in amplifiers having semiconductor devices
- H03G3/32—Automatic control in amplifiers having semiconductor devices the control being dependent upon ambient noise level or sound level
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/414—Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance
- H04N21/41407—Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance embedded in a portable device, e.g. video client on a mobile phone, PDA, laptop
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/485—End-user interface for client configuration
- H04N21/4852—End-user interface for client configuration for modifying audio parameters, e.g. switching between mono and stereo
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
- H04R5/00—Stereophonic arrangements
- H04R5/04—Circuit arrangements, e.g. for selective connection of amplifier inputs/outputs to loudspeakers, for loudspeaker detection, or for adaptation of settings to personal preferences or hearing impairments
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
- H04R2205/00—Details of stereophonic arrangements covered by H04R5/00 but not provided for in any of its subgroups
- H04R2205/041—Adaptation of stereophonic signal reproduction for the hearing impaired
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
- H04R2430/00—Signal processing covered by H04R, not provided for in its groups
- H04R2430/01—Aspects of volume control, not necessarily automatic, in sound systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
- H04R2499/00—Aspects covered by H04R or H04S not otherwise provided for in their subgroups
- H04R2499/10—General applications
- H04R2499/11—Transducers incorporated or for use in hand-held devices, e.g. mobile phones, PDA's, camera's
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
- H04R25/00—Electric hearing aids
- H04R25/70—Adaptation of deaf aid to hearing loss, e.g. initial electronic fitting
Definitions
- Embodiments of the present application relate to terminal technology, and in particular, to a volume adjustment method, electronic device, and system.
- terminal devices including but not limited to mobile phones, glasses, open headphones, etc.
- music in public scenes such as elevators, conference rooms, etc.
- the calls or music are often heard by people next to them.
- Content there is a risk of personal privacy being leaked or interfering with others.
- the present application provides a volume adjustment method, electronic device and system.
- the method can adjust the playback volume of the downlink signal based on the environmental noise signal, the downlink signal and the user's hearing level to improve the user experience.
- inventions of the present application provide a volume adjustment method applied to electronic equipment.
- the electronic equipment includes a sound-generating device and at least one microphone.
- the method includes:
- the electronic device responds to the user operation of starting the hearing test and plays the preset audio through multiple volumes respectively; the multiple volumes include the first volume;
- the electronic device determines that the user's hearing level is the first hearing level corresponding to the first volume based on the determination operation for the first volume;
- the electronic device plays the downlink signal through the sound-generating device, it collects the downlink signal and the environmental noise signal through at least one microphone;
- the electronic device adjusts the playback volume of the sound-generating device from the first sound level to the second sound level based on the first hearing level.
- the electronic device can determine the user's hearing level based on the user's determination operation in the hearing test; further, based on the user's hearing level, when at least one of the collected downlink signal and the collected environmental noise signal changes, Adjust the playback volume of the sound-generating device. It can be seen that the playback volume is a volume determined by comprehensively considering the environmental noise signal, the downlink signal and the user's hearing level. It can meet the individual needs of the user, reduce the impact of environmental noise, and improve the user experience.
- the greater the first volume the higher the level of the second sound level.
- the greater the first volume selected by the user in the hearing level test the higher the sound level of the downlink signal played by the electronic device when the user actually uses the electronic device. This method can meet the listening needs of different users.
- the first volume is the user's comfortable volume in the current environment or the lowest volume that can be heard clearly.
- the user can select the first volume based on the user's personal needs during the hearing level test. For example, if the user wants the volume of the downlink signal played by the electronic device to be a volume that is comfortable for the user, the user can determine the volume that the user feels comfortable in the current environment as the first volume; for another example, the user wants the volume of the downlink signal played by the electronic device to be The user can determine the lowest volume that the user can hear clearly in the current environment as the first volume. This method can meet users' different needs for volume.
- the greater the energy of the changed environmental noise signal the higher the level of the second sound level.
- the electronic device can automatically adjust the sound level of the sound-emitting device when the energy of the environmental noise signal changes. For example, if the energy of the environmental noise signal becomes larger, that is, the noise is enhanced, the electronic device can adjust the sound level of the sound-generating device to a higher level to prevent the user from hearing the downlink signal due to the environmental noise signal.
- the electronic device adjusts the playback volume of the sound-generating device from the first sound level to the second sound level, including:
- the electronic device adjusts the playback volume of the sound-generating device to the second sound level. Two sound levels.
- the electronic device adjusts the playback volume of the sound-generating device from the first sound level to the second sound level, including:
- the electronic device adjusts the playback volume of the sound-generating device to the third sound level; the third sound level is higher than the first sound level and lower than the second sound level. grade.
- the electronic device can gradually adjust the first sound level to the second sound level when the first sound level and the second sound level differ by a second preset level number. Assuming that the first sound level is lower than the second sound level, the electronic device can first adjust the playback volume of the sound-generating device to the third sound level; the third sound level is higher than the first sound level and lower than the second sound level; and then the The playback volume of the sound-generating device is adjusted from the third sound level to the second sound level.
- the third sound level is only used as one intermediate adjustment as an example.
- the sound-generating device may also undergo multiple intermediate adjustments when adjusting from the first sound level to the second sound level. The situation where the first sound level is higher than the second sound level is similar to the above adjustment, and the first sound level can be gradually lowered to the second sound level. This method can avoid sudden volume adjustments causing discomfort to the user's hearing experience.
- the method also includes
- the electronic device plays the downlink signal corresponding to the first application through the sound-generating device at the second volume under the first environmental condition;
- the electronic device plays the downlink signal corresponding to the second application through the sound-generating device at a third volume under the first environmental condition;
- the second volume is not equal to the third volume.
- the electronic device can have different volume adjustments for different applications. This method can meet the user's hearing needs for different applications.
- the first application is a call application
- the second application is a music application
- the second volume is greater than the third volume
- the volume of the downlink signal of the call application is adjusted to a larger value. Therefore, in the same environment, the electronic device plays the call application The volume is greater than the playback volume of music applications.
- the method also includes:
- the electronic device determines the preset model corresponding to the first hearing level as the target model; the target model is obtained through user data training of users who meet the first hearing level; the user data includes users with the first hearing level under the second environmental condition.
- the electronic device inputs the energy difference between the changed downlink signal and the changed environmental noise signal into the target model to obtain the second sound level.
- the changed downlink signal includes N first sub-band signals, and the changed environmental noise signal includes N second sub-band signals, N is a positive integer; N The first subband signal has a one-to-one correspondence with N second subband signals; the first subband signal and the second subband signal corresponding to each other form a set of signals;
- the energy difference between the changed downlink signal and the changed environmental noise signal includes the energy difference between N groups of signals.
- the energy difference between the changed downlink signal and the changed environmental noise signal is used as the input data of the target model.
- the energy difference may include the sub-frequency band signal of the above-mentioned noise signal and the sub-frequency band of the above-mentioned downlink signal.
- the energy difference of the signal If the decibel value of the signal in any frequency band of the sub-band signal in the downlink signal is greater than the masking threshold in the masking effect, then the downlink signal can be heard by the user. This method is based on the masking effect, which can ensure that the adjusted volume meets the hearing needs of the human ear during the adjustment process and avoids interference from masking sounds.
- the user data also includes the playback volume of the sound-emitting device at several moments before the target sound level; the electronic device inputs the energy difference between the changed downlink signal and the changed environmental noise signal.
- the target model to obtain the second sound level includes:
- the electronic device inputs the energy difference between the changed downlink signal and the changed ambient noise signal and the playback volume of the sound-generating device at several previous moments into the target model to obtain the second sound level.
- the method further includes:
- the electronic device responds to the user's operation of adjusting the volume key and adjusts the playback volume of the sound-generating device to the target sound level;
- the electronic device stores the energy difference between the changed downlink signal and the changed ambient noise signal and the target sound level
- the electronic device uses the energy difference between the changed downlink signal and the changed environmental noise signal and the target sound level as sample data to train the target model.
- the electronic device can collect the target sound level adjusted when the user manually adjusts the volume key and the above-mentioned volume difference as sample data to train the target model.
- This method can make electronic devices more and more in line with user habits and improve user experience.
- At least one microphone includes a first microphone and a second microphone, the distance between the first microphone and the sound-generating device is smaller than the distance between the second microphone and the sound-generating device, and the first microphone is used for collecting For downlink signals, the second microphone is used to collect environmental noise signals.
- embodiments of the present application provide another volume adjustment method, applied to electronic equipment.
- the electronic equipment includes a sound-generating device and at least one microphone.
- the method includes:
- the electronic device plays the downlink signal through the sound-generating device, it collects the downlink signal and the environmental noise signal through at least one microphone;
- the electronic device adjusts the playback volume of the sound-generating device from the first sound level to the second sound level based on the first hearing level, and the first hearing level is the electronic device The listening level corresponding to the user.
- embodiments of the present application provide a volume adjustment device, which includes a processing module, a sound-generating device, and at least one microphone:
- a sound-generating device for playing preset audio through multiple volumes respectively; the multiple volumes include a first volume;
- a processing module configured to determine the user's hearing level to be the first hearing level corresponding to the first volume based on the determination operation for the first volume;
- At least one microphone is used to collect the downlink signal and the environmental noise signal when the sound-generating device plays the downlink signal
- the sound-generating device is configured to adjust the playback volume from the first sound level to the second sound level based on the first hearing level when at least one of the collected downlink signal and the collected environmental noise signal changes.
- the first volume is the lowest volume or the comfortable volume that the user can hear clearly in the current environment.
- the greater the energy of the changed environmental noise signal the higher the level of the second sound level.
- the sound-generating device is specifically configured to play a downlink signal through the second sound level when the first sound level and the second sound level differ by a first preset level number.
- the sound-generating device is specifically configured to play the downlink signal through the third sound level when the first sound level and the second sound level differ by a second preset level number; third The sound level is higher than the first sound level and lower than the second sound level.
- the sound-generating device is used to play the downlink signal corresponding to the first application according to the second volume under the first environmental condition;
- the sound-generating device is used to play the downlink signal corresponding to the second application at a third volume under the first environmental condition; the second volume is not equal to the third volume.
- the first application is a call application
- the second application is a music application
- the second volume is greater than the third volume
- the processing module is used for:
- the preset model corresponding to the first hearing level is determined as the target model; the target model is obtained through user data training of users who meet the first hearing level; the user data includes the goals of users with the first hearing level under the second environmental conditions. Sound level, and the energy difference between the downlink signal and the ambient noise signal collected under the second environmental condition;
- the energy difference between the changed downlink signal and the changed environmental noise signal is input into the target model to obtain the second sound level.
- the changed downlink signal includes N first sub-band signals, and the changed environmental noise signal includes N second sub-band signals, N is a positive integer; N The first subband signal has a one-to-one correspondence with N second subband signals; the first subband signal and the second subband signal corresponding to each other form a set of signals;
- the energy difference between the changed downlink signal and the changed environmental noise signal includes the energy difference between N groups of signals.
- the user data also includes the playback volume of the sound-emitting device at several moments before the target sound level
- the processing module is used to input the energy difference between the changed downlink signal and the changed environmental noise signal and the playback volume of the sound-generating device at several previous moments into the target model to obtain the second sound level.
- the sound-generating device is used to adjust the playback volume of the sound-generating device to the target sound level in response to the user's operation of adjusting the volume key;
- the processing module is used to store the energy difference and target sound level between the changed downlink signal and the changed environmental noise signal
- the processing module is used to train the target model using the energy difference between the changed downlink signal and the changed environmental noise signal and the target sound level as sample data.
- At least one microphone includes a first microphone and a second microphone, the distance between the first microphone and the sound-generating device is smaller than the distance between the second microphone and the sound-generating device, and the first microphone is used for collecting For downlink signals, the second microphone is used to collect environmental noise signals.
- embodiments of the present application provide an electronic device, including one or more functional modules.
- the one or more functional modules can be used to perform any of the above aspects or any possible implementation of any aspect.
- the present application provides a computer storage medium, including computer instructions.
- the communication device causes the communication device to perform any of the above aspects or any possible implementation of any aspect. Volume adjustment method.
- the present application provides a computer program product.
- the computer program product When the computer program product is run on a computer, it causes the computer to execute the volume adjustment method in any of the above aspects or any possible implementation of any aspect.
- the present application provides a chip, including: a processor and an interface.
- the processor and the interface cooperate with each other so that the chip executes any of the above aspects or any possible implementation of any aspect. volume adjustment method.
- the electronic device provided by the fourth aspect the computer-readable storage medium provided by the fifth aspect, the computer program product provided by the sixth aspect, and the chip provided by the seventh aspect are all used to execute the tasks provided by the embodiments of the present application. method. Therefore, the beneficial effects it can achieve can be referred to the beneficial effects in the corresponding methods, and will not be described again here.
- Figure 1 is a schematic diagram of windowing a speech signal provided by an embodiment of the present application.
- Figure 2 is a schematic diagram of a masking effect provided by an embodiment of the present application.
- Figure 3 is a schematic diagram of the hardware structure of the electronic device 100 provided by the embodiment of the present application.
- Figure 4 is a schematic diagram of a hardware form of a mobile phone provided by an embodiment of the present application.
- Figure 5 is a schematic diagram of a hardware form of glasses provided by an embodiment of the present application.
- Figure 6 is a software structure block diagram of an electronic device 100 provided by an embodiment of the present application.
- Figure 7 is an overall flow diagram of a volume adjustment method provided by an embodiment of the present application.
- Figure 8 is a diagram showing the correspondence between a preset model and hearing levels provided by an embodiment of the present application.
- Figure 9 is a schematic diagram of the training process of a preset model provided by an embodiment of the present application.
- Figure 10 is a schematic diagram of a training process provided by the embodiment of the present application.
- Figure 11 is a schematic diagram of the overall flow of a volume adjustment method provided by this application.
- Figures 12A to 12F are some interface diagrams illustrating exemplary embodiments of the present application when a user performs a hearing test on an electronic device;
- Figure 13 is a schematic diagram of framing and windowing the signals collected by the bottom microphone provided by the embodiment of the present application.
- Figure 14 is a schematic diagram of framing and windowing the signals collected by the top microphone provided by the embodiment of the present application.
- Figure 15 is a schematic diagram of a molecular band processing provided by an embodiment of the present application.
- Figure 16 is a schematic diagram of the corresponding relationship between sound level and playback volume provided by an embodiment of the present application.
- Figure 17 is a schematic diagram of the output sound level changing with the environment provided by the embodiment of the present application.
- Figure 18 is a comparison chart of the playback volume of different electronic devices during calls in the same environment provided by the embodiment of the present application.
- Figure 19 is a schematic diagram of a volume adjustment process during a user call provided by an embodiment of the present application.
- Figure 20 is a comparison chart of the sound levels adjusted when an electronic device is used in different applications according to an embodiment of the present application
- Figure 21 is a partial application interface of the electronic device provided by the embodiment of the present application for making a call under the first environmental condition before self-learning;
- Figure 22 is a partial application interface of the electronic device provided by the embodiment of the present application for making calls under the first environmental condition after self-learning.
- first and second are used for descriptive purposes only and shall not be understood as implying or implying relative importance or implicitly specifying the quantity of indicated technical features. Therefore, the features defined as “first” and “second” may explicitly or implicitly include one or more of the features. In the description of the embodiments of this application, unless otherwise specified, “plurality” The meaning is two or more.
- GUI graphical user interface
- speech has time-varying properties, that is, the speech signal characteristics change with time and is a non-stationary random process.
- the speech signal has time-varying properties, its characteristics can be regarded as stable in a short period of time. Therefore, the speech signal can be divided into shorter frames, and each frame can be regarded as a steady-state signal.
- the frame length can generally be 25 milliseconds.
- the frame shift can be half the frame length, or fixed to 10 milliseconds.
- the speech signal needs to be windowed to reduce spectrum leakage.
- the windowing operation is to multiply the speech signal with a window function.
- window functions include rectangular window, Hamming window and Hanning window. Different window functions can be selected according to different situations.
- Figure 1 is a schematic diagram illustrating windowing of a frame of speech signal.
- the horizontal axis of the coordinates in Figure 1 is time and the vertical axis is amplitude;
- (a) in Figure 1 is the waveform diagram of the speech signal before adding the window;
- (b) in Figure 1 is the window function Waveform diagram;
- (c) in Figure 1 is the waveform diagram of the windowed speech signal.
- the phenomenon in which the auditory perception of a weaker sound (the masked sound) is affected by another stronger sound (the masking sound) is called the "masking effect" of the human ear.
- the masking effect is another important physiological characteristic of the human ear. If there are two sound signals in a narrow frequency band, when one is stronger than the other, the hearing threshold of the human ear will increase, and the human ear can hear the high-volume sound signal, while the low-volume sound at nearby frequencies The signal cannot be heard, it seems that the low volume signal is masked by the high volume signal.
- the hearing threshold of the human ear i.e., the hearing threshold
- the hearing threshold is the lowest sound pressure level at which sound can be perceived by the human ear.
- Figure 2 is a schematic diagram of a masking effect provided by an embodiment of the present application.
- the horizontal axis of this coordinate system is frequency in Hertz (Hz), and the vertical axis is sound pressure level in decibels (dB).
- the solid line represents the audible threshold
- the diagonal rectangle represents the audible threshold.
- the dotted line represents the masking threshold
- the blank rectangle represents the masked sound.
- the sound pressure level of the sound is greater than the audible threshold and can be heard by the human ear.
- the sound pressure level of the masked sound is greater than the audible threshold.
- the sound pressure level of the sound is greater than the audible threshold.
- the human ear can hear the masked sound; and when the masking sound appears, the sound pressure level of the sound near the masking sound must be greater than the masking threshold before it can be heard by the human ear.
- the sound pressure level of the masked sound is smaller than the masking sound. Threshold, so when the masking sound appears, the masked sound that the human ear originally heard becomes inaudible.
- FIG. 3 shows a schematic diagram of the hardware structure of the electronic device 100.
- electronic device 100 may have more or fewer components than shown in the figures, may combine two or more components, or may have a different component configuration.
- the various components shown in the figures may be implemented in hardware, software, or a combination of hardware and software including one or more signal processing and/or application specific integrated circuits.
- the electronic device 100 may include: a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (USB) interface 130, a charging management module 140, a power management module 141, a battery 142, an antenna 1, an antenna 2.
- Mobile communication module 150 wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, headphone interface 170D, sensor module 180, button 190, motor 191, indicator 192, camera 193, display screen 194 and Subscriber identification module (SIM) card interface 195, etc.
- SIM Subscriber identification module
- the sensor module 180 may include a pressure sensor 180A, a gyro sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, and ambient light. Sensor 180L, bone conduction sensor 180M, etc.
- the structure illustrated in the embodiment of the present application does not constitute a specific limitation on the electronic device 100 .
- the electronic device 100 may include more or fewer components than shown in the figures, or some components may be combined, some components may be separated, or some components may be arranged differently.
- the components illustrated may be implemented in hardware, software, or a combination of software and hardware.
- the processor 110 may include one or more processing units.
- the processor 110 may include an application processor (application processor, AP), a modem processor, a graphics processing unit (GPU), and an image signal processor. (image signal processor, ISP), controller, memory, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural-network processing unit (NPU) wait.
- application processor application processor, AP
- modem processor graphics processing unit
- GPU graphics processing unit
- image signal processor image signal processor
- ISP image signal processor
- controller memory
- video codec digital signal processor
- DSP digital signal processor
- baseband processor baseband processor
- NPU neural-network processing unit
- different processing units can be independent devices or integrated in one or more processors.
- the controller may be the nerve center and command center of the electronic device 100 .
- the controller can generate operation control signals based on the instruction operation code and timing signals to complete the control of fetching and executing instructions.
- the processor 110 may also be provided with a memory for storing instructions and data.
- the memory in processor 110 is cache memory. This memory may hold instructions or data that have been recently used or recycled by processor 110 . If the processor 110 needs to use the instructions or data again, it can be called directly from the memory. Repeated access is avoided and the waiting time of the processor 110 is reduced, thus improving the efficiency of the system.
- processor 110 may include one or more interfaces.
- Interfaces may include integrated circuit (inter-integrated circuit, I2C) interface, integrated circuit built-in audio (inter-integrated circuit sound, I2S) interface, pulse code modulation (pulse code modulation, PCM) interface, universal asynchronous receiver and transmitter (universal asynchronous receiver/transmitter (UART) interface, mobile industry processor interface (MIPI), general-purpose input/output (GPIO) interface, subscriber identity module (SIM) interface, and /or universal serial bus (USB) interface, etc.
- I2C integrated circuit
- I2S integrated circuit built-in audio
- PCM pulse code modulation
- UART universal asynchronous receiver and transmitter
- MIPI mobile industry processor interface
- GPIO general-purpose input/output
- SIM subscriber identity module
- USB universal serial bus
- the I2C interface is a bidirectional synchronous serial bus, including a serial data line (SDA) and a serial clock line (derail clock line, SCL).
- processor 110 may include multiple sets of I2C buses.
- the processor 110 can separately couple the touch sensor 180K, charger, flash, camera 193, etc. through different I2C bus interfaces.
- the processor 110 can be coupled to the touch sensor 180K through an I2C interface, so that the processor 110 and the touch sensor 180K communicate through the I2C bus interface to implement the touch function of the electronic device 100 .
- the I2S interface can be used for audio communication.
- processor 110 may include multiple sets of I2S buses.
- the processor 110 can be coupled with the audio module 170 through the I2S bus to implement communication between the processor 110 and the audio module 170 .
- the audio module 170 can transmit audio signals to the wireless communication module 160 through the I2S interface to implement the function of answering calls through a Bluetooth headset.
- the PCM interface can also be used for audio communications to sample, quantize and encode analog signals.
- the audio module 170 and the wireless communication module 160 may be coupled through a PCM bus interface.
- the audio module 170 can also transmit audio signals to the wireless communication module 160 through the PCM interface to implement the function of answering calls through a Bluetooth headset. Both the I2S interface and the PCM interface can be used for audio communication.
- the UART interface is a universal serial data bus used for asynchronous communication.
- the bus can be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication.
- a UART interface is generally used to connect the processor 110 and the wireless communication module 160 .
- the processor 110 communicates with the Bluetooth module in the wireless communication module 160 through the UART interface to implement the Bluetooth function.
- the audio module 170 can transmit audio signals to the wireless communication module 160 through the UART interface to implement the function of playing music through a Bluetooth headset.
- the MIPI interface can be used to connect the processor 110 with peripheral devices such as the display screen 194 and the camera 193 .
- MIPI interfaces include camera serial interface (CSI), display serial interface (DSI), etc.
- the processor 110 and the camera 193 communicate through the CSI interface to implement the shooting function of the electronic device 100 .
- the processor 110 and the display screen 194 communicate through the DSI interface to implement the display function of the electronic device 100 .
- the GPIO interface can be configured through software.
- the GPIO interface can be configured as a control signal or as a data signal.
- the GPIO interface can be used to connect the processor 110 with the camera 193, display screen 194, wireless communication module 160, audio module 170, sensor module 180, etc.
- the GPIO interface can also be configured as an I2C interface, I2S interface, UART interface, MIPI interface, etc.
- the SIM card interface can be used to communicate with the SIM card interface 195 to implement the function of transmitting data to the SIM card or reading data in the SIM card.
- the USB interface 130 is an interface that complies with the USB standard specification, and may be a Mini USB interface, a Micro USB interface, a USB Type C interface, etc.
- the USB interface 130 can be used to connect a charger to charge the electronic device 100, and can also be used to transmit data between the electronic device 100 and peripheral devices. It can also be used to connect headphones to play audio through them. This interface can also be used to connect other electronic devices, such as AR devices, etc.
- the interface connection relationships between the modules illustrated in the embodiments of the present application are only schematic illustrations and do not constitute a structural limitation of the electronic device 100 .
- the electronic device 100 may also adopt different interface connection methods in the above embodiments, or a combination of multiple interface connection methods.
- the charging management module 140 is used to receive charging input from the charger.
- the charger can be a wireless charger or a wired charger.
- the power management module 141 is used to connect the battery 142, the charging management module 140 and the processor 110.
- the power management module 141 receives input from the battery 142 and/or the charging management module 140, and supplies power to the processor 110, internal memory 121, external memory, display screen 194, camera 193, wireless communication module 160, etc.
- the wireless communication function of the electronic device 100 can be implemented through the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, the modem processor and the baseband processor.
- Antenna 1 and Antenna 2 are used to transmit and receive electromagnetic wave signals.
- Each antenna in electronic device 100 may be used to cover a single or multiple communication frequency bands. Different antennas can also be reused to improve antenna utilization. For example: Antenna 1 can be reused as a diversity antenna for a wireless LAN. In other embodiments, antennas may be used in conjunction with tuning switches.
- the mobile communication module 150 can provide solutions for wireless communication including 2G/3G/4G/5G applied on the electronic device 100 .
- the mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (LNA), etc.
- the mobile communication module 150 can receive electromagnetic waves through the antenna 1, perform filtering, amplification and other processing on the received electromagnetic waves, and transmit them to the modem processor for demodulation.
- the mobile communication module 150 can also amplify the signal modulated by the modem processor and convert it into electromagnetic waves through the antenna 1 for radiation.
- at least part of the functional modules of the mobile communication module 150 may be disposed in the processor 110 .
- at least part of the functional modules of the mobile communication module 150 and at least part of the modules of the processor 110 may be provided in the same device.
- a modem processor may include a modulator and a demodulator.
- the modulator is used to modulate the low-frequency baseband signal to be sent into a medium-high frequency signal.
- the demodulator is used to demodulate the received electromagnetic wave signal into a low-frequency baseband signal.
- the demodulator then transmits the demodulated low-frequency baseband signal to the baseband processor for processing.
- the application processor outputs sound signals through audio devices (not limited to speaker 170A, receiver 170B, etc.), or displays images or videos through display screen 194.
- the modem processor may be a stand-alone device.
- the modem processor may be independent of the processor 110 and may be provided in the same device as the mobile communication module 150 or other functional modules.
- the wireless communication module 160 can provide applications on the electronic device 100 including wireless local area networks (WLAN) (such as wireless fidelity (Wi-Fi) network), Bluetooth (bluetooth, BT), and global navigation satellites.
- WLAN wireless local area networks
- System global navigation satellite system, GNSS
- frequency modulation frequency modulation, FM
- near field communication technology near field communication, NFC
- infrared technology infrared, IR
- the wireless communication module 160 may be one or more devices integrating at least one communication processing module.
- the wireless communication module 160 receives electromagnetic waves via the antenna 2 , frequency modulates and filters the electromagnetic wave signals, and sends the processed signals to the processor 110 .
- the wireless communication module 160 can also receive the signal to be sent from the processor 110, frequency modulate it, amplify it, and convert it into electromagnetic waves through the antenna 2 for radiation.
- the antenna 1 of the electronic device 100 is coupled to the mobile communication module 150, and the antenna 2 is coupled to the wireless communication module 160, This allows the electronic device 100 to communicate with the network and other devices through wireless communication technology.
- the wireless communication technology may include global system for mobile communications (GSM), general packet radio service (GPRS), code division multiple access (CDMA), broadband Code division multiple access (wideband code division multiple access, WCDMA), time-division code division multiple access (TD-SCDMA), long term evolution (long term evolution, LTE), BT, GNSS, WLAN, NFC , FM, and/or IR technology, etc.
- the GNSS may include global positioning system (GPS), global navigation satellite system (GLONASS), Beidou navigation satellite system (BDS), quasi-zenith satellite system (quasi -zenith satellite system (QZSS) and/or satellite based augmentation systems (SBAS).
- GPS global positioning system
- GLONASS global navigation satellite system
- BDS Beidou navigation satellite system
- QZSS quasi-zenith satellite system
- SBAS satellite based augmentation systems
- the electronic device 100 implements display functions through a GPU, a display screen 194, an application processor, and the like.
- the GPU is an image processing microprocessor and is connected to the display screen 194 and the application processor. GPUs are used to perform mathematical and geometric calculations for graphics rendering.
- Processor 110 may include one or more GPUs that execute program instructions to generate or alter display information.
- the display screen 194 is used to display images, videos, etc.
- Display 194 includes a display panel.
- the display panel can use a liquid crystal display (LCD), an organic light-emitting diode (OLED), an active matrix organic light emitting diode or an active matrix organic light emitting diode (active-matrix organic light emitting diode).
- LCD liquid crystal display
- OLED organic light-emitting diode
- AMOLED organic light-emitting diode
- FLED flexible light-emitting diode
- Miniled MicroLed, Micro-oLed, quantum dot light emitting diode (QLED), etc.
- the electronic device 100 may include 1 or N display screens 194, where N is a positive integer greater than 1.
- the electronic device 100 can implement the shooting function through an ISP, a camera 193, a video codec, a GPU, a display screen 194, an application processor, and the like.
- the ISP is used to process the data fed back by the camera 193. For example, when taking a photo, the shutter is opened, the light is transmitted to the camera sensor through the lens, the optical signal is converted into an electrical signal, and the camera sensor passes the electrical signal to the ISP for processing, and converts it into an image visible to the naked eye. ISP can also perform algorithm optimization on image noise, brightness, and skin color. ISP can also optimize the exposure, color temperature and other parameters of the shooting scene. In some embodiments, the ISP may be provided in the camera 193.
- Camera 193 is used to capture still images or video.
- the object passes through the lens to produce an optical image that is projected onto the photosensitive element.
- the photosensitive element can be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor.
- CMOS complementary metal-oxide-semiconductor
- the photosensitive element converts the optical signal into an electrical signal, and then passes the electrical signal to the ISP to convert it into a digital image signal.
- ISP outputs digital image signals to DSP for processing.
- DSP converts digital image signals into standard RGB, YUV and other format image signals.
- the electronic device 100 may include 1 or N cameras 193, where N is a positive integer greater than 1.
- Digital signal processors are used to process digital signals. In addition to digital image signals, they can also process other digital signals. For example, when the electronic device 100 selects a frequency point, the digital signal processor is used to perform Fourier transform on the frequency point energy.
- Video codecs are used to compress or decompress digital video.
- Electronic device 100 may support one or more video codecs. In this way, the electronic device 100 can play or record videos in multiple encoding formats, such as moving picture experts group (MPEG) 1, MPEG2, MPEG3, MPEG4, etc.
- MPEG moving picture experts group
- MPEG2 MPEG2, MPEG3, MPEG4, etc.
- NPU is a neural network (NN) computing processor.
- NN neural network
- Intelligent cognitive applications of the electronic device 100 can be implemented through the NPU, such as image recognition, face recognition, speech recognition, text understanding, etc.
- the external memory interface 120 can be used to connect an external memory card, such as a Micro SD card, to expand the storage capacity of the electronic device 100.
- the external memory card communicates with the processor 110 through the external memory interface 120 to implement the data storage function. Such as saving music, videos, etc. files in external memory card.
- Internal memory 121 may be used to store computer executable program code, which includes instructions.
- the processor 110 executes instructions stored in the internal memory 121 to execute various functional applications and data processing of the electronic device 100 .
- the internal memory 121 may include a program storage area and a data storage area.
- the stored program area can store the operating system, at least one application required for the function (such as face recognition function, fingerprint recognition function, mobile payment function, etc.).
- the storage data area can store data created during the use of the electronic device 100 (such as face information template data, fingerprint information templates, etc.).
- the internal memory 121 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, universal flash storage (UFS), etc.
- the electronic device 100 can implement audio functions through the audio module 170, the speaker 170A, the receiver 170B, the microphone 170C, the headphone interface 170D, and the application processor. Such as music playback, recording, etc.
- the audio module 170 is used to convert digital audio information into analog audio signal output, and is also used to convert analog audio input into digital audio signals. Audio module 170 may also be used to encode and decode audio signals. In some embodiments, the audio module 170 may be provided in the processor 110 , or some functional modules of the audio module 170 may be provided in the processor 110 .
- the electronic device can play the sound signal through the sound-generating device.
- the sound-generating device may be a speaker 170A described below, a receiver 170B described below, or an external device connected to an electronic device, such as headphones and glasses, which are not limited here.
- Speaker 170A also called “speaker” is used to convert audio electrical signals into sound signals.
- the electronic device 100 can listen to music through the speaker 170A, or listen to hands-free calls.
- Receiver 170B also called “earpiece” is used to convert audio electrical signals into sound signals.
- the electronic device 100 answers a call or a voice message, the voice can be heard by bringing the receiver 170B close to the human ear.
- Microphone 170C also called “microphone” or “microphone” is used to convert sound signals into electrical signals. When making a call or sending a voice message, the user can speak close to the microphone 170C with the human mouth and input the sound signal to the microphone 170C.
- the electronic device 100 may be provided with at least one microphone 170C. In other embodiments, the electronic device 100 may be provided with two microphones 170C, which in addition to collecting sound signals, may also implement a noise reduction function. In other embodiments, the electronic device 100 can also be provided with three, four or more microphones 170C to collect sound signals, reduce noise, identify sound sources, and implement directional recording functions, etc.
- the headphone interface 170D is used to connect wired headphones.
- the headphone interface 170D may be a USB interface 130, or may be a 3.5mm open mobile terminal platform (OMTP) standard interface, or a Cellular Telecommunications Industry Association of the USA (CTIA) standard interface.
- OMTP open mobile terminal platform
- CTIA Cellular Telecommunications Industry Association of the USA
- the pressure sensor 180A is used to sense pressure signals and can convert the pressure signals into electrical signals.
- pressure sensor 180A may be disposed on display screen 194 .
- pressure sensors 180A there are many types of pressure sensors 180A, such as resistive pressure sensors, inductive pressure sensors, capacitive pressure sensors, etc.
- a capacitive pressure sensor may include at least two parallel plates of conductive material.
- the electronic device 100 determines the intensity of the pressure based on the change in capacitance.
- the electronic device 100 detects the intensity of the touch operation according to the pressure sensor 180A.
- the electronic device 100 may also calculate the touched position based on the detection signal of the pressure sensor 180A.
- touch operations acting on the same touch location but with different touch operation intensities may correspond to different operation instructions. For example: when a touch operation with a touch operation intensity less than the first pressure threshold is applied to the short message application icon, an instruction to view the short message is executed. When a touch operation with a touch operation intensity greater than or equal to the first pressure threshold is applied to the short message application icon, an instruction to create a new short message is executed.
- the gyro sensor 180B may be used to determine the motion posture of the electronic device 100 .
- the angular velocity of electronic device 100 about three axes ie, x, y, and z axes
- the gyro sensor 180B can be used for image stabilization. For example, when the shutter is pressed, the gyro sensor 180B detects the angle at which the electronic device 100 shakes, calculates the distance that the lens module needs to compensate based on the angle, and allows the lens to offset the shake of the electronic device 100 through reverse movement to achieve anti-shake.
- the gyro sensor 180B can also be used for navigation and somatosensory game scenes.
- Air pressure sensor 180C is used to measure air pressure. In some embodiments, the electronic device 100 calculates the altitude through the air pressure value measured by the air pressure sensor 180C to assist positioning and navigation.
- Magnetic sensor 180D includes a Hall sensor.
- the electronic device 100 may utilize the magnetic sensor 180D to detect opening and closing of the flip holster.
- the electronic device 100 may detect the opening and closing of the flip according to the magnetic sensor 180D. Then, based on the detected opening and closing status of the leather case or the opening and closing status of the flip cover, features such as automatic unlocking of the flip cover are set.
- the acceleration sensor 180E can detect the acceleration of the electronic device 100 in various directions (generally three axes). When the electronic device 100 is stationary, the magnitude and direction of gravity can be detected. It can also be used to identify the posture of electronic devices and be used in horizontal and vertical screen switching, pedometer and other applications.
- Distance sensor 180F for measuring distance.
- Electronic device 100 can measure distance via infrared or laser. In some embodiments, when shooting a scene, the electronic device 100 may utilize the distance sensor 180F to measure distance to achieve fast focusing.
- Proximity light sensor 180G may include, for example, a light emitting diode (LED) and a light detector, such as a photodiode.
- the light emitting diode may be an infrared light emitting diode.
- the electronic device 100 emits infrared light outwardly through the light emitting diode.
- Electronic device 100 uses photodiodes to detect infrared reflected light from nearby objects. When sufficient reflected light is detected, it can be determined that there is an object near the electronic device 100 . When insufficient reflected light is detected, the electronic device 100 may determine that there is no object near the electronic device 100 .
- the electronic device 100 can use the proximity light sensor 180G to detect when the user holds the electronic device 100 close to the ear for talking, so as to automatically turn off the screen to save power.
- proximity light sensor 180G can also be used in leather case mode, and pocket mode automatically unlocks and locks the screen.
- the ambient light sensor 180L is used to sense ambient light brightness.
- the electronic device 100 can adaptively adjust the brightness of the display screen 194 according to the perceived ambient light brightness.
- the ambient light sensor 180L can also be used to automatically adjust the white balance when taking pictures.
- the ambient light sensor 180L can also cooperate with the proximity light sensor 180G to detect whether the electronic device 100 is in the pocket to prevent accidental touching.
- Fingerprint sensor 180H is used to collect fingerprints.
- the electronic device 100 can use the collected fingerprint characteristics to achieve fingerprint unlocking, access to application locks, fingerprint photography, fingerprint answering of incoming calls, etc.
- Temperature sensor 180J is used to detect temperature.
- the electronic device 100 utilizes the temperature detected by the temperature sensor 180J to execute the temperature processing strategy. For example, when the temperature reported by the temperature sensor 180J exceeds a threshold, the electronic device 100 reduces the performance of a processor located near the temperature sensor 180J in order to reduce power consumption and implement thermal protection. In other embodiments, when the temperature is lower than another threshold, the electronic device 100 heats the battery 142 to prevent the low temperature from causing the electronic device 100 to shut down abnormally. In some other embodiments, when the temperature is lower than another threshold, the electronic device 100 performs boosting on the output voltage of the battery 142 to avoid abnormal shutdown caused by low temperature.
- Touch sensor 180K also called “touch panel”.
- the touch sensor 180K can be disposed on the display screen 194.
- the touch sensor 180K and the display screen 194 form a touch screen, which is also called a "touch screen”.
- the touch sensor 180K is used to detect a touch operation on or near the touch sensor 180K.
- the touch sensor can pass the detected touch operation to the application processor to determine the touch event type.
- Visual output related to the touch operation may be provided through display screen 194 .
- the touch sensor 180K may also be disposed on the surface of the electronic device 100 at a location different from that of the display screen 194 .
- the buttons 190 include a power button, a volume button, etc.
- Key 190 may be a mechanical key. It can also be a touch button.
- the electronic device 100 may receive key inputs and generate key signal inputs related to user settings and function control of the electronic device 100 .
- the motor 191 can generate vibration prompts.
- the motor 191 can be used for vibration prompts for incoming calls and can also be used for touch vibration feedback.
- touch operations for different applications can correspond to different vibration feedback effects.
- the motor 191 can also respond to different vibration feedback effects for touch operations in different areas of the display screen 194 .
- Different application scenarios such as time reminders, receiving information, alarm clocks, games, etc.
- the touch vibration feedback effect can also be customized.
- the indicator 192 may be an indicator light, which may be used to indicate charging status, power changes, or may be used to synthesize requests, missed calls, notifications, etc.
- the SIM card interface 195 is used to connect a SIM card.
- the SIM card can be connected to or separated from the electronic device 100 by inserting it into the SIM card interface 195 or pulling it out from the SIM card interface 195 .
- the electronic device 100 can support 1 or N SIM card interfaces, where N is a positive integer greater than 1.
- SIM card interface 195 can support Nano SIM card, Micro SIM card, SIM card, etc. Multiple cards can be inserted into the same SIM card interface 195 at the same time. The types of the plurality of cards may be the same or different.
- the SIM card interface 195 is also compatible with different types of SIM cards.
- the SIM card interface 195 is also compatible with external memory cards.
- the electronic device 100 interacts with the network through the SIM card to implement functions such as calls and data communications.
- the electronic device 100 can execute the volume adjustment method provided by the embodiment of the present application through the processor 110 .
- the above-mentioned electronic device 100 may include two microphones, where one microphone is used to acquire noise signals; the other microphone is used to acquire downlink signals.
- the electronic device 100 may be the mobile phone shown in FIG. 4 or the glasses shown in FIG. 5 .
- the downlink signal is the audio played by the electronic device through the sound-generating device; for example, during a call, the downlink signal is the voice of the other party played by the electronic device through the earpiece; and for example, when listening to a song, the downlink signal is the song played by the electronic device.
- FIG 4 illustrates a hardware form of a mobile phone.
- Mobile phones can play downlink signals through multiple playback modes, such as earpieces and headphones. Different playback modes can correspond to different preset models or volume adjustment methods.
- the following uses the earpiece as an example to introduce the process of collecting downlink signals and noise signals by a mobile phone.
- one microphone of the mobile phone is located on one side of the earpiece; the other microphone of the mobile phone is located on the opposite side of the earpiece.
- the microphone located on one side of the earpiece is called a top microphone (referred to as top microphone); the microphone located on the opposite side of the earpiece is called a bottom microphone (referred to as bottom microphone).
- top microphone the microphone located on one side of the earpiece
- bottom microphone the microphone located on the opposite side of the earpiece.
- the signal collected by the top microphone can be used as the downlink signal
- the signal collected by the bottom microphone can be used as the noise signal.
- the mobile phone can determine the downlink signal from the signal collected by the top microphone. For example, the mobile phone identifies the noise signal from the signal collected by the top microphone and removes the noise signal to obtain the downlink signal; and For example, the mobile phone compares the signal collected by the top microphone with the signal collected by the bottom microphone, and uses the difference between the signal collected by the top microphone and the signal collected by the bottom microphone as the downlink signal.
- the mobile phone may also include a volume key, where the side marked (+) is the volume increase key of the volume key;
- the (-) side is the volume down key of the volume key.
- the volume keys are used to adjust the sound level of the earpiece or speaker.
- the mobile phone includes 10 sound levels. The higher the level, the louder the playback volume; the user can press the volume increase key to determine a higher sound level, making the playback volume louder.
- FIG. 5 illustrates a hardware form of glasses.
- the two temples of the glasses can be respectively provided with two microphones and earpieces.
- the second microphone is located close to the earpiece, the second microphone is used to obtain downlink signals, and the first microphone is used to obtain noise signals.
- the signal collected by the second microphone can be used as the downlink signal
- the signal collected by the first microphone can be used as the noise signal.
- the downlink signal can be determined from the signal collected by the second microphone, for example, the noise signal is identified and removed from the signal collected by the second microphone to obtain the downlink signal; for another example, the second The signal collected by the microphone is compared with the signal collected by the bottom microphone, and the difference between the signal collected by the second microphone and the signal collected by the bottom microphone is used as the downlink signal.
- FIG. 6 is a software structure block diagram of an electronic device 100 disclosed in the embodiment of the present application.
- the layered architecture divides the software into several layers, and each layer has clear roles and division of labor.
- the layers communicate through software interfaces.
- the system is divided into four layers, from top to bottom: application layer, application framework layer, runtime and system library, and kernel layer.
- the application layer can include a series of application packages.
- the application layer also includes a call anti-leakage module.
- the application package can include camera, gallery, calendar, call, map, navigation, WLAN, Bluetooth, music, video, short message and other applications (also called an application).
- the call anti-leakage module is used to adjust the playback volume of the downlink signal during the call.
- the call sound leakage prevention module can specifically include a hearing level test model, an audio calculation module and a self-learning module as shown in Figure 7. The specific contents of the hearing level test model, audio calculation module and self-learning module can be found below. The relevant content will not be described here.
- the application framework layer provides an application programming interface (API) and programming framework for applications in the application layer.
- API application programming interface
- the application framework layer includes some predefined functions.
- the application framework layer can include window manager, content provider, view system, phone manager, resource manager, notification manager, etc.
- a window manager is used to manage window programs.
- the window manager can obtain the display size, determine whether there is a status bar, lock the screen, capture the screen, etc.
- Content providers are used to store and retrieve data and make this data accessible to applications.
- Said data can include videos, images, audio, calls made and received, browsing history and bookmarks, phone books, etc.
- the view system includes visual controls, such as controls that display text, controls that display pictures, etc.
- a view system can be used to build applications.
- the display interface can be composed of one or more views.
- a display interface including a text message notification icon may include a view for displaying text and a view for displaying pictures.
- the phone manager is used to provide communication functions of the electronic device 100 .
- call status management including connected, hung up, etc.
- the resource manager provides various resources to applications, such as localized strings, icons, pictures, layout files, video files, etc.
- the notification manager allows applications to display notification information in the status bar, which can be used to convey notification-type messages and can automatically disappear after a short stay without user interaction.
- the notification manager is used to notify download completion, message reminders, etc.
- the notification manager can also be notifications that appear in the status bar at the top of the system in the form of charts or scroll bar text, such as notifications for applications running in the background, or notifications that appear on the screen in the form of a conversation interface. For example, text information is prompted in the status bar, a beep sounds, the electronic device vibrates, the indicator light flashes, etc.
- Runtime includes core libraries and virtual machines. Runtime is responsible for the scheduling and management of the system.
- the core library contains two parts: one part is the functional functions that the programming language (for example, Java language) needs to call, and the other part is the core library of the system.
- one part is the functional functions that the programming language (for example, Java language) needs to call
- the other part is the core library of the system.
- the application layer and application framework layer run in virtual machines.
- the virtual machine executes the programming files (for example, jave files) of the application layer and application framework layer as binary files.
- the virtual machine is used to perform object life cycle management, stack management, thread management, security and exception management, and garbage collection and other functions.
- System libraries can include multiple functional modules. For example: surface manager, media libraries, three-dimensional graphics processing library (for example: OpenGL ES), two-dimensional graphics engine (for example: SGL), etc.
- the surface manager is used to manage the display subsystem and provides the fusion of two-dimensional (2-Dimensional, 2D) and three-dimensional (3-Dimensional, 3D) layers for multiple applications.
- the media library supports playback and recording of a variety of commonly used audio and video formats, as well as static image files, etc.
- the media library can support a variety of audio and video encoding formats, such as: MPEG4, H.264, MP3, AAC, AMR, JPG, PNG, etc.
- the 3D graphics processing library is used to implement 3D graphics drawing, image rendering, composition, and layer processing.
- 2D Graphics Engine is a drawing engine for 2D drawing.
- the kernel layer is the layer between hardware and software.
- the kernel layer at least includes display driver, camera driver, audio driver, sensor driver, and virtual card driver.
- the following exemplifies the workflow of the software and hardware of the electronic device 100 in conjunction with capturing the photographing scene.
- the corresponding hardware interrupt is sent to the kernel layer.
- the kernel layer processes touch operations into raw input events (including touch coordinates, timestamps of touch operations, and other information). Raw input events are stored in the kernel layer.
- the application framework layer obtains the original input event from the kernel layer and identifies the control corresponding to the input event. Taking the touch operation as a touch click operation and the control corresponding to the click operation as a camera application icon control as an example, the camera application calls the interface of the application framework layer to start the camera application, and then starts the camera driver by calling the kernel layer. Camera 193 captures still images or video.
- the electronic device may include a signal acquisition module, a hearing level testing module, a volume calculation module, a self-learning module and a volume adjustment module.
- the signal acquisition module may include a noise signal acquirer and a downlink signal acquirer.
- the noise signal acquirer and the downlink signal acquirer may be microphones of the electronic device 100; for another example, the downlink signal acquirer may be as shown in Figure 4
- the top microphone and the noise signal acquirer of the mobile phone can be the bottom microphone of the mobile phone shown in Figure 4; for another example, the downlink signal acquirer can be the first microphone of the glasses shown in Figure 5, and the noise signal acquirer can be the first microphone of the glasses shown in Figure 5.
- the volume calculation module can determine the feature vector based on the difference between the noise signal acquired by the noise signal acquirer and the downlink signal acquired by the downlink signal acquirer, where the feature vector can include the sub-band signal of the above-mentioned noise signal and the sub-band signal of the above-mentioned downlink signal.
- the energy difference of the downlink signal If the decibel value of any frequency band signal in the sub-band signal in the downlink signal is greater than the masking threshold in the masking effect, the downlink signal can be heard by the user; input the feature vector into the target model to get the output sound Level, the sound level of the electronic device includes the output sound level.
- the target model may be determined based on the application corresponding to the audio playback, the playback method of the audio playback, and the user. For details, please refer to the relevant content below.
- the volume adjustment module can adjust the playback volume of the downstream signal based on the output sound level of the electronic device.
- the sound level is used to adjust the audio playback volume; electronic devices can play audio according to different sound levels. When the electronic device plays audio according to different sound levels, the audio playback volume is different. For details, see the relevant description in Figure 16.
- the target model can be a preset model determined by the hearing level testing module from several preset models based on the user's hearing level; the target model can also play the preset model corresponding to the application corresponding to the audio.
- the hearing level test module is used to test the user's hearing level, and the preset model corresponding to the user's hearing level is the target model.
- the self-learning module is used to train the target model based on the user's usage data to update the target model so that the output results of the target model are more in line with the user's usage habits.
- the listening level test module can divide the user's listening level into K listening levels, which are listening level 1, listening level 2 and listening level K respectively.
- K listening levels which are listening level 1, listening level 2 and listening level K respectively.
- Each hearing level corresponds to a preset model.
- the preset model corresponding to hearing level 1 is preset model 1
- the preset model corresponding to hearing level 2 is preset model 2
- the preset model corresponding to hearing level K is is the preset model K.
- preset model 1 can output a model that meets the listening needs of age group 1; if hearing level 2 is the hearing level of age group 2, then preset model 2 can output Model to meet the hearing needs of age group 2.
- each preset model can be trained based on the user training set corresponding to the hearing level, the preset module can meet the listening needs of its corresponding user.
- the user training set includes multiple feature vectors and the target sound level corresponding to each feature vector.
- Each feature vector is obtained based on the difference between the uplink signal and the noise signal.
- the target sound level may be the user's comfortable sound level or the lowest sound level that the user can hear clearly. For example, if the electronic device plays audio according to the first sound level, and the user thinks the audio playback volume is comfortable, then the first sound level is the user's comfortable sound level; the electronic device plays audio according to the first sound level, and the user thinks the audio playback volume is comfortable. is the lowest volume that the user can hear clearly, then the first sound level is the lowest sound level that the user can hear clearly.
- the target sound level is a comfortable sound level
- the output sound level of the preset model trained based on the target sound level is the user's comfortable sound level. If the target sound level is the lowest sound level that the user can hear clearly, then the output sound level of the preset model trained based on the target sound level is the lowest sound level that the user can hear clearly or a lower sound level.
- the following takes the comfort sound level as an example to introduce the process of obtaining the user training set.
- the comfortable sound levels of several people in different scenarios and the downstream signals and noise signals corresponding to this volume can be collected.
- the scene dimensions described here include the size and type of environmental noise, playing different sound sources, etc.
- the feature vector corresponding to each comfort sound level is calculated.
- a comfort sound level and the feature vector corresponding to the comfort sound level can be used as a piece of training data for the user training set.
- the comfort sound level is is the above target sound level, thereby obtaining the user training set. It should be noted that in order to avoid short-term fluctuations in volume, the comfortable sound levels of the two frames of speech before and after can also be smoothed.
- user training sets corresponding to different hearing levels can be obtained based on user data of different hearing levels. For example, if the hearing level of age group 1 is hearing level 1, then the user training set corresponding to hearing level 1 is obtained based on the comfortable sound level of age group 1 and the downstream signal and noise signal corresponding to this volume; another example is that of age group 2 The hearing level is hearing level 2, then the user training set corresponding to hearing level 2 is obtained based on the comfortable sound level of age group 2 and the downlink signal and noise signal corresponding to this volume.
- the following takes preset model 1 as an example to introduce the training process of the preset model.
- the electronic device can first obtain a user training set with a hearing level of 1.
- the user training set can include multiple pieces of training data.
- the multiple pieces of training data include the j-th piece of training data, and the j-th piece of training data includes the j-th frame.
- the electronic device can train the initial preset model based on each piece of training data, and end the training when the loss meets the preset conditions to obtain the preset model 1.
- the jth training data includes the input data of the jth frame downlink signal and the target sound level a, where the input data of the jth frame downlink signal may include the sound level of the earpiece several moments before the jth frame downlink signal and the jth frame downlink signal
- the feature vector of the j-th frame downlink signal is obtained based on the difference between the j-th frame downlink signal and the j-th frame noise signal.
- the electronic device inputs the j-th training data into the initial preset model to obtain the output sound level b; the electronic device compares the target sound level a and the output sound level b to obtain the loss of the target sound level a and the output sound level b; Loss feedback value initial preset model.
- the input data of the j-th frame downlink signal may not include the sound level of the earpiece at several moments before the j-th frame downlink signal.
- the input data includes the sound level of the earpiece at several previous moments in order to smooth the volume change and avoid sudden volume increase or decrease too much.
- the above preset model is only an average modeling of the user's volume usage habits of big data; the volume adjustment method provided by the embodiment of the present application takes into account the large differences in hearing levels and listening habits between actual individual users.
- the electronic device can record the feature vector each time the user manually adjusts the volume, the sound level after the adjustment, and the sound levels at several moments before the adjustment, where the feature vector is the sound level when the user manually adjusts the volume.
- the difference between the collected downlink signal and the noise signal; the electronic device can use the feature vector, the adjusted sound level and the sound level at several moments before adjustment as a piece of training data and store it in the self-learning training set, where the feature vector and the sound level before adjustment
- the sound levels at several moments are the input data, and the adjusted sound levels are the target sound levels.
- the electronic device can train the target model based on the self-learning training set at an appropriate time (for example, at night). For example, when the user clicks the consent control 212 in the user interface 21 shown in FIG. 12A, the electronic device detects the user's volume adjustment operation during each call, and when the user performs the volume adjustment operation, the electronic device The feature vectors obtained from the downlink signal and noise signal at this time, the sound level after user adjustment, and the sound level at several moments before adjustment are saved as a piece of training data.
- the electronic device can also send the target model and the self-learning training set to the server, and the server uses the self-learning training set to train the target model; the server then sends the trained target model to the electronic device.
- different applications on the electronic device may correspond to different preset models.
- the preset models corresponding to application 1 are model 1, model 2 and model 3, and the preset models corresponding to application 2 are model 4, model 3. 5 and model 6; furthermore, the electronic device can determine the target model corresponding to the application from the preset model corresponding to the application based on the user's hearing level; the audio played for the application based on the target model Make volume adjustments.
- model 1, model 2 and model 3 are trained based on user data when users with different hearing levels use application 1
- model 4 model 5 and model 6 are based on user data when users with different hearing levels use application 2. Obtained by training, the training process can be found in the relevant description above and will not be repeated here.
- Application 1 can be a call application, which has a call function. Users can use the software to conduct video communication or voice communication with other users, such as the call application or Changlian that comes with the mobile phone system;
- Application 2 can be a music application. application, which has the function of playing audio, such as the audio software that comes with the mobile phone system or Huawei Music, Huawei Video, etc. It is understandable that under normal circumstances, users have higher confidentiality requirements for call applications and will adjust the volume of the downlink signal of the call application to be smaller. Therefore, under the same environment, the sound output by the target model of the call application will The level is lower than the sound level output by music applications.
- the hearing test results in the electronic device can be used for multiple applications, that is, one hearing level can correspond to preset models for different applications. For example, when the electronic device determines that user A's hearing level is hearing level 1, it can determine that the target model corresponding to application 1 is model 1, and the target model corresponding to application 2 is model 3.
- the electronic device can be configured with different hearing level tests for different applications.
- the listening test corresponding to application 1 is listening test 1
- the listening test corresponding to application 2 is listening test 2.
- the test result of user A in listening test 1 is listening level 3, and the test result in listening test 2 is listening level 2.
- the target model corresponding to hearing level 3 is model 3
- the target model corresponding to hearing level 2 is model 5
- the target model corresponding to application 1 is model 3
- the target model corresponding to application 2 is model 5.
- the volume adjustment method provided by the embodiment of the present application can be applied to a variety of scenarios, such as a call scenario, where the user answers an incoming call through a mobile phone or glasses; or a music listening scenario, where the user wears glasses to listen to music, etc. There are no limitations here.
- the following takes a scenario in which the electronic device is a mobile phone and the user answers an incoming call through the mobile phone as an example to introduce the volume adjustment method provided by the embodiment of the present application in detail.
- Figure 11 is an overall flow diagram of a volume adjustment method provided by this application.
- the method may include the following steps:
- Step S101 The electronic device responds to a user operation to start a hearing test and plays preset audio at different volumes.
- the volume can be the playback volume or sound level; each volume corresponds to a preset hearing level.
- electronic devices play preset audio at playback volumes of 5 dB, 15 dB, 25 dB and 35 dB respectively, and each playback volume corresponds to a hearing level.
- an electronic device plays preset audio according to different preset sound levels.
- the electronic device includes 10 sound levels, and the preset sound level is 5 sound levels among the 10 sound levels.
- the 5 audios are respectively related to 5 hearing levels. Corresponding. Among them, the higher the sound level, the louder the audio played by the earpiece.
- the system settings of the electronic device may include a call sound leakage prevention function option; therefore, the user can turn on the call sound leakage prevention function through user operations for the call sound leakage prevention function option.
- the electronic device detects When the user operates, the preset audio is played at different volumes in response to the user operation.
- the default audio can be a piece of news or music, etc., which is not limited here.
- the electronic device can play preset audio of the same content according to the volume from low to high. It should be noted that if the electronic device plays the preset audio of the same content at a volume from high to low, and the user has already heard the audio content clearly when playing the preset audio at a high volume, this method will judge whether the user can hear it at a low volume. The accuracy of the clear audio content is affected.
- Figures 12A to 12F are some exemplary interface diagrams of a user performing a hearing test on an electronic device according to embodiments of the present application.
- FIG. 12A exemplarily shows a user interface 21 in a system setting of an electronic device.
- the user interface 21 includes an anti-sound leakage switch 211 and a consent control 212 .
- the anti-leakage switch 211 is used to turn on the anti-leakage function during calls.
- a prompt message "In order to match your recent hearing curve, turning on this function requires a hearing level test first" is displayed below the anti-leakage switch 211; the consent control 212 is used for Obtain the user's volume usage information, which is used to train the target model.
- the electronic device when the electronic device detects a user operation on the anti-sound leakage switch 211 , in response to the user operation, the electronic device may display the user interface 22 as shown in FIG. 12B .
- the user interface 22 displays a prompt message "The preset voice is about to be played at volume 1. Please maintain your normal calling posture to listen to the voice played by the receiver, and determine whether to use this volume after the voice playback is completed.” Among them, volume 1 can be the lowest volume. It is understandable that after seeing the prompt information of the user interface 22, the user can put the mobile phone to his ear and wait for the voice playback.
- the electronic device can start playing the preset audio at the volume of 1; after the playback is completed, the electronic device can display the user interface 23 as shown in Figure 12C, and the user interface 23 can include satisfaction controls. 231 and dissatisfied controls 232.
- the satisfaction control 231 is used to indicate that the volume 1 is a volume that the user is satisfied with;
- the dissatisfaction control 232 is used to indicate that the volume 1 is a volume that the user is not satisfied with. It is understandable that the user can check the mobile phone interface when the voice playback ends, and when the user sees the mobile phone display as shown in Figure 12C At interface 23, select the corresponding option based on the user experience during voice playback.
- volume 2 can be one level higher than volume 1.
- volume 1 For example, the volume of an electronic device is divided into five levels. From low to high, the volume is volume 1, volume 2, volume 3, volume 4, and volume 5.
- the electronic device can start playing the preset audio at the volume of 2; after the playback is completed, the electronic device can display the user interface 25 as shown in Figure 12E, and the user interface 25 can include satisfaction controls. 251 and dissatisfied controls 252.
- the satisfaction control 251 is used to indicate that the volume 2 is a volume that the user is satisfied with;
- the dissatisfaction control 252 is used to indicate that the volume 2 is a volume that the user is not satisfied with.
- the user interface shown in Figure 12F is displayed. 26.
- the user interface 26 displays a prompt message "Thank you for your cooperation, the hearing test has been completed! The volume of your earpiece during the call will be intelligently adjusted based on volume 2 to ensure the security of your information."
- the electronic device when the electronic device detects a user operation for an unsatisfactory control, it can play the preset audio according to the next volume. For example, when the electronic device detects a user operation for the unsatisfactory control 232 of Volume 1, Play the preset voice at volume 2 to continue the hearing test.
- the electronic device detects a user operation on a satisfactory control, it determines that the volume indicated by the satisfactory control is a volume that the user is satisfied with, and the electronic device can end the hearing test. For example, when the electronic device detects a user operation on the satisfactory control 251 for Volume 2 , it is determined that the volume 2 is the volume that the user is satisfied with, the user interface 26 shown in Figure 12F is displayed, and the hearing test is no longer performed.
- all volumes include volume 1 to volume 5, with volume 1 being the lowest volume and volume 5 being the highest volume.
- the electronic device can play presets according to all volumes and display a user interface 23 similar to that shown in Figure 12C for the user to adjust each volume. Select a volume, and end the listening test after detecting that the user has completed selecting all volumes.
- Step S102 The electronic device determines the user's hearing level based on user operations for different volumes.
- the electronic device determines the hearing level corresponding to the volume as the user's hearing level. For example, as shown in FIGS. 12A to 12F , when the electronic device detects the user's operation on the satisfaction control 251 on the user interface 25 , it determines the volume 2 as the volume that the user is satisfied with; then the electronic device can determine the hearing volume corresponding to the volume 2 . Level 2 is determined as the user's hearing level.
- volume 1 to volume 5 are playback volumes from low to high, a user with better hearing may choose volume 1, while a user with poor hearing may choose volume 5.
- the hearing level corresponding to volume 1 is the hearing level of the user with the worst hearing; the hearing level corresponding to the volume 5 can be the hearing level of the user with the best hearing.
- Step S103 The electronic device determines a target model from multiple preset models based on the user's hearing level.
- each listening level corresponds to a preset model.
- the preset model corresponding to hearing level 2 is preset model 2
- the electronic device can determine preset model 2 as the target model.
- the electronic device can be preset with 5 preset models, and the 5 preset models correspond to the hearing levels corresponding to the 5 volumes respectively. Furthermore, when the user selects one of the volumes, the target model is selected from the five preset models.
- the embodiments of this application do not limit the number of volumes and the number of preset models for the electronic device hearing test.
- Step S104 When the electronic device detects the user's operation to answer the incoming call, it turns on the top microphone and the bottom microphone of the mobile phone to obtain the noise signal and the downlink signal.
- top microphone i.e. top microphone
- bottom microphone the positions of the top microphone (i.e. top microphone) and the bottom microphone (bottom microphone) can be shown in Figure 4.
- the call interface when the electronic device receives an incoming call, can be displayed; furthermore, when the electronic device detects a user operation to answer the incoming call, the electronic device can respond to the user operation by playing a downlink signal through the earpiece and turning on the top of the mobile phone.
- Microphone and bottom microphone; electronic equipment can use the signal collected by the top microphone as a downlink signal and the signal collected by the bottom microphone as a noise signal.
- the signal collected by the top microphone is used as the collected downlink signal.
- Step S105 The electronic device obtains the feature vector of at least one frame of downlink signal based on the noise signal and the downlink signal.
- the electronic device can calculate the energy difference between the downlink signal and the noise signal corresponding to the downlink signal of the frame, and use the energy difference as the feature vector of the downlink signal of the frame.
- the downlink signal of the frame and the noise signal of the frame are the signals collected by the top microphone and the bottom microphone at the same time respectively; the energy difference can be the energy difference of the downlink signal of the frame and the noise signal of the frame in multiple frequency bands.
- the electronic device can perform frame segmentation and windowing processing on the noise signal and downlink signal collected by the microphone, respectively, to obtain M frames of noise signals and M frames of downlink signals, that is, M sets of audio signals, where M is a positive integer, and each set of audio signals includes downlink signals and noise signals collected at the same time; furthermore, the electronic equipment performs molecular band processing on the downlink signals and noise signals in each set of audio signals to obtain N signal sub-bands of the downlink signals.
- N N signal sub-bands with and noise signals, where N is an integer greater than 1; furthermore, the electronic device calculates the energy difference between the noise signal and the downlink signal corresponding to each signal sub-band signal in each frame of audio signal, Obtain the N-dimensional feature vector corresponding to each set of audio signals.
- molecular band processing is to divide a frame signal into multiple signals with different frequency bands, and each sub-band signal has a different frequency band.
- the M group of audio signals includes the i-th group of audio signals.
- the following takes the i-th group of audio signals as an example to introduce the process of calculating the feature vector.
- Figure 13 is a schematic diagram of framing and windowing the signals collected by the bottom microphone provided by an embodiment of the present application. As shown in Figure 13, after the electronic equipment divides the signals collected by the bottom microphone into frames and adds windows, M frames of noise signals can be obtained, where the M frames of noise signals include the i-th frame of noise signals, i ⁇ M.
- Figure 14 is a schematic diagram of framing and windowing the signals collected by the top microphone according to an embodiment of the present application. As shown in Figure 14, after the electronic device divides the signals collected by the top microphone into frames and adds windows, M frames of downlink signals can be obtained, where the M frames of downlink signals include the i-th frame of downlink signals.
- the electronic device can perform molecular band processing on the noise signal of the i-th frame and the downlink signal of the i-th frame respectively, to obtain the N sub-band signals corresponding to the noise signal of the i-th frame and the downlink signal of the i-th frame. N subband signals.
- the number of subbands of the noise signal of the i-th frame and the downlink signal of the i-th frame are equal, and the frequency band of a sub-band signal of the noise signal of the i-th frame and the sub-band signal of the downlink signal of the i-th frame are the same, for example, the noise signal of the i-th frame
- the first sub-band signal of the i-th frame downlink signal has the same frequency band as the first sub-band signal of the i-th frame downlink signal
- the second sub-band signal of the i-th frame noise signal has the same frequency band as the i-th frame downlink signal
- the Nth sub-band signal of the i-th frame noise signal is the same as the N-th sub-band signal of the i-th frame downlink signal.
- the electronic device can respectively subtract the energy of the sub-band signal in the same frequency band of the i-th frame noise signal and the i-th frame downlink signal. For example, subtract the i-th frame noise signal from the energy of the first sub-band signal of the i-th frame noise signal.
- the energy of the first sub-band signal of the frame downlink signal is d 1
- the energy of the second sub-band signal of the i-th frame noise signal is subtracted from the energy of the second sub-band signal of the i-th frame downlink signal to get d 2
- the energy of the N-th sub-band signal of the i-th frame noise signal is subtracted from the energy of the N-th sub-band signal of the i-th frame downlink signal to obtain d N
- the N-dimensional feature vector of the i-th frame downlink signal can be obtained as [ d 1 ,d 2 ,...,d N ] T .
- the superscript T represents the transpose of a vector or matrix.
- Step S106 The electronic device inputs the feature vector of at least one frame of downlink signal into the target model to obtain the output sound level.
- the sound level of the electronic device includes the output sound level; the sound level is used to adjust the playback volume of the earpiece.
- the sound level may be the sound pressure level of the downlink signal, for example, in decibels. Among them, the higher the sound level, the louder the playback volume of the earpiece.
- the volume that can be adjusted by the volume keys on a mobile phone includes 10 levels, that is, the user can press the volume up button 10 times to adjust the phone from the lowest volume to the highest volume; the 10 levels can be respectively sound level 1, sound level 2, and sound level 10. Then the output sound level is a sound level from sound level 1 to sound level 10. Among them, sound level 1 is the lowest and has the smallest playback volume; sound level 10 is the highest and has the largest playback volume.
- the electronic device can input the feature vector of the frame of downlink signal and the sound level of the earpiece at several moments before the frame of downlink signal.
- Target model to get the output sound level. For example, if the feature vector of at least one frame of downlink signal is [d 1 , d 2 ,..., d N ] T , the feature vector of the i-th frame downlink signal is [d 1 , d 2 ,..., d N ] T , then the electronic device can input the feature vector of the i-th frame downlink signal into the target model. , get the output sound level.
- the feature vector of at least one frame of downlink signal includes the feature vector of multiple frames of downlink signals
- the electronic device can first select the feature vector of one frame of downlink signal from the feature vectors of multiple frames of downlink signals, and then add the feature vector of the frame to the downlink signal.
- the feature vector of the downlink signal and the sound level of the earpiece at the first few moments of the downlink signal of the frame are input into the target model to obtain the output sound level.
- the output sound level is the lowest sound level or comfortable sound level predicted by the electronic device that the user can hear clearly in the current environment.
- the user cannot hear the voice played at volume 1 clearly but can clearly hear the voice played at volume 2.
- the user selects volume 2.
- the electronic device determines preset model 2 as the target model. Then the output sound level of preset model 2 is the lowest sound level that the user can hear clearly.
- the downlink signals and noise signals collected by electronic devices in different environments are different, and the feature vectors input to the target model are different. Therefore, the target model outputs different volumes.
- the output sound level is the lowest sound that the user can hear clearly in the current environment. level, this method can ensure the security of the user's information during the call and prevent outsiders from hearing the call content.
- the output sound level of preset model 2 is the user's comfortable sound level.
- the downlink signals and noise signals collected by electronic devices in different environments are different, and the feature vectors input to the target model are different. Therefore, the target model outputs different volumes.
- the output sound level is a comfortable sound that the user can hear clearly in the current environment. level, which can improve the user experience during calls.
- Step S107 The electronic device plays the downlink signal based on the output sound level.
- the electronic device compares the output sound level with the current sound level of the electronic device playing the downlink signal; when the difference between the output sound level and the current sound level is greater than the preset threshold, Adjust the current sound level to the output sound level, that is, adjust the current sound level to the output sound level.
- the sound level of electronic equipment includes 10 levels, ranging from sound level 1 to sound level 10 from low to high, and the decibel difference of each sound level is 5 dB.
- the electronic device adjusts the current sound level from sound level 1 to sound level 3. That is to say, the electronic device can Increase the decibel value of the downlink signal by 10 decibels and play it through the earpiece.
- the volumes of downlink signals in each frequency band of the same sound level are different.
- Figure 16 illustrates the downlink signals of some frequency bands.
- Figure 16 exemplarily shows the downlink signals in three frequency bands.
- the three triangles located on the ordinate are used to identify the sound pressure levels of the downlink signals in the three frequency bands when the sound level is 1; the three circles located on the ordinate are used to identify the sound pressure levels of the downlink signals in the three frequency bands when the sound level is 1.
- the sound pressure level of the downlink signals in the three frequency bands when the sound level is 1.
- Figure 16 exemplarily shows that the downlink signals of three frequency bands have different volumes at the same sound level.
- the level of sound level 3 is higher than the level of sound level 1; assuming that the difference between each sound level is 5 decibels, then the current sound level
- the playback volume of the downlink signal in each frequency band increases by 10 decibels. It is understandable that if the electronic device is a mobile phone and the volume button of the mobile phone can adjust the volume to 10 steps, the mobile phone will automatically adjust the current sound level from sound level 1 to sound level 3, and the user will manually press the volume twice to increase the volume. keys have the same effect.
- the sound level of the electronic device can also be 10 or 15, which is not limited here.
- the electronic device may adjust the current sound level to the output sound level only when the differences between multiple output sound levels within the preset time period and the current sound level are greater than a preset threshold. Understandably, this method can avoid the negative impact of frequent volume adjustments caused by data fluctuations.
- the playback volume can be different when playing the same downlink signal at the same sound level.
- the electronic device plays the downlink signal of frequency band A through sound level 1.
- the playback volume of the downlink signal is volume 1;
- the playback volume of the downstream signal is volume 2; the decibel value of volume 1 is not equal to the decibel value of volume 2.
- the preset model corresponding to the satisfactory volume is the target model. Then, the target model is used to output the lowest sound level that user A can hear clearly in different environments.
- the electronic device can adjust the earpiece playback volume to the lowest volume that user A can hear clearly in different environments.
- the electronic device When user A makes calls in different environments, due to the different sizes of noise signals in the environment, the electronic device obtains different feature vectors in different scenarios.
- the electronic device inputs different feature vectors into the target model to obtain the output sound level. different.
- FIG. 17 is a schematic diagram of the output sound level changing with the environment according to an embodiment of the present application.
- FIG. 17 takes the downlink signal of frequency band A as an example. After the electronic device adjusts the sound level of the earpiece to sound level 1, the playback volume of the downlink signal of frequency band A is playback volume 1. It can be seen that the decibel value of the playback volume 1 is greater than the masking threshold caused by the first noise signal at this time.
- the noise signal with the highest decibel value at this time is the second noise signal
- the second noise signal is The volume
- the electronic device can The standard model obtains sound level 2, which is specifically used to adjust the playback volume of the downlink signals in each frequency band.
- Figure 17 takes the downlink signal of frequency band A as an example. After the electronic device adjusts the sound level of the earpiece to sound level 1, the playback volume of the downlink signal of frequency band A is playback volume 2. It can be seen that the decibel value of the playback volume 2 is greater than the masking threshold caused by the second noise signal at this time.
- the noise volume 2 of the second noise signal is a decibel smaller than the noise volume 1 of the first noise signal, and the playback volume 2 is b decibels smaller than the playback volume 1; the playback volume 1 and playback volume 2 are both higher than the masking threshold in the current environment. It can be seen that when user A is in a quieter environment, the playback volume obtained based on the target model is lower than the playback volume obtained based on the target model in a noisier environment. In other words, the electronic device can play in a quieter environment based on the target model.
- volume 3 is greater than volume 1.
- Figure 18 exemplarily shows the downlink signal in frequency band A.
- the decibels of the playback volume obtained by user A's electronic device based on preset model 3 The value is greater than the decibel value of the playback volume obtained by user B based on preset model 1; the playback volume of user A's electronic device and the playback volume of user B's electronic device are both greater than the masking threshold.
- the electronic device can obtain a sound level that meets the user's hearing needs based on the preset model selected by the user, and adjust the playback volume of the downlink signal to the sound level. Corresponding playback volume to meet the hearing needs of different groups of people.
- the electronic device can slowly adjust the volume through the target model when the user is in two environments with a large noise gap.
- the target model can be found in the description above and will not be described again here.
- the electronic device can slowly increase the sound level of the downlink signal played by the earpiece until the sound level of the earpiece is raised to the corresponding sound level in the noisy environment; for another example, When the user moves from a noisy environment to a quiet environment during a call, the electronic device can slowly reduce the sound level of the downlink signal played by the earpiece until the sound level of the earpiece is reduced to the corresponding sound level in a quiet environment.
- FIG 19 illustrates an example of the volume adjustment process during a user's call.
- the user interface 31, user interface 32 and user interface 33 shown in Figure 19 are respectively interface diagrams displayed on the user's electronic device during the conversation between the user and Bob.
- user interface 31 is the user interface at 00:07 of the call
- user interface 32 is the user interface at 00:09 of the call
- user interface 33 is the user interface at 00:11 of the call.
- the electronic device can obtain the sound level 2 through the target model based on the downlink signals and noise signals collected in this environment. It can be seen from the user interface 31 that the electronic device plays the downlink signal through sound level 2 when the user is talking at 00:07.
- the electronic device obtains the sound level 4 through the target model based on the downlink signals and noise signals collected in the noisy environment.
- the electronic device can first adjust the playback sound level of the earpiece to sound level 3 at 00:09 of the call, as shown in the user interface 32; and then adjust the playback sound level of the earpiece to sound level 4 at 00:11 of the call. As shown in user interface 33. It is understandable that this method can avoid the user's auditory discomfort when the volume of multiple levels is suddenly adjusted.
- the user interface 31 to 33 is only an exemplary interface provided by the embodiment of the present application, and its purpose is only to clearly display the sound level of the current interface, and should not cause limitations to the embodiment of the present application;
- the electronic device may not display the volume adjustment portion of the user interface 31 to 33 in Figure 19 during the actual volume adjustment process. In other words, the electronic device may not display the sound level of the earpiece when automatically adjusting the volume.
- different applications on the electronic device can correspond to different target models; furthermore, the electronic device can adjust the volume of audio played by the application based on the target model corresponding to the application.
- the electronic device can adjust the volume of audio played by the application based on the target model corresponding to the application.
- the playback volume of the other party's voice during the call is obtained based on the first target model.
- the volume of the songs played by the music application is obtained based on the second target model.
- the playback volume of the other party's voice mentioned above is The volume and the above song volume can be different volumes.
- FIG. 20 exemplarily shows the sound level adjusted by the electronic device when the user uses different applications through the same electronic device.
- the following example takes a call application and a music application as examples. It is assumed that the first target application corresponding to the call application is the default model corresponding to the lowest volume that the user can hear clearly; the second target application corresponding to the music application is the most comfortable volume corresponding to the user. default model. It is understandable that the user may believe that the confidentiality of the call content is higher than that of the music, and therefore wants the volume during the call to be as low as possible.
- the sound level of the earpiece playback volume obtained by the electronic device through the first target application is sound level 3. That is, as shown in the user interface 41, the electronic device plays through sound level 3.
- the voice of Bob, the other party on the call; under the same environmental conditions as the user interface 41, the electronic device based on the collected downlink signals and noise signals, the sound level of the playing song volume obtained by the second target application is sound level 4, that is, as in the user interface 42
- the electronic device plays songs in the music application through sound level 4. That is to say, when users use electronic devices to play audio through different applications in the same environment, the volume of the audio they hear is different. For example, when making a call in the same environment, the volume of the other party's voice played by the earpiece is higher than that played by the other party in the same environment. The volume of the music heard through the headphones using the music app is low.
- user interface 41 to user interface 41 is only an exemplary interface provided by the embodiment of the present application. Its purpose is only to clearly display the sound level of the current interface and should not limit the embodiment of the present application. That is to say, the electronic device may not display the volume adjustment portion of the user interface 41 to 42 in Figure 20 during the actual volume adjustment process. That is, the electronic device may not display the sound level when automatically adjusting the volume.
- Electronic devices can adjust the target model based on user habits.
- the electronic device may record user data each time the user manually adjusts the volume during actual use.
- the data may include feature vectors, adjusted sound levels, and sound levels at several moments before adjustment. etc.; furthermore, the electronic device trains the target model based on the above user data as training data, so that the target model conforms to the user's habits.
- Figure 21 exemplarily shows part of the application interface for the user to talk to Bob through the electronic device under the first environmental condition on August 1, 2022.
- the user interface 51 exemplarily shows the sound level 4 of the electronic device after being adjusted by the target model under the first environmental condition; because the user feels that the volume is loud, as shown in the user interface 52 , the user passes Manually press the volume down (-) key in the volume key, and the electronic device responds to the user operation by adjusting the volume of the downlink signal played by the earpiece to sound level 3.
- the electronic device can determine a piece of training data based on the user data corresponding to the scene shown in Figure 21, such as the feature vector determined at that time, the adjusted sound level and the sound levels at several moments before adjustment, etc., and based on this piece of training data, the target model Conduct training.
- the electronic device trains the target model based on at least one piece of training data between August 1, 2022, and August 5, 2022, to obtain the trained target model.
- the user spoke to Bob through the electronic device under the first environmental condition.
- the electronic device could obtain a sound level of 3 based on the trained target model.
- the electronic device has adjusted the playback volume of the earpiece to sound level 3 when the user talks to Bob for 00:02 seconds. Therefore, the user does not need to manually adjust the volume. It is understandable that based on usage experience, users may find that the number of times they manually adjust the volume gradually decreases, and the volume played by electronic devices becomes more and more in line with their own usage habits.
- Embodiments of the present application also provide an electronic device.
- the electronic device includes one or more processors and one or more memories; wherein the one or more memories are coupled to the one or more processors, and the one or more memories are used to For storing computer program code, the computer program code includes computer instructions.
- the electronic device When one or more processors execute the computer instructions, the electronic device causes the electronic device to perform the method described in the above embodiments.
- Embodiments of the present application also provide a computer program product containing instructions, which when the computer program product is run on an electronic device, causes the electronic device to execute the method described in the above embodiment.
- Embodiments of the present application also provide a computer-readable storage medium, which includes instructions.
- the electronic device When the instructions are run on an electronic device, the electronic device causes the electronic device to execute the method described in the above embodiment.
- the computer program product includes one or more computer instructions.
- the computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable device.
- the computer instructions may be stored in or transmitted from one computer-readable storage medium to another, e.g., the computer instructions may be transferred from a website, computer, server, or data center Transmission to another website, computer, server or data center through wired (such as coaxial cable, optical fiber, digital subscriber line) or wireless (such as infrared, wireless, microwave, etc.) means.
- the computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that contains one or more available media integrated.
- the available media may be magnetic media (eg, floppy disk, hard disk, magnetic tape), optical media (eg, DVD), or semiconductor media (eg, Solid State Disk), etc.
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- General Health & Medical Sciences (AREA)
- Theoretical Computer Science (AREA)
- Human Computer Interaction (AREA)
- General Engineering & Computer Science (AREA)
- Animal Behavior & Ethology (AREA)
- Biophysics (AREA)
- Heart & Thoracic Surgery (AREA)
- Medical Informatics (AREA)
- Molecular Biology (AREA)
- Surgery (AREA)
- Pathology (AREA)
- Biomedical Technology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Artificial Intelligence (AREA)
- Acoustics & Sound (AREA)
- General Physics & Mathematics (AREA)
- Psychiatry (AREA)
- Physiology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Mathematical Physics (AREA)
- Otolaryngology (AREA)
- Fuzzy Systems (AREA)
- Evolutionary Computation (AREA)
- Circuit For Audible Band Transducer (AREA)
- Telephone Function (AREA)
Abstract
Description
Claims (22)
- 一种音量调节方法,其特征在于,应用于电子设备,所述电子设备包括发声装置和至少一个麦克风,所述方法包括:所述电子设备响应于开始听力测试的用户操作,分别通过多个音量播放预设音频;所述多个音量包括第一音量;所述电子设备基于针对所述第一音量的确定操作,确定用户的听力水平为所述第一音量对应的第一听力水平;所述电子设备在通过所述发声装置播放下行信号时,通过所述至少一个麦克风采集下行信号和环境噪音信号;所述电子设备在采集的下行信号和采集的环境噪音信号中的至少一个改变时,基于所述第一听力水平将所述发声装置的播放音量由第一声音等级调节为第二声音等级。
- 根据权利要求1所述的方法,其特征在于,所述第一音量越大,所述第二声音等级的等级越高。
- 根据权利要求1或2所述的方法,其特征在于,所述第一音量为所述用户在当前环境中的舒适音量或者能够听清的最低音量。
- 根据权利要求1-3任一项所述的方法,其特征在于,改变后的环境噪音信号的能量越大,所述第二声音等级的等级越高。
- 根据权利要求1-4任一项所述的方法,其特征在于,所述电子设备在下行信号和环境噪音信号中的至少一个改变时,将所述发声装置的播放音量由第一声音等级调节为第二声音等级,包括:所述电子设备在所述第一声音等级与所述第二声音等级相差第一预设等级数时,将所述发声装置的播放音量调整为所述第二声音等级。
- 根据权利要求1-5任一项所述的方法,其特征在于,所述电子设备在下行信号和环境噪音信号中的至少一个改变时,将所述发声装置的播放音量由第一声音等级调节为第二声音等级,包括:所述电子设备在所述第一声音等级与所述第二声音等级相差第二预设等级数时,将所述发声装置的播放音量调整为第三声音等级;所述第三声音等级高于所述第一声音等级且低于所述第二声音等级。
- 根据权利要求1-6任一项所述的方法,其特征在于,所述方法还包括:所述电子设备响应于针对第一应用的用户操作,在第一环境条件下通过所述发声装置按照第二音量播放所述第一应用对应的下行信号;所述电子设备响应于针对第二应用的用户操作,在所述第一环境条件下通过所述发声装置按照第三音量播放所述第二应用对应的下行信号;所述第二音量不等于所述第三音量。
- 根据权利要求7所述的方法,其特征在于,所述第一应用为通话类应用,所述第二应用为音乐类应用,所述第二音量大于所述第三音量。
- 根据权利要求1-8任一项所述的方法,其特征在于,所述方法还包括:所述电子设备将所述第一听力水平对应的预置模型确定为目标模型;所述目标模型为通过符合所述第一听力水平的用户的用户数据训练得到的;所述用户数据包括所述第一听力水平的用户在第二环境条件下的目标声音等级,以及所述第二环境条件下采集的下行信号和环境噪音信号的能量差值;所述电子设备将改变后的下行信号和所述改变后的环境噪音信号的能量差值输入所述目标模型,得到所述第二声音等级。
- 根据权利要求9所述的方法,所述改变后的下行信号包括N个第一子带信号,所述改变后的环境噪音信号包括所述N个第二子带信号,所述N为正整数;所述N个第一子带信号与所述N个第二子带信号一一对应;相互对应的第一子带信号和第二子带信号组成一组信号;所述改变后的下行信号和所述改变后的环境噪音信号的能量差值包括所述N组信号的能量差值。
- 根据权利要求9或10所述的方法,其特征在于,所述用户数据还包括所述目标声音等级前若干时刻的发声装置播放音量;所述电子设备将所述改变后的下行信号和所述改变后的环境噪音信号的能量差值输入所述目标模型,得到所述第二声音等级,包括:所述电子设备将所述改变后的下行信号和所述改变后的环境噪音信号的能量差值和所述前若干时刻的发声装置播放音量输入所述目标模型,得到所述第二声音等级。
- 根据权利要求9-11所述的方法,其特征在于,在所述将所述发声装置的播放音量调整为所述第二声音等级之后,所述方法还包括:所述电子设备响应于用户调节音量键的操作,将所述发声装置的播放音量调整为目标声音等级;所述电子设备存储所述改变后的下行信号和所述改变后的环境噪音信号的能量差值和所述目标声音等级;所述电子设备以所述改变后的下行信号和所述改变后的环境噪音信号的能量差值和所述目标声音等级为样本数据训练所述目标模型。
- 根据权利要求1-12任一项所述的方法,其特征在于,所述至少一个麦克风包括第一麦克风和第二麦克风,所述第一麦克风与所述发声装置的距离小于所述第二麦克风与所述发声装置的距离,所述第一麦克风用于采集下行信号,所述第二麦克风用于采集环境噪音信号。
- 一种音量调节装置,其特征在于,所述装置包括处理模块、发声装置和至少一个麦克风:所述发声装置,用于分别通过多个音量播放预设音频;所述多个音量包括第一音量;所述处理模块,用于基于针对所述第一音量的确定操作,确定用户的听力水平为所述第一音量对应的第一听力水平;所述至少一个麦克风用于,在所述发声装置播放下行信号时采集下行信号和环境噪音信号;所述发声装置,用于在采集的下行信号和采集的环境噪音信号中的至少一个改变时,基于所述第一听力水平将播放音量由第一声音等级调节为第二声音等级。
- 根据权利要求14所述的装置,其特征在于,所述第一音量越大,所述第二声音等级的等级越高。
- 根据权利要求15所述的装置,其特征在于,所述第一音量为所述用户在当前环境中能够听清的最低音量或者舒适音量。
- 根据权利要求14-16任一项所述的装置,其特征在于,改变后的环境噪音信号的能量越大,所述第二声音等级的等级越高。
- 根据权利要求14-17任一项所述的装置,其特征在于,所述发声装置,具体用于在所述第一声音等级与所述第二声音等级相差第一预设等级数时通过所述第二声音等级播放下行信号。
- 根据权利要求14-18任一项所述的装置,其特征在于,所述发声装置,具体用于在所述第一声音等级与所述第二声音等级相差第二预设等级数时,通过所述第三声音等级播放下行信号;所述第三声音等级高于所述第一声音等级且低于所述第二声音等级。
- 根据权利要求14-19任一项所述的装置,其特征在于,所述发声装置用于在第一环境条件下通过按照第二音量播放所述第一应用对应的下行信号;所述发声装置用于在所述第一环境条件下按照第三音量播放所述第二应用对应的下行信号;所述第二音量不等于所述第三音量。
- 一种电子设备,其特征在于,所述电子设备包括一个或多个处理器和一个或多个存储器;其中,所述一个或多个存储器与所述一个或多个处理器耦合,所述一个或多个存储器用于存储计算机程序代码,所述计算机程序代码包括计算机指令,当所述一个或多个处理器执行所述计算机指令时,使得所述电子设备执行如权利要求1-13中任一项所述的方法。
- 一种计算机可读存储介质,包括指令,其特征在于,当所述指令在电子设备上运行时,使得所述电子设备执行如权利要求1-13中任一项所述的方法。
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP23859445.1A EP4530821A4 (en) | 2022-09-01 | 2023-08-31 | VOLUME ADJUSTMENT METHOD, ELECTRONIC DEVICE AND SYSTEM |
| US18/994,912 US20260019055A1 (en) | 2022-09-01 | 2023-08-31 | Volume Adjustment Method, Electronic Device, and System |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202211067066.3A CN117667001A (zh) | 2022-09-01 | 2022-09-01 | 一种音量调节方法、电子设备及系统 |
| CN202211067066.3 | 2022-09-01 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2024046416A1 true WO2024046416A1 (zh) | 2024-03-07 |
Family
ID=90083331
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2023/116067 Ceased WO2024046416A1 (zh) | 2022-09-01 | 2023-08-31 | 一种音量调节方法、电子设备及系统 |
Country Status (4)
| Country | Link |
|---|---|
| US (1) | US20260019055A1 (zh) |
| EP (1) | EP4530821A4 (zh) |
| CN (1) | CN117667001A (zh) |
| WO (1) | WO2024046416A1 (zh) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN118170341A (zh) * | 2024-03-08 | 2024-06-11 | 未来城市爱家智能科技(上海)有限公司 | 一种适老化空间用智能手环的控制系统 |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN118519603B (zh) * | 2024-05-16 | 2024-12-27 | 广州市志欣电子科技股份有限公司 | 基于按摩椅与移动终端的音量调控方法、装置及存储介质 |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20120027227A1 (en) * | 2010-07-27 | 2012-02-02 | Bitwave Pte Ltd | Personalized adjustment of an audio device |
| WO2017139218A1 (en) * | 2016-02-08 | 2017-08-17 | Nar Special Global, Llc. | Hearing augmentation systems and methods |
| CN109819375A (zh) * | 2019-01-11 | 2019-05-28 | 平安科技(深圳)有限公司 | 调节音量的方法与装置、存储介质、电子设备 |
| CN110347366A (zh) * | 2019-07-15 | 2019-10-18 | 百度在线网络技术(北京)有限公司 | 音量调节方法、终端设备、存储介质及电子设备 |
| CN112118485A (zh) * | 2020-09-22 | 2020-12-22 | 英华达(上海)科技有限公司 | 音量自适应调整方法、系统、设备及存储介质 |
| CN113890936A (zh) * | 2021-11-09 | 2022-01-04 | Oppo广东移动通信有限公司 | 音量调整方法、装置及存储介质 |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10838686B2 (en) * | 2017-07-13 | 2020-11-17 | Josh Kovacevic | Artificial intelligence to enhance a listening experience |
| CN110459212A (zh) * | 2019-06-05 | 2019-11-15 | 西安易朴通讯技术有限公司 | 音量控制方法及设备 |
| CN111031445B (zh) * | 2019-11-29 | 2021-06-29 | 深圳普罗声声学科技有限公司 | 音量补偿方法、装置、计算机设备和存储介质 |
| CN113827228B (zh) * | 2021-10-22 | 2024-04-16 | 武汉知童教育科技有限公司 | 一种音量控制方法与装置 |
-
2022
- 2022-09-01 CN CN202211067066.3A patent/CN117667001A/zh active Pending
-
2023
- 2023-08-31 EP EP23859445.1A patent/EP4530821A4/en active Pending
- 2023-08-31 US US18/994,912 patent/US20260019055A1/en active Pending
- 2023-08-31 WO PCT/CN2023/116067 patent/WO2024046416A1/zh not_active Ceased
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20120027227A1 (en) * | 2010-07-27 | 2012-02-02 | Bitwave Pte Ltd | Personalized adjustment of an audio device |
| WO2017139218A1 (en) * | 2016-02-08 | 2017-08-17 | Nar Special Global, Llc. | Hearing augmentation systems and methods |
| CN109819375A (zh) * | 2019-01-11 | 2019-05-28 | 平安科技(深圳)有限公司 | 调节音量的方法与装置、存储介质、电子设备 |
| CN110347366A (zh) * | 2019-07-15 | 2019-10-18 | 百度在线网络技术(北京)有限公司 | 音量调节方法、终端设备、存储介质及电子设备 |
| CN112118485A (zh) * | 2020-09-22 | 2020-12-22 | 英华达(上海)科技有限公司 | 音量自适应调整方法、系统、设备及存储介质 |
| CN113890936A (zh) * | 2021-11-09 | 2022-01-04 | Oppo广东移动通信有限公司 | 音量调整方法、装置及存储介质 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP4530821A4 |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN118170341A (zh) * | 2024-03-08 | 2024-06-11 | 未来城市爱家智能科技(上海)有限公司 | 一种适老化空间用智能手环的控制系统 |
Also Published As
| Publication number | Publication date |
|---|---|
| CN117667001A (zh) | 2024-03-08 |
| EP4530821A1 (en) | 2025-04-02 |
| EP4530821A4 (en) | 2025-10-08 |
| US20260019055A1 (en) | 2026-01-15 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US12342122B2 (en) | Headset noise processing method, apparatus, and headset | |
| CN113873379B (zh) | 一种模式控制方法、装置及终端设备 | |
| CN111294438A (zh) | 实现立体声输出的方法及终端 | |
| WO2021083128A1 (zh) | 一种声音处理方法及其装置 | |
| WO2021000817A1 (zh) | 环境音处理方法及相关装置 | |
| CN114115770A (zh) | 显示控制的方法及相关装置 | |
| WO2022089563A1 (zh) | 一种声音增强方法、耳机控制方法、装置及耳机 | |
| US20260019055A1 (en) | Volume Adjustment Method, Electronic Device, and System | |
| CN111930335A (zh) | 声音调节方法及装置、计算机可读介质及终端设备 | |
| WO2023197997A1 (zh) | 穿戴设备、拾音方法及装置 | |
| WO2024051638A1 (zh) | 声场校准方法、电子设备及系统 | |
| WO2024027259A1 (zh) | 信号处理方法及装置、设备控制方法及装置 | |
| CN113467747B (zh) | 音量调节方法、电子设备及存储介质 | |
| CN113438364B (zh) | 振动调节方法、电子设备、存储介质 | |
| CN114120987B (zh) | 一种语音唤醒方法、电子设备及芯片系统 | |
| WO2024234983A1 (zh) | 调整通话音量的方法及电子设备 | |
| CN119182849A (zh) | 音频处理方法、电子设备及计算机可读存储介质 | |
| CN115695640B (zh) | 一种防关机保护方法及电子设备 | |
| CN116320123B (zh) | 一种语音信号的输出方法和电子设备 | |
| EP4451109B1 (en) | Volume adjustment method and electronic device | |
| CN118555327A (zh) | 一种音频处理方法及电子设备 | |
| CN113938556A (zh) | 来电提示方法、装置和电子设备 | |
| CN119383520A (zh) | 音量控制方法、图形界面及相关装置 | |
| CN114420160A (zh) | 处理音频信号的方法和装置 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23859445 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2023859445 Country of ref document: EP |
|
| ENP | Entry into the national phase |
Ref document number: 2023859445 Country of ref document: EP Effective date: 20241223 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 18994912 Country of ref document: US |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| WWP | Wipo information: published in national office |
Ref document number: 2023859445 Country of ref document: EP |
|
| WWP | Wipo information: published in national office |
Ref document number: 18994912 Country of ref document: US |