EP4404584A1 - Vorrichtung, verfahren und computerprogramme zur analyse der kopfhörerversiegelung - Google Patents
Vorrichtung, verfahren und computerprogramme zur analyse der kopfhörerversiegelung Download PDFInfo
- Publication number
- EP4404584A1 EP4404584A1 EP23152475.2A EP23152475A EP4404584A1 EP 4404584 A1 EP4404584 A1 EP 4404584A1 EP 23152475 A EP23152475 A EP 23152475A EP 4404584 A1 EP4404584 A1 EP 4404584A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- microphone
- earphone
- microphone signal
- ear
- user
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/10—Earpieces; Attachments therefor ; Earphones; Monophonic headphones
- H04R1/1016—Earpieces of the intra-aural type
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/10—Earpieces; Attachments therefor ; Earphones; Monophonic headphones
- H04R1/1008—Earpieces of the supra-aural or circum-aural type
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
- H04R2460/00—Details of hearing devices, i.e. of ear- or headphones covered by H04R1/10 or H04R5/033 but not provided for in any of their subgroups, or of hearing aids covered by H04R25/00 but not provided for in any of its subgroups
- H04R2460/15—Determination of the acoustic seal of ear moulds or ear tips of hearing devices
Definitions
- Examples of the disclosure relate to apparatus, methods and computer programs for analyzing earphone sealing. Some relate to apparatus, methods and computer programs for determining a quality of sealing between the earphone and a user's ear.
- Earphones are audio output devices that are configured to be worn in or on a user's ear.
- a seal, or partial seal can be created between the earphone and the user's ear.
- the quality of this seal and/or the air leakage through this seal can affect the sound level within the user's ear canal. This can affect measurements made by sensors in the earphones, audio signal quality, pressure levels within the user's ear and/or other functions of the earphones.
- an apparatus comprising means for:
- the model of the at least one microphone and at least one speaker may be trained with the at least one microphone and the at least one speaker not positioned in a user's ear.
- the model of the at least one microphone and at least one speaker may comprise a non-linear model.
- a machine learning program may be used to model the at least one microphone and at least one speaker.
- the processing of the correlation may determines two or more parameters and the two or more parameters may be used to determine the quality of sealing between the earphone and a user's ear.
- the means may be for estimating a sealing coefficient by normalizing the correlation wherein the sealing coefficient is used to determine the quality of sealing between the earphone and the user's ear.
- the means may be for calculating a proportion of energy in a low frequency range of the obtained microphone signal.
- the means may be for estimating a correlation coefficient between the obtained microphone signal and the reference microphone signal.
- the microphone signal obtained from at least one microphone may comprise at least one of; external noise, reflections, distortions from at least part of an ear.
- the earphone may be configured to be worn inside a user's ear.
- the earphone may be configured to be worn over a user's ear.
- an earphone comprising an apparatus as described herein.
- a computer program comprising instructions which, when executed by an apparatus, cause the apparatus to perform:
- Earphones can be worn by a user to provide audio signals to a user's ears.
- the earphones can be worn in a user's ear, for example the earphones could comprise ear bud devices.
- the earphones could be worn over a user's ear. For instance, they could comprise head phones or headsets.
- the earphones can also comprise additional sensors that can be configured to monitor one or more characteristics of the user.
- the earphones could comprise a microphone, vibration sensors, infrared sensors, air pressure sensors, motion sensors or other suitable types of sensors. These sensors can be used for monitoring health characteristics of a user, identifying a user, identifying activities or actions of a user or for any other suitable purpose.
- the earphone can form a seal with the user's ear.
- the quality of this seal and/or the air leakage through this seal can affect the sound level within the user's ear canal. This can affect measurements made by sensors in the earphones, audio signal quality, pressure levels within the user's ear and/or other functions of the earphones. It is therefore useful to determine the quality of the seal between the earphone and the user's ear.
- Examples of the disclosure provide a reliable method for determining the quality of the seal.
- Fig. 1 schematically shows an example system 101 that can be used to implement some examples of the disclosure.
- the system 101 comprises an earphone 103 and an apparatus 105. Only components that are referred to in the following description are shown in Fig. 1 .
- the system 101 could comprise other components in other examples.
- the earphone 103 can be a device that is configured to be worn by a user 111.
- the earphone 103 can be configured to be worn in or over a user's ear. This can enable the earphone 103 to be used to provide acoustic signals to the user 111.
- the earphone 103 can be used for the playback of audio content.
- the earphone 103 comprises at least one speaker 107 and at least one microphone 109.
- the earphone 103 could comprise additional components in some examples.
- the speakers 107 can comprise any means that can be configured to generate an acoustic signal.
- the speakers 107 can be configured to convert an electrical input signal into an output acoustic signal.
- the speakers 107 can be positioned within the earphone 103 so that when a user 111 is using the earphone the speaker 107 is positioned in, or close to, the user's ear.
- the microphones 109 can comprise any means that can be configured to detect acoustic signals.
- the microphones 109 can be configured to detect acoustic signals and convert the acoustic signals into an output electric signal.
- the microphones 109 therefore provide microphone signals as an output.
- the microphone signals can comprise audio signals.
- the microphones 109 can be positioned within the earphones so that they can detect acoustic signals generated by the speaker 107 and/or reflections and reverberations of the acoustic signals.
- the earphone 103 and the apparatus 105 are shown as separate devices.
- the apparatus 105 could be provided within a different device to the earphone 103.
- Example devices that could comprise the apparatus 105 could be a personal device, such as a mobile phone, belonging to the user 111 or any other suitable device.
- the apparatus 105 can be configured to communicate with the earphone via a wireless communication link or by any other suitable means.
- the apparatus 105 could be comprised within the earphone.
- the earphone could be part of a headset or other wearable device that comprises an apparatus 105.
- the apparatus 105 can comprise a controller comprising a processor and memory. Examples of an apparatus 105 are shown in Fig. 9 .
- the apparatus 105 can be configured to process microphone signals or to perform any other suitable function.
- the system 101 could comprise additional components that are not shown in Fig. 1 in some examples of the disclosure.
- the system 101 could comprise one or more sensors within the earphone 103.
- the sensors could comprise any means that are configured to detect a physical characteristic of a user 111.
- the sensors could be located in any suitable position within the earphone 103.
- the sensors could be positioned so that, when the earphone 103 is in use, the sensors are located within the ear of the user 111.
- the sensors could comprise microphones, vibration sensors, infrared sensors, air pressure sensors, motion sensors or other suitable types of sensors. These sensors can be used for monitoring health characteristics of a user, identifying a user, identifying activities or actions of a user, or for any other suitable purpose.
- Fig. 1 only one earphone 103 is shown. In some examples there could be two earphones 103 so that one is provided for each ear of the user 111.
- Fig. 2 shows an example method. The method could be implemented using the system of Fig.1 or any other suitable system.
- the method comprises, at block 201, comparing a microphone signal obtained from at least one microphone with a reference microphone signal.
- the microphone signal can be obtained from a microphone 109 within an earphone 103.
- the reference microphone signal can be obtained using a model of the microphone 109 and the speaker 107.
- the reference microphone signal is an estimated microphone signal rather than one that has been captured using a microphone.
- the microphone that is used to obtain the microphone signal can be positioned within the earphone so that it is facing inwards.
- the microphone 109 could be an in ear microphone 109.
- the microphone signal obtained from the microphone 109 comprises external noise, and also reflections and/or distortions from at least part of an ear.
- the mount of external noise in the microphone signal will be increased if the seal is of poor quality.
- the part of the ear that causes the reflections and/or distortions will be determined by the type of earphones used and the position of the microphones 109 within the earphone 103. If the earphone 103 is an in ear device, then the reflections and distortions will come from the user's ear canal while if the earphone is an over ear device the reflections and distortions could come from other parts of the ear.
- the model that is used for the reference microphone signal can be a pre-trained model.
- the model can be trained using a microphone 109 and a speaker 107 that are not positioned in a user's ear.
- the model can comprise a non-linear model.
- the model can comprise a machine learning program such as a neural network, or any other suitable type of machine learning program.
- a correlation between the obtained microphone signal and the reference microphone signal can be used to obtain a difference between the obtained microphone signal and the reference microphone signal.
- the correlation gives an indication of the differences between the respective signals. These differences can be due to leakage of external noise into the microphone 109 and reflections and reverberations of the acoustic signal within the user's ear.
- the method comprises processing the correlation between the obtained microphone signal and the reference microphone signal to determine a quality of sealing between the earphone and a user's ear.
- the amount of leakage of external noise into the microphone 109 and the relative levels of reflections and reverberations are affected by the quality of the seal between the earphone 103 and the user's ear. Therefore, by determining and analysing these components of the signal a quality level of the seal can be determined.
- any suitable methods can be used to process the correlation between the obtained microphone signal and the reference microphone signal.
- the process could comprise determining two or more parameters.
- the parameters could comprise a measurement of a physical characteristic of the response of the user's ear to the acoustic signal. The parameters could then be used to determine the quality of sealing between the earphone and a user's ear.
- the processing of the correlation could comprise estimating a sealing coefficient by normalizing the correlation. In some examples the processing of the correlation could comprise calculating a proportion of energy in a low frequency range. This could be calculated as a ratio of low frequency energy to all frequency energy or could be calculated using any other suitable method. In some examples the processing of the correlation could comprise estimating a correlation coefficient such as a Pearson coefficient or any other suitable coefficient.
- multiple parameters can be determined and used in combination to determine the quality of sealing between the earphone and a user's ear. Using multiple different parameters can provide a more reliable estimate of the quality of the seal.
- Fig. 3 shows training of an example model 303.
- the model 303 can be a model of the speaker 107 and the microphone 109.
- the model 303 can account for non-linearities within the system of the speaker 107 and the microphone 109.
- the model 303 can be trained with the speaker 107 and the microphone 109 in open air so that they are not positioned within a user's ear during training. In this case there is no seal.
- the model 303 can be trained to predict an output microphone signal for a given input audio signal.
- the model 303 can be trained to account for nonlinearities of the speaker 107 and/or the microphone 109.
- the model 303 can account for intermodulation distortions.
- the intermodulation distortions are multi-tone distortion products of two or more signals being present in the output of the non-linear speaker 107
- the intermodulation distortions arise due to the non-linearity of the speaker.
- the intermodulation distortions arise due to the interaction of the respective frequency components with each other.
- the effect of intermodulation distortions can be particularly pronounced for devices such as earbuds which have to be small enough to fit into a user's ear and so only have space for a small speaker.
- the small speakers have limitations on the movement of the cone and so can show higher non-linear behaviours than bigger speakers.
- the model 303 could comprise a machine learning program.
- the machine learning program can comprise a neural network or any other suitable type of trainable model.
- the term "Machine Learning program” refers to any kind of artificial intelligence (Al), intelligent or other method that is trainable or tunable using data.
- the machine learning program can comprise a computer program.
- the machine learning program can be trained to perform a task, such as obtaining a reference a microphone signal for given input to a speaker 107, without being explicitly programmed to perform that task.
- the machine learning program can be configured to learn from experience E with respect to some class of tasks T and performance measure P if its performance at tasks in T, as measured by P, improves with experience E. In these examples the machine learning program can often learn from reference data to make estimations on future data.
- the machine learning program can also be a trainable computer program. Other types of machine learning programs could be used in other examples.
- the training of the machine learning program can be performed by any suitable apparatus or system.
- the machine learning program could be trained by a system or other apparatus that has a high processing capacity.
- the machine learning program could be trained by a system comprising one or more graphical processing units (GPUs) or any other suitable type of processor.
- GPUs graphical processing units
- an audio signal 301 is provided as an input to the model 303.
- the audio signal 301 is denoted as x[n] in Fig. 3 .
- the model 303 provides a reference microphone signal as an output.
- the reference microphone signal is denoted f ⁇ (. ) in Fig. 3 .
- the audio signal 301 can comprise any suitable type of audio content.
- the audio signals could comprise sound signals with varying magnitude.
- the sound signals could comprise different languages or other types of sound signals.
- the different languages could be Vietnamese, English, Italian, Arabic, or any other suitable languages.
- the model 303 Before the model 303 is trained it can be configured in an initial state. In the initial state the weights or other parameters of the model 303 can be set to any suitable value. In some examples the initial weights of the model 303 can be set to unity.
- the audio signal 301 is also provided to a digital to analogue converter 305 to generate the analogue audio signal x ⁇ ( t ).
- the analogue audio signal x ⁇ ( t ) is then provided as an input to a speaker 107.
- the speaker 107 is configured to convert an input electrical signal into an output acoustic signal 307.
- the acoustic signal 307 comprises a pressure wave that could be detected by a user's ear.
- the output signal can be any suitable acoustic signal.
- the output acoustic signal could comprise content that a user might be listening to such as music or people talking or any other suitable type of audio.
- the output acoustic signal 307 is denoted ⁇ ( t ) in this example.
- one or more microphones 109 are configured to detect the acoustic signal 307 that is output from the speaker 107.
- the microphone 109 can be an inward facing microphone so that when the earphone 103 is in use the microphone 109 would be facing inwards to the user's ear canal.
- the one or more microphones 109 can comprise any means that can be configured to convert an acoustic input signal into a corresponding electrical output signal.
- the microphones 109 can be part of an earphone 103.
- the output microphone signal from the microphone 109 is provided to an analogue to digital converter 309.
- the analogue to digital converter 309 is configured to convert the analogue output from the microphone 109 into a digital signal.
- the digital microphone signal is denoted y [ n ] in Fig. 3 . This provides an obtained microphone signal that can be compared to the reference microphone signal of the model 303.
- the output of the microphone 109 is the ground truth and the audio signal 301 is the stimuli.
- the comparison between the predicted microphone signal f ⁇ (.) from the model 303 and the obtained output of the microphones 109 enables the output of the model 303 to be evaluated relative to the actual output of the microphone 109.
- the comparison provides an error signal 311 as an output.
- the error signal 311 can comprise an error value or other indication of the difference between the estimated speaker output and the actual speaker output.
- the error value or other indication can be provided as an input to the model 303.
- One or more weights or other parameters of the model 303 can be adjusted in response to the error input. For example, if the model 303 comprises a neural network one or more weights of the neural network can be adjusted. The weights or other parameters can be updated to decrease the loss between the output of the model 303 and the output of the microphone 109.
- the process shown in Fig. 3 can be repeated until the outputs of the model 303 have sufficiently converged with the output of the microphone 109.
- the outputs of the model 303 can be determined to have converged with the output of the microphone if the error value is below a given threshold or if any other suitable criteria has been satisfied.
- the trained model 303 can then be used in systems such as the system 101 of Fig. 1 and used to determine a quality of a seal between a user's ear and an earphone 103.
- Any suitable functions can be used in the model 303 to approximate the non-linearities of the speaker 107 and/or microphone 109.
- a Volterra filter can be used. It can be assumed that Volterra kernels have a finite memory length N. The not cubic or higher order terms can be ignored for computational ease because the effect of higher distortions is less.
- x[n] is the input signal and y[n] is the output signal
- h 1 [ k 1 ] and h 2 [ k 1 , k 1 ] are the first and second-order discrete Volterra kernels.
- w 1 and w 2 are weight vectors.
- the filter coefficients can be obtained using a Normalised least mean square algorithm.
- ⁇ is the step size and ⁇ is a small positive number to avoid division by zero and e(n) is the error between the output of the model 303 and the output of the microphone 109.
- Other types of functions could be used in other examples.
- the training of the model 303 can be performed offline.
- the training of the model 303 only needs to be performed once.
- the training can be performed when new earphone 103 is designed and built.
- the training of the model 303 only needs to be performed for a new type of earphone 103 and not for all units of the earphone 103 that are manufactured.
- the trained model 303 can then be provided to the relevant devices for deployment.
- Fig. 4 shows deployment of an example trained model 403.
- the trained model 403 can be deployed after the training of the model 303 from Fig. 3 has been completed.
- the training could be the process of Fig. 3 , or any other suitable training process.
- the earphone 103 comprising the speaker 107 and the microphone 109 is located in or over a user's ear.
- the microphone 109 will detect leaked external audio and also reflections and distortions of the acoustic signals from the space created by the user's ear and the earphone 103.
- the earphone 103 is an in ear phone the reflections and distortions can be from the ear canal that is, at least partly blocked, by the earphone 103.
- an audio signal 401 is provided as an input to the trained model 403.
- the audio signal 401 is denoted as x[n] in Fig. 4 .
- the audio signal 401 can comprise any suitable type of audio.
- the audio signal 401 does not need to be the same as the audio signal 301 that is used for training the model 303.
- the audio signal 401 does not need to be a reference signal or contain any specific parameters.
- the audio signal 401 can comprise audio content that a user is listening to, for example it could comprise speech or music or other types of content.
- the trained model 403 provides a reference microphone signal f ⁇ (. ) as an output.
- the reference microphone signal f ⁇ (. ) is an estimation of a signal that should be expected from a microphone 109.
- the audio signal 401 is also provided to a digital to analogue converter 305 to generate the analogue audio signal x ⁇ ( t ).
- the analogue audio signal x ⁇ ( t ) is then provided as an input to the speaker 107.
- the speaker 107 is configured to convert an input electrical signal into an output acoustic signal 407.
- the output acoustic signal 407 is denoted ⁇ ( t ) in this example.
- the output acoustic signal 407 provides audio content that a user can hear.
- the output acoustic signal 407 is dependent upon the audio signal 401.
- the acoustic signal 407 can then be captured by the microphone 109.
- the microphone 109 captures the acoustic signal 407 and provides a microphone signal as an output in response to the captured acoustic signal 407.
- the microphone signal is provided to an analogue to digital converter 309.
- the analogue to digital converter 309 is configured to convert the analogue output from the microphone 109 into a digital signal. This provides an obtained microphone signal.
- ⁇ ( x [ n ]) represents the captured waveform of the signal emitted from the speaker 107
- r[n] represents the reflections and distortions
- v [ n ] is the external or outside noise as measured by the in-ward facing microphone 109.
- the earphone 103 is in an in ear device and the reflections can come from a user's ear canal and ear drum. If the earphone 103 is an over the ear device then the reflections would come from other parts of the user's ear. The distortions can be caused by putting the speaker 107 under pressure.
- the obtained microphone signal y [ n ] can be compared to the reference microphone signal f ⁇ ( . ) to determine the amount of reflections and external noise 409 in the obtained microphone signal y[n]. This can give an indication of the quality of seal between the earphone and the user's ear.
- Any suitable processing can be used to determine the amount of reflections and external noise in the obtained microphone signal y [ n ].
- one or more parameters indicative of the amount of reflections and noise in the obtained microphone signal y [ n ] can be calculated and used to provide an indication of the quality of the seal.
- the air leakage will be high. This will result in less distortions and reflections. This will result in a smaller value for ⁇ in the equation for y[ n ]. Similarly, if the seal is poor this will increase the amount of external noise that is detected by an inward facing microphone 109. This will result in a higher value for ⁇ in the equation for y[ n ]. Therefore, the ratio between ⁇ and ⁇ gives information about the quality of the seal between the earphone 103 and the user's ear. Separating ⁇ and ⁇ from the obtained microphone signal is not trivial because x[n] and r[n] are correlated.
- e [ n ] is the error between two nonlinear functions f ⁇ (.), ⁇ (.). That is, e [ n ] gives the error between the trained model 403 of the speaker 107 and microphone 109 and the actual performance of the speaker 107 and microphone 109.
- the obtained sealing coefficient K gives information about the overall magnitude of reflections.
- Lower values of K indicate a lower sealing quality. For example, a value of K that is close to zero indicates that there is a poor sealing and not many reflections while a value of K that is close to one indicates that there is good sealing and a larger amount of reflections.
- the sealing coefficient K is obtained using a magnitude-based normalization. This can provide a more reliable discriminator between poor sealing and good sealing compared to normalizations where the autocorrelations at zero lag equal 1. This is mainly because the amplitude of the input signal also changes the magnitude of reflections and distortions. If a normalization using zero lag is used, for example a Pearson correlation coefficient, then the magnitude of the input signal does not affect the resulting sealing coefficient K and does not give a correct indication of the amount of vibrations in the obtained microphone signal and therefore would not give a good indication of the sealing quality.
- the reflections r [ n ] that come from the ear canal and eardrum or other parts of the user's ear are a function of frequency.
- the function will change from person to person.
- the overall pattern of the reflections will be the same for different users because of the occlusion effect which is an enhancement of low-frequency components of sounds in an occluded ear canal. Therefore, in addition to the use of the sealing coefficient K other parameters can be used.
- the other parameters can be selected so as to address the dependency on the user for estimating the sealing quality.
- a good seal between the earphone 103 and the user's ear will leave traces in the low frequency bands of the obtained microphone signals due to the occlusion effect.
- These traces in the low frequency bands can preserve some structure of the motion or body signals of the user 111. For example, it can preserve structure of the heartbeats chewing motions, walking motions or other suitable body signals.
- These low frequency traces can be distinguished from the acoustic signal played back by the speaker 107 and detected by the microphone 109 and analysed to give further parameters that can be used as an indication of the quality of the seal.
- a further parameter that can be used as an indication of the quality of the seal could be a proportion of energy in a low frequency range. Any suitable means or process can be used to determine the proportion of energy in the low frequency range. In some examples the proportion of energy in a low frequency range can be determined by transforming the resultant signal ⁇ [ n ] to obtain energy levels for different frequency points. A fast Fourier transform, or any other suitable type of transform, can be used to transform the signal. The sum of energies at low frequencies can then be divided by the sum of the energies across all frequency ranges.
- the FFT can use any suitable resolution. In some examples the FFT could use 1Hz resolution.
- the ratio, K LP gives information about the low-frequency boost caused by a seal.
- the ratio K LP will be higher if there is a good seal between the user's ear and the earphone 103 and will be lower if there is a poor seal between the user's ear and the earphone 103. This enables the ratio K LP to be used as another parameter that indicates the quality of the seal.
- external artefacts in the ear canal, or other parts of the user's ear can be removed before the ratio K LP is used as a discriminator.
- the external artefacts can originate from the user's body. For example, they could be caused by walking, heartbeats, chewing or any other suitable factor.
- the artefacts can be conducted to the user's ear canal through bone conduction and amplified within the same low frequency region. If the seal is poor then the artefacts could be caused by external sounds.
- the output reference signal from the trained model 403 can be used. If the trained model 403 has been trained with the speaker 107 and the microphone 109 in open air so that they are not positioned within a user's ear then the output of the trained model 403 represents the signal captured by the microphone 109 when there is no sealing. Therefore a correlation coefficient between the output of the trained model 403 and the captured microphone signal 109 can provide another parameter that can be used to estimate the sealing quality.
- a low pass filter can be applied to both the output of the trained model 403 and the resultant signal ⁇ [ n ] and then a Pearson coefficient, or other suitable coefficient can be calculated between them.
- two or more of the parameters can be used to determine the quality of the seal. Using more than one parameter can provide a more robust indication of the quality of the seal. Other parameters and/or combinations of parameters could be used in other examples.
- Fig. 5 , 6 and 7 shows results obtained for examples of the disclosure. These results were obtained from experiments performed using twelve participants. The twelve participants included two females and ten males with an age range from 24 to 50. The participants were informed about the goal of the experiments and asked to perform several tasks in their natural way. The experiments were performed in an office environment while the participants wore an earbud prototype at the usual position.
- Figs. 5 and 6 show a three-dimensional plot for the result obtained from the experiments conducted. Figs. 5 and 6 show the same plot from two different perspectives.
- the different axis in the plot show the different parameters for indicating seal quality as described above.
- the x axis shows a magnitude base correlation
- the y axis shows a low frequency correlation
- the z axis shows a low frequency ratio.
- the different symbols used for the points indicate whether the seal was good or bad or average.
- Figures 5 and 6 clearly show that good, bad or average regions can be clearly defined using the respective parameters.
- Fig. 7 shows the t-distributed stochastic neighbour embedding of extracted features using implementations of the disclosure. This shows that the above-mentioned parameters are discriminative for the sealing quality. Also, the overlapping region between Good-Average and Average-Bad shows that discrete and continuous sealing quality estimation is crucial for earphones 103 because it is much more informative compared to a simple indication of good or bad.
- Examples of the disclosure could be used in different use cases and scenarios. Examples of the disclosure could be used for long-term continuous monitoring of ear canal sealing with ear buds. In such use cases the examples of the disclosure enable a measurement of the air leakage level without using a test or reference signal. This continuous monitoring will therefore not be intrusive because it will not interfere with audio content that the user is listening to.
- Examples of the disclosure can also be used to provide information about the quality of the seal to other applications.
- the other applications could be non-audio applications such as face recognition, user authentication, respiration rate monitoring, activity recognition, or any other suitable type of application that uses sensors in an earphone 103.
- Having information about the quality of the seal can enable the applications to control which sensors or sensor combinations are used for the relevant applications.
- a respiration rate monitoring application can use any of the inertial measurement unit, Photoplethysmography sensors, and speaker-microphone pairs together or separately. If implementations of the disclosure are used the respiration rate monitoring application can select which of these to use based on the sealing quality and how this affects the respective sensors.
- implementations of the disclosure can be used to provide feedback to active noise cancellation applications to prevent divergence.
- Active noise cancellation functions to cancel or remove unwanted noise by introducing an additional, electronically controlled sound field referred to as anti-noise.
- the anti-noise is electronically designed to have the proper pressure amplitude and phase that destructively interferes with the unwanted noise or disturbance. If the active noise cancellation algorithms are used in scenarios with high leakage then the adaptive filters will increase the gain more and more so as to cancel the noise, leading to a possible divergence. This can be prevented by providing feedback about the seal quality to the active noise cancellation algorithm.
- implementations of the disclosure could be used to reduce power consumption of the earphones 103. For instance, it can be used to detect the presence or absence of an ear by detecting whether there is a seal or not. The audio content can then be controlled based on the earphone 103 is still positioned next to the user's ear.
- Fig. 8 shows example earphones 103 that could be used in examples of the disclosure.
- the example earphones 103 comprise earbuds 801. Other types of earphones 103 could be used in other examples.
- the earbuds 801 of Fig. 8 comprise a housing 803 and an in ear portion 805.
- the in ear portion 805 is sized and shaped to fit into the ear of a user. When the earbuds are in use the in ear portion is inserted into the ear of the user.
- the housing 803 can be configured to house an apparatus or any other suitable control means for controlling the earphones 103.
- the apparatus could be housed in a different device such as a mobile phone or other personal electronic device.
- An example apparatus is shown in Fig. 9 .
- Fig. 9 schematically illustrates an apparatus 901 that can be used to implement examples of the disclosure.
- the apparatus 901 comprises a controller 903.
- the controller 903 can be a chip or a chip-set.
- the controller 903 can be provided within any suitable device such as earphones or a device such as a smartphone that can be configured to communicate with the earphones.
- the implementation of the controller 903 can be as controller circuitry.
- the controller 903 can be implemented in hardware alone, have certain aspects in software including firmware alone or can be a combination of hardware and software (including firmware).
- the controller 903 can be implemented using instructions that enable hardware functionality, for example, by using executable instructions of a computer program 909 in a general-purpose or special-purpose processor 905 that may be stored on a computer readable storage medium (disk, memory etc.) to be executed by such a processor 905.
- a general-purpose or special-purpose processor 905 that may be stored on a computer readable storage medium (disk, memory etc.) to be executed by such a processor 905.
- the processor 905 is configured to read from and write to the memory 907.
- the processor 905 can also comprise an output interface via which data and/or commands are output by the processor 905 and an input interface via which data and/or commands are input to the processor 905.
- the memory 907 stores a computer program 909 comprising computer program instructions (computer program code) that controls the operation of the controller 903 when loaded into the processor 905.
- the computer program instructions, of the computer program 909 provide the logic and routines that enables the controller 903. to perform the methods illustrated in the accompanying Figs.
- the processor 905 by reading the memory 907 is able to load and execute the computer program 909.
- the apparatus 901 comprises:
- the computer program 909 can arrive at the controller 903 via any suitable delivery mechanism 913.
- the delivery mechanism 913 can be, for example, a machine readable medium, a computer-readable medium, a non-transitory computer-readable storage medium, a computer program product, a memory device, a record medium such as a Compact Disc Read-Only Memory (CD-ROM) or a Digital Versatile Disc (DVD) or a solid-state memory, an article of manufacture that comprises or tangibly embodies the computer program 909.
- the delivery mechanism can be a signal configured to reliably transfer the computer program 909.
- the controller 903 can propagate or transmit the computer program 909 as a computer data signal.
- the computer program 909 can be transmitted to the controller 903 using a wireless protocol such as Bluetooth, Bluetooth Low Energy, Bluetooth Smart, 6LoWPan (IP v 6 over low power personal area networks) ZigBee, ANT+, near field communication (NFC), Radio frequency identification, wireless local area network (wireless LAN) or any other suitable protocol.
- a wireless protocol such as Bluetooth, Bluetooth Low Energy, Bluetooth Smart, 6LoWPan (IP v 6 over low power personal area networks) ZigBee, ANT+, near field communication (NFC), Radio frequency identification, wireless local area network (wireless LAN) or any other suitable protocol.
- the computer program 909 comprises computer program instructions for causing an apparatus 901 to perform at least the following or for performing at least the following:
- the computer program instructions can be comprised in a computer program 909, a non-transitory computer readable medium, a computer program product, a machine readable medium. In some but not necessarily all examples, the computer program instructions can be distributed over more than one computer program 909.
- memory 907 is illustrated as a single component/circuitry it can be implemented as one or more separate components/circuitry some or all of which can be integrated/removable and/or can provide permanent/semi-permanent/ dynamic/cached storage.
- processor 905 is illustrated as a single component/circuitry it can be implemented as one or more separate components/circuitry some or all of which can be integrated/removable.
- the processor 905 can be a single core or multi-core processor.
- references to 'computer-readable storage medium', 'computer program product', 'tangibly embodied computer program' etc. or a 'controller', 'computer', 'processor' etc. should be understood to encompass not only computers having different architectures such as single /multi- processor architectures and sequential (Von Neumann)/parallel architectures but also specialized circuits such as field-programmable gate arrays (FPGA), application specific circuits (ASIC), signal processing devices and other processing circuitry.
- References to computer program, instructions, code etc. should be understood to encompass software for a programmable processor or firmware such as, for example, the programmable content of a hardware device whether instructions for a processor, or configuration settings for a fixed-function device, gate array or programmable logic device etc.
- circuitry may refer to one or more or all of the following:
- the blocks illustrated in Fig. 2 can represent steps in a method and/or sections of code in the computer program 909.
- the illustration of a particular order to the blocks does not necessarily imply that there is a required or preferred order for the blocks and the order and arrangement of the blocks can be varied. Furthermore, it can be possible for some blocks to be omitted.
- the wording 'connect', 'couple' and 'communication' and their derivatives mean operationally connected/coupled/in communication. It should be appreciated that any number or combination of intervening components can exist (including no intervening components), i.e., so as to provide direct or indirect connection/coupling/communication. Any such intervening components can include hardware and/or software components.
- the term "determine/determining” can include, not least: calculating, computing, processing, deriving, measuring, investigating, identifying, looking up (for example, looking up in a table, a database or another data structure), ascertaining and the like. Also, “determining” can include receiving (for example, receiving information), accessing (for example, accessing data in a memory), obtaining and the like. Also, “ determine/determining” can include resolving, selecting, choosing, establishing, and the like.
- a property of the instance can be a property of only that instance or a property of the class or a property of a sub-class of the class that includes some but not all of the instances in the class. It is therefore implicitly disclosed that a feature described with reference to one example but not with reference to another example, can where possible be used in that other example as part of a working combination but does not necessarily have to be used in that other example.
- 'a', 'an' or ⁇ the' is used in this document with an inclusive not an exclusive meaning. That is any reference to X comprising a/an/the Y indicates that X may comprise only one Y or may comprise more than one Y unless the context clearly indicates the contrary. If it is intended to use 'a', 'an' or ⁇ the' with an exclusive meaning then it will be made clear in the context. In some circumstances the use of 'at least one' or 'one or more' may be used to emphasis an inclusive meaning but the absence of these terms should not be taken to infer any exclusive meaning.
- the presence of a feature (or combination of features) in a claim is a reference to that feature or (combination of features) itself and also to features that achieve substantially the same technical effect (equivalent features).
- the equivalent features include, for example, features that are variants and achieve substantially the same result in substantially the same way.
- the equivalent features include, for example, features that perform substantially the same function, in substantially the same way to achieve substantially the same result.
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Circuit For Audible Band Transducer (AREA)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP23152475.2A EP4404584A1 (de) | 2023-01-19 | 2023-01-19 | Vorrichtung, verfahren und computerprogramme zur analyse der kopfhörerversiegelung |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP23152475.2A EP4404584A1 (de) | 2023-01-19 | 2023-01-19 | Vorrichtung, verfahren und computerprogramme zur analyse der kopfhörerversiegelung |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| EP4404584A1 true EP4404584A1 (de) | 2024-07-24 |
Family
ID=85018354
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EP23152475.2A Pending EP4404584A1 (de) | 2023-01-19 | 2023-01-19 | Vorrichtung, verfahren und computerprogramme zur analyse der kopfhörerversiegelung |
Country Status (1)
| Country | Link |
|---|---|
| EP (1) | EP4404584A1 (de) |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20150365776A1 (en) * | 2008-09-19 | 2015-12-17 | Personics Holdings Llc | Acoustic sealing analysis system |
| EP3166328A1 (de) * | 2014-07-01 | 2017-05-10 | Sony Corporation | Signalverarbeitungsvorrichtung, signalverarbeitungsverfahren und computerprogramm |
| US20190274595A1 (en) * | 2018-03-10 | 2019-09-12 | Staton Techiya, Llc | Method to estimate hearing impairment compensation function |
| US20200162808A1 (en) * | 2017-06-26 | 2020-05-21 | Ecole De Technologie Superieure | System, Device and Method for Assessing a Fit Quality of an Earpiece |
| EP3742756A1 (de) * | 2019-05-23 | 2020-11-25 | Beijing Xiaoniao Tingting Technology Co., Ltd | Verfahren und vorrichtung zur erkennung des verschleisszustandes von ohrhörern, ohrhörer und speichermedium |
-
2023
- 2023-01-19 EP EP23152475.2A patent/EP4404584A1/de active Pending
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20150365776A1 (en) * | 2008-09-19 | 2015-12-17 | Personics Holdings Llc | Acoustic sealing analysis system |
| EP3166328A1 (de) * | 2014-07-01 | 2017-05-10 | Sony Corporation | Signalverarbeitungsvorrichtung, signalverarbeitungsverfahren und computerprogramm |
| US20200162808A1 (en) * | 2017-06-26 | 2020-05-21 | Ecole De Technologie Superieure | System, Device and Method for Assessing a Fit Quality of an Earpiece |
| US20190274595A1 (en) * | 2018-03-10 | 2019-09-12 | Staton Techiya, Llc | Method to estimate hearing impairment compensation function |
| EP3742756A1 (de) * | 2019-05-23 | 2020-11-25 | Beijing Xiaoniao Tingting Technology Co., Ltd | Verfahren und vorrichtung zur erkennung des verschleisszustandes von ohrhörern, ohrhörer und speichermedium |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11638085B2 (en) | System, device and method for assessing a fit quality of an earpiece | |
| CN114143646B (zh) | 检测方法、装置、耳机和可读存储介质 | |
| US11043210B2 (en) | Sound processing apparatus utilizing an electroencephalography (EEG) signal | |
| RU2626987C2 (ru) | Устройство и способ для улучшения воспринимаемого качества воспроизведения звука путем объединения активного шумоподавления и компенсации воспринимаемого шума | |
| CN103229238B (zh) | 用于产生音频信号的系统和方法 | |
| CN109688498B (zh) | 一种音量调节方法、耳机及存储介质 | |
| CN113676803A (zh) | 一种主动降噪方法及装置 | |
| CN114450745B (zh) | 用于耳戴式播放设备的音频系统和信号处理方法 | |
| US10034087B2 (en) | Audio signal processing for listening devices | |
| CN113316055A (zh) | 用于音频信号评估和调整的系统和方法 | |
| Mawalim et al. | Non-intrusive speech intelligibility prediction using an auditory periphery model with hearing loss | |
| US11671767B2 (en) | Hearing aid comprising a feedback control system | |
| US10966038B2 (en) | Method of fitting a hearing device to a user's needs, a programming device, and a hearing system | |
| US11206502B1 (en) | System and method for evaluating an ear seal using normalization | |
| Demirel et al. | Unobtrusive air leakage estimation for earables with in-ear microphones | |
| EP4404584A1 (de) | Vorrichtung, verfahren und computerprogramme zur analyse der kopfhörerversiegelung | |
| CN115996349A (zh) | 包括反馈控制系统的听力装置 | |
| WO2022247673A1 (zh) | 检测方法、装置、耳机和计算机可读存储介质 | |
| US20260046570A1 (en) | Hearing aid comprising a loop transfer function estimator and a method of training a loop transfer function estimator | |
| US12501214B2 (en) | Audio system | |
| CN118984437A (zh) | 啸叫抑制的助听器、透传耳机和控制方法 | |
| CN207995324U (zh) | 脖戴式语音交互耳机 | |
| Fabry et al. | Time-variant acoustic front-end measurements of active noise cancellation headphones | |
| US20250175733A1 (en) | Hearing protection devices | |
| CN207518792U (zh) | 脖戴式语音交互耳机 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN PUBLISHED |
|
| AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC ME MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE |
|
| 17P | Request for examination filed |
Effective date: 20250124 |
|
| RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: OMNIBUDS LTD |