WO2024149352A1 - 语音交互方法、装置及相关设备 - Google Patents
语音交互方法、装置及相关设备 Download PDFInfo
- Publication number
- WO2024149352A1 WO2024149352A1 PCT/CN2024/071940 CN2024071940W WO2024149352A1 WO 2024149352 A1 WO2024149352 A1 WO 2024149352A1 CN 2024071940 W CN2024071940 W CN 2024071940W WO 2024149352 A1 WO2024149352 A1 WO 2024149352A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- target
- voice
- augmented reality
- instruction
- type
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/011—Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/225—Feedback of the input speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P90/00—Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
- Y02P90/02—Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]
Definitions
- the present application relates to the field of voice interaction, and in particular to a voice interaction method, apparatus and related equipment.
- augmented reality devices are used to display or collect sensor data, and in most cases they need to be connected to a terminal device compatible with the augmented reality device before they can be used.
- the present application provides a voice interaction method, apparatus and related equipment to at least solve the above technical problems existing in the prior art.
- a voice interaction method comprising:
- the target device When the target device is of the first type, determining that the target response mode to the voice instruction is a first response mode
- the target device When the target device is of the second type or the target device is not detected, determining that the target response mode to the voice instruction is the second response mode;
- the target response mode is used to respond to the voice instruction.
- the adopting the target response mode to respond to the voice command includes:
- the voice command is sent to the target device for response;
- the target response mode is the second response mode, based on the type of the voice instruction, determining One of the augmented reality device and the target device responds.
- determining, based on the type of the voice command, that one of the augmented reality device and the target device responds includes:
- the augmented reality device responds to the voice instruction
- the voice instruction is delivered to the target device for response.
- the voice instruction when the voice instruction is a second type instruction, the voice instruction is sent to the target device for response, including:
- the voice instruction is converted into a target type instruction, and the target type instruction is delivered to the target device for response.
- the audio acquisition unit is used to collect target voice data, and the target voice data includes voice instructions;
- the method further comprises:
- obtaining the voice instruction by determining whether there is a voice instruction in the target voice data; and/or delivering the target voice data to the target device includes:
- the method of obtaining the voice instruction by determining whether there is a voice instruction in the target voice data after noise reduction; and/or delivering the target voice data after noise reduction to the target device includes:
- a voice interaction device comprising:
- An acquisition unit used to acquire a user's voice command based on an audio acquisition unit of an augmented reality device
- a first determining unit configured to determine a type of a target device connected to the augmented reality device
- a second determining unit configured to determine, when the target device is of the first type, that the target response mode to the voice instruction is a first response mode
- a third determining unit configured to determine, when the target device is of the second type or the target device is not detected, that the target response mode to the voice instruction is a second response mode
- the response unit is used to respond to the voice instruction using the target response mode.
- an augmented reality device wherein the augmented reality device at least includes the voice interaction device described in the present application.
- an electronic device including:
- the memory stores instructions that can be executed by the at least one processor, and the instructions are executed by the at least one processor to enable the at least one processor to perform the method described in the present application.
- the audio acquisition unit of the augmented reality device obtains the user's voice command, determines the type of the target device connected to the augmented reality device, and when the target device is of the first type, determines the target response mode to the voice command to be the first response mode; when the target device is of the second type or the target device is not detected, determines the target response mode to the voice command to be the second response mode, and responds to the voice command using the target response mode.
- FIG1 shows a schematic diagram of the implementation flow of the voice interaction method according to an embodiment of the present application.
- FIG. 2 shows a schematic diagram 1 of the implementation process of different target response modes in an embodiment of the present application.
- FIG3 shows a second schematic diagram of the implementation process of different target response modes in an embodiment of the present application.
- FIG4 shows a schematic diagram of data flow on an augmented reality device according to an embodiment of the present application.
- FIG5 shows a schematic diagram of the composition structure of the voice interaction device in an embodiment of the present application.
- FIG6 shows a schematic diagram of the structure of an electronic device according to an embodiment of the present application.
- the augmented reality device can only communicate with the terminal device adapted thereto, which fails to reflect the flexibility and versatility of the augmented reality device.
- AR devices are now the mainstream wearable devices.
- Intelligent voice interaction as the mainstream interaction method of augmented reality devices, can free your hands and easily and quickly complete the input or control of augmented reality devices.
- augmented reality devices are usually not used as complex computing units, but only used for display (projection) and sensor data acquisition functions (such as images, audio, inertial measurement units, etc.). Based on this, augmented reality devices can only be connected to compatible terminal devices through wired or wireless means, and the collected sensor data will be handed over to the terminal device for algorithm calculation.
- augmented reality devices If it is possible to achieve normal voice interaction of augmented reality devices when they are not connected to terminal devices or connected to other terminal devices, it is bound to expand the functionality of augmented reality devices. In this way, it can lay the foundation for the widespread application of augmented reality devices in daily life.
- the technical solution of the embodiment of the present application involves a voice interaction solution.
- the augmented reality device can be connected to different types of target devices for voice interaction, and can also be connected to the target device for voice interaction, which reflects the versatility and flexibility of the augmented reality device. Based on the acquired voice command and the type of target device connected to the augmented reality device, different target response modes can be used to respond to the voice command. It provides technical support for the augmented reality device to be able to perform voice interaction normally in scenarios where it is not connected to the target device or is connected to different target devices, thereby expanding the use scenarios of the augmented reality device.
- the present application provides a voice interaction method, as shown in FIG1 , the method comprising:
- S101 Acquire a user's voice command based on an audio acquisition unit of an augmented reality device.
- the augmented reality device is an electronic device that can perform AR interaction.
- the augmented reality device can be a smart wearable device, including but not limited to smart glasses and smart watches.
- the augmented reality device is a split AR glasses as an example for explanation.
- the augmented reality device includes an audio collection unit, such as a microphone.
- the user's voice command is acquired by collecting the voice command sent by the user to the augmented reality device through the microphone.
- the microphone includes a microphone array sensor (MicArray) for collecting voice commands issued by the user to the augmented reality device.
- MicArray microphone array sensor
- the user can issue a voice command to the augmented reality device when the augmented reality device is not in use.
- a voice command such as "please turn on the screen” or "please turn off the device” can be issued to it.
- the user can also issue voice commands to the augmented reality device when the augmented reality device is used, such as when the augmented reality device is used to project movies or play audio such as songs. That is, when the augmented reality device is used to output multimedia data, the user's voice commands are obtained based on the audio acquisition unit of the augmented reality device.
- augmented reality devices usually output some multimedia data, such as images, audio, etc., during AR interaction.
- the video in the target device can be projected to the augmented reality device for output, such as projecting a movie in the target device to the augmented reality device for output.
- the multimedia data may refer to the video in the target device that can be projected by the AR device.
- the audio in the target device through the augmented reality device, such as answering a call or voice call through the augmented reality device.
- the multimedia data may refer to the audio in the target device that can be output by the AR device.
- the augmented reality device in this application is used as a device to replace the target device for audio and video output.
- This alternative solution mainly takes into account that in some application scenarios, using the target device as an audio and video output device is far less convenient and has a better output effect than using the augmented reality device as an audio and video output device.
- the projectable video in the target device is projected by the augmented reality device, so that the wearer of the augmented reality device can feel the immersive effect.
- the call can be answered by wearing split AR glasses.
- Split AR glasses can be built into ordinary glasses worn by the wearer. Answering calls through split AR glasses can avoid the situation where you cannot take out your mobile phone from your pocket to answer calls in a crowded environment.
- the user can issue voice commands to the augmented reality device when necessary.
- the augmented reality device specifically, a microphone, collects voice commands.
- the multimedia data output by the split AR glasses is the audio and video information played by the user.
- the microphone can collect the voice command issued by the user to obtain the voice command for audio and video playback, so as to increase the volume of the current audio and video playback by responding to the voice command.
- S102 Determine the type of the target device connected to the augmented reality device.
- the target device can be any device that performs voice interaction with the augmented reality device.
- the target device connected to the augmented reality device in this application can be different types of terminals.
- the target device is a self-developed terminal, that is, a terminal adapted to the augmented reality device. It can be understood that after the augmented reality device is produced by the manufacturer, there will usually be a self-developed terminal produced by the manufacturer that is adapted to the augmented reality device.
- the self-developed terminal can be understood as a normal-sized terminal with no display screen or a small display screen but computing power.
- the intelligent voice interaction function of the augmented reality device is realized by connecting the augmented reality device to the adapted self-developed terminal.
- the target device may also be other types of terminals, such as a third-party mobile phone or a third-party computer.
- the intelligent voice interaction function of the augmented reality device is realized by connecting the augmented reality device with other types of terminals.
- the augmented reality device is a split-type AR glasses
- the split-type AR glasses can be connected to a terminal adapted thereto, and can also be connected to a third-party terminal.
- the split-type AR glasses can perform intelligent voice interaction with the device connected to the split-type AR glasses.
- voice interaction as the mainstream interaction method of augmented reality devices, is usually limited to self-developed terminals. That is, if users want to use augmented reality devices, they have to purchase self-developed terminals that are compatible with them in order to perform voice interaction normally. In this case, on the one hand, if users already have mobile phones, they will not purchase additional compatible terminals for cost and portability reasons. On the other hand, when the augmented reality device is connected to a personal computer (PC), the PC, as a computing unit, will no longer be connected to the terminal. In the above two scenarios, the voice interaction function of the augmented reality device will become unavailable, greatly limiting the use scenarios of voice interaction under AR glasses.
- PC personal computer
- the augmented reality device can be connected to different types of terminals, and the voice interaction function of the augmented reality device can be realized by determining the type of target device connected to the augmented reality device.
- the augmented reality device can also be connected to no type of terminal, and realize some basic voice interaction functions such as adjusting volume and brightness through its own voice command control application.
- augmented reality devices can be connected to different types of terminals, or not connected to any terminal, so that users can choose to purchase only augmented reality devices without having to purchase self-developed terminals that are compatible with them. They can also realize the voice interaction function of augmented reality devices by directly connecting to their mobile phones or computers.
- the augmented reality device can access or connect to different types of terminals.
- the response mode to voice commands will be different depending on the type of target device connected to the augmented reality device.
- the types of target devices in this application include the first type and the second type.
- the first type is a terminal that contains voice keyword detection technology services, such as the aforementioned self-developed terminal.
- the second type is a terminal that does not contain voice keyword detection technology services, such as the aforementioned third-party mobile phone terminal, computer terminal, etc.
- the corresponding target response mode to the voice command can be mode A (first response mode).
- the corresponding target response mode to the voice command can be mode B (second response mode).
- the augmented reality device identifies whether it is connected to a target device. If no device is connected, the target response mode for the voice command is determined to be the second response mode. If a device is connected, the identifier of the connected device is obtained, and based on the identifier of the connected device, it is determined whether the connected device is of the first type or the second type. If the identifier of the connected device is identifier A, and identifier A is an identifier representing a device of the first type, the connected device is determined to be of the first type. If the identifier of the connected device is identifier B, and identifier B is an identifier representing a device of the second type, the connected device is determined to be of the second type.
- two response modes are pre-set based on whether the terminal connected to the augmented reality device is a terminal adapted to the augmented reality device or a third-party terminal not adapted to the augmented reality device.
- One of them is a mode used when the terminal connected to the augmented reality device is a terminal adapted to the augmented reality device.
- the other is a mode used when the terminal connected to the augmented reality device is a third-party terminal.
- Two different types of terminals and the modes to be used under each type of terminal are pre-set as a corresponding relationship.
- the mode corresponding to the type of terminal is searched in the corresponding relationship as the target response mode for responding to the voice command.
- the augmented reality device of this application needs to use voice keyword detection technology services to achieve normal voice interaction of the augmented reality device when connected to the second type of target device.
- the augmented reality device of this application includes voice keyword detection technology services, when the augmented reality device is not connected to the target device, that is, when the target device is not detected, the augmented reality device can also use the corresponding target response mode to perform basic voice interaction, such as adjusting brightness, adjusting volume, etc.
- a low-power voice keyword detection technology service is deployed in an augmented reality device. Without significantly increasing the power consumption or heat generation of the augmented reality device, the low-power voice keyword detection technology service is used to complete the recognition of voice commands.
- the augmented reality device is able to maintain voice interaction when connected to different types of terminals.
- Different target response modes are used to respond to voice commands based on the different types of target devices connected, that is, different target response modes are used to analyze what kind of command the voice command is and process the command.
- the voice command is analyzed to be a voice command of "increase the volume", and the output volume of the augmented reality device is increased.
- the voice command is analyzed to be a command of "reduce screen brightness”, and the screen brightness of the augmented reality device is reduced. This is to achieve the operation corresponding to the voice command.
- the augmented reality device can not only communicate with the adapted terminal and the third-party terminal, but also perform voice interaction without connecting to any terminal, which reflects the flexibility and versatility of the augmented reality device.
- the target response mode can be determined based on the type of target device connected to the augmented reality device, and the target response mode can be used to respond to voice commands. Based on the type of target device connected to the augmented reality device, different target response modes can be used to respond to voice commands.
- the augmented reality device maintains voice interaction capabilities when connected to different types of terminals. Technical support is provided for enabling the augmented reality device to perform normal voice interaction in scenarios where different target devices are connected.
- the adopting the target response mode to respond to the voice instruction includes:
- the voice command is sent to the target device for response;
- the target response mode is the second response mode, it is determined that one of the augmented reality device and the target device responds based on the type of the voice instruction.
- the type of the voice command indicates whether the voice command is a type of command to be responded to by the augmented reality device or a type of command to be responded to by the target device.
- target devices there are mainly two types of response entities involved: target devices and augmented reality devices.
- Augmented reality devices mainly include voice keyword detection technical services and voice command control applications.
- the target response mode is the first response mode, that is, based on the fact that the target device type connected to the augmented reality device is of the first type, such as a self-developed terminal, and the target response mode is determined to be the first response mode, the voice command is handed over to the operating system of the target device for full voice control.
- the voice command is converted into an Event Id to notify the voice command control application in the augmented reality device, and the voice command control application determines the instruction. Whether the command is a control command of the augmented reality device itself. If the command is a control command of the augmented reality device itself, the voice command control application responds to the voice command, such as volume adjustment, brightness adjustment, etc.
- the Event Id is converted into a universal keyboard (USB KeyBoard, UniversalSerialBus KeyBoard) protocol Id, and the USB KeyBoard Id is handed over to the target device, which responds to the voice command.
- USB KeyBoard UniversalSerialBus KeyBoard
- the voice keyword detection technology service of the augmented reality device is in operation, and the voice keyword detection technology service determines whether the target device connected to the augmented reality device is of the first type or the second type. By judging the type of the target device connected, different target response modes are adopted.
- the target response mode is the first response mode, that is, when the target response mode is determined to be the first response mode based on the target device type connected to the augmented reality device being of the first type such as a self-developed terminal
- the voice keyword detection technology service in the augmented reality device enters a dormant state, and the voice command is handed over to the operating system of the target device for full voice control.
- the target device connected to the augmented reality device is a self-developed terminal
- the self-developed terminal contains a voice keyword detection technology service
- a predefined voice command set on the self-developed terminal such as a volume increase command to adjust the sound of multimedia data, a brightness increase command to adjust the display brightness of the screen, and a mode switching command to adjust the display mode of multimedia data, such as adjusting from normal mode to 3D mode, or from 3D mode to normal mode.
- the augmented reality device determines that the connected target device is a self-developed terminal, and can hand over the voice commands collected by the augmented reality device to the operating system of the target device for full voice control.
- the voice keyword detection technology service running on the self-developed terminal hands over the voice command to the self-developed terminal.
- the self-developed terminal specifically the application processing unit, responds to the voice command and executes the control actions corresponding to the language command, such as sound adjustment, brightness adjustment, mode switching, etc.
- the voice command obtained by the augmented reality device is a (sound adjustment) command for adjusting the sound of the multimedia data output by the augmented reality device. If the target device connected to the augmented reality device is a self-developed terminal compatible with the augmented reality device, the sound adjustment command will be processed by the self-developed terminal to achieve the adjustment of the sound of the multimedia data output by the augmented reality device through the self-developed terminal.
- the voice command obtained by the augmented reality device is an instruction for adjusting the display brightness of the multimedia data output by the augmented reality device. If the target device connected to the augmented reality device is a self-developed terminal adapted to the augmented reality device, the display brightness adjustment instruction will be handed over to the self-developed terminal for processing, so as to realize the adjustment of the display brightness of the augmented reality device screen through the self-developed terminal.
- the voice command obtained by the augmented reality device is an instruction for adjusting the display mode of the multimedia data output by the augmented reality device. If the target device connected to the augmented reality device is a self-developed terminal adapted to the augmented reality device, the display mode adjustment instruction is handed over to the self-developed terminal for processing, so as to The terminal adjusts the display mode of the multimedia data output by the augmented reality device.
- the target response mode is the second response mode, that is, when the target response mode is determined to be the second response mode based on the fact that the target device type connected to the augmented reality device is the second type such as a third-party mobile phone or computer, or the target device is not detected, the voice keyword detection technical service of the augmented reality device remains running.
- determining that one of the augmented reality device and the target device responds based on the type of the voice instruction includes:
- the augmented reality device responds to the voice instruction
- the voice instruction is delivered to the target device for response.
- the voice commands when the target response mode is the second response mode, also include two types.
- the first type of command is a command that the augmented reality device can respond to, such as the aforementioned sound adjustment command, display brightness adjustment command, and display mode adjustment command.
- the second type of command is a command that the augmented reality device cannot respond to but the target device can respond to, such as the "return to the previous step" command, the "confirm” command, and the "main menu” command.
- the target response mode is the second response mode
- the voice command is a command that the augmented reality device can respond to, such as a sound adjustment command, a display brightness adjustment command, and a display mode adjustment command
- the augmented reality device responds to the voice command. If the voice command is a "return to the previous step" command, an "OK” command, and a "main menu” command, the voice command is handed over to the target device for response.
- delivering the voice instruction to the target device for response includes:
- the voice instruction is converted into a target type instruction, and the target type instruction is delivered to the target device for response.
- the voice command is a command that the augmented reality device cannot respond to but the target device can respond to, it is necessary to convert the voice command into a type that the target device can recognize or respond to, thereby achieving a response to the voice command.
- the target device connected to the augmented reality device is a third-party mobile phone or computer
- the voice keyword detection technology service in the augmented reality device recognizes the voice command
- it converts the voice command into an Event Id and notifies the voice command control application in the augmented reality device.
- the voice command control application matches the command Event Id with the number corresponding to the predefined command function, and makes corresponding feedback according to the predefined command function represented by the corresponding number.
- the predefined command function is the command function represented by different numbers in the predefined command set, such as number 1 represents the volume increase function, number 2 represents the brightness increase function, number 3 represents the mode switching function, etc.
- the instruction corresponding to the instruction Event Id is a first type instruction, that is, when the instruction Event Id corresponds to the number corresponding to the instruction function predefined for the augmented reality device, that is, the instruction corresponding to the instruction Event Id is a self-control-related instruction that the augmented reality device can respond to, such as sound adjustment, brightness adjustment, display switching, mode switching and other instructions related to the hardware of the augmented reality device
- the voice command control application directly completes the corresponding control operation through the system application programming interface (API, Application Programming Interface).
- the instruction Event Id is converted into a USB KeyBoard protocol Id and sent to the connected target device.
- the target device responds to the USB KeyBoard protocol Id and performs operations on the multimedia data output by the augmented reality device.
- the command "return to the previous step” is defined as the "F1" key on the keyboard of the target device.
- the multimedia data output by the augmented reality device will return to the previous step.
- the "main menu” is defined as the "F2" key on the keyboard of the target device.
- the augmented reality device will pop up the main menu interface.
- confirmation can also be defined as the "enter” key on the keyboard of the target device.
- the aforementioned execution process of the augmented reality device in the present application can be realized by a high-performance dedicated chip set in the augmented reality device.
- the setting of the high-performance dedicated chip can improve the computing efficiency and realize the rapid response to the voice command.
- the low-power voice keyword detection technology service is adopted, which has the significant advantages of low required computing power and low resource consumption. Without significantly increasing the power consumption, the augmented reality device is endowed with AR capabilities, which greatly improves the control ability of the augmented reality device itself.
- the voice command can be customized and expanded, which makes up for the shortcomings of key control.
- the augmented reality device converts the recognized voice command into a target type that can be recognized or responded to by the target device, and regards the response of the target device to the voice command as a response to the execution of the voice command by the user operation (such as the aforementioned target device keyboard "F1" key, the target device keyboard “F2” key and the target device keyboard “enter” key, etc.).
- This scheme is equivalent to using the augmented reality device as a control peripheral of the target device, and giving the target device voice interaction capabilities under the condition of low-cost access. In practical applications, the product competitiveness of the augmented reality device is increased.
- the audio acquisition unit is used to acquire target voice data, and the target voice data includes voice instructions;
- the method further comprises:
- the augmented reality device also includes a keyword detection unit and a control application unit.
- the keyword detection unit includes a voice keyword detection technical service, which is used to determine the type of target device connected and recognize voice commands.
- the control application unit includes a voice command control application, which is used to determine the command type and send different types of commands to different response entities for response.
- the target voice data includes voice commands and/or voice data generated when the augmented reality device outputs multimedia data.
- the voice command in the present application refers to the command input into the augmented reality device in the form of voice, which is a kind of command data.
- the target voice data collected by the microphone array sensor may be the command data.
- the voice data collected by the microphone array sensor may be the voice data generated when multimedia data is output, such as when answering a call through an augmented reality device, the voice data collected by the microphone array sensor may be the content of the call. Or when projecting a movie through an augmented reality device, the voice data collected by the microphone array sensor may be the audio content of the movie.
- the augmented reality device collects target voice data through a microphone array sensor, and identifies whether the target voice data contains voice commands and/or voice data generated when the augmented reality device outputs multimedia data. If it is found through identification that the target voice data contains both voice commands and voice data generated when the augmented reality device outputs multimedia data, the two types of voice data can be separated to perform different processing on the two types of voice data.
- the augmented reality device determines different target device types through the keyword detection unit, thereby adopting different response modes. Specifically, when the target device is of the first type, the target device responds to the voice command. When the target device is of the second type or the target device is not detected, the keyword detection unit recognizes the voice command, and the control application unit determines different command types, thereby selecting different response subjects to respond to the voice command.
- the voice data can be delivered to the target device, which will record, perform semantic recognition and other processing on the voice data.
- the method further comprises determining whether there is a voice indicator in the target voice data. command to obtain the voice command; and/or, delivering the target voice data to the target device, including:
- the augmented reality device also includes a noise reduction unit for performing noise reduction processing on the target voice data to obtain the target voice data after noise reduction.
- the augmented reality device collects the target voice data through the microphone array sensor, and performs noise reduction processing on the target voice data through the noise reduction unit to obtain the target voice data after noise reduction.
- the target voice data after noise reduction is transmitted to the augmented reality device end and/or the target device end.
- the augmented reality device end determines different target device types through the keyword detection unit, thereby adopting different response modes.
- the target device type is the first type
- the target device responds to the voice instruction.
- the target device type is the second type or the target device is not detected
- the voice instruction is identified by the keyword detection unit, and the control application unit determines different instruction types, thereby selecting different response subjects to respond to the voice instruction.
- the noise-reduced voice data can be delivered to the target device, which will record, perform semantic recognition and other processing on the noise-reduced voice data.
- the quality of the voice data transmitted to the augmented reality device and/or the target device can be improved, thereby achieving accurate response to the target voice data.
- obtaining the voice instruction by determining whether there is a voice instruction in the target voice data after noise reduction; and/or delivering the target voice data after noise reduction to the target device includes:
- the augmented reality device also includes a copying and shunting unit, which is used to copy the noise-reduced target voice data to obtain two copies of the noise-reduced target voice data, and shun the two copies of the noise-reduced target voice data to obtain the first target voice data and the second target voice data.
- the augmented reality device sends the microphone audio data Audio In read to a specific noise reduction unit for directional noise reduction to obtain enhanced audio data Clean Audio that is free of noise or contains less noise.
- the audio data that is free of noise or contains less noise is obtained by the copying and shunting unit to obtain the first target voice data and the second target voice data. Voice data.
- the first target voice data and the second target voice data are both the same voice data as the Clean Audio that is noise-free or less noise-containing after noise reduction.
- the second target voice data is directly transmitted to the target device Audio Out through hardware wired or wireless means, and the first target voice data is processed by the augmented reality device.
- the augmented reality device obtains the voice instruction, and judges different target device types through the keyword detection unit, thereby adopting different response modes. Specifically, when the target device is of the first type, the target device responds to the voice instruction.
- the target device is of the second type or the target device is not detected, the voice instruction is identified by the keyword detection unit, and the control application unit judges different instruction types, thereby selecting different response subjects to respond to the voice instruction.
- the first target voice data and the second target voice data are obtained.
- the first target voice data is processed by the augmented reality device end for voice command recognition and responses by different response entities.
- the second target voice data is transmitted to the target device end so that the collected target voice data can be used by other applications of the target device. For example, when the user makes a call, the collected target voice data is used as the call content by the call application of the target device. Copying and diverting the target voice data after noise reduction can ensure that the applications on the augmented reality device end and the target device end can obtain the audio data they need without interfering with each other.
- the present application embodiment provides a voice interaction device, as shown in FIG5 , the device comprising:
- An acquisition unit 501 is used to acquire a user's voice command based on an audio acquisition unit of an augmented reality device;
- a first determining unit 502 configured to determine a type of a target device connected to the augmented reality device
- a second determining unit 503 is used to determine that the target response mode to the voice instruction is a first response mode when the target device is of the first type;
- a third determining unit 504 is configured to determine that the target response mode to the voice command is a second response mode when the target device is of the second type or the target device is not detected;
- the response unit 505 is used to respond to the voice instruction using the target response mode.
- the response unit 505 is used to direct the voice instruction to be responded to by the target device when the target response mode is the first response mode; and to determine, based on the type of the voice instruction, whether one of the augmented reality device and the target device will respond when the target response mode is the second response mode.
- the response unit 505 is used to cause the augmented reality device to respond to the voice instruction when the voice instruction is a first type of instruction; and to hand over the voice instruction to the target device for response when the voice instruction is a second type of instruction.
- the response unit 505 is used to convert the voice instruction into a target type instruction when the voice instruction is a second type instruction, and to hand over the target type instruction to the The target device responds.
- the audio acquisition unit is used to acquire target voice data, and the target voice data includes voice instructions;
- the device also includes:
- the voice data unit is used to obtain the voice instruction by determining whether there is a voice instruction in the target voice data; and/or, to deliver the target voice data to the target device.
- the device also includes: a noise reduction unit; the noise reduction unit is used to perform noise reduction processing on the target voice data to obtain the target voice data after noise reduction; accordingly, the voice data unit is also used to obtain the voice instruction by determining whether there is a voice instruction in the target voice data after noise reduction; and/or, delivering the target voice data after noise reduction to the target device.
- a noise reduction unit used to perform noise reduction processing on the target voice data to obtain the target voice data after noise reduction
- the voice data unit is also used to obtain the voice instruction by determining whether there is a voice instruction in the target voice data after noise reduction; and/or, delivering the target voice data after noise reduction to the target device.
- the device also includes: a copy and diversion unit; the copy and diversion unit is used to obtain first target voice data and second target voice data based on the target voice data after noise reduction; accordingly, the voice data unit is also used to obtain the voice instruction by determining whether there is a voice instruction in the first target voice data; and/or, handing over the second target voice data to the target device.
- a copy and diversion unit is used to obtain first target voice data and second target voice data based on the target voice data after noise reduction; accordingly, the voice data unit is also used to obtain the voice instruction by determining whether there is a voice instruction in the first target voice data; and/or, handing over the second target voice data to the target device.
- the voice interaction device of the embodiment of the present application solves the problem in a similar principle to the aforementioned voice interaction method. Therefore, the implementation process, implementation principle and beneficial effects of the device can all be referred to the description of the implementation process, implementation principle and beneficial effects of the aforementioned method, and the repeated parts will not be repeated.
- An embodiment of the present application provides an augmented reality device, which at least includes the voice interaction device described in the present application.
- the present application also provides an electronic device.
- Fig. 6 shows a schematic block diagram of an example electronic device 600 that can be used to implement an embodiment of the present application.
- the electronic device is intended to represent various forms of digital computers, such as laptop computers, desktop computers, workbenches, personal digital assistants, servers, blade servers, mainframe computers, and other suitable computers.
- the electronic device can also represent various forms of mobile devices, such as personal digital processing, cellular phones, smart phones, wearable devices, and other similar computing devices.
- the components shown herein, their connections and relationships, and their functions are merely examples, and are not intended to limit the implementation of the present application described herein and/or required.
- the electronic device 600 includes a computing unit 601, which can perform various appropriate actions and processes according to a computer program stored in a read-only memory (ROM) 602 or a computer program loaded from a storage unit 608 into a random access memory (RAM) 603.
- ROM read-only memory
- RAM random access memory
- Various programs and data required for the operation of the electronic device 600 may also be stored.
- the computing unit 601, the ROM 602, and the RAM 603 are connected to one another via a bus 604.
- An input/output (I/O) interface 605 is also connected to the bus 604.
- the I/O interface 605 includes: an input unit 606, such as a keyboard, a mouse, etc.; an output unit 607, such as various types of displays, speakers, etc.; a storage unit 608, such as a disk, an optical disk, etc.; and a communication unit 609, such as a network card, a modem, a wireless communication transceiver, etc.
- the communication unit 609 allows the electronic device 600 to exchange information/data with other devices through a computer network such as the Internet and/or various telecommunication networks.
- the computing unit 601 may be a variety of general and/or special processing components with processing and computing capabilities. Some examples of the computing unit 601 include, but are not limited to, a central processing unit (CPU), a graphics processing unit (GPU), various dedicated artificial intelligence (AI) computing chips, various computing units running machine learning model algorithms, digital signal processors (DSPs), and any appropriate processors, controllers, microcontrollers, etc.
- the computing unit 601 performs the various methods and processes described above, such as the voice interaction method.
- the voice interaction method may be implemented as a computer software program, which is tangibly contained in a machine-readable medium, such as a storage unit 608.
- part or all of the computer program may be loaded and/or installed on the electronic device 600 via ROM 602 and/or communication unit 609.
- the computer program When the computer program is loaded into RAM 603 and executed by the computing unit 601, one or more steps of the voice interaction method described above may be performed.
- the computing unit 601 may be configured to perform the voice interaction method in any other appropriate manner (e.g., by means of firmware).
- Various implementations of the systems and techniques described above herein can be implemented in digital electronic circuit systems, integrated circuit systems, field programmable gate arrays (FPGAs), application specific integrated circuits (ASICs), application specific standard products (ASSPs), systems on chips (SOCs), complex programmable logic devices (CPLDs), computer hardware, firmware, software, and/or combinations thereof.
- FPGAs field programmable gate arrays
- ASICs application specific integrated circuits
- ASSPs application specific standard products
- SOCs systems on chips
- CPLDs complex programmable logic devices
- Various implementations can include: being implemented in one or more computer programs that can be executed and/or interpreted on a programmable system including at least one programmable processor, which can be a special purpose or general purpose programmable processor that can receive data and instructions from a storage system, at least one input device, and at least one output device, and transmit data and instructions to the storage system, the at least one input device, and the at least one output device.
- a programmable processor which can be a special purpose or general purpose programmable processor that can receive data and instructions from a storage system, at least one input device, and at least one output device, and transmit data and instructions to the storage system, the at least one input device, and the at least one output device.
- the program code for implementing the method of the present application can be written in any combination of one or more programming languages. These program codes can be provided to a processor or controller of a general-purpose computer, a special-purpose computer or other programmable data processing device, so that when the program code is executed by the processor or controller, the functions/operations specified in the flow chart and/or block diagram are implemented.
- the program code can be executed entirely on the machine, Partially on the machine, partly on the machine as a stand-alone software package and partly on a remote machine, or entirely on a remote machine or server.
- first and second are used for descriptive purposes only and should not be understood as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Therefore, a feature defined as “first” or “second” may explicitly or implicitly include at least one of the features. In the description of this application, the meaning of “plurality” is two or more, unless otherwise clearly and specifically defined.
Landscapes
- Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- General Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Signal Processing (AREA)
- Quality & Reliability (AREA)
- General Physics & Mathematics (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
Description
Claims (10)
- 一种语音交互方法,其中,所述方法包括:基于增强现实设备的音频采集单元获取用户的语音指令;确定与所述增强现实设备连接的目标设备的类型;在所述目标设备为第一类型时,确定对所述语音指令的目标响应模式为第一响应模式;在所述目标设备为第二类型或未检测到目标设备时,确定对所述语音指令的目标响应模式为第二响应模式;采用所述目标响应模式,对所述语音指令进行响应。
- 根据权利要求1所述的方法,其中,所述采用所述目标响应模式,对所述语音指令进行响应,包括:在所述目标响应模式为第一响应模式时,将所述语音指令交由所述目标设备响应;在所述目标响应模式为第二响应模式时,基于所述语音指令的类型,确定由所述增强现实设备和所述目标设备中的其中之一进行响应。
- 根据权利要求2所述的方法,其中,所述基于所述语音指令的类型,确定由所述增强现实设备和所述目标设备中的其中之一进行响应,包括:在所述语音指令为第一类型指令时,由所述增强现实设备对所述语音指令进行响应;在所述语音指令为第二类型指令时,将所述语音指令交由所述目标设备响应。
- 根据权利要求3所述的方法,其中,所述在所述语音指令为第二类型指令时,将所述语音指令交由所述目标设备响应,包括:在所述语音指令为第二类型指令时,将所述语音指令转换成目标类型指令,将所述目标类型指令交由所述目标设备响应。
- 根据权利要求1所述的方法,其中,所述音频采集单元用于对目标语音数据进行采集,所述目标语音数据包括语音指令;所述方法还包括:通过确定所述目标语音数据中是否存在语音指令而获取所述语音指令;和/或,将所述目标语音数据交由所述目标设备。
- 根据权利要求5所述的方法,其中,所述通过确定所述目标语音数据中 是否存在语音指令而获取所述语音指令;和/或,将所述目标语音数据交由所述目标设备,包括:将所述目标语音数据进行降噪处理,得到降噪后的目标语音数据;通过确定所述降噪后的目标语音数据中是否存在语音指令而获取所述语音指令;和/或,将所述降噪后的目标语音数据交由所述目标设备。
- 根据权利要求6所述的方法,其中,所述通过确定所述降噪后的目标语音数据中是否存在语音指令而获取所述语音指令;和/或,将所述降噪后的目标语音数据交由所述目标设备,包括:基于降噪后的目标语音数据,得到第一目标语音数据和第二目标语音数据;通过确定所述第一目标语音数据中是否存在语音指令而获取所述语音指令;和/或,将所述第二目标语音数据交由所述目标设备。
- 一种语音交互装置,其中,所述装置包括:获取单元,用于基于增强现实设备的音频采集单元获取用户的语音指令;第一确定单元,用于确定与所述增强现实设备连接的目标设备的类型;第二确定单元,用于在所述目标设备为第一类型时,确定对所述语音指令的目标响应模式为第一响应模式;第三确定单元,用于在所述目标设备为第二类型或未检测到目标设备时,确定对所述语音指令的目标响应模式为第二响应模式;响应单元,用于采用所述目标响应模式,对所述语音指令进行响应。
- 一种增强现实设备,其中,至少包括权利要求8所述的语音交互装置。
- 一种电子设备,其中,包括:至少一个处理器;以及与所述至少一个处理器通信连接的存储器;其中,所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行权利要求1-7中任一项所述的方法。
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2025534437A JP7838876B2 (ja) | 2023-01-12 | 2024-01-12 | 音声インタラクション方法、装置、及び関連デバイス |
| EP24741348.7A EP4625409A4 (en) | 2023-01-12 | 2024-01-12 | METHOD AND APPARATUS FOR VOICE INTERACTION, AND ASSOCIATED DEVICE |
| KR1020257026716A KR20250163861A (ko) | 2023-01-12 | 2024-01-12 | 음성 상호작용 방법, 장치 및 관련 설비 |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202310096785.6 | 2023-01-12 | ||
| CN202310096785.6A CN116030810B (zh) | 2023-01-12 | 2023-01-12 | 语音交互方法、装置及相关设备 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2024149352A1 true WO2024149352A1 (zh) | 2024-07-18 |
Family
ID=86075860
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2024/071940 Ceased WO2024149352A1 (zh) | 2023-01-12 | 2024-01-12 | 语音交互方法、装置及相关设备 |
Country Status (5)
| Country | Link |
|---|---|
| EP (1) | EP4625409A4 (zh) |
| JP (1) | JP7838876B2 (zh) |
| KR (1) | KR20250163861A (zh) |
| CN (1) | CN116030810B (zh) |
| WO (1) | WO2024149352A1 (zh) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116030810B (zh) * | 2023-01-12 | 2025-11-25 | 杭州灵伴科技有限公司 | 语音交互方法、装置及相关设备 |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20150293738A1 (en) * | 2014-04-15 | 2015-10-15 | Samsung Display Co., Ltd. | Wearable device |
| US20180261224A1 (en) * | 2017-03-08 | 2018-09-13 | Jetvox Acoustic Corp. | Wireless voice-controlled system and wearable voice transmitting-receiving device thereof |
| CN109658932A (zh) * | 2018-12-24 | 2019-04-19 | 深圳创维-Rgb电子有限公司 | 一种设备控制方法、装置、设备及介质 |
| CN111966321A (zh) * | 2020-08-24 | 2020-11-20 | Oppo广东移动通信有限公司 | 音量调节方法、ar设备及存储介质 |
| US20210366472A1 (en) * | 2019-04-17 | 2021-11-25 | Lg Electronics Inc. | Artificial intelligence apparatus for speech interaction and method for the same |
| CN116030810A (zh) * | 2023-01-12 | 2023-04-28 | 杭州灵伴科技有限公司 | 语音交互方法、装置及相关设备 |
Family Cites Families (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8929954B2 (en) * | 2012-04-25 | 2015-01-06 | Kopin Corporation | Headset computer (HSC) as auxiliary display with ASR and HT input |
| CN105527710B (zh) * | 2016-01-08 | 2018-11-20 | 北京乐驾科技有限公司 | 一种智能抬头显示系统 |
| US11526034B1 (en) * | 2017-02-01 | 2022-12-13 | Ram Pattikonda | Eyewear with flexible audio and advanced functions |
| WO2019079826A1 (en) * | 2017-10-22 | 2019-04-25 | Magical Technologies, Llc | DIGITAL ASSISTANT SYSTEMS, METHODS AND APPARATUSES IN AN INCREASED REALITY ENVIRONMENT AND LOCAL DETERMINATION OF VIRTUAL OBJECT PLACEMENT AND SINGLE OR MULTIDIRECTIONAL OBJECTIVES AS GATEWAYS BETWEEN A PHYSICAL WORLD AND A DIGITAL WORLD COMPONENT OF THE SAME ENVIRONMENT OF INCREASED REALITY |
| WO2019216874A1 (en) * | 2018-05-07 | 2019-11-14 | Google Llc | Methods, systems, and apparatus for providing composite graphical assistant interfaces for controlling connected devices |
| CN108648756A (zh) * | 2018-05-21 | 2018-10-12 | 百度在线网络技术(北京)有限公司 | 语音交互方法、装置和系统 |
| WO2020034104A1 (zh) * | 2018-08-14 | 2020-02-20 | 华为技术有限公司 | 一种语音识别方法、可穿戴设备及系统 |
| KR102597031B1 (ko) * | 2018-08-14 | 2023-11-01 | 삼성전자주식회사 | 전자장치, 서버 및 전자장치의 제어방법 |
| EP3893087A4 (en) * | 2018-12-07 | 2022-01-26 | Sony Group Corporation | RESPONSE PROCESSING DEVICE, RESPONSE PROCESSING METHOD AND RESPONSE PROCESSING PROGRAM |
| CN110362204A (zh) * | 2019-07-11 | 2019-10-22 | Oppo广东移动通信有限公司 | 信息提示方法、装置、存储介质及增强现实设备 |
| WO2021021670A1 (en) * | 2019-07-26 | 2021-02-04 | Magic Leap, Inc. | Systems and methods for augmented reality |
| CN110955332A (zh) * | 2019-11-22 | 2020-04-03 | 深圳传音控股股份有限公司 | 人机交互方法、装置、移动终端与计算机可读存储介质 |
| CN111768757A (zh) * | 2020-07-10 | 2020-10-13 | Oppo(重庆)智能科技有限公司 | 可穿戴设备的控制方法、可穿戴设备及存储介质 |
| CN112767934A (zh) * | 2020-12-22 | 2021-05-07 | 未来穿戴技术有限公司 | 按摩设备控制方法、相关装置及计算机存储介质 |
| MX2023007777A (es) * | 2021-01-12 | 2023-08-24 | Interdigital Ce Patent Holdings Sas | Metodo y sistema de realidad aumentada que permite comandos para controlar dispositivos del mundo real. |
| US11676599B2 (en) * | 2021-05-10 | 2023-06-13 | International Business Machines Corporation | Operational command boundaries |
-
2023
- 2023-01-12 CN CN202310096785.6A patent/CN116030810B/zh active Active
-
2024
- 2024-01-12 EP EP24741348.7A patent/EP4625409A4/en active Pending
- 2024-01-12 KR KR1020257026716A patent/KR20250163861A/ko active Pending
- 2024-01-12 JP JP2025534437A patent/JP7838876B2/ja active Active
- 2024-01-12 WO PCT/CN2024/071940 patent/WO2024149352A1/zh not_active Ceased
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20150293738A1 (en) * | 2014-04-15 | 2015-10-15 | Samsung Display Co., Ltd. | Wearable device |
| US20180261224A1 (en) * | 2017-03-08 | 2018-09-13 | Jetvox Acoustic Corp. | Wireless voice-controlled system and wearable voice transmitting-receiving device thereof |
| CN109658932A (zh) * | 2018-12-24 | 2019-04-19 | 深圳创维-Rgb电子有限公司 | 一种设备控制方法、装置、设备及介质 |
| US20210366472A1 (en) * | 2019-04-17 | 2021-11-25 | Lg Electronics Inc. | Artificial intelligence apparatus for speech interaction and method for the same |
| CN111966321A (zh) * | 2020-08-24 | 2020-11-20 | Oppo广东移动通信有限公司 | 音量调节方法、ar设备及存储介质 |
| CN116030810A (zh) * | 2023-01-12 | 2023-04-28 | 杭州灵伴科技有限公司 | 语音交互方法、装置及相关设备 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP4625409A4 * |
Also Published As
| Publication number | Publication date |
|---|---|
| JP2026502430A (ja) | 2026-01-23 |
| CN116030810A (zh) | 2023-04-28 |
| CN116030810B (zh) | 2025-11-25 |
| EP4625409A4 (en) | 2026-03-11 |
| EP4625409A1 (en) | 2025-10-01 |
| JP7838876B2 (ja) | 2026-04-01 |
| KR20250163861A (ko) | 2025-11-21 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US12229473B2 (en) | Changing companion communication device behavior based on status of wearable device | |
| CN106293597B (zh) | 无线音频输出装置 | |
| CN107025906A (zh) | 扩展语音识别的周期的方法和产品以及信息处理设备 | |
| CN109101517B (zh) | 信息处理方法、信息处理设备以及介质 | |
| CN108038231A (zh) | 日志处理方法、装置、终端设备及存储介质 | |
| CN112995402A (zh) | 控制方法及装置、计算机可读介质和电子设备 | |
| CN111435354A (zh) | 数据导出方法、装置、存储介质及电子设备 | |
| CN111522524A (zh) | 一种基于会议机器人的演示文稿控制方法、装置、存储介质及终端 | |
| CN108932102A (zh) | 数据处理方法、装置以及移动终端 | |
| WO2018157499A1 (zh) | 一种语音输入的方法和相关设备 | |
| WO2024149352A1 (zh) | 语音交互方法、装置及相关设备 | |
| WO2020135131A1 (zh) | 网络热点的切换方法、智能终端及计算机可读存储介质 | |
| WO2019061287A1 (zh) | 一种电子设备和降低功耗的方法及装置 | |
| CN112506460B (zh) | 屏幕控制权限共享方法、装置、终端及存储介质 | |
| CN115408696A (zh) | 应用识别方法及电子设备 | |
| CN113778255A (zh) | 触摸识别方法和装置 | |
| US20250271969A1 (en) | Electronic device and operating method thereof | |
| CN110086941A (zh) | 语音播放方法、装置及终端设备 | |
| CN107277906B (zh) | 模式选择方法、装置、终端及计算机可读存储介质 | |
| WO2020019844A1 (zh) | 语音数据处理方法及相关产品 | |
| CN115412802A (zh) | 基于耳机的控制方法、装置、耳机及计算机可读存储介质 | |
| WO2025139137A1 (zh) | 一种混合虚拟设备的构建方法和装置 | |
| JP2019091444A (ja) | スマートインタラクティブの処理方法、装置、設備及びコンピュータ記憶媒体 | |
| CN115904863A (zh) | 一种pc场景识别方法及电子设备 | |
| CN110879733B (zh) | 电子红包的检测方法、装置、终端及计算机可读存储介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 24741348 Country of ref document: EP Kind code of ref document: A1 |
|
| ENP | Entry into the national phase |
Ref document number: 2025534437 Country of ref document: JP Kind code of ref document: A |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2025534437 Country of ref document: JP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2024741348 Country of ref document: EP |
|
| ENP | Entry into the national phase |
Ref document number: 2024741348 Country of ref document: EP Effective date: 20250627 |
|
| ENP | Entry into the national phase |
Ref document number: 1020257026716 Country of ref document: KR Free format text: ST27 STATUS EVENT CODE: A-0-1-A10-A15-NAP-PA0105 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 1020257026716 Country of ref document: KR |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| WWP | Wipo information: published in national office |
Ref document number: 2024741348 Country of ref document: EP |
|
| WWP | Wipo information: published in national office |
Ref document number: 1020257026716 Country of ref document: KR |