WO2025001769A1 - 一种数字助手虚拟形象的显示方法及相关装置 - Google Patents
一种数字助手虚拟形象的显示方法及相关装置 Download PDFInfo
- Publication number
- WO2025001769A1 WO2025001769A1 PCT/CN2024/097229 CN2024097229W WO2025001769A1 WO 2025001769 A1 WO2025001769 A1 WO 2025001769A1 CN 2024097229 W CN2024097229 W CN 2024097229W WO 2025001769 A1 WO2025001769 A1 WO 2025001769A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- interface
- feature
- digital assistant
- state
- displayed
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0481—Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
- G06F3/04817—Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance using icons
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F1/00—Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
- G06F1/26—Power supply means, e.g. regulation thereof
- G06F1/32—Means for saving power
- G06F1/3203—Power management, i.e. event-based initiation of a power-saving mode
- G06F1/3206—Monitoring of events, devices or parameters that trigger a change in power modality
- G06F1/3231—Monitoring the presence, absence or movement of users
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/011—Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/017—Gesture based interaction, e.g. based on a set of recognized hand gestures
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0481—Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
- G06F3/04815—Interaction with a metaphor-based environment or interaction object displayed as three-dimensional [3D], e.g. changing the user viewpoint with respect to the environment or object
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0484—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
- G06F3/04845—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range for image manipulation, e.g. dragging, rotation, expansion or change of colour
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0484—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
- G06F3/04847—Interaction techniques to control parameter settings, e.g. interaction with sliders or dials
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0487—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/44—Arrangements for executing specific programs
- G06F9/451—Execution arrangements for user interfaces
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/44—Arrangements for executing specific programs
- G06F9/451—Execution arrangements for user interfaces
- G06F9/453—Help systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/442—Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
- H04N21/44213—Monitoring of end-user related data
- H04N21/44218—Detecting physical presence or behaviour of the user, e.g. using sensors to detect if the user is leaving the room or changes his face expression during a TV programme
Definitions
- the present application relates to the field of human-computer interaction technology, and in particular to a method for displaying a virtual image of a digital assistant and related devices.
- the user interface of the digital assistants will display an assistant image, which is usually a two-dimensional pattern that is static or continuously moves in a certain pattern.
- the digital assistant In the user interface of the digital assistant, after the user further inputs voice commands or gesture commands, it is usually necessary to wait for the digital assistant to output the recognition results before confirming whether the digital assistant has received the voice commands or gesture commands. Later, the digital assistant's form does not change during the process of recognizing or processing commands. Therefore, after the user inputs voice commands or gesture commands, it is usually necessary to wait for the digital assistant to output the recognition results before confirming whether the digital assistant has received the voice commands or gesture commands. That is, when the digital assistant recognizes or processes the commands input by the user, the user cannot get feedback from the digital assistant, and it is difficult to identify the current working status of the digital assistant. It is easy to mistakenly believe that the digital assistant has not received the command, which causes the user to repeat the command.
- the present application provides a method for displaying a digital assistant virtual image, which can improve the naturalness of the interaction between the user and the digital assistant.
- the first aspect of the present application provides a method for displaying a digital assistant virtual image, which is applied to an electronic device, and the virtual image of the digital assistant includes multiple feature bodies.
- the virtual image is first displayed in a first form on a first interface; then, a signal instruction generated by the electronic device is detected, and the virtual image is displayed in a second form.
- the signal instruction is generated by the electronic device based on the received first user input, and the user input can be before or after the first form is displayed. That is, after the electronic device receives the first user input, a signal instruction is generated, thereby triggering the virtual image to switch from the first form to the second form, so that the form of the virtual image displayed on the interface of the electronic device changes.
- first form and the second form both display at least one of a plurality of characteristic bodies, and the change from the first form to the second form displays that at least one characteristic body undergoes at least one of a change in shape, a change in position, a change in movement pattern, and a change in display area.
- the different forms of the virtual image represent the different working states of the digital assistant, that is, the digital assistant uses the form of the virtual image to indicate the current working state of itself.
- the digital assistant uses the form of the virtual image to indicate the current working state of itself.
- the virtual image of the digital assistant is set to include multiple features, and after the electronic device obtains user input, the electronic device generates a corresponding signal instruction, thereby triggering the change of the form of the virtual image displayed on the interface, thereby achieving the use of different forms to represent the different working states of the digital assistant, so that the user can confirm the current state of interaction with the digital assistant through the changes in the form of the digital assistant, thereby improving the naturalness of the interaction between the user and the digital assistant.
- the plurality of feature bodies include a first feature body
- the plurality of feature bodies include the first feature body
- the shape change of the first feature body includes at least one of the following changes: enlargement, reduction, stretching, compression, rotation, and switching between a main form, a geometric array, or a strip. That is, the shape of the first feature body includes: a main form, a geometric array, and a strip.
- the shape change of the first feature body includes switching between multiple shapes, and enlarging, reducing, stretching, compressing, or rotating each shape. That is, in addition to switching between different shapes, the shape change of the first feature body may also include enlarging, reducing, stretching, compressing, or rotating under the same shape.
- the main form refers to the form of the first feature body under normal circumstances, while the geometric array and the long strip are the forms of the first feature body under certain specific circumstances.
- the plurality of feature bodies further include a second feature body.
- the main form of the first feature body is a ring
- the second feature body is a sphere
- the center of the ring coincides with the center of the sphere.
- the position change of the first feature body includes at least one of a change in orbital position after flipping around the sphere and a change in position caused by the overall movement or rotation of the virtual image.
- the motion law of the first feature body includes the ring rotating around the sphere with the center of the ring as the fulcrum according to a preset trajectory.
- the motion law of the second feature body includes the sphere reciprocating along a straight line passing through the center of the ring and perpendicular to the plane where the ring is located according to a preset law.
- a third feature is displayed on the first feature, and the third feature can move on the first feature and indicate the position of the user.
- the virtual image has a breathing state
- the virtual image is displayed in a first form on the first interface, including: if the first interface is an off-screen interface, the first form is a breathing state including a first feature body and a second feature body, and the first feature body is displayed in a main form.
- the first interface is a screen-off interface
- the digital assistant in a breathing state moves back and forth on the screen according to a preset pattern.
- a third feature body is displayed on the first feature body, and the third feature body can move on the first feature body and indicate the user's position.
- the first feature body is a ring, and the third feature body can be a light effect area on the ring.
- the virtual image has a waiting state and a listening state
- the first interface is a lock screen interface or any screen interface after unlocking.
- Displaying the virtual image in a first form on the first interface specifically includes: displaying the waiting state as the first form on the first interface, the waiting state includes a first feature body, and the first feature body is displayed in a long strip as a navigation prompt line, and the navigation prompt line is used as a reference line for screen gesture navigation; wherein, when no input for waking up the digital assistant is received, the waiting state is permanently displayed on the first interface.
- a third user input is detected and displayed on the first interface in a listening state as the first form, and the third user input is used to wake up the digital assistant.
- the listening state includes a first feature body and a second feature body, and the first feature body is displayed in the main form.
- the first interface is a lock screen interface or any screen interface after unlocking, and the first form is a listening state.
- displaying the virtual image in the first form on the first interface specifically includes: the first feature body is first displayed in a geometric array; the first feature body is then changed from the geometric array to the main form, and the second feature body is displayed. That is, in the initial stage, only the first feature body is displayed, and the second feature body is not displayed, and the first feature body is initially displayed in a geometric array, covering the entire screen; then the first feature body is changed from the geometric array to the main form through shape display, and the second feature body is displayed.
- the digital assistant when waking up the digital assistant, the digital assistant first appears in the form of a geometric array (such as a dot matrix) and gradually fills the screen; then, the digital assistant gradually transforms from the geometric array to the main form of the first feature body (i.e., a ring) and displays the second feature body (i.e., a sphere), thereby realizing the awakening process of the digital assistant in the form of an animation, increasing the intuitiveness of the interaction process, and helping users to identify the current working status of the digital assistant.
- a geometric array such as a dot matrix
- the first feature body is first displayed in a geometric array, specifically: a portion of the array of geometric figures is displayed at an initial display position, and then the geometric figures are displayed on the entire screen. That is, the initial display position of the geometric array is first determined; secondly, the portion of the array of geometric figures is displayed at the initial display position, and then the geometric figures are gradually spread over the entire screen. For example, a portion of the dot matrix is first displayed from an initial display position, and then the dot matrix is gradually spread to cover the entire screen.
- the user input refers to the input used to wake up the digital assistant.
- the initial display position may be near the user input that triggers the digital assistant to wake up.
- the initial display position of the geometric array may be on the screen near the power button.
- the initial position is a default position, which is an area where the virtual image is displayed, such as the bottom of the screen.
- the initial display position of the geometric array is determined by user input, which can make the display process of the digital assistant more closely aligned with the user input, enhance the intuitiveness of the interaction between the user and the digital assistant, and improve the user experience.
- the first interface is the lock screen interface or any screen interface after unlocking
- the first form is the wake-up state
- the second form is the listening state
- the signal instruction is generated based on the user performing a preset gesture on the digital assistant in the first form.
- the plurality of feature bodies further include a second feature body; in the second form, the first feature body is in the main form of a ring, the second feature body is a sphere, and the ring is located on the equator of the sphere in a top-down perspective.
- the virtual image is displayed in a first form on the first interface, specifically: the virtual image is displayed at a preset position on the first interface; or the virtual image is displayed by overlaying a target content element in the first interface, wherein the target content element is a content element of a preset type.
- the virtual image is covered on the target content element, so that the target content element is no longer displayed on the first interface.
- the gesture event interface of the target content element is copied to the virtual image so that performing a gesture on the virtual image can realize the gesture corresponding function of the target content element.
- the virtual image inherits the original function of the target content element, so that when a specific gesture (such as single-click) is performed on the virtual image, the gesture-corresponding function of the target content element can still be executed.
- the first interface is a camera interface
- the target content element is a shutter control in the camera interface
- the virtual image also has a task execution state, which includes a first feature body; in at least one process interface during the digital assistant's execution of an automated task, the first feature body in the task execution state corresponds to at least one of a shape change and a position change.
- the digital assistant in the process interface of the digital assistant performing automated tasks, the digital assistant is actually always in the task execution state, and the first feature body of the digital assistant in the task execution state can undergo various changes to show the user the automated process of the digital assistant performing tasks, making the automated process of the digital assistant performing tasks more intuitive and clear, making it easier for users to observe whether there are any errors in the process of the digital assistant performing automated tasks.
- the first feature body is a geometric array
- the display position of the geometric array matches the target control in the process interface
- the next process interface is entered by simulating the click event of the target control.
- the digital assistant can use the first feature body as the geometric array to simulate the user's operation of clicking the control on the interface, thereby visualizing the entire process of the digital assistant simulating the user to execute the automation task.
- the first form is a listening state
- the second form is a task execution state.
- Both the listening state and the task execution state include a first feature body, and the first feature body in the listening state and the task execution state has different forms respectively; in the task execution state, the forms of the first feature body include: a geometric array within the interface, an outer frame of a digital assistant dialogue card, an outer frame of an input box when calling a digital assistant input, or an outer frame of a task result page.
- the first feature body may be represented in different forms so as to be more in line with the content displayed on the current interface.
- the first feature body has different morphological changes at different task progress of the automation task. That is, in the process of the digital assistant performing the automation task, at different task progress, the first feature body may change into different forms, so as to ensure that the form of the first feature body always closely fits the current interface.
- the first feature body is changed from a geometric array covering the entire screen to a geometric array displayed only on the recognized executable object.
- the morphological changes of the first feature body include: initially, the first feature body is displayed in the main form of a ring at an initial position relative to the second feature body (for example, the periphery of the second feature body), and the initial position is the same as or different from the position of the first feature body in the listening state; the first feature body is separated from the second feature body by the ring, and is enlarged and displayed as the outer frame of the digital assistant dialogue card; the first feature body changes from the outer frame of the dialogue card to a geometric array in the dialogue bubble of the digital assistant; after the automated task is executed, the virtual image is displayed in the listening state.
- the method may further include: the electronic device acquires scene information, determines a target form of the first feature body based on the scene information, and displays the first feature body in the target form. That is, the target form is related to the scene information.
- the electronic device may display the first feature body in a specific form.
- the virtual image has a state to be awakened, which includes at least a first feature body, and the first feature body is displayed in a long strip; in response to detecting that the focus of human eye sight falls on the area where the screen is located, the electronic device displays a second feature body.
- the digital assistant when the digital assistant is in the wake-up state, only a long bar is displayed; when the digital assistant detects that the user's eyes fall on the screen, it can be considered that the user has an interactive tendency, so the second feature body is displayed, thereby indicating that the digital assistant detects that the user has an interactive tendency and is waiting for interaction, so as to improve the naturalness of human-computer interaction.
- the method further includes: in response to detecting a human voice, the second feature body moves closer to the first feature body. The direction of the vocal sound source.
- the electronic device is a smart car
- the method further includes: indicating a vehicle function or warning information by deformation or movement of a first feature body among the multiple feature bodies.
- a second aspect of the present application provides a display device for a digital assistant virtual image, which is applied to an electronic device, wherein the virtual image includes a plurality of feature bodies.
- the device includes:
- a display module configured to display a virtual image in a first form on a first interface
- the display module is further used to detect the signal instruction generated by the electronic device and display the virtual image in the second form;
- the signal instruction is generated based on the first user input, the first form and the second form display at least one of the multiple feature bodies, and the change from the first form to the second form means that at least one feature body undergoes at least one of a shape change, a position change, a motion law change, and a display area change.
- the plurality of feature bodies include a first feature body, and the shape change of the first feature body includes at least one of the following changes: enlargement, reduction, stretching, compression, rotation, and switching between a main form, a geometric array, or a strip.
- the plurality of feature bodies further include a second feature body, wherein the main shape of the first feature body is a ring, the second feature body is a sphere, and the center of the ring and the center of the sphere are co-pointed;
- the position change of the first feature body includes at least one of a change in orbital position after flipping around the sphere and a change in position caused by overall movement or rotation of the virtual image;
- the motion law of the first characteristic body includes that the ring rotates around the sphere according to a preset trajectory with the center of the circle as the fulcrum;
- the motion law of the second characteristic body includes that the sphere moves back and forth according to a preset law along a straight line passing through the center of the ring and perpendicular to the plane where the ring is located.
- the display module is further configured to detect a second user input and display a third feature body on the first feature body, where the third feature body can move on the first feature body and indicate the user's position.
- the virtual image has a breathing state
- the virtual image is displayed in a first form on the first interface, including:
- the first interface is a screen-off interface
- the first form is a breathing form including a first feature body and a second feature body, and the first feature body is displayed in a main form.
- the virtual image in a breathing state reciprocates on the screen according to a preset rule.
- the display module is further used to:
- a second user input is detected, and a third feature is displayed on the first feature, the third feature being movable on the first feature and indicating the user's position.
- the virtual image has a waiting state and a listening state
- the first interface is a lock screen interface or any screen interface after unlocking
- Displaying a virtual image in a first form on a first interface includes:
- the first state to be awakened is displayed on the first interface as the first form, the state to be awakened includes a first feature body, and the first feature body is displayed in a long strip as a navigation prompt line, and the navigation prompt line is used as a reference line for screen gesture navigation; wherein, when no input for waking up the digital assistant is received, the state to be awakened is permanently displayed on the first interface;
- a third user input is detected and displayed on the first interface in a listening state as a first form.
- the third user input is used to wake up the digital assistant.
- the listening state includes a first feature body and a second feature body, and the first feature body is displayed in a main form.
- the first interface is a lock screen interface or any screen interface after unlocking, and the first state is a listening state;
- Displaying the virtual image in the first form on the first interface specifically includes:
- the first feature is first displayed as a geometric array
- the first feature body is then transformed from a geometric array into a main form, and a second feature body is displayed.
- the first feature body is first displayed in a geometric array, specifically:
- the initial display position is determined based on user input
- the initial position is the area where the virtual image is displayed.
- the virtual image is displayed in the first form on the first interface, specifically:
- the virtual image is overlaid and displayed on a target content element in the first interface, wherein the target content element is a content element of a preset type.
- the gesture event interface of the target content element is copied to the virtual image so that performing a gesture on the virtual image can realize the gesture corresponding function of the target content element.
- the first interface is a camera interface
- the target content element is a shutter control in the camera interface
- the first interface is the lock screen interface or any screen interface after unlocking
- the first form is the wake-up state
- the second form is the listening state
- the signal instruction is generated based on the user performing a preset gesture on the digital assistant in the first form.
- the plurality of feature bodies further includes a second feature body
- the first feature body is in the main form of a ring
- the second feature body is a sphere
- the ring is located on the equator of the sphere in a top-down perspective.
- the virtual image further has a task execution state, and the task execution state includes a first feature body
- the first feature body in the task execution state corresponds to at least one of a shape change and a position change.
- the first feature body is a geometric array
- the display position of the geometric array matches the target control in the process interface, so as to simulate a click event of the target control to enter the next process interface.
- the first state is a listening state
- the second state is a task execution state.
- Both the listening state and the task execution state include a first feature body, and the first feature bodies in the listening state and the task execution state have different forms respectively.
- the shape of the first feature body includes: a geometric array within the interface, an outer frame of a dialog card, an outer frame of an input box when calling a digital assistant input, or an outer frame of a task result page.
- the first feature body has different morphological changes at different task progresses of the automation task.
- the first feature body is changed from a geometric array covering the entire screen to a geometric array displayed only on the recognized executable object.
- the morphological change of the first feature body includes:
- the first feature body is first displayed in the main form of a ring on the periphery of the second feature body;
- the first feature body is then separated from the second feature body by the ring and is enlarged and displayed to serve as the outer frame of the conversation card;
- the first feature body changes from the outer frame of the dialogue card to the geometric array inside the dialogue bubble;
- the avatar is displayed in a listening state.
- the device further includes:
- the acquisition module is used to acquire scene information and display a first feature body among multiple feature bodies in a target form based on the scene information.
- the virtual image has a waiting-to-be-awakened state
- the waiting-to-be-awakened state includes at least a first feature body, and the first feature body is displayed in a long strip;
- the display module is also used to display a second feature body in response to detecting that the focus of the human eye falls on the area where the screen is located.
- the display module is further configured to, in response to detecting a human voice, cause the second feature body to move relative to the first feature body toward a direction close to a source of the human voice.
- the electronic device is a smart car
- the display module is further used for:
- the vehicle function or warning information is indicated by the deformation or movement of the first feature body among the multiple feature bodies.
- the third aspect of the present application provides a display device for a digital assistant virtual image, which may include a processor, the processor and a memory are coupled, the memory stores program instructions, and when the program instructions stored in the memory are executed by the processor, the method of the first aspect or any implementation of the first aspect is implemented.
- a display device for a digital assistant virtual image which may include a processor, the processor and a memory are coupled, the memory stores program instructions, and when the program instructions stored in the memory are executed by the processor, the method of the first aspect or any implementation of the first aspect is implemented.
- the fourth aspect of the present application provides a computer-readable storage medium, in which a computer program is stored.
- the computer-readable storage medium is run on a computer, the computer executes the method of any implementation manner of the first aspect.
- a fifth aspect of the present application provides a circuit system, the circuit system includes a processing circuit, and the processing circuit is configured to execute a method in any implementation manner of the above-mentioned first aspect.
- the sixth aspect of the present application provides a computer program product, which, when executed on a computer, enables the computer to execute a method implemented in any one of the first aspects.
- the seventh aspect of the present application provides a chip system, which includes a processor for supporting a server or a threshold value acquisition device to implement the functions involved in any implementation of the first aspect, for example, sending or processing the data and/or information involved in the above method.
- the chip system also includes a memory, which is used to store program instructions and data necessary for the server or communication device.
- the chip system can be composed of chips, or it can include chips and other discrete devices.
- FIG1 is a schematic diagram of an architecture of an application scenario provided in an embodiment of the present application.
- FIG. 2 is a schematic diagram of the structure of an electronic device 101 provided in an embodiment of the present application.
- FIG3 is a schematic diagram of a flow chart of a method for displaying a digital assistant virtual image provided in an embodiment of the present application
- FIG4 is a schematic diagram of a main body image of a digital assistant provided in an embodiment of the present application.
- FIG5 is a schematic diagram of a movement law of a digital assistant in a breathing state provided by an embodiment of the present application.
- FIG6 is a schematic diagram of a digital assistant switching from a breathing state to a waiting-to-be-awakened state provided by an embodiment of the present application;
- FIG7 is a schematic diagram of a digital assistant provided by an embodiment of the present application in a wake-up state
- FIG8 is a schematic diagram of a digital assistant provided by an embodiment of the present application in a listening state
- FIG9 is a schematic diagram of a digital assistant provided by an embodiment of the present application in a thinking state
- FIG10 is a schematic diagram of a digital assistant provided by an embodiment of the present application in a task execution state
- FIG11 is a schematic diagram of various forms of a digital assistant provided in an embodiment of the present application.
- FIG12 is a schematic diagram of a digital assistant provided in an embodiment of the present application displayed in a waiting-to-be-awakened state;
- FIG13 is a schematic diagram of a digital assistant switching from a waiting state to a listening state provided by an embodiment of the present application.
- FIG14 is a schematic diagram of a virtual image overlaid on a camera shutter control provided by an embodiment of the present application.
- FIG15 is a schematic diagram of a virtual image overlaid on an assistant suggestion component provided by an embodiment of the present application.
- FIG16 is a schematic diagram of a process of a digital assistant performing an automated task provided by an embodiment of the present application.
- FIG17 is a schematic diagram of a digital assistant provided in an embodiment of the present application, in which a circular ring is converted into a dot matrix and laid out on a task execution interface;
- FIG18 is a schematic diagram of using a digital assistant to input text on a desktop according to an embodiment of the present application.
- FIG19 is a schematic diagram of a ring of a digital assistant provided by an embodiment of the present application deforming and returning to its original shape;
- FIG20 is a schematic diagram of the morphological changes of a digital assistant in a smart recognition scenario provided by an embodiment of the present application.
- FIG21 is a schematic diagram of a digital assistant appearing in a smart recognition scenario provided by an embodiment of the present application.
- FIG22 is a schematic diagram of a digital assistant that generates deformation based on scene information provided by an embodiment of the present application
- FIG23 is a schematic diagram of a change of a digital assistant on a vehicle-mounted terminal provided in an embodiment of the present application.
- FIG24 is a schematic diagram of a flow chart of a display method of a digital assistant provided in an embodiment of the present application.
- FIG25 is a schematic diagram of an application scenario of a display method of a digital assistant provided in an embodiment of the present application.
- FIG26 is a schematic diagram of a digital assistant in a dormant state provided by an embodiment of the present application.
- FIG27 is a schematic diagram of a region division provided in an embodiment of the present application.
- FIG28 is a schematic diagram of a form of switching display of a digital assistant provided in an embodiment of the present application.
- FIG29 is a schematic diagram of changes in a digital assistant in a wake-up state provided by an embodiment of the present application.
- FIG30 is a schematic diagram of another digital assistant in a wake-up state provided by an embodiment of the present application.
- FIG31 is a schematic diagram of the structure of a display device for a digital assistant virtual image provided in an embodiment of the present application.
- FIG32 is a schematic diagram of a structure of an execution device provided in an embodiment of the present application.
- FIG33 is a schematic diagram of the structure of a computer-readable storage medium provided in an embodiment of the present application.
- the division of units in this application is a logical division. There may be other division methods when it is implemented in actual applications. For example, multiple units can be combined or integrated into another system, or some features can be ignored or not executed.
- the mutual coupling or direct coupling or communication connection shown or discussed can be through some interfaces, and the indirect coupling or communication connection between units can be electrical or other similar forms, which are not limited in this application.
- the units or sub-units described as separate components may or may not be physically separated, may or may not be physical units, or may be distributed in multiple circuit units, and some or all of the units may be selected according to actual needs to achieve the purpose of the present application.
- a digital assistant is a virtual assistant that receives user commands to replace manual operations and helps users complete a series of operations on smart devices.
- most digital assistants are voice assistants that complete the operation of smart devices by receiving user voice commands. For example, when the smart device is a smartphone or smart TV, the digital assistant can receive user voice commands to open an application or play a song.
- a distance sensor also called a displacement sensor, is a type of sensor that is used to sense the distance between it and an object to complete a preset function and has been widely used.
- Distance sensors can be divided into optical distance sensors, infrared distance sensors, ultrasonic distance sensors and other types according to their working principles.
- Most distance sensors used on mobile phones are infrared distance sensors, which have an infrared transmitting tube and an infrared receiving tube. When the infrared rays emitted by the transmitting tube are received by the receiving tube, it indicates that the distance is close and the screen needs to be turned off to avoid misoperation. When the receiving tube cannot receive the infrared rays emitted by the transmitting tube, it indicates that the distance is far and there is no need to turn off the screen.
- the working principles of other types of distance sensors are similar. They also judge the distance through the emission and reception of a certain substance. The emitted substance can be ultrasonic waves, light pulses, etc.
- Figure 1 is a schematic diagram of the architecture of an application scenario provided in an embodiment of the present application.
- the application scenario includes a user and an electronic device with a digital assistant.
- the user can issue instructions to the digital assistant in the electronic device through voice or gestures, so that the digital assistant assists the user to automatically complete a series of operations of the electronic device, such as automatically executing an application in the electronic device.
- the electronic device is equipped with a digital assistant, and the electronic device may be equipped with a command receiving device such as a microphone or a camera, which can receive the user's voice commands or gesture commands, and then automatically complete a series of operations by processing the user's input commands through the digital assistant, and feedback the processing results to the user.
- the electronic device also has a display screen, which can display the form of the digital assistant on the display screen.
- the electronic device when the electronic device is at a certain distance from the user, the electronic device recognizes the user's spatial interaction behavior by sensing the user's position change or posture change, and displays the morphological changes of the digital assistant in response to the user's spatial interaction behavior on the display screen.
- the electronic device adjusts the form of the digital assistant by obtaining the spatial interaction behavior of the user in the real physical world.
- the electronic device can also be a virtual reality (VR) device or an augmented reality (AR) device.
- the electronic device adjusts the form of the digital assistant by acquiring the spatial interaction behavior of the user in a virtual world such as a VR world or an AR world.
- the user interacting with the electronic device may be a virtual image representing the real user in the virtual world.
- the real user controls the virtual image to perform spatial interaction behavior, thereby changing the form of the digital assistant in the virtual world.
- electronic devices with digital assistants may be, for example, smartphones (mobile phones), personal computers (PCs), laptops, tablet computers, smart TVs, mobile internet devices (MIDs), display devices in autonomous vehicles, smart speakers, wearable devices, VR devices, AR devices, display devices in industrial control, display devices in remote medical surgery, display devices in smart grids, wireless communication devices in smart cities, display devices in smart homes, etc.
- smartphones mobile phones
- PCs personal computers
- laptops laptops
- tablet computers smart TVs
- MIDs mobile internet devices
- display devices in autonomous vehicles smart speakers
- wearable devices VR devices
- AR devices display devices in industrial control
- display devices in remote medical surgery display devices in smart grids
- wireless communication devices in smart cities display devices in smart homes, etc.
- FIG. 2 is a schematic diagram of the structure of an electronic device 101 provided in an embodiment of the present application.
- the electronic device 101 includes a processor 103, and the processor 103 is coupled to a system bus 105.
- the processor 103 may be one or more processors, each of which may include one or more processor cores.
- a display adapter (video adapter) 107 which may drive a display 109, and the display 109 is coupled to the system bus 105.
- the system bus 105 is coupled to an input/output (I/O) bus via a bus bridge 111.
- An I/O interface 115 is coupled to the I/O bus.
- the I/O interface 115 communicates with a variety of I/O devices, such as an input device 117 (such as a touch screen, etc.), an external memory 121 (for example, a hard disk, a floppy disk, an optical disk or a USB flash drive), a multimedia interface, etc.).
- I/O devices such as an input device 117 (such as a touch screen, etc.), an external memory 121 (for example, a hard disk, a floppy disk, an optical disk or a USB flash drive), a multimedia interface, etc.).
- a transceiver 123 which may send and/or receive radio communication signals
- a camera 155 which may capture static and dynamic digital video images
- an external USB port 125 may be a USB interface.
- the processor 103 may be any conventional processor, including a reduced instruction set computing (RISC) processor, a complex instruction set computing (CISC) processor, or a combination thereof.
- the processor may be a dedicated device such as an ASIC.
- the electronic device 101 can communicate with the software deployment server 149 through the network interface 129.
- the network interface 129 is a hardware network interface, such as a network card.
- the network 127 can be an external network, such as the Internet, or an internal network, such as Ethernet or a virtual private network (VPN).
- the network 127 can also be a wireless network, such as a WiFi network, a cellular network, etc.
- the hard disk drive interface 131 is coupled to the system bus 105.
- the hard disk drive interface is connected to the hard disk drive 133.
- the internal memory 135 is coupled to the system bus 105.
- the data running in the internal memory 135 may include an operating system (OS) 137 of the electronic device 101, an application 143, and a scheduler.
- OS operating system
- the operating system consists of a shell 139 and a kernel 141.
- Shell 139 is an interface between the user and the kernel of the operating system.
- the shell is the outermost layer of the operating system.
- the shell manages the interaction between the user and the operating system: it waits for user input, interprets user input to the operating system, and processes various operating system output results.
- the kernel 141 consists of those parts of the operating system that manage memory, files, peripherals, and system resources.
- the kernel 141 interacts directly with the hardware.
- the operating system kernel usually runs processes and provides communication between processes, provides CPU time slice management, interrupts, memory management, IO management, etc.
- Figure 3 is a flow chart of a method for displaying a digital assistant virtual image provided in an embodiment of the present application.
- the method for displaying a digital assistant virtual image includes the following steps 301-302, and the method for displaying a digital assistant virtual image is applied to an electronic device equipped with a digital assistant.
- Step 301 displaying the virtual image of the digital assistant in a first form on a first interface.
- the virtual image of the digital assistant may include multiple features.
- the multiple features included in the virtual image will change.
- the first form is one of the multiple forms of the virtual image of the digital assistant, which is displayed on the first interface of the electronic device.
- Step 302 based on the signal instruction generated by the electronic device, display the virtual image in a second form.
- the signal instruction is generated by the electronic device based on the received first user input. After the electronic device receives the first user input, the signal instruction is generated, thereby triggering the virtual image to switch from the first form to the second form, so that the form of the virtual image displayed on the interface of the electronic device changes.
- the virtual image in the first form and the second form each includes at least one of the plurality of features.
- the virtual image changes from the first form to the second form in particular when at least one of the features changes in shape, position, motion pattern, or display area. One less change.
- the first user input received by the electronic device may be before the electronic device displays the first form of the virtual image, or after the first form of the virtual image is displayed. If the first user input received may be before the electronic device displays the first form of the virtual image, it means that the first user input will prompt the electronic device to display the first form of the virtual image, and change from the first form to the second form, that is, cause at least two form changes. If the first user input received may be after the electronic device displays the first form of the virtual image, it means that the first user input will prompt the digital assistant's virtual image to change from the first form to the first form, that is, cause at least one form change.
- different forms of the virtual image represent different working states of the digital assistant, that is, the digital assistant uses the form of the virtual image to indicate the current working state of the digital assistant.
- the digital assistant uses the form of the virtual image to indicate the current working state of the digital assistant.
- the plurality of feature bodies include a first feature body, and the shape of the first feature body includes: a main form, a geometric array, and a long strip.
- the shape change of the first feature body includes switching between multiple shapes, and enlarging, reducing, stretching, compressing, or rotating each shape. That is, in addition to switching between different shapes, the shape change of the first feature body may also include enlarging, reducing, stretching, compressing, or rotating the same shape.
- the main form refers to the form of the first feature body under normal circumstances, while the geometric array and the long strip are the forms of the first feature body under certain specific circumstances.
- the plurality of feature bodies further include a second feature body.
- the main form of the first feature body is a ring
- the second feature body is a sphere
- the center of the ring coincides with the center of the sphere.
- the position change of the first feature body includes at least one of the orbital position change after flipping around the sphere and the position change caused by the overall movement or rotation of the virtual image;
- the motion law of the first feature body includes the ring rotating around the sphere with the center of the ring as the fulcrum according to a preset trajectory;
- the motion law of the second feature body includes the sphere reciprocating along a straight line passing through the center of the ring and perpendicular to the plane where the ring is located according to a preset law.
- the digital assistant can have the following six different forms.
- Breathing state corresponds to the state of the digital assistant when there is no task processing, that is, it is used to indicate that the digital assistant is in a state of no task processing.
- the breathing state is the state of the digital assistant in the screen-off interface.
- Waiting state corresponds to the state of the digital assistant waiting to be awakened (essentially, the state of the digital assistant when there is no task to be processed), that is, it is used to indicate that the digital assistant is in a state of waiting for a wake-up command.
- the waiting state is the state of the digital assistant in the bright screen interface, where the bright screen interface includes the lock screen interface or the interface after the screen is unlocked.
- Wake-up state corresponds to the state when the digital assistant is awakened and ready to receive commands, that is, it is used to indicate that the digital assistant is in the state of waiting to receive commands.
- Listening state corresponds to the state of the digital assistant during the process of receiving commands, that is, it is used to indicate that the digital assistant is in the command receiving state.
- Task execution state The state of the task obtained when the digital assistant executes the parsed instruction, which is used to indicate that the digital assistant is in the task execution state.
- the above-mentioned multiple forms are arranged from low to high according to the degree of interaction intention: breathing state, waiting to be awakened state, awakened state, listening state, thinking state, and task execution state.
- the breathing state can be transformed into the awakened state.
- the awakened state can be transformed into the listening state instead of the awakened state.
- the forms of digital assistants may also be different.
- the forms of digital assistants when applied to smartphones, the forms of digital assistants include the above-mentioned breathing state, waiting to be awakened state, listening state, thinking state, and task execution state, and the forms can be switched back and forth in sequence; when applied to smart TVs, the forms of digital assistants include the above-mentioned breathing state, awakened state, listening state, thinking state, and task execution state, and the forms can be switched back and forth in sequence.
- the main image of the digital assistant includes at least two features, and the different forms of the digital assistant are distinguished by the changes in the shape, position, and movement law of the feature bodies.
- each feature body can be dynamically presented to represent the concept of the digital assistant's living body.
- Different forms of digital assistants are distinguished by the changes in the shapes, positions, and/or movement laws of different feature bodies.
- the change in the shape of the feature body can mean that the feature body maintains its current position, while adjusting the position of the feature body presented on the display screen (for example, adjusting the display of the front of the feature body to the display of the back of the feature body), switching between a plurality of pre-set different shapes (including enlarging, reducing, stretching, compressing or rotating each shape), adjusting the position or movement of certain parts of the feature body (for example, For example, the arm on the feature body is adjusted from being stationary to waving.
- the change in the position of the feature body may refer to the change in the position where the feature body is displayed, for example, from being displayed at the bottom of the screen to being displayed at the top of the screen or in the center of the screen.
- the change in the motion law of the feature body may refer to the change in the motion of the entire feature body on the display screen, for example, the feature body changes from being stationary to moving along a preset trajectory, or the feature body changes from moving along a sine curve trajectory to the feature body rotating around a certain point.
- Figure 4 is a schematic diagram of the main image of a digital assistant provided in an embodiment of the present application.
- the main image of the digital assistant includes three characteristic bodies: a ring, a sphere, and an image body.
- the image body is wrapped in the sphere; the ring surrounds the sphere, similar to the ring of a planet.
- the ring, sphere, and image body can have different postures and different motion laws.
- the posture and/or motion law of at least one characteristic body has changed.
- a reference system is defined for the main image of the digital assistant: in the breathing state, the plane where the ring is located is the xoy plane, and the x-axis points to the screen of the electronic device, and the vertical direction is the z-axis.
- the width direction of the electronic device is the y-axis
- the height direction of the electronic device is the z-axis.
- the ring, sphere and image body When the digital assistant is in the breathing state, the ring, sphere and image body all move regularly along the preset trajectory and according to the preset period. During the movement, the relative positions between the ring, sphere and image body remain unchanged, that is, the center points of the ring, sphere and image body always coincide.
- the image body and the sphere can rotate around an axis or a point, for example, around the center of the sphere. That is, the image body and the sphere rotate while moving along the preset trajectory (similar to the earth rotating around the sun while rotating itself).
- the avatar when the avatar has a breathing state, the avatar is displayed in the first form on the first interface, which may specifically include: if the first interface is a screen-off interface, the first form is a breathing state.
- the breathing state includes a first feature body and a second feature body, and the first feature body is displayed in the main form.
- the digital assistant in a breathing state makes reciprocating motion on the screen according to a preset pattern.
- Figure 5 is a schematic diagram of the movement law of a digital assistant in a breathing state provided by an embodiment of the present application.
- Figure 5 shows the breathing state image of the digital assistant at time t, and the ring moves back and forth along the z-axis direction, and the function of the moving distance and time conforms to a sine curve.
- the sphere and the image body rotate around the center of the sphere.
- corresponding light and shadow effects can be added to the sphere to indicate the rotation of the sphere.
- the circular ring of the digital assistant i.e., the first characteristic body mentioned above
- a gesture prompt line which can specifically be a long strip.
- the gesture prompt line is designed to match the full-screen gestures.
- the gesture prompt line is a global control that can be displayed at the bottom of the screen to provide a reference for full-screen gestures. For example, swiping up from the bottom usually starts from the position of the gesture prompt line.
- the digital assistant only displays the first feature body, and the shape of the first feature body is a long strip. Other feature bodies of the digital assistant (such as the sphere and image body of the digital assistant) are hidden.
- gesture prompt lines are usually displayed at the bottom of the screen and have specific functions.
- the digital assistant in the wake-up state also has the function of gesture prompt lines, such as swiping left and right to switch applications.
- Figure 6 is a schematic diagram of a digital assistant switching from a breathing state to a to-be-awakened state provided in an embodiment of the present application.
- the virtual image of the digital assistant includes a ring, a sphere, and an image body.
- the digital assistant is converted from the breathing state to the to-be-awakened state.
- the ring of the digital assistant is transformed into a long strip at the bottom of the screen, and the sphere and image body of the digital assistant are hidden.
- the ring When the digital assistant is in the awake state, the ring is fixed on the equatorial plane of the sphere and rotates around the z-axis as the user moves.
- part of the area on the ring is set as a light effect area.
- the brightness of the light effect area on the ring is higher than the non-light effect area on the ring.
- the light effect area on the ring points to the user, and when the ring rotates, the light effect area moves and points to the user.
- the ring may also rotate with the user, so that the light effect area always points to the user's location.
- the ring may change its pitch angle (i.e., rotate around the plane where the display screen of the electronic device is located), so that the ring can simulate the "head up” or “head down” action when seeing the user approaching.
- the first interface is a screen-off interface and the first form displayed on the first interface is a breathing state
- a third feature body is displayed on the first feature body, and the third feature body is used to indicate the user's orientation information and can move on the first feature body.
- the first feature body is the above-mentioned ring, and the third feature body can be a light effect area on the ring.
- Figure 7 is a schematic diagram of a digital assistant provided by an embodiment of the present application in a wake-up state.
- the partial area on the ring close to the side of the screen is the light effect area, which indicates the user's orientation relative to the electronic device.
- the ring rotates around the z-axis so that the light effect area always faces the user.
- the distance between the user and the electronic device is less than a preset value, the user continues to approach the TV, and in addition to rotating around the z-axis, the ring will also produce a change in pitch angle around the y-axis. For example, after the ring rotates upward around the y-axis for a certain angle, it gradually stops following inertia, so that the ring can simulate the "head-up" action when seeing the user approaching.
- the ring rotates around the y-axis on the xoy plane. After the rotation, the ring is located on the yoz plane and no longer blocks the sphere and the image. In addition, when the ring flips, it can drive the sphere and the image to flip together, so that the bottom or top of the sphere and the image is facing the screen.
- Figure 8 is a schematic diagram of a digital assistant provided by an embodiment of the present application in a listening state. As shown in Figure 8, the ring in Figure 8 rotates to the yoz plane, and the sphere and the image body also rotate.
- the ring and the image body are always in a rotating state, and the fulcrum around which the ring and the image body rotate can be the center of the sphere.
- the trajectory of the rotation of the ring and the image body can be random or preset.
- part of the area on the ring in order to indicate the rotation relationship of the ring, can also be set as a light effect area, and the light effect area moves with the movement of the ring, or the light effect area continuously flows and moves on the ring.
- the method of setting the light effect area on the ring can refer to the description of the above-mentioned awakening state, which will not be repeated here.
- Figure 9 is a schematic diagram of a digital assistant provided by an embodiment of the present application in a thinking state.
- Figure 9 shows two states of the digital assistant during rotation, and in these two states, the ring and the image body are always rotating.
- the ring of the digital assistant When the digital assistant is in the task execution state, the ring of the digital assistant may be converted into other shapes (such as a geometric array) and separated from the sphere and the image body, and displayed in other locations on the interface.
- Figure 10 is a schematic diagram of a digital assistant provided in an embodiment of the present application when the form is in the task execution state. As shown in Figure 10, the ring of the digital assistant is transformed into a geometric array, and is separated from the sphere and the image body of the digital assistant, and is displayed in other locations.
- Figure 11 is a schematic diagram of multiple forms of a digital assistant provided in an embodiment of the present application. Among them, Figure 11 shows multiple forms included in the digital assistant.
- the first interface showing the first form of the virtual image may be a lock screen interface or any screen interface after unlocking.
- displaying the virtual image in the first form on the first interface may specifically include: displaying the to-be-awakened state as the first form on the first interface.
- the to-be-awakened state includes a first feature body, and the first feature body is displayed in a long strip as a navigation prompt line.
- the navigation prompt line is used as a reference line for screen gesture navigation.
- the to-be-awakened state is permanently displayed on the display interface.
- the electronic device determines a response strategy based on the recognition of the user input and displays the response result on the screen.
- the response result displayed on the screen by the electronic device may be that the interface remains unchanged or that the interface has changed.
- the electronic device is a smart phone
- the user moves the probe close to the smart phone, and the face recognition hardware on the smart phone recognizes the face
- the smart phone detects the user input.
- the smart phone detects the approach of a human hand through an infrared sensor, and it can also be considered that the smart phone detects the user input.
- the screen is switched from the off-screen interface to the lock-screen interface (the lock-screen interface is essentially the bright-screen interface).
- the face recognition verification on the lock-screen interface is passed, it is unlocked to enter the desktop or the running application interface.
- the user directly double-clicks the off-screen interface, and the electronic device switches to the lock-screen interface.
- the digital assistant When the electronic device switches from the off-screen interface to the bright-screen interface and the digital assistant is not awakened, the digital assistant is displayed in the waiting-to-be-awakened state.
- the ring of the digital assistant closes inward and is displayed in the form of a gesture prompt line, which can be specifically a long strip.
- the gesture prompt line is a global control and is displayed at the bottom of the screen in any interface. It is used to provide a reference for full-screen gestures. For example, swiping up from the bottom usually swipes up from the position of the gesture prompt line.
- the digital assistant is always displayed in a location that the user can reach and in different forms.
- the digital assistant in the lock screen interface after the screen is turned off, and in the desktop or in-application interface when the user does not wake up the digital assistant through a preset operation, the digital assistant can be displayed in a waiting-to-be-awakened state. That is, the ring of the digital assistant closes to form a long strip as a system gesture prompt line (usually displayed at the bottom of the screen), and as the ring closes, the image body and the sphere in the ring disappear (which may be accompanied by a relevant disappearance animation).
- FIG. 12 is a schematic diagram of a digital assistant provided in an embodiment of the present application being displayed in a waiting state.
- the digital assistant in the off-screen interface, the digital assistant is displayed in a breathing state and can move in the off-screen interface according to a preset rule.
- FIG. 12 (b) when the user's input causes the electronic device to enter the lock screen interface from the off-screen interface, the digital assistant moves to the bottom, the ring shrinks inward, and the image body and the sphere are squeezed to be completely hidden (this process is reflected in animation, and the animation can also be hidden).
- the digital assistant becomes a strip and is displayed at the bottom of the screen as a system gesture prompt line.
- FIG. 12 (a) in the off-screen interface, the digital assistant is displayed in a breathing state and can move in the off-screen interface according to a preset rule.
- FIG. 12 (b) when the user's input causes the electronic device to enter the lock screen interface from the off-screen interface, the digital assistant moves to the bottom,
- the user has not woken up the digital assistant, so the digital assistant is always displayed in a waiting state and displayed on the interface in the form of a gesture prompt line.
- the digital assistant switches from the first form to the second form, which can be specifically switching from the wake-up state to the listening state.
- the first form is a waiting state
- the second form displayed based on the signal instruction generated by the electronic device is a listening state.
- the signal instruction that triggers the display of the virtual image in the second form can be generated after the user performs a preset gesture on the digital assistant in the first form.
- the user performs a long press gesture or a double-click gesture on the digital assistant in the first form.
- This embodiment does not specifically limit the preset gesture.
- the plurality of features include a first feature and a second feature.
- the first feature is in the main form of a ring
- the second feature is a sphere
- the ring is located on the equator of the sphere in a top-down perspective.
- the above describes an embodiment in which the first state is the waiting state, and the following describes an embodiment in which the first state is the listening state.
- the virtual image of the digital assistant is displayed on the first interface in a listening state as a first form, wherein the third user input is used to wake up the digital assistant.
- the third user input may be, for example, a user's voice input, an input from an external input device (such as a remote control or a mouse), a touch input generated by a user's direct interaction with an electronic device screen, or an input from a user's interaction with a physical button, a back panel gesture, or other shortcut operations.
- the listening state of the virtual image includes a first characteristic body (such as the aforementioned ring) and a second characteristic body (such as the aforementioned sphere), and the first characteristic body is displayed in the main state.
- a first characteristic body such as the aforementioned ring
- a second characteristic body such as the aforementioned sphere
- Figure 13 is a schematic diagram of a digital assistant switching from a waiting state to a listening state provided in an embodiment of the present application.
- the user interacts with the gesture prompt line where the digital assistant is located with gestures such as single-click, double-click, or long press to trigger the awakening of the digital assistant.
- the digital assistant is displayed in the current interface in a listening state.
- specific content may be displayed on the interface of the electronic device, and there may be less effective space on the interface to display the digital assistant.
- the virtual image of the digital assistant can be integrated with the specific content on the interface.
- the above-mentioned display of the virtual image in the first form on the first interface may specifically include: displaying the virtual image at a preset position on the first interface; or, overlaying and displaying the virtual image on a target content element in the first interface, wherein the target content element is a content element of a preset type. That is, the virtual image is overlaid on the target content element, so that the target content element is no longer displayed on the first interface.
- the target content element is, for example, a control or icon of a preset type predefined by the system.
- a logo may refer to a space or icon having a circular shape.
- the gesture event interface of the target content element can be copied to the avatar, so that the gesture-corresponding function of the target content element can be realized by performing a gesture on the avatar.
- the avatar inherits the original function of the target content element, so that when a specific gesture (such as single-click) is performed on the avatar, the gesture-corresponding function of the target content element can still be executed.
- the target content element is a shutter control in the camera interface.
- the virtual image is displayed overlaid on the shutter control of the camera interface, and clicking the virtual image can still realize the shooting of photos or videos.
- the gesture can be used to implement the function of the original control.
- the display area of the virtual image implements the function of the original content element when receiving the gesture input
- the mapping relationship between the gesture and function of the target content element can be associated with the digital assistant, so that the digital assistant triggers the corresponding function after receiving the gesture input.
- the digital assistant integrates the original function of the control, it only needs to synchronize the trigger interface of the touch instruction to the digital assistant, so that when the area where the digital assistant is located is touched, the corresponding touch instruction is generated to realize the function of the original control.
- the function corresponding to the control is executed according to the gesture type, regardless of whether there are other layers displayed above the control.
- the digital assistant function itself, it is only necessary to recognize the gesture corresponding to the digital assistant and then trigger the function corresponding to the gesture.
- Figure 14 is a schematic diagram of a virtual image overlaid on a camera shutter control provided in an embodiment of the present application.
- the digital assistant in one of the initial states, the digital assistant is not awakened, and is displayed on the interface in a state to be awakened as a navigation prompt line.
- the digital assistant When the user interacts with the digital assistant in the state to be awakened (i.e., the navigation prompt line), such as long pressing the navigation prompt line or long pressing the power button to wake up the digital assistant, the digital assistant is now in a listening state.
- the system displays the digital assistant in the listening state, it first checks the type of control in the current interface. Since the camera shutter control in the current interface is a preset type of control, the digital assistant is displayed in a fusion with the camera shutter control. After the fusion display, the digital assistant is displayed in a listening state overlaid on the position of the original camera shutter control.
- the digital assistant in another initial state, is not displayed in any state on the current interface.
- the digital assistant is called out by long pressing a control (such as the power button) or by voice, the digital assistant can also be displayed over a preset type of control or icon on the current interface.
- clicking the display area of the digital assistant's avatar can also trigger taking a photo. This can be achieved by the digital assistant passing instructions to the camera application to control taking photos, or by the camera application directly recognizing gestures.
- the navigation prompt line is permanently displayed in the system globally. Then, when the digital assistant is in the wake-up state, it can also be understood that the virtual image hides the sphere, changes the circle, and merges it with the navigation prompt line, so that the navigation prompt line not only has the function of a navigation prompt line, but also serves as the permanent state of the digital assistant virtual image.
- the digital assistant can switch between different controls or icons.
- the digital assistant can be cut and merged from the controls of the first displayed interface to the controls or icons in the latest interface in the order of display time.
- Figure 15 is a schematic diagram of a virtual image overlaid on an assistant suggestion component provided by an embodiment of the present application.
- the digital assistant is displayed in a state to be awakened. In this state, the digital assistant merges the navigation prompt line, and the deformed ring overlaps the navigation prompt line.
- the main screen page contains an assistant suggestion widget, and the icon of the virtual image is displayed in the assistant suggestion widget, then the digital assistant can flow from the navigation prompt line to the virtual icon in the assistant suggestion widget.
- the navigation prompt line when the navigation prompt line is globally resident, it is the scheme shown in (b) of Figure 15; when the navigation prompt line is only a deformed ring, then after the digital assistant flows from the navigation prompt line to the virtual icon in the assistant suggestion widget, the navigation prompt line is no longer displayed.
- the virtual image of the digital assistant also has a task execution state, which includes a first feature body.
- the first feature body in the task execution state generates at least one of a shape change and a position change.
- a digital assistant when a digital assistant performs an automated task, a series of changes will occur on the display interface of the electronic device.
- the electronic device When the digital assistant performs an automated task of calling a certain contact, the electronic device will first open the contact interface, then select a specific contact in the contact interface, and then enter the phone interface for calling the specific contact. Then, in the process interface of the digital assistant performing an automated task, the digital assistant is actually always in the task execution state, and the first feature body of the digital assistant in the task execution state can undergo various changes to show the user the automated process of the digital assistant performing the task, making the automated process of the digital assistant performing the task more intuitive and clear, and making it easier for the user to observe whether there are any errors in the process of the digital assistant performing the automated task.
- the first feature body is displayed in the form of a geometric array, and the display position of the geometric array is adapted to the target control in the process interface.
- the geometric array enters the next process interface by simulating the click event of the target control.
- the digital assistant in the process of performing the automation task, can simulate the operation of the user clicking the control on the interface with the first feature body as the geometric array, thereby visualizing the entire process of the digital assistant simulating the user to perform the automation task.
- the application icon is the execution target of this step.
- the dot matrix that changes to a circle can be displayed on the application icon.
- Figure 16 is a schematic diagram of the process of a digital assistant performing an automated task provided by an embodiment of the present application.
- the digital assistant receives the user's voice command "Share this photo to WeChat Moments”.
- the digital assistant displays the dot matrix on the "Send" control of the current interface to simulate the user's operation of clicking the "Send” control.
- the interface changes, and the current interface changes to display applications that can send photos, and these applications include the "WeChat” application; in the interface after the change, the dot matrix is displayed on the icon of the "WeChat” application to simulate the user's operation of clicking the "WeChat” application icon.
- the interface changes again, changing to an interface for editing a picture to be published in the "WeChat” application's Moments, and at this time the dot matrix is displayed on the "Publish” control to simulate the user's operation of clicking the "Publish” control.
- both the listening state and the task execution state include a first feature body, and the first feature bodies in the listening state and the task execution state have different forms.
- the first feature body includes: a geometric array in the interface, an outer frame of a digital assistant dialog card, an outer frame of an input box when calling a digital assistant input, or an outer frame of a task result page. That is, in different scenarios, the first feature body can be represented in different forms to better fit the content displayed on the current interface.
- the first feature body has different morphological changes corresponding to the task progress of the automated task. That is, in the process of the digital assistant performing the automated task, at different task progresses, the first feature body may change into different forms, so that the form of the first feature body always closely fits the current interface.
- the morphological change of the first feature body includes the following process: initially, the first feature body is displayed in the main form of a ring at an initial position relative to the second feature body, and the initial position is the same as or different from the position of the first feature body in the listening state; then, the first feature body is separated from the second feature body, and after being enlarged and deformed, it serves as the outer frame of the digital assistant dialogue card; secondly, the first feature body is changed from the outer frame of the dialogue card to a geometric array within the dialogue bubble of the dialogue card; finally, after the automated task is executed, the virtual image is displayed in the listening state.
- the task execution interface refers to the interface when the digital assistant receives the task instruction and transmits the instruction to the local application for execution based on the analysis of the task instruction, or the execution result interface. Therefore, the task execution interface can be part of the interface or the entire interface of the execution application.
- the morphological changes of the ring include: the ring concentric with the image body and the sphere disappears and transforms into a geometric array (such as a dot matrix) of the task execution interface on the screen, or an outer frame surrounding the task execution interface.
- a geometric array such as a dot matrix
- the process of transforming from a ring to a geometric array or an outer frame can be accompanied by animation.
- the ring is decomposed into several points, and then laid on the task execution interface to form a dot matrix; the ring is deformed and elongated, transformed into a frame, and enlarged into the outer frame of the task execution interface.
- Figure 17 is a schematic diagram of a digital assistant provided by an embodiment of the present application, wherein a circular ring is converted into a dot matrix and laid on a task execution interface.
- (a) of Figure 17 corresponds to the listening state of the digital assistant, and the language instruction input by the user at this time is "help me order a cup of Xinba coffee."
- Figure 17(b) corresponds to the thinking state of the digital assistant (where the thinking state is a dynamic image, the ring keeps rotating according to a certain rule, and Figure 17(b) is a certain instantaneous state).
- the thinking state corresponds to the digital assistant's task parsing process.
- the interface for users to interact with the digital assistant shows the process of the digital assistant parsing the task: "Recommendations are being made for you based on Xinba's historical orders.”
- the task execution phase begins, and the digital assistant is in the task execution state, corresponding to Figure 17 (c).
- the ring of the digital assistant icon at the bottom disappears, and the task card obtained by executing the task is displayed above (i.e., a dialogue bubble, but presented in the form of a card).
- a number of dots are displayed on the task card, which is formed by the transformation of the original ring of the digital assistant.
- the dot matrix can disappear after being displayed for a period of time.
- the dot matrix of the upper task card has disappeared, and the lower task card shows an instantaneous state in the process of the dot matrix disappearing (the dot matrix and interface content are displayed at the same time, and both are displayed translucently).
- the dot matrix disappears. That is, for the task execution interface in Figure 17, the display process is: only the dot matrix is displayed ⁇ the dot matrix gradually disappears, and the interface content is gradually displayed (the dot matrix and the interface exist at the same time) - only the interface content is displayed.
- the task execution interface can be an input box or a pop-up card interface.
- These cards can be components or subroutines provided by locally installed applications.
- Figure 18 is a schematic diagram of using a digital assistant to input text on the desktop provided by an embodiment of the present application.
- the digital assistant is enabled by long pressing the power button. At this time, the digital assistant is in a listening state, and the digital assistant calls the microphone to obtain the user's voice input.
- the digital assistant receives the voice input, and fills the voice into the input box after converting it into text; at this time, the digital assistant is in a task execution state, the ring disappears from the outside of the sphere, and is enlarged and deformed and transferred to the outside of the search box to form an outer frame surrounding the search box.
- the search widget is expanded into a search card, and the outer frame converted by the ring can be expanded and surround the search card; of course, the outer frame converted by the ring can also disappear after the search card is presented, or disappear after a preset time.
- the ring in the initial state (undeformed) can be displayed outside the sphere.
- Figure 19 is a schematic diagram of a ring of a digital assistant provided in an embodiment of the present application deforming and returning to its original state. As shown in (a) of Figure 19, after the digital assistant is initially called out, the digital assistant is displayed in a listening state, and the ring on the digital assistant is displayed outside the sphere.
- the digital assistant is displayed in the task execution state.
- the ring of the digital assistant deforms outward, overlaps with the border position of the digital assistant's dialogue card, and is displayed on the border or outside the border, or displayed on the upper layer of the border.
- the ring of the digital assistant becomes the outer frame of the dialogue card.
- the task execution interface is the application interface currently being operated.
- a smart recognition result is usually generated, which is one or more executable objects obtained by recognition (such as smart removal of people/smart removal of objects in album editing), then the task execution interface can be the area where the executable object is located.
- the dot matrix corresponding to the circle can also be adapted to the task execution process, that is, in the recognition process stage, the dot matrix is spread all over the interface, and after the executable object is recognized, the dot matrix is displayed only in the area of the executable object.
- Figure 20 is a schematic diagram of the morphological changes of a digital assistant in a smart recognition scenario provided by an embodiment of the present application.
- the user calls out the digital assistant and uses the smart recognition function.
- a picture is displayed on the interface of the smartphone, and the picture is displayed in full screen. After that, the user long presses anywhere on the interface to trigger the smart recognition function.
- the icon of the digital assistant is displayed on the current interface. When the digital assistant just appears, the digital assistant is in a listening state.
- the digital assistant performs smart recognition on the picture.
- the ring of the digital assistant disappears from the outside of the sphere and is converted into a dot matrix displayed on the interface, and the dot matrix fills the screen.
- the dot matrix fills the screen.
- the points of the dot matrix may not be a regular dot matrix, but can move dynamically in the display area where the executable object is located. The dot matrix becomes multiple irregular discrete points.
- the system defines long pressing the power button as a shortcut operation for smart recognition
- the operating system displays the digital assistant icon on the interface and directly displays the task execution status, that is, after long pressing the power button, it directly enters the display interface of (b) in Figure 20.
- the image body and sphere of the digital assistant may not be displayed on the interface, but only the changed shape of the ring body may be displayed.
- the dot matrix may be displayed without displaying the sphere and image body below.
- displaying the virtual image in the first form on the first interface specifically includes: in the initial stage, only the first feature body is displayed, and the second feature body is not displayed, and the first feature body is initially displayed in a geometric array, covering the entire screen; then, the first feature body is changed from the geometric array to be displayed in the main form, and the second feature body is displayed.
- the digital assistant when waking up the digital assistant, the digital assistant first appears in the form of a geometric array (such as a dot matrix) and gradually fills the screen; then, the digital assistant gradually transforms from the geometric array to the main form of the first feature body (i.e., a ring) and displays the second feature body (i.e., a sphere), thereby realizing the awakening process of the digital assistant in the form of an animation, increasing the intuitiveness of the interaction process, and helping users to identify the current working status of the digital assistant.
- a geometric array such as a dot matrix
- the geometric array may be, for example, a dot matrix, that is, an array formed by a plurality of dots arranged in a regular pattern.
- the geometric array may also be an array formed by other geometric shapes (such as polygons such as triangles and rectangles) arranged in a regular pattern.
- the first feature body is initially displayed in a geometric array, which may specifically include: first determining an initial display position of the geometric array; secondly, displaying a portion of the geometric pattern of the array at the initial display position, and then gradually spreading the geometric pattern to cover the entire screen. For example, first displaying a portion of the dot matrix from an initial display position, and then gradually spreading the dot matrix to cover the entire screen.
- the initial display position of the geometric array is determined based on user input.
- the user input refers to the input used to wake up the digital assistant.
- the initial display position may be near the user input that triggers the waking up of the digital assistant. For example, when the user input is a long press of the power button of a smartphone, the initial display position of the geometric array may be on the screen near the power button.
- the initial position is a default position, which is an area where the virtual image is displayed, such as a preset area at the bottom of the screen.
- the initial display position of the geometric array is determined by user input, which can make the display process of the digital assistant more closely aligned with the user input, enhance the intuitiveness of the interaction between the user and the digital assistant, and improve the user experience.
- the appearance of the dot matrix can have a preset pattern.
- the initial display position of the digital assistant is determined based on the user's operation, and the dot matrix is expanded from the initial display position to the full image display.
- the display position of the digital assistant's virtual image is used as the initial display position, and the dot matrix is expanded from the initial position to the full image display.
- Figure 21 is a schematic diagram of the appearance of a digital assistant in a smart recognition scenario provided by an embodiment of the present application.
- the digital assistant when the user wakes up the digital assistant by long pressing the power button on the right side of the smartphone, the digital assistant is initially displayed in the form of a dot matrix on the screen near the power button, and then gradually spreads to the entire screen.
- the digital assistant When the user wakes up the digital assistant by long pressing the digital assistant at the bottom of the screen, the digital assistant is initially displayed in the form of a dot matrix near the area where the virtual image of the digital assistant is located at the bottom, and then gradually spreads to the entire screen.
- the morphological changes of the digital assistant may also be related to the scene information, so that some information in the current usage scenario can be accurately represented through the digital assistant's virtual image.
- the electronic device may obtain scene information and determine the first feature of the digital assistant based on the scene information.
- the target form of the body is used to display the first feature body in the target form. That is, the target form is related to the scene information.
- the electronic device can display the first feature body in the specific form.
- the scene information may include external environment information and system event information.
- External environment information is a signal obtained by a sensor to indicate the external environment, which may be obtained through a sensor on an electronic device or a network, such as temperature, humidity, weather and other external environment information.
- System event information is an event obtained from the system of an electronic device and occurring within a preset time, such as music being played, an alarm notification within a preset time, searching for a file and other event information.
- the ring can be deformed based on the external environment information. For example, on a rainy day, the ring shape changes into an umbrella shape.
- Figure 22 is a schematic diagram of a digital assistant deformed based on scene information provided in an embodiment of the present application. As shown in Figure 22, after calling out the digital assistant, if the current interface is maintained, the digital assistant is always displayed in a listening state. After maintaining a certain state (such as the listening state in Figure 22) for a preset time, the ring of the digital assistant is deformed based on the scene information. In Figure 2, the current weather is light rain, so the ring becomes an umbrella shape.
- the ring can also be deformed based on system event information. For example, when the screen is turned off to play music, the ring of the digital assistant changes to the shape of headphones; when there is an alarm notification within the preset time, the ring of the digital assistant changes to the shape of a physical alarm clock. For another example, when the user calls the digital assistant to search for a file, the ring of the digital assistant changes to a magnifying glass.
- the state to be awakened includes at least a first feature body, and the first feature body is currently displayed in a long strip.
- the electronic device In response to the electronic device detecting that the focus of the human eye falls on the area where the screen is located, the electronic device displays a second feature body.
- the digital assistant when the digital assistant is in the wake-up state, only a long bar is displayed; when the digital assistant detects that the user's eyes fall on the screen, it can be considered that the user has an interactive tendency, so the second feature body is displayed, thereby indicating that the digital assistant detects that the user has an interactive tendency and is waiting for interaction, so as to improve the naturalness of human-computer interaction.
- the second feature body in response to detecting a human voice, may move relative to the first feature body in a direction closer to the source of the human voice. Specifically, when the second feature body just appears, its position is relatively fixed, for example, located directly above or below the first feature body.
- the electronic device may move the second feature body according to the direction of the human voice, so that the second feature body moves with the human voice to improve the naturalness of the interaction.
- the in-vehicle terminal can also be resident on the display screen of the vehicle (such as the central control screen) and interact with the user visually through at least one deformation in a ring or a sphere, thereby keeping the user's attention on the digital assistant and enhancing the sense of conversation.
- scene information mainly includes human information, system information and external environment information.
- the human information may come from at least one of the driver and the passenger, including head orientation, eye contact, gestures, posture, and voice, and may be distinguished as the main driver or the front passenger (it is usually difficult for the back seat to interact with the central control screen).
- System information may refer to changes in manual or adaptive adjustments of vehicle control functions, such as driving mode, air conditioning settings, window defogger, seat belt status changes, etc., as well as some vehicle condition change information, such as changes in vehicle speed, driving direction, vehicle bumps, special traffic conditions, etc.
- vehicle condition change information such as changes in vehicle speed, driving direction, vehicle bumps, special traffic conditions, etc.
- system information may also include details such as the type of the aforementioned running application.
- External environmental information usually includes changes in weather, light, temperature, etc.
- virtual images corresponding to different states can trigger different deformations of the virtual form, such as the deformation of the above-mentioned ring, or the deformation of the sphere.
- the central control screen basically does not have an off state, so the digital assistant on the vehicle terminal is displayed in the wake-up state after it is turned on. In this way, in the wake-up state, the digital assistant has different deformation states according to the different scene conditions detected.
- FIG. 23 is a schematic diagram of the changes of a digital assistant on a vehicle-mounted terminal provided by an embodiment of the present application.
- the initial state of the digital assistant to be awakened is to display only the deformed ring.
- the driver or co-driver is detected looking at the central control screen, a part of the sphere is displayed. If a human voice is further detected on this basis, more sphere area is displayed, such as a hemisphere.
- the ring can be elongated, and the sphere is located on the side of the ring in the direction of the voice source.
- the hemisphere when the driver's voice is detected, the hemisphere is located on the side of the elongated ring body close to the driver, that is, the hemisphere moves toward the driver; when the co-driver's voice is detected, the hemisphere is located on the side of the elongated ring body close to the co-driver, that is, the hemisphere moves toward the co-driver.
- a digital assistant can be displayed on the screen of the electronic device based on the activation or alarm of the vehicle-mounted function, and the corresponding function or alarm information can be indicated by the deformation or movement of the first feature body.
- a digital assistant can also be used to prompt vehicle condition information. For example, when it is detected that the vehicle is in a bumpy road environment, the awakened digital assistant is displayed, and the sphere vibrates up and down along the screen where the ring is located. Alternatively, the sphere and the ring vibrate up and down together, and the vibration frequency of the two can be consistent or inconsistent.
- the awakened digital assistant is displayed, and the sphere is rendered in white, and then the ring uses the x-axis as the fixed axis to produce an up and down swing in the z-axis direction, and as it swings, the transparency of the white rendered on the sphere becomes lower until it is completely transparent and the ring stops swinging.
- the awakened digital assistant is displayed, and after the ring body is tilted at a preset angle, a part of the ring body is broken in the middle and becomes an arc, and the arc gradually closes to form a ring.
- the above mainly introduces the process of the digital assistant's form changing when the electronic device is a smartphone or a car terminal.
- the following will introduce the process of the digital assistant's form changing when the electronic device is a smart TV.
- Figure 24 is a flow chart of a display method of a digital assistant provided in an embodiment of the present application.
- the display method of the digital assistant includes the following steps 2401-2403, and the display method of the digital assistant is applied to an electronic device equipped with a digital assistant.
- Step 2401 obtain the user's location and display the first form of the digital assistant based on the user's location.
- the electronic device configured with the digital assistant may be deployed with a sensor, such as an infrared sensor, an ultrasonic sensor, or an optical distance sensor, so that the electronic device can detect the user's position based on the deployed sensor.
- a sensor such as an infrared sensor, an ultrasonic sensor, or an optical distance sensor
- the electronic device emits infrared light through the infrared sensor and receives infrared light returned by the user's body, and then calculates the direction and distance of the user relative to the electronic device, and then determines the user's position.
- the electronic device may also obtain the user's location in other ways, such as obtaining the user's location from other devices.
- the electronic device when the electronic device is a smart TV or smart speaker in a smart home, the electronic device can connect to other smart devices (such as home cameras, sweeping robots, or smart routers) through the network, so that after the other smart devices determine the user's location through their own sensors or positioning modules, the electronic device can obtain the user's location from the other smart devices.
- other smart devices such as home cameras, sweeping robots, or smart routers
- the first form of the digital assistant can be displayed on the display screen of the electronic device based on the user's location, that is, the first form of the digital assistant displayed by the electronic device is related to the user's location.
- the farther the user's location is from the electronic device the lower the user's tendency to interact with the electronic device, so the electronic device can display a digital assistant form with a lower degree of interaction intention; the closer the user's location is to the electronic device, the higher the user's tendency to interact with the electronic device, so the electronic device can display a digital assistant form with a higher degree of interaction intention.
- Step 2402 obtaining the user's spatial interaction behavior, where the spatial interaction behavior is used to indicate the user's activities in the space.
- the electronic device After determining the user's position and displaying the first form of the digital assistant, the electronic device can further capture the user's spatial interaction behavior, that is, the user's activities in the space, through sensors, and then determine the user's further interaction tendencies.
- the user's spatial interaction behavior may include, but is not limited to, any of the following behaviors: approaching the electronic device, moving away from the electronic device, moving to the left, moving to the right, pointing a finger at the electronic device, pointing an arm at the electronic device, facing the face toward the electronic device, and a preset gesture.
- the user's spatial interaction behavior can be used to indicate the interaction tendency between the user and the digital assistant of the electronic device. For example, when the user's spatial interaction behavior is approaching the electronic device, it means that the user expects to conduct further interaction with the digital assistant of the electronic device; when the user's spatial interaction behavior is moving away from the electronic device, it means that the user does not expect to conduct further interaction with the digital assistant of the electronic device.
- Step 2403 in response to the spatial interaction behavior, display the second form of the digital assistant.
- the electronic device switches from the first form of the digital assistant to displaying the second form of the digital assistant.
- the first form of the digital assistant and the second form of the digital assistant are used to represent different states of the digital assistant and the user's interaction process, thereby facilitating the user to identify the current interaction state of the digital assistant and issue corresponding interaction instructions.
- the second form of the digital assistant displayed by the electronic device may be related only to the spatial interaction behavior. That is, after the electronic device obtains the user's spatial interaction behavior, no matter what the first form of the digital assistant currently displayed is, it determines to display the corresponding form of the digital assistant (i.e., the second form mentioned above) according to the pre-established mapping relationship between the spatial interaction behavior and the form of the digital assistant.
- the second form of the digital assistant displayed by the electronic device may also be related to the first form of the digital assistant and the spatial interaction behavior displayed by the electronic device.
- the electronic device is actually based on the first form currently displayed and the user
- the spatial interaction behavior of the digital assistant is used to determine the second form of the digital assistant that needs to be displayed.
- the digital assistant can have multiple forms, and different forms correspond to different degrees of interaction intention between the user and the electronic device as judged by the electronic device.
- the electronic device When the electronic device judges that the user expects the degree of interaction intention with the electronic device to increase based on the acquired spatial interaction behavior, the electronic device can determine to display a second form with a higher degree of interaction intention than the first form based on the first form of the digital assistant currently displayed, and display the second form; conversely, when the electronic device judges that the degree of interaction intention between the user and the electronic device decreases based on the acquired spatial interaction behavior, the electronic device can determine to display another second form with a lower degree of interaction intention than the first form based on the first form of the digital assistant currently displayed, and display the other second form.
- the electronic device may divide the area near the electronic device into multiple areas based on the location of the electronic device, and different areas are not the same relative to the location of the electronic device.
- the electronic device can be divided into three areas, namely the first area, the second area and the third area.
- the electronic device is located in the first area
- the second area is located outside the first area relative to the electronic device and surrounds the first area
- the third area is located outside the second area relative to the electronic device and surrounds the second area.
- the first area, the second area and the third area may be areas divided with the electronic device as the center, and the plane shape of the first area is a circle or an ellipse, the plane shape of the second area is a ring surrounded outside the first area, and the plane shape of the third area is a ring surrounded outside the second area.
- the electronic device recognizes that the user's spatial interaction behavior is moving between different areas, it can accurately determine the user's interaction tendency with the electronic device, and then adjust the form of the displayed digital assistant.
- the electronic device switches the first form of the digital assistant to the second form.
- the first target area belongs to any one of the above-mentioned first area, second area and third area, and the second target area can be an area adjacent to the first target area.
- the electronic device may display a second form of the digital assistant; wherein the second form is the form corresponding to the second target area.
- each area divided around the electronic device corresponds to a form of the digital assistant, and when the user moves from one area to another, the form of the digital assistant is switched so that the currently displayed form is the form corresponding to the area where the user moves.
- the second form is used to indicate that the digital assistant is in a state of waiting to receive instructions.
- the first target area is outside the second target area
- the user moves from the first area to the second area, which means that the user moves from an area farther away from the electronic device to an area closer to the electronic device, that is, the user is approaching the electronic device, so the electronic device can recognize that the user has a further tendency to interact, thereby switching the form of the digital assistant to show the user that the digital assistant has switched from a non-task processing state to a state of waiting to receive instructions.
- the non-task processing state may be a state indicating that the digital assistant has no tasks to be processed and is in a dormant state;
- the state of waiting to receive instructions may refer to a state in which the digital assistant has been awakened and is ready to receive instructions at any time.
- the state of waiting to receive instructions of the digital assistant is a state with a higher degree of interaction intention than the state of non-task processing.
- the second form is used to indicate that the digital assistant is in a state to be received.
- the user moves from the first target area to the second target area, which means that the user has a further tendency to interact, thereby switching the form of the digital assistant to show the user that the digital assistant has switched from a state to be received to a state to be received.
- the instruction receiving state may refer to a state in which the digital assistant has received instructions issued by the user through the corresponding hardware on the electronic device, that is, the hardware receiving the instructions is already in a working state, for example, the digital assistant receives voice instructions issued by the user through the microphone on the electronic device.
- the instruction receiving state of the digital assistant is a state with a higher degree of interaction intention than the state to be received.
- the second form is used to indicate that the digital assistant is in a state of waiting to receive instructions
- the second target area is outside the first target area
- the second form is used to indicate that the digital assistant is in a state of no task processing. It is understandable that when the second target area is outside the first target area, the user moves from the first target area to the second target area, which means that the user is moving away from the electronic device, so the electronic device can recognize that the user has no further interaction tendency, and thus switches from the state of waiting to receive instructions to the state of no task processing with a lower degree of interaction intention.
- the user's interaction tendency with the digital assistant is identified by obtaining the user's movement between areas with different distances from the electronic device, and then the morphology of the digital assistant is adjusted. This can well fit the relationship between social status and distance between humans and improve the naturalness of the morphology change of the digital assistant.
- the electronic device may also obtain the user's spatial activity behavior.
- the spatial activity behavior is used to indicate the user's activities in the space.
- the spatial activity behavior is the behavior performed by the user without the intention to interact with the digital assistant, that is, the spatial interaction behavior can be used to indicate that the user does not have the intention to interact with the digital assistant.
- the electronic device displays the adjusted first form of the digital assistant; wherein the adjusted first form is obtained by adjusting at least one feature of the first form, and the adjusted first form and the first form are used to represent the same state in the process of interaction between the digital assistant and the user. That is, in the case where the user does not have an intention to interact, the spatial activity behavior performed by the user will change the features of the form of the digital assistant, but will not change the form of the digital assistant, that is, will not cause the interactive state of the digital assistant to change.
- the features adjusted in the first form may include, but are not limited to, the following features of the feature body used to represent the digital assistant: the posture of the feature body, the movement law of the feature body, the light effect of the feature body, the material of the feature body, the color of the feature body, the brightness of the feature body, etc.
- the feature body of the digital assistant such as a ring body
- the feature body of the digital assistant may rotate with the user; for another example, when the user points his finger in different directions, the feature body of the digital assistant may rotate or rotate with the position pointed by the user's finger; for another example, when the user gradually approaches the display screen of the electronic device, the ring body of the digital assistant may rotate and change the pitch angle.
- the first form and the second form of the digital assistant both include multiple features, and compared with the first form, the posture and/or motion law of at least one feature in the second form is changed. That is, the different forms of the digital assistant are all composed of the same multiple features, and the difference between the different forms is that the posture and/or motion law of at least one feature has changed.
- different feature bodies can independently adjust posture and movement rules.
- the digital assistant in different states can be represented, forming a unified style of feedback method that responds to spatial interactive behavior input, which helps users understand the feedback of the digital assistant.
- the multiple feature bodies used to represent the image of the digital assistant include a torus, a sphere, and an image body.
- the image body is wrapped in the sphere, and the torus surrounds the sphere. That is, in the image of the digital assistant, from the inside to the outside, they are the image body-the sphere-the torus.
- the sphere may be transparent or transparent so that the user can see through the sphere to see the image body wrapped in the sphere.
- the image body may be an object of any shape designed for the digital assistant, such as an object similar to the shape of the human body, an object of irregular shape, etc.
- the first form or the second form when used to indicate that the digital assistant is in a non-task processing state, the first form or the second form is that multiple feature bodies move along a preset trajectory, and the sphere and the image body rotate during the movement.
- the form of the digital assistant can be manifested as moving according to predetermined rules without any interactive intention, that is, multiple feature bodies move along preset trajectories and the sphere and the image body rotate during the movement.
- the first form or the second form when used to indicate that the digital assistant is in a state of waiting to receive commands, the first form or the second form is that a partial area in the ring body has a light effect, and the partial area is facing the user's position.
- the digital assistant When the digital assistant is in the state of waiting to receive instructions, it means that the user has the tendency to interact with the digital assistant, and the digital assistant is ready to receive the user's instructions at any time. Therefore, the digital assistant can be manifested as always paying attention to the user (that is, the area with light effects is facing the user's position) and waiting for the user to issue instructions.
- the second form when the second form is used to indicate that the digital assistant is in a command receiving state, the second form is that the top or bottom of the multiple features are facing the user. That is, when the digital assistant is in a command receiving state, it means that the digital assistant is receiving the user's command, so the form of the digital assistant can be expressed as the state in which the digital assistant is concentrating on receiving the command (that is, the top or bottom of the multiple features are facing the user, indicating listening to the user's command).
- the first form and the second form are both forms of a virtual human body, and the different forms of the virtual human body differ in any one or more of the following features: the expression, action, and size of the virtual human body.
- the form of the digital assistant is also composed of multiple feature bodies, and the multiple feature bodies can be divided according to human body parts, that is, different feature bodies correspond to
- the plurality of feature bodies can be divided into a head feature body, an eye feature body, a mouth feature body, a left hand feature body, a right hand feature body, etc.
- Figure 25 is a schematic diagram of an application scenario of the display method of the digital assistant provided in an embodiment of the present application.
- the electronic device 402 can be, for example, a smart TV in the user's home, and a digital assistant is deployed on the smart TV.
- the user 401 can control the electronic device 402 through voice commands, so that the electronic device 402 automatically executes certain applications, such as automatically opening an application for playing movies and TV shows.
- the electronic device 402 is configured with hardware such as a distance sensor and a microphone, wherein the distance sensor is used to detect the user's position and the microphone is used to receive the user's voice command.
- the electronic device 402 is equipped with a digital assistant in software to process the user's voice command, thereby automatically executing certain applications according to the voice command.
- a smart TV as an example of an electronic device that executes the display method of a digital assistant, and will describe in detail the process of a smart TV executing the display method of a digital assistant.
- FIG. 26 is a schematic diagram of a digital assistant in a dormant state provided by an embodiment of the present application.
- the smart TV when the smart TV is not used for a long time and there are no users around the smart TV, the smart TV enters a lock screen standby state and displays the lock screen content on the display screen.
- the digital assistant is not displayed on the display screen of the smart TV, and the digital assistant runs in the background, waiting to be awakened.
- the lock screen content displayed on the display screen may be content unrelated to the digital assistant.
- the lock screen content displayed on the display screen may also be a certain form of the digital assistant, such as the breathing form of the digital assistant.
- the smart TV when the smart TV detects a user through a sensor, it can first determine the user's location based on the sensor, and then determine the area where the user is located based on the user's location, and determine the initial display form of the digital assistant (i.e., the first form mentioned above) according to the area where the user is located.
- the initial display form of the digital assistant i.e., the first form mentioned above
- a preset logic may be set in the smart TV to divide the space around the smart TV into multiple areas.
- different forms of digital assistants are displayed accordingly.
- Figure 27 is a schematic diagram of a regional division provided in an embodiment of the present application.
- the space around the smart TV is divided into four areas according to different radii, from the outside to the inside: non-perception area A1, presence perception area A2, natural behavior area A3 and typical task area A4.
- three radii are defined: R1, R2 and R3, where R1>R2>R3.
- the distance between the non-perception area A1 and the smart TV is greater than R1
- the distance between the presence perception area A2 and the smart TV is between R1 and R2
- the distance between the natural behavior area A3 and the smart TV is between R2 and R3
- the distance between the typical task area A4 and the smart TV is less than R3.
- the non-perception area corresponds to the breathing state
- the presence perception area corresponds to the awakening state
- the natural behavior area and the typical task area correspond to the listening state.
- the smart TV When the digital assistant of the smart TV is activated, for example, the user activates the digital assistant through the wake-up word/remote control, the smart TV will determine an initial form of the digital assistant from the breathing state, the wake-up state and the thinking state according to the user's position, and display it on the screen.
- the initial state corresponding to the non-sensing area is a breathing state
- the smart TV activates the main program of the digital assistant and displays the breathing state of the digital assistant.
- the awakening state of the digital assistant is displayed.
- the smart TV After the smart TV displays the initial form of the digital assistant based on the user's location, the smart TV will also capture the user's activities in the space through the sensors deployed by itself, and parse the sensor data according to preset logic to identify whether there is spatial interaction behavior and the type of spatial interaction behavior from the captured user spatial activities.
- a smart TV uses a distance sensor to sense the user's position and distance relative to the TV, and determines whether the user is walking towards the TV based on the changes in position and distance over a period of time. If it is detected that the distance between the user and the TV continues to decrease over a period of time, and the real-time distance between the user and the TV is less than a preset value, it is considered that a walking input is detected. Similarly, when the user moves left or right relative to the TV, the user's horizontal left/right movement input can be detected.
- the smart TV After detecting the user's spatial interaction behavior, the smart TV can adjust at least one feature of the initial form of the digital assistant according to the type of spatial interaction behavior, thereby achieving form switching of the digital assistant.
- Figure 28 is a schematic diagram of a switching display of the form of a digital assistant provided in an embodiment of the present application.
- the digital assistant in the smart TV when the digital assistant in the smart TV is activated, the user is in the non-perception zone A1, and the breathing state of the digital assistant is displayed on the smart TV, that is, the ring body moves regularly up and down along the z-axis.
- the smart TV When the user is walking in the non-perception zone A1, the smart TV will not perceive the spatial input of the non-perception zone A1, so the user's movement will not affect the form of the digital assistant, that is, the smart TV always displays the breathing state of the digital assistant.
- the smart TV detects that the user's spatial interaction behavior approaches the smart TV, and the user is in the presence sensing area A2, so the smart TV switches the digital assistant to the awake state.
- the digital assistant is in the awake state, the ring body is fixed on the equatorial plane of the sphere and rotates around the z-axis following the user's movement.
- the light effect area on the ring body points to the user.
- Figure 29 is a schematic diagram of the change of a digital assistant in the awake state provided by an embodiment of the present application.
- the state of the digital assistant switches from the awake state to the listening state, and the position and movement law of the ring body change.
- the ring body rotates 90 degrees around the y-axis from the equatorial plane of the sphere to the yoz plane.
- the light effect can change, and the entire ring body is filled with light effects.
- the movement law of the sphere and the image body changes. For example, after the sphere and the image body rotate 90 degrees around the y-axis once, they rotate continuously around the z-axis.
- the digital assistant displayed on the smart TV switches from the awake state to the listening state, indicating that the digital assistant is receiving the voice command issued by the user.
- the digital assistant When the digital assistant is in the listening state, when the user inputs a voice command, for example, the voice command is "What's the weather like today?", the digital assistant will parse the voice command and perform the corresponding operation.
- the smart TV may display the thinking state of the digital assistant, and the display duration of the thinking state is positively correlated with the length of time the digital assistant takes to parse the voice command and perform the corresponding operation.
- the digital assistant when the user is in the natural behavior area A3, the digital assistant is in a listening state.
- the user When the user inputs a voice command to the digital assistant, the user often needs to first input a voice command as a wake-up command, such as "Hello, Xiaoyi” or "Xiaoyi Xiaoyi", so that the digital assistant can receive and parse the voice command after the wake-up command.
- a voice command such as "Hello, Xiaoyi" or "Xiaoyi Xiaoyi”
- the digital assistant when the user enters the typical task area A4, the digital assistant is still in a listening state, but the user can directly input a voice command to the digital assistant without having to input a voice command as a wake-up command.
- the digital assistant can change its own display form according to the content of the user's voice command.
- the digital assistant's form can be to dynamically change color or rotation rhythm to follow the rhythm of the music, that is, the digital assistant will also adjust its own display form accordingly when performing music playback operations.
- the digital assistant's form can change in brightness, color, or material following the action of turning on the lights, turning off the lights, or the type of weather being reported.
- the digital assistant when the digital assistant parses voice commands and performs corresponding tasks based on the content of the voice commands, the digital assistant can change its own display form according to the progress of its own task execution. For example, the digital assistant can gradually change the material of the ring body, change the area of the ring body filled with specific materials, etc. according to the progress of the task.
- the smart TV can decide based on the user's spatial interaction behavior and the area where the user is located after moving. Display the state of the digital assistant, so as to realize the state switching of the digital assistant. For example, when the user moves from the natural behavior area A3 to the presence perception area A2, the digital assistant displayed on the smart TV switches from the listening state to the awake state; when the user moves from the presence perception area A2 to the non-perception area A1, the digital assistant displayed on the smart TV switches from the awake state to the breathing state.
- the above describes the morphological change process of the digital assistant when the main body image of the digital assistant is composed of a torus, a sphere, and an image body.
- the following describes the morphological change process of the digital assistant when the main body image of the digital assistant is composed of a virtual human body.
- the main body image of the digital assistant can be a virtual human body, and the shape of the digital assistant is also composed of multiple feature bodies.
- the multiple feature bodies can be divided according to human body parts, that is, different feature bodies correspond to different parts of the virtual human body.
- the multiple feature bodies can be divided into head feature bodies, eye feature bodies, mouth feature bodies, left hand feature bodies, right hand feature bodies, etc.
- the smart TV can also divide the space near the smart TV into multiple areas with the location of the smart TV as the center of the circle. In this way, after the digital assistant in the smart TV is activated, the smart TV can display the corresponding form of the digital assistant according to the area where the user is located in the space and the user's spatial interaction behavior.
- the space near the smart TV can be divided into three areas, area 1 is an area 1.2-3.7 meters away from the smart TV, area 2 is an area 0.5-1.2 meters away from the smart TV, and area 3 is an area less than 0.5 meters away from the smart TV.
- the smart TV When the smart TV recognizes that the user is in area 1, the smart TV can consider the distance between the user and the digital assistant to be an identification distance, and the spatial interaction behaviors of the user obtained by the smart TV include: the user faces the digital assistant, the user's gaze is on the digital assistant, and the user gazes at the digital assistant for more than 3 seconds, then the smart TV can display the form of the digital assistant as a form with expressions and actions.
- the form of the digital assistant can be a virtual human body representing the digital assistant looking at the user, blinking and smiling at the user, and the body and head swaying slightly, indicating that the digital assistant is willing to interact with the user.
- 1.2 meters can be used as the dividing distance to control the virtual human body representing the digital assistant to look in the direction of different users in turn.
- the smart TV When the smart TV recognizes that the user is in area 2, the smart TV can consider the distance between the user and the digital assistant to be a greeting distance, and the spatial interaction behavior of the user obtained by the smart TV includes: the user faces the digital assistant, the user's gaze point is on the digital assistant, and the user gazes at the digital assistant for more than 3 seconds, then the smart TV can display the form of the digital assistant as a form with expressions and actions, and the smart TV outputs the corresponding voice of the digital assistant.
- the form of the digital assistant can be a virtual human body representing the digital assistant waving to the user and greeting the user; in addition, the voice of the digital assistant output by the smart TV can be to inquire about the user's needs, for example, the voice of the digital assistant can be specifically "Excuse me, how can I help you?"
- the smart TV When the smart TV recognizes that the user is in area 3, the smart TV can consider that the distance between the user and the digital assistant is a manual touch screen operation distance, and the spatial interaction behavior of the user obtained by the smart TV includes: the user faces the digital assistant, the user's gaze point is on the digital assistant, and the user gazes at the digital assistant for more than 3 seconds, then the smart TV can display the task information that requires touch screen operation on the display screen, and proportionally reduce the main image of the digital assistant.
- H is the projected height of the main image of the digital assistant on the screen
- the actual height of the main image of the digital assistant is the real height of the virtual human body representing the digital assistant.
- the real height of the male virtual human body can be, for example, 174cm
- the real height of the female virtual human body can be, for example, 161cm.
- the specific value can be adjusted according to the actual situation, and this embodiment does not make specific limitations on this.
- FIG. 31 is a schematic diagram of the structure of a display device for a digital assistant virtual image provided by an embodiment of the present application.
- the display device for the digital assistant virtual image is applied to an electronic device, and the virtual image includes multiple feature bodies.
- the device includes:
- Display module 3101 used for displaying a virtual image in a first form on a first interface
- the display module 3101 is further used to detect the signal instruction generated by the electronic device and display the virtual image in the second form;
- the signal instruction is generated based on the first user input, the first form and the second form display at least one of the multiple feature bodies, and the change from the first form to the second form means that at least one feature body undergoes at least one of a shape change, a position change, a motion law change, and a display area change.
- the plurality of feature bodies include a first feature body, and the shape change of the first feature body includes the following changes: At least one of: zoom in, zoom out, stretch, compress, rotate, and switch between main form, geometric array, or stripe.
- the plurality of feature bodies further include a second feature body, wherein the main shape of the first feature body is a ring, the second feature body is a sphere, and the center of the ring and the center of the sphere are co-pointed;
- the position change of the first feature body includes at least one of a change in orbital position after flipping around the sphere and a change in position caused by overall movement or rotation of the virtual image;
- the motion law of the first characteristic body includes that the ring rotates around the sphere according to a preset trajectory with the center of the circle as the fulcrum;
- the motion law of the second characteristic body includes that the sphere moves back and forth according to a preset law along a straight line passing through the center of the ring and perpendicular to the plane where the ring is located.
- the display module 3101 is further configured to detect a second user input and display a third feature body on the first feature body, where the third feature body can move on the first feature body and indicate the user's position.
- the virtual image has a breathing state
- the virtual image is displayed in a first form on the first interface, including:
- the first interface is a screen-off interface
- the first form is a breathing form including a first feature body and a second feature body, and the first feature body is displayed in a main form.
- the virtual image in a breathing state reciprocates on the screen according to a preset rule.
- the display module 3101 is further used to:
- a second user input is detected, and a third feature is displayed on the first feature, the third feature being movable on the first feature and indicating the user's position.
- the virtual image has a waiting state and a listening state
- the first interface is a lock screen interface or any screen interface after unlocking
- Displaying a virtual image in a first form on a first interface includes:
- the first state to be awakened is displayed on the first interface as the first form, the state to be awakened includes a first feature body, and the first feature body is displayed in a long strip as a navigation prompt line, and the navigation prompt line is used as a reference line for screen gesture navigation; wherein, when no input for waking up the digital assistant is received, the state to be awakened is permanently displayed on the first interface;
- a third user input is detected and displayed on the first interface in a listening state as a first form.
- the third user input is used to wake up the digital assistant.
- the listening state includes a first feature body and a second feature body, and the first feature body is displayed in a main form.
- the first interface is a lock screen interface or any screen interface after unlocking, and the first state is a listening state;
- Displaying the virtual image in the first form on the first interface specifically includes:
- the first feature is first displayed as a geometric array
- the first feature body is then transformed from a geometric array into a main form, and a second feature body is displayed.
- the first feature body is first displayed in a geometric array, specifically:
- the initial display position is determined based on user input
- the initial position is the area where the virtual image is displayed.
- the virtual image is displayed in the first form on the first interface, specifically:
- the virtual image is overlaid and displayed on a target content element in the first interface, wherein the target content element is a content element of a preset type.
- the gesture event interface of the target content element is copied to the virtual image so that performing a gesture on the virtual image can realize the gesture corresponding function of the target content element.
- the first interface is a camera interface
- the target content element is a shutter control in the camera interface
- the first interface is the lock screen interface or any screen interface after unlocking
- the first form is the wake-up state
- the second form is the listening state
- the signal instruction is generated based on the user performing a preset gesture on the digital assistant in the first form.
- the plurality of feature bodies further includes a second feature body
- the first feature body is a main form of a ring
- the second feature body is a sphere
- the ring is located at the red center of the sphere when viewed from above. On the road.
- the virtual image further has a task execution state, and the task execution state includes a first feature body
- the first feature body in the task execution state corresponds to at least one of a shape change and a position change.
- the first feature body is a geometric array
- the display position of the geometric array matches the target control in the process interface, so as to simulate a click event of the target control to enter the next process interface.
- the first state is a listening state
- the second state is a task execution state.
- Both the listening state and the task execution state include a first feature body, and the first feature bodies in the listening state and the task execution state have different forms respectively.
- the shape of the first feature body includes: a geometric array within the interface, an outer frame of a dialog card, an outer frame of an input box when calling a digital assistant input, or an outer frame of a task result page.
- the first feature body has different morphological changes at different task progresses of the automation task.
- the first feature body is changed from a geometric array covering the entire screen to a geometric array displayed only on the recognized executable object.
- the morphological change of the first feature body includes:
- the first feature body is first displayed in the main form of a ring on the periphery of the second feature body;
- the first feature body is then separated from the second feature body by the ring and is enlarged and displayed to serve as the outer frame of the conversation card;
- the first feature body changes from the outer frame of the dialogue card to the geometric array inside the dialogue bubble;
- the avatar is displayed in a listening state.
- the device further includes:
- the acquisition module 3102 is used to acquire scene information and display a first feature body among multiple feature bodies in a target form based on the scene information.
- the virtual image has a waiting-to-be-awakened state
- the waiting-to-be-awakened state includes at least a first feature body, and the first feature body is displayed in a long strip;
- the display module 3101 is further configured to display a second feature body in response to detecting that the focus of the human eye falls on the area where the screen is located.
- the display module 3101 is further configured to, in response to detecting a human voice, cause the second feature body to move relative to the first feature body toward a direction close to a source of the human voice.
- the electronic device is a smart car
- the display module 3101 is further used for:
- the vehicle function or warning information is indicated by the deformation or movement of the first feature body among the multiple feature bodies.
- FIG 32 is a schematic diagram of the structure of an execution device provided in an embodiment of the present application.
- the execution device 3200 can be specifically manifested as a mobile phone, a tablet, a laptop computer, an intelligent wearable device, a server, etc., which is not limited here.
- the execution device 3200 includes: a receiver 3201, a transmitter 3202, a processor 3203 and a memory 3204 (wherein the number of processors 3203 in the execution device 3200 can be one or more, and one processor is taken as an example in Figure 32), wherein the processor 3203 may include an application processor 32031 and a communication processor 32032.
- the receiver 3201, the transmitter 3202, the processor 3203 and the memory 3204 may be connected via a bus or other means.
- the memory 3204 may include a read-only memory and a random access memory, and provides instructions and data to the processor 3203. A portion of the memory 3204 may also include a non-volatile random access memory (NVRAM).
- NVRAM non-volatile random access memory
- the memory 3204 stores processor and operation instructions, executable modules or data structures, or subsets thereof, or extended sets thereof, wherein the operation instructions may include various operation instructions for implementing various operations.
- the processor 3203 controls the operation of the execution device.
- the various components of the execution device are coupled together through a bus system, wherein the bus system includes not only a data bus but also a power bus, a control bus, and a status signal bus, etc.
- the bus system includes not only a data bus but also a power bus, a control bus, and a status signal bus, etc.
- various buses are referred to as bus systems in the figure.
- the method disclosed in the above-mentioned embodiment of the present application can be applied to the processor 3203, or implemented by the processor 3203.
- the processor 3203 can be an integrated circuit chip with signal processing capabilities. In the implementation process, each step of the above-mentioned method can be completed by an integrated logic circuit of hardware in the processor 3203 or an instruction in the form of software.
- the above-mentioned processor 3203 can be a general-purpose processor, a digital signal processor (digital signal processing, DSP), a microprocessor or a microcontroller, and can further include an application specific integrated circuit (ASIC), a field-programmable gate array (FPGA) or other Programmable logic devices, discrete gate or transistor logic devices, discrete hardware components.
- DSP digital signal processing
- ASIC application specific integrated circuit
- FPGA field-programmable gate array
- the processor 3203 can implement or execute the methods, steps and logic block diagrams disclosed in the embodiments of the present application.
- the general processor can be a microprocessor or the processor can also be any conventional processor, etc.
- the steps of the method disclosed in the embodiments of the present application can be directly embodied as a hardware decoding processor for execution, or a combination of hardware and software modules in the decoding processor for execution.
- the software module can be located in a mature storage medium in the field such as a random access memory, a flash memory, a read-only memory, a programmable read-only memory or an electrically erasable programmable memory, a register, etc.
- the storage medium is located in the memory 3204, and the processor 3203 reads the information in the memory 3204 and completes the steps of the above method in combination with its hardware.
- the receiver 3201 can be used to receive input digital or character information and generate signal input related to the relevant settings and function control of the execution device.
- the transmitter 3202 can be used to output digital or character information through the first interface; the transmitter 3202 can also be used to send instructions to the disk group through the first interface to modify the data in the disk group; the transmitter 3202 can also include a display device such as a display screen.
- the electronic device provided in the embodiment of the present application may specifically be a chip, and the chip includes: a processing unit and a communication unit, the processing unit may be, for example, a processor, and the communication unit may be, for example, an input/output interface, a pin or a circuit, etc.
- the processing unit may execute the computer execution instructions stored in the storage unit so that the chip in the execution device executes the method described in the above embodiment.
- the storage unit is a storage unit in the chip, such as a register, a cache, etc.
- the storage unit may also be a storage unit located outside the chip in the wireless access device end, such as a read-only memory (ROM) or other types of static storage devices that can store static information and instructions, a random access memory (RAM), etc.
- ROM read-only memory
- RAM random access memory
- Figure 33 is a schematic diagram of the structure of a computer-readable storage medium provided in an embodiment of the present application.
- the present application also provides a computer-readable storage medium.
- the method disclosed in Figure 3 above can be implemented as computer program instructions encoded in a machine-readable format on a computer-readable storage medium or encoded on other non-transient media or products.
- FIG. 33 schematically illustrates a conceptual partial view of an example computer-readable storage medium including a computer program for executing a computer process on a computing device, arranged in accordance with at least some embodiments presented herein.
- computer readable storage medium 3300 is provided using signal bearing medium 3301.
- Signal bearing medium 3301 may include one or more program instructions 3302, which when executed by one or more processors may provide the functionality or portions of the functionality described above with respect to FIG.
- the signal bearing medium 3301 may include a computer readable medium 3303 such as, but not limited to, a hard drive, a compact disk (CD), a digital video disk (DVD), a digital tape, a memory, a ROM or RAM, and the like.
- a computer readable medium 3303 such as, but not limited to, a hard drive, a compact disk (CD), a digital video disk (DVD), a digital tape, a memory, a ROM or RAM, and the like.
- the signal bearing medium 3301 may include a computer recordable medium 3304, such as, but not limited to, a memory, a read/write (R/W) CD, a R/W DVD, etc.
- the signal bearing medium 3301 may include a communication medium 3305, such as, but not limited to, a digital and/or analog communication medium (e.g., a fiber optic cable, a waveguide, a wired communication link, a wireless communication link, etc.).
- a wireless form of the communication medium 3305 e.g., a wireless communication medium complying with the IEEE 802.X standard or other transmission protocol.
- the one or more program instructions 3302 may be, for example, computer executable instructions or logic implementing instructions.
- the computing device of the computing device may be configured to provide various operations, functions, or actions in response to the program instructions 3302 communicated to the computing device via one or more of the computer readable medium 3303, the computer recordable medium 3304, and/or the communication medium 3305.
- the device embodiments described above are merely schematic, wherein the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed over multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the scheme of this embodiment.
- the connection relationship between the modules indicates that there is a communication connection between them, which may be specifically implemented as one or more communication buses or signal lines.
- the storage medium such as a computer floppy disk, USB flash drive, mobile hard disk, ROM, RAM, magnetic disk or optical disk, includes a number of instructions for enabling a computer device (which may be a personal computer, training device, or network device, etc.) to execute the methods of the various embodiments of the present application.
- all or part of the embodiments may be implemented by software, hardware, firmware or any combination thereof.
- all or part of the embodiments may be implemented in the form of a computer program product.
- the computer program product includes one or more computer instructions.
- the computer can be a general-purpose computer, a special-purpose computer, a computer network, or other programmable devices.
- the computer instructions can be stored in a computer-readable storage medium, or transmitted from one computer-readable storage medium to another computer-readable storage medium.
- the computer instructions can be transmitted from a website site, a computer, a training device or a data center by wired (e.g., coaxial cable, optical fiber, digital subscriber line) or wireless (e.g., infrared, wireless, microwave, etc.) mode to another website site, computer, training device or data center.
- wired e.g., coaxial cable, optical fiber, digital subscriber line
- wireless e.g., infrared, wireless, microwave, etc.
- the computer-readable storage medium can be any available medium that a computer can store or a data storage device such as a training device, a data center, etc. that contains one or more available media integration.
- Available media can be magnetic media, (e.g., floppy disk, hard disk, tape), optical media (e.g., DVD), or semiconductor media (e.g., solid-state hard disk (SSD)), etc.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Software Systems (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Social Psychology (AREA)
- Multimedia (AREA)
- Computer Networks & Wireless Communication (AREA)
- Databases & Information Systems (AREA)
- Signal Processing (AREA)
- Audiology, Speech & Language Pathology (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
Description
Claims (55)
- 一种数字助手虚拟形象的显示方法,应用于电子设备,其特征在于,所述虚拟形象包括多个特征体,所述方法包括:在第一界面上以第一形态显示所述虚拟形象;检测到所述电子设备产生的信号指令,以第二形态显示所述虚拟形象;其中,所述信号指令为基于第一用户输入产生的,所述第一形态和所述第二形态展示所述多个特征体中的至少一个,从所述第一形态变化为所述第二形态是,至少一个特征体发生形状变化、位置变化、运动规律变化、显示面积变化中的至少一个。
- 根据权利要求1所述的方法,其特征在于,所述多个特征体包括第一特征体,所述第一特征体的形状变化包括以下变化中的至少一种:放大、缩小、拉伸、压缩、旋转,以及在主形态、几何阵列或长条形之间切换。
- 根据权利要求2所述的方法,其特征在于,所述多个特征体还包括第二特征体,其中,所述第一特征体的主形态为一圆环,所述第二特征体为一球体,所述圆环的圆心与所述球体的球心共点;所述第一特征体的位置变化包括,围绕所述球体翻转后的轨道位置变化、所述虚拟形象整体移动或旋转所导致的位置变化中的至少一种;所述第一特征体的运动规律包括,所述圆环以圆心为支点,按照预设的轨迹环绕所述球体转动;所述第二特征体的运动规律包括,所述球体沿着经过所述圆环的圆心、且垂直于所述圆环所在平面的直线按照预设的规律做往复运动。
- 根据权利要求3所述的方法,其特征在于,所述方法还包括:检测到第二用户输入,在所述第一特征体上显示第三特征体,所述第三特征体能够在所述第一特征体上移动并指示用户的方位。
- 根据权利要求1-3任意一项所述的方法,其特征在于,所述虚拟形象具有呼吸态,所述在第一界面上以第一形态显示所述虚拟形象,包括:若所述第一界面为熄屏界面,则所述第一形态为包括第一特征体和第二特征体的呼吸态,且所述第一特征体以主形态显示。
- 根据权利要求5所述的方法,其特征在于,在所述第一界面为熄屏界面时,所述呼吸态的虚拟形象在屏幕上按照预设的规律做往复运动。
- 根据权利要求5所述的方法,其特征在于,在所述第一界面为熄屏界面,所述第一形态为呼吸态时,所述方法还包括:检测到第二用户输入,在所述第一特征体上显示第三特征体,所述第三特征体能够在所述第一特征体上移动并指示用户的方位。
- 根据权利要求1-4任意一项所述的方法,其特征在于,所述虚拟形象具有待唤醒态和聆听态,所述第一界面为锁屏界面或解锁后的任意屏幕界面;所述在第一界面上以第一形态显示所述虚拟形象,包括:以待唤醒态作为第一形态显示在所述第一界面上,所述待唤醒态包括第一特征体,且所述第一特征体作为导航提示线以长条形显示,所述导航提示线用于作为屏幕手势导航的基准线;其中,在未接收到唤醒数字助手的输入时,所述待唤醒态在所述第一界面常驻显示;或者,检测到第三用户输入,以所述聆听态作为所述第一形态显示在所述第一界面上,所述第三用户输入用于唤醒所述数字助手,所述聆听态包括第一特征体和第二特征体,且所述第一特征体以主形态显示。
- 根据权利要求8所述的方法,其特征在于,所述第一界面为锁屏界面或解锁后的任意屏幕界面,所述第一形态为聆听态;所述在第一界面上以第一形态显示所述虚拟形象,具体包括:所述第一特征体先以几何阵列显示;所述第一特征体再由几何阵列变化为主形态,并显示所述第二特征体。
- 根据权利要求9所述的方法,其特征在于,所述第一特征体先以几何阵列显示,具体为:在初始显示位置显示部分阵列的几何图形,再在整个屏幕显示所述几何图形。
- 根据权利要求10所述的方法,其特征在于,所述初始显示位置是基于用户输入确定的;且在未检测到用户输入时,所述初始位置为显示所述虚拟形象的区域。
- 根据权利要求1-11任意一项所述的方法,其特征在于,所述在第一界面上以第一形态显示所述虚拟形象,具体为:在所述第一界面上的预设位置显示所述虚拟形象;或者,将所述虚拟形象覆盖显示在所述第一界面内的目标内容元素上,其中,所述目标内容元素为预设类型的内容元素。
- 根据权利要求12所述的方法,其特征在于,当所述虚拟形象覆盖显示在所述第一界面内的目标内容元素上时,将所述目标内容元素的手势事件接口复制到所述虚拟形象上,以使得对所述虚拟形象执行手势能够实现所述目标内容元素的手势对应功能。
- 根据权利要求13所述的方法,其特征在于,所述第一界面为相机界面,所述目标内容元素为相机界面内的快门控件。
- 根据权利要求8所述的方法,其特征在于,所述第一界面为锁屏界面或解锁后的任意屏幕界面,所述第一形态为待唤醒态,所述第二形态为聆听态;所述信号指令是基于用户对所述第一形态的数字助手执行预设手势后产生。
- 根据权利要求15所述的方法,其特征在于,所述多个特征体还包括第二特征体;在所述第二形态下,所述第一特征体为圆环的主形态,所述第二特征体为球体,且所述圆环位于所述球体的俯视视角的赤道上。
- 根据权利要求8所述的方法,其特征在于,所述虚拟形象还具有任务执行态,所述任务执行态包括所述第一特征体;在所述数字助手执行自动化任务过程中的至少一个过程界面,所述任务执行态下的所述第一特征体对应产生形状变化、位置变化中的至少一种。
- 根据权利要求17所述的方法,其特征在于,所述第一特征体为几何阵列,且所述几何阵列的显示位置与过程界面内的目标控件相匹配,用于模拟所述目标控件的点击事件以进入下一过程界面。
- 根据权利要求17所述的方法,其特征在于,所述第一形态为聆听态,所述第二形态为任务执行态,所述聆听态和所述任务执行态均包括所述第一特征体,且所述聆听态和所述任务执行态下的第一特征体分别具有不同的形态;在所述任务执行态下,所述第一特征体的形态包括:界面内的几何阵列、对话卡片的外框、调用数字助手输入时输入框的外框或任务结果页的外框。
- 根据权利要求19所述的方法,其特征在于,在自动化任务的不同任务进度下,所述第一特征体具有不同的形态变化。
- 根据权利要求19所述的方法,其特征在于,在所述自动化任务为智慧识别任务时,所述第一特征体由铺满屏幕的几何阵列变化为仅显示在识别到的可执行对象上的几何阵列。
- 根据权利要求19或20所述的方法,其特征在于,在所述任务执行态下,所述第一特征体的形态变化包括:所述第一特征体先以圆环的主形态显示在第二特征体的外围;所述第一特征体再由圆环脱离所述第二特征体,并放大显示,以作为对话卡片的外框;所述第一特征体由所述对话卡片的外框变为对话气泡内的几何阵列;在自动化任务执行完成后,所述虚拟形象以聆听态显示。
- 根据权利要求1-4所述的方法,其特征在于,所述方法还包括:获取场景信息,并基于所述场景信息以目标形态显示所述多个特征体中的第一特征体。
- 根据权利要求1-4所述的方法,其特征在于,所述虚拟形象具有待唤醒态,所述待唤醒态至少包括第一特征体,所述第一特征体以长条形显示;响应于检测到人眼视线焦点落在屏幕所在区域,显示第二特征体。
- 根据权利要求24所述的方法,其特征在于,所述方法还包括:响应于检测到人声,所述第二特征体相对所述第一特征体向靠近人声声源的方向移动。
- 根据权利要求1-4所述的方法,其特征在于,所述电子设备为智能汽车,所述方法还包括:通过所述多个特征体中第一特征体的形变或运动来指示车载功能或告警信息。
- 一种数字助手虚拟形象的显示装置,应用于电子设备,其特征在于,所述虚拟形象包括多个特征体;所述装置包括:显示模块,用于在第一界面上以第一形态显示所述虚拟形象;所述显示模块,还用于检测到所述电子设备产生的信号指令,以第二形态显示所述虚拟形象;其中,所述信号指令为基于第一用户输入产生的,所述第一形态和所述第二形态展示所述多个特征体中的至少一个,从所述第一形态变化为所述第二形态是,至少一个特征体发生形状变化、位置变化、运动规律变化、显示面积变化中的至少一个。
- 根据权利要求27所述的装置,其特征在于,所述多个特征体包括第一特征体,所述第一特征体的形状变化包括以下变化中的至少一种:放大、缩小、拉伸、压缩、旋转,以及在主形态、几何阵列或长条形之间切换。
- 根据权利要求28所述的装置,其特征在于,所述多个特征体还包括第二特征体,其中,所述第一特征体的主形态为一圆环,所述第二特征体为一球体,所述圆环的圆心与所述球体的球心共点;所述第一特征体的位置变化包括,围绕所述球体翻转后的轨道位置变化、所述虚拟形象整体移动或旋转所导致的位置变化中的至少一种;所述第一特征体的运动规律包括,所述圆环以圆心为支点,按照预设的轨迹环绕所述球体转动;所述第二特征体的运动规律包括,所述球体沿着经过所述圆环的圆心、且垂直于所述圆环所在平面的直线按照预设的规律做往复运动。
- 根据权利要求29所述的装置,其特征在于,所述显示模块,还用于检测到第二用户输入,在所述第一特征体上显示第三特征体,所述第三特征体能够在所述第一特征体上移动并指示用户的方位。
- 根据权利要求27-29任意一项所述的装置,其特征在于,所述虚拟形象具有呼吸态,所述在第一界面上以第一形态显示所述虚拟形象,包括:若所述第一界面为熄屏界面,则所述第一形态为包括第一特征体和第二特征体的呼吸态,且所述第一特征体以主形态显示。
- 根据权利要求31所述的装置,其特征在于,在所述第一界面为熄屏界面时,所述呼吸态的虚拟形象在屏幕上按照预设的规律做往复运动。
- 根据权利要求31所述的装置,其特征在于,在所述第一界面为熄屏界面,所述第一形态为呼吸态时,所述显示模块,还用于:检测到第二用户输入,在所述第一特征体上显示第三特征体,所述第三特征体能够在所述第一特征体上移动并指示用户的方位。
- 根据权利要求27-30任意一项所述的装置,其特征在于,所述虚拟形象具有待唤醒态和聆听态,所述第一界面为锁屏界面或解锁后的任意屏幕界面;所述在第一界面上以第一形态显示所述虚拟形象,包括:以待唤醒态作为第一形态显示在所述第一界面上,所述待唤醒态包括第一特征体,且所述第一特征体作为导航提示线以长条形显示,所述导航提示线用于作为屏幕手势导航的基准线;其中,在未接收到唤醒数字助手的输入时,所述待唤醒态在所述第一界面常驻显示;或者,检测到第三用户输入,以所述聆听态作为所述第一形态显示在所述第一界面上,所述第三用户输入用于唤醒所述数字助手,所述聆听态包括第一特征体和第二特征体,且所述第一特征体以主形态显示。
- 根据权利要求34所述的装置,其特征在于,所述第一界面为锁屏界面或解锁后的任意屏幕界面, 所述第一形态为聆听态;所述在第一界面上以第一形态显示所述虚拟形象,具体包括:所述第一特征体先以几何阵列显示;所述第一特征体再由几何阵列变化为主形态,并显示所述第二特征体。
- 根据权利要求35所述的装置,其特征在于,所述第一特征体先以几何阵列显示,具体为:在初始显示位置显示部分阵列的几何图形,再在整个屏幕显示所述几何图形。
- 根据权利要求36所述的装置,其特征在于,所述初始显示位置是基于用户输入确定的;且在未检测到用户输入时,所述初始位置为显示所述虚拟形象的区域。
- 根据权利要求27-37任意一项所述的装置,其特征在于,所述在第一界面上以第一形态显示所述虚拟形象,具体为:在所述第一界面上的预设位置显示所述虚拟形象;或者,将所述虚拟形象覆盖显示在所述第一界面内的目标内容元素上,其中,所述目标内容元素为预设类型的内容元素。
- 根据权利要求38所述的装置,其特征在于,当所述虚拟形象覆盖显示在所述第一界面内的目标内容元素上时,将所述目标内容元素的手势事件接口复制到所述虚拟形象上,以使得对所述虚拟形象执行手势能够实现所述目标内容元素的手势对应功能。
- 根据权利要求39所述的装置,其特征在于,所述第一界面为相机界面,所述目标内容元素为相机界面内的快门控件。
- 根据权利要求34所述的装置,其特征在于,所述第一界面为锁屏界面或解锁后的任意屏幕界面,所述第一形态为待唤醒态,所述第二形态为聆听态;所述信号指令是基于用户对所述第一形态的数字助手执行预设手势后产生。
- 根据权利要求39所述的装置,其特征在于,所述多个特征体还包括第二特征体;在所述第二形态下,所述第一特征体为圆环的主形态,所述第二特征体为球体,且所述圆环位于所述球体的俯视视角的赤道上。
- 根据权利要求34所述的装置,其特征在于,所述虚拟形象还具有任务执行态,所述任务执行态包括所述第一特征体;在所述数字助手执行自动化任务过程中的至少一个过程界面,所述任务执行态下的所述第一特征体对应产生形状变化、位置变化中的至少一种。
- 根据权利要求43所述的装置,其特征在于,所述第一特征体为几何阵列,且所述几何阵列的显示位置与过程界面内的目标控件相匹配,用于模拟所述目标控件的点击事件以进入下一过程界面。
- 根据权利要求43所述的装置,其特征在于,所述第一形态为聆听态,所述第二形态为任务执行态,所述聆听态和所述任务执行态均包括所述第一特征体,且所述聆听态和所述任务执行态下的第一特征体分别具有不同的形态;在所述任务执行态下,所述第一特征体的形态包括:界面内的几何阵列、对话卡片的外框、调用数字助手输入时输入框的外框或任务结果页的外框。
- 根据权利要求45所述的装置,其特征在于,在自动化任务的不同任务进度下,所述第一特征体具有不同的形态变化。
- 根据权利要求46所述的装置,其特征在于,在所述自动化任务为智慧识别任务时,所述第一特征体由铺满屏幕的几何阵列变化为仅显示在识别到的可执行对象上的几何阵列。
- 根据权利要求45或46所述的装置,其特征在于,在所述任务执行态下,所述第一特征体的形态变化包括:所述第一特征体先以圆环的主形态显示在第二特征体的外围;所述第一特征体再由圆环脱离所述第二特征体,并放大显示,以作为对话卡片的外框;所述第一特征体由所述对话卡片的外框变为对话气泡内的几何阵列;在自动化任务执行完成后,所述虚拟形象以聆听态显示。
- 根据权利要求27-30所述的装置,其特征在于,所述装置还包括:获取模块,用于获取场景信息,并基于所述场景信息以目标形态显示所述多个特征体中的第一特征体。
- 根据权利要求27-30所述的装置,其特征在于,所述虚拟形象具有待唤醒态,所述待唤醒态至少包括第一特征体,所述第一特征体以长条形显示;所述显示模块,还用于响应于检测到人眼视线焦点落在屏幕所在区域,显示第二特征体。
- 根据权利要求50所述的装置,其特征在于,所述显示模块,还用于响应于检测到人声,使得所述第二特征体相对所述第一特征体向靠近人声声源的方向移动。
- 根据权利要求27-30所述的装置,其特征在于,所述电子设备为智能汽车,所述显示模块,还用于:通过所述多个特征体中第一特征体的形变或运动来指示车载功能或告警信息。
- 一种数字助手虚拟形象的显示装置,其特征在于,包括存储器和处理器;所述存储器存储有代码,所述处理器被配置为执行所述代码,当所述代码被执行时,所述装置执行如权利要求1至26任意一项所述的方法。
- 一种计算机存储介质,其特征在于,所述计算机存储介质存储有指令,所述指令在由计算机执行时使得所述计算机实施权利要求1至26任意一项所述的方法。
- 一种计算机程序产品,其特征在于,所述计算机程序产品存储有指令,所述指令在由计算机执行时使得所述计算机实施权利要求1至26任意一项所述的方法。
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP24830416.4A EP4645047A4 (en) | 2023-06-29 | 2024-06-04 | METHOD FOR DISPLAYING A DIGITAL ASSISTANT AVATAR AND ASSOCIATED DEVICE |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202310786258.8 | 2023-06-29 | ||
| CN202310786258.8A CN117008762A (zh) | 2023-06-29 | 2023-06-29 | 一种数字助手的显示方法及相关装置 |
| CN202410178141.6 | 2024-02-08 | ||
| CN202410178141.6A CN119225568A (zh) | 2023-06-29 | 2024-02-08 | 一种数字助手虚拟形象的显示方法及相关装置 |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US19/433,973 Continuation US20260127793A1 (en) | 2023-06-29 | 2025-12-29 | Method for displaying virtual image of digital assistant and related apparatus |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2025001769A1 true WO2025001769A1 (zh) | 2025-01-02 |
Family
ID=93937318
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2024/097229 Ceased WO2025001769A1 (zh) | 2023-06-29 | 2024-06-04 | 一种数字助手虚拟形象的显示方法及相关装置 |
Country Status (2)
| Country | Link |
|---|---|
| EP (1) | EP4645047A4 (zh) |
| WO (1) | WO2025001769A1 (zh) |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109358923A (zh) * | 2018-08-29 | 2019-02-19 | 华为技术有限公司 | 一种虚拟机器人形象的呈现方法及装置 |
| CN110825469A (zh) * | 2019-09-18 | 2020-02-21 | 华为技术有限公司 | 语音助手显示方法及装置 |
| US20200210142A1 (en) * | 2018-12-29 | 2020-07-02 | Baidu Online Network Technology (Beijing) Co., Ltd. | Method and apparatus for controlling virtual speech assistant, user device and storage medium |
| CN116168701A (zh) * | 2023-02-24 | 2023-05-26 | 上海传英信息技术有限公司 | 交互方法、智能终端及存储介质 |
| CN116204253A (zh) * | 2021-11-30 | 2023-06-02 | 华为技术有限公司 | 一种语音助手显示方法及相关装置 |
| CN117008762A (zh) * | 2023-06-29 | 2023-11-07 | 华为技术有限公司 | 一种数字助手的显示方法及相关装置 |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9338493B2 (en) * | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
| US12045437B2 (en) * | 2020-05-22 | 2024-07-23 | Apple Inc. | Digital assistant user interfaces and response modes |
| US11955137B2 (en) * | 2021-03-11 | 2024-04-09 | Apple Inc. | Continuous dialog with a digital assistant |
-
2024
- 2024-06-04 EP EP24830416.4A patent/EP4645047A4/en active Pending
- 2024-06-04 WO PCT/CN2024/097229 patent/WO2025001769A1/zh not_active Ceased
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109358923A (zh) * | 2018-08-29 | 2019-02-19 | 华为技术有限公司 | 一种虚拟机器人形象的呈现方法及装置 |
| US20200210142A1 (en) * | 2018-12-29 | 2020-07-02 | Baidu Online Network Technology (Beijing) Co., Ltd. | Method and apparatus for controlling virtual speech assistant, user device and storage medium |
| CN110825469A (zh) * | 2019-09-18 | 2020-02-21 | 华为技术有限公司 | 语音助手显示方法及装置 |
| CN116204253A (zh) * | 2021-11-30 | 2023-06-02 | 华为技术有限公司 | 一种语音助手显示方法及相关装置 |
| CN116168701A (zh) * | 2023-02-24 | 2023-05-26 | 上海传英信息技术有限公司 | 交互方法、智能终端及存储介质 |
| CN117008762A (zh) * | 2023-06-29 | 2023-11-07 | 华为技术有限公司 | 一种数字助手的显示方法及相关装置 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP4645047A1 |
Also Published As
| Publication number | Publication date |
|---|---|
| EP4645047A1 (en) | 2025-11-05 |
| EP4645047A4 (en) | 2026-04-22 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11500510B2 (en) | Information processing apparatus and non-transitory computer readable medium | |
| CN107801413B (zh) | 对电子设备进行控制的终端及其处理方法 | |
| CN109739361B (zh) | 基于眼睛跟踪的可见度提高方法和电子装置 | |
| US12039161B2 (en) | Electronic device comprising a plurality of touch screen displays and screen division method | |
| WO2021057830A1 (zh) | 一种信息处理方法及电子设备 | |
| WO2021244443A1 (zh) | 分屏显示方法、电子设备及计算机可读存储介质 | |
| US11886673B2 (en) | Trackpad on back portion of a device | |
| US20180274936A1 (en) | Method and apparatus for providing augmented reality function in electronic device | |
| US20250265755A1 (en) | Hand-tracked text selection and modification | |
| TWI659333B (zh) | 用以處理與移動相關的資料之計算裝置及方法 | |
| CN112181207B (zh) | 一种显示设备及几何图形识别方法 | |
| US10936184B2 (en) | Display apparatus and controlling method thereof | |
| CN119225568A (zh) | 一种数字助手虚拟形象的显示方法及相关装置 | |
| EP4589977A1 (en) | Gesture sensing method, device and storage medium | |
| US20250199621A1 (en) | Gesture-based application invocation | |
| WO2023235672A1 (en) | Ar-based virtual keyboard | |
| US20220350997A1 (en) | Pointer-based content recognition using a head-mounted device | |
| CN111240483A (zh) | 操作控制方法、头戴式设备及介质 | |
| US12483776B2 (en) | Photographing method and electronic device | |
| WO2025001769A1 (zh) | 一种数字助手虚拟形象的显示方法及相关装置 | |
| CN114282544A (zh) | 显示设备和控件识别方法 | |
| US12436610B2 (en) | Display method and electronic device | |
| US20260127793A1 (en) | Method for displaying virtual image of digital assistant and related apparatus | |
| CN117472262A (zh) | 交互方法和电子设备 | |
| CN119013650A (zh) | 控制多个窗口的显示的电子设备及其操作方法和存储介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 24830416 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2024830416 Country of ref document: EP |
|
| ENP | Entry into the national phase |
Ref document number: 2024830416 Country of ref document: EP Effective date: 20250728 |
|
| ENP | Entry into the national phase |
Ref document number: 2024830416 Country of ref document: EP Effective date: 20250728 |
|
| WWP | Wipo information: published in national office |
Ref document number: 2024830416 Country of ref document: EP |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |